Introduction to Data Science

  9-12 graders

  Credits awarded on transcript  

  Algebra I completed with B- or better

  UC A-G approved for [C] Mathematics credits

  2 hours per day (summer)  

  8-10 students per class

  4 days per week 2, 4, or 6 weeks

  649 per student, per week  

  Self paced instructor-guided  

  Personalized 1-1 support

  Office hours 1 hour per week

  1749 per student, per semester  

  90 minutes per class

  4-8 students per class

  Twice per week over 36 weeks

  1749 per student, per semester  

Data is everywhere around us. We generate more data every 40 minutes than all of the data generated since the dawn of civilization until 2003. The ability to work with data, understand what it tells us, and use it in your communication has become an essential life and career skill.

90%

of all the world's data was created in just the last two years

1000x

computing power in a smartphone vs. a 1970s mainframe computer

11%

of all U.S. high school students complete any statistics coursework

Decisions that used to be straightforward are increasingly more complex and driven by data. Individuals across all disciplines need to constantly separate fact from friction. The need to analyze and interpret data has permeated every discipline — across engineering, business, finance, social sciences, humanities, and even journalism. Several leading academics now agree that the mathematics we teach in high school is rooted in the 1950s space race and needs to be updated to reflect the realities of the digital and information age of today.

2Sigma School takes an interactive approach to data exploration, rather than a lecture based approach. Our classes are hands-on and use several tools that are used by leading data scientists as well as higher education universities, as illustrated by the following video clip of a live session in a small cohort.

This is a high-school level course that introduces students to the exciting opportunities available at the intersection of data analysis, computing, and mathematics. In this course students will learn to understand, ask questions of, and represent data through project-based units. The units will give students opportunities to be data explorers through active engagement, developing their understanding of data analysis, sampling, correlation/causation, bias and uncertainty, modeling with data, making and evaluating data-based arguments, and the importance of data in society. At the end of the course, students will have a portfolio of their data science work to showcase their newly developed knowledge and understanding.

This is a beginner course and no prior experience with programming is required. During the first half of the course we cover key programming concepts that include variables, data types, comparisons and boolean operators, functions, control structures, and iteration. We will be using industry standard tools like Jupyter Notebooks, Python, and Data Commons. Students will get the chance to explore data sets in areas that they are familiar with. The course ends with a capstone project where the student get to apply what they have learned and round out their portfolio of data science work to showcase their newly developed abilities.

Some key differences between a traditional statistics course and the data science course include:

  • Larger data sets (Big Data) that can only be analyzed programmatically vs small, tailored data sets.
  • Use of modern statistical analysis and simulation tools vs a formula-based approach.
  • Use of Python programming for data analysis vs pen and paper based computations.

In order to maximize our time together during the live sessions, we use a flipped classroom model that includes pre-work for every class. This allows students to program with the support of an instructor during the class. The pre-work includes pre-recorded videos, online reading, and some programming practice.


   University of California A-G approved for [C] Mathematics credits.      NCAA Approved.

Course Outline

    hide details
  1. Data Tells a Story
    Students are introduced to data science through a reflection of their own experiences using self-generated data, an exploration of a larger dataset of people’s media use, and an analysis of business data. Students will make sense of the questions: What part of the story is told by data? What is variation? How is data generated? Students will learn simple spreadsheets and will explore univariate, bivariate, and multivariate data.
  2. Learning from Data Distributions
    Students will learn ways of modeling data, starting with the basic models of measures of center and spread, as well as considering sampling. It takes a deeper dive into the concepts, limitations, and the impact of outliers. Students will explore distributions and the role of probability. Students will collect their own data and compare it to a larger data set.
  3. Bivariate Data and Causality vs. Spurious Correlation
    Students will learn about bivariate data through discussions and data explorations with several analytical tools. Students will explore scatter plots as a visual way to represent the relationship between two variables, draw their own lines of best fit, and learn how data scientists determine and analyze lines of best fit.
  4. Probabilistic Modeling
    Students will consider the modeling process and the role played by variation, reflecting on the data collected from simulations and the ways data can help answer probabilistic questions and leverage this power for decision-making. In the process of creating powerful simulations, students will learn the basics of data analysis in Python.
  5. Categorical Data and Introduction to Linear Algebra
    Students will collect and analyze categorical data. They will also delve into the use of vectors to organize data into a multi-dimensional space to understand how data are similar or different to each other. Students will work in spreadsheets and Python based Jupyter notebooks.
  6. Modeling with Data and Understanding Bias
    Students will build a prioritization model to create a ranking, based on what they value, collect variables based on their values, gather and clean data, create functions to combine variables, normalize data, and create a weighting system for prioritizing their data. Students will do a sensitivity analysis on their weighting system and learn how bias impacts mathematical models.
  7. Introduction to Machine Learning
    Students will gather and clean data to make predictive models. They will be introduced to machine learning and will use machine learning on the same data to make more efficient and accurate predictions.
  8. Being a Data Scientist
    Students will have an opportunity to work through the full cycle of data science: making their own decisions about the questions they are interested in exploring, finding data to answer that question, cleaning the data, creating and analyzing a model, communicating with the data visually and reflecting on their process. This will be an interactive process mirroring how data scientists work on a project.

Summer of Code
    see detailed summer schedule

To take any of our courses, students must be familiar with opening a browser, navigating to a website, and joining a Zoom meeting.

Students must have a quiet place to study and participate in the class for the duration of the class. Some students may prefer a headset to isolate any background noise and help them focus in class.

Most course lectures and content may be viewed on mobile devices but programming assignments and certain quizzes require a desktop or laptop computer.

Students are required to have their camera on at all times during the class, unless they have an explicit exception approved by their parent or legal guardian.

Our technology requirements are similar to that of most Online classes.

A desktop or laptop computer running Windows (PC), Mac OS (Mac), or Chrome OS (Chromebook).
Students must be able to run a Zoom Client.
A working microphone, speaker, webcam, and an external mouse.
A high-speed internet connection with at least 15mbps download speed (check your Internet speed).

This course includes several timed tests where you will be asked to complete a given number of questions within a 60-90 minutes limit. These tests are designed to keep you competitively prepared but you can take them as often as you like. We do not proctor these exams, neither do we require that you install special lockdown browser.

In today's environment, when students have access to multiple devices, most attempts to avoid cheating in online exams are symbolic. Our exams are meant to encourage you to learn and push yourself using an honor system.

We do assign a grade at the end of the year based on a number of criteria which includes class participation, completion of assignments, and performance in the tests. We do not reveal the exact formula to minimize students' incentive to optimize for a higher grade.

We believe that your grade in the course should reflect how well you have learnt the skills, and a couple of timed-tests, while traditional, aren't the best way to evaluate your learning.

FAQ

Your questions. Our answers.

2Sigma School offers three flexible enrollment formats so students can learn in the way that best fits their schedule:

Independent Study is our most popular format and is offered year round. An assigned instructor guides the student through the course with weekly 1-on-1 office hours for teaching new concepts, reviewing work, and giving personalized feedback. At a standard pace, students complete one semester in about 18 weeks or the full course in 36 weeks. Students who want to move faster can set an accelerated goal and their instructor will provide additional office hours to match. This format works well for students with busy schedules or those completing coursework started in another format.

Summer of Code is our accelerated summer program. Students meet 2 hours per day, 4 days per week in an intensive format that allows them to complete a full-year course in as few as 6 weeks. This is ideal for students who want to earn credits quickly or get ahead before the school year starts. It requires a significant time commitment and is best suited for students who can dedicate themselves fully to the program during the summer.

Live Classes are our year-round, cohort-based format. Students attend live online sessions (90 minutes, twice per week) over 36 weeks in small groups of 10–14 students. This format provides consistent interaction with instructors and peers throughout the academic year, with time between sessions to work on projects and assignments. These classes are subject to cohort formation and may have limited availability.

Independent Study offers the most flexibility. During the application process, students provide their availability and we match them with an instructor whose schedule aligns. Sessions are then scheduled directly between the student and their instructor.

Summer of Code and Live Classes are cohort-based. Students indicate all of their available time slots when applying, and we form cohorts based on the collective availability of enrolled students. This approach gives families flexibility while ensuring each cohort has a consistent meeting time that works for everyone in the group.

As an accredited institution, 2Sigma School can award high school credits for our courses on a transcript, just like your child's regular school. Some high schools may accept these credits as part of their graduation requirements, and even use them to calculate your child's GPA. This policy varies by school, so we recommend you check with your child's school counselor.

Regardless of your local school's decision, students can submit our transcript to colleges as college applications allow additional transcripts to be submitted. This can help your child stand out in the college application process.

We are accredited by Cognia and most of our courses are not only NCAA approved, they are also approved for A-G credits by the University of California. Our AP courses are approved by the College Board.

In addition, 2Sigma School is an approved online course provider in multiple states, including California, Washington, Oklahoma, and Florida. See our accreditations page for more details.

Our instructors vary by course as well as by availability. Some of our instructors are high school teachers with deep expertise in computer science, teaching credentials in multiple states, and years of classroom experience. Others are professionals with an undergraduate degree in Computer Science with deep experience in the subject matter and a demonstrated ability and aptitude to teach teenage students.

All of our instructors undergo a rigorous screening process for content and pedagogical expertise. They are background checked as per local and state regulations, and are trained in online instruction and classroom management.

Students need a desktop or laptop computer running Windows (PC), Mac OS (Mac), or Chrome OS (Chromebook), along with a working microphone, speaker, and webcam. A high-speed internet connection with at least 10 Mbps download speed is required.

Students should be familiar with opening a browser, navigating to a website, and joining a Zoom meeting. Most course content can be viewed on mobile devices, but programming assignments and certain quizzes require a desktop or laptop computer.

Yes. All formats include access to instructor support. In Live Classes and Summer of Code, students can ask questions during class and instructors are available to provide additional guidance. In Independent Study, students have dedicated weekly office hours with their assigned instructor.

If a student needs additional 1-on-1 support beyond what is included, we offer tutoring sessions that can be arranged through our team. Contact us at admissions@2sigma.school for details.