Big Idea 2: Data | AP CSP Course | APCSExamPrep.com
Big Idea 2 • DAT • 17–22% of AP Exam
Data
How computers represent, store, compress, and analyze data. Covers binary numbers, lossless vs. lossy compression, extracting information from datasets, metadata, and using programs to process data. Includes the binary conversion game — the highest-impact interactive in the course.
🕒 Prerequisite: Complete Big Idea 1 first — Big Idea 2 builds on program concepts introduced there.
All lessons
Big Idea 2 Topics
Follow in order. Binary (2.1) is the foundation everything else builds on.
Binary Numbers
How computers represent all data — text, images, audio, and numbers — using only 0s and 1s. Covers binary-to-decimal conversion, place values, how bits represent digital data, and how analog signals are approximated digitally. The most calculation-heavy topic in BI2.
Data Compression
How data is made smaller for storage and transmission. Covers lossless compression (perfect reconstruction guaranteed) vs. lossy compression (smaller file, some data lost permanently). Students must choose the right type for a given context — a classic AP exam scenario.
Extracting Information from Data
How programs process raw data to extract information, identify trends, and generate knowledge. Covers correlation vs. causation, metadata, data cleaning challenges, bias in datasets, and why large datasets don't automatically mean better conclusions.
Using Programs with Data
How programs transform and visualize data to reveal patterns and support decision-making. Covers filtering, sorting, searching datasets, visualizations (charts, graphs, tables), and how computational tools enable analysis that would be impractical by hand.
Key concepts
What the Exam Tests in BI2
These are the highest-frequency BI2 concepts across released AP CSP exams.
Binary conversion
Convert decimal to binary and vice versa using place values. Expect 2–3 direct calculation questions. The binary race game will drill this faster than any other method.
Lossless vs. lossy
Know when each compression type is appropriate. Lossless = exact reconstruction. Lossy = smaller file, permanent quality loss. The exam presents scenarios and asks which to choose.
Correlation ≠ causation
One of the most tested BI2 ideas. Data can show correlation between variables but cannot alone prove causation. Additional research is required.
Metadata
Data about data. A photo's metadata includes date taken, file size, GPS location. Changing or deleting metadata does not change the primary data. Privacy implications are tested in BI5.
Data cleaning challenges
Real datasets are messy: incomplete values, inconsistent formatting, invalid entries, combined sources. Cleaning is required before analysis. More data doesn't eliminate bias.
Analog vs. digital
Analog data is continuous (sound waves, light). Digital is discrete (0s and 1s). Converting analog to digital involves sampling and always creates an approximation, not an exact copy.
Need slides, tests, and a pacing guide?
The AP CSP Teacher Superpack includes editable Google Slides for all 5 Big Ideas, lesson plans, student guides, unit tests with answer keys, and both full-year and semester pacing guides.
Get in Touch
Whether you're a student, parent, or teacher — I'd love to hear from you.
Just want free AP CS resources?
Enter your email below and check the subscribe box — no message needed. Students get daily practice questions and study tips. Teachers get curriculum resources and teaching strategies.
Message Sent!
Thanks for reaching out. I'll get back to you within 24 hours.
Prefer email? Reach me directly at [email protected]