Fall 2022
September 08, 2022
Set course expectations and explain course policies.
Introduction to Data Science (SDS 192) aims to equip students with the knowledge and tools to understand, critically evaluate, manipulate, and explain data. This is an introductory course, and no prior experience is necessary.1 Students will learn how to read and write code, but also how to create, organize, and collaborate on coding projects while critically examining the projects goals and data sources. We will be primarily use the R language, along with supplemental tools.
Learning to code can be intimidating!
Coding is often frustrating.
Don’t worry about the math for now.
More than anything, remember coding is like learning a new language.
If you have only a Chromebook, come talk to me as soon as possible.
Nothing graphic, but sometimes sad.
Learn how to (hopefully) make things better.
Lab/Project Work
Credit to Michael Barber
A little data literacy goes a long way.
I think this is unfair …
I can show how this process impacts people differently.
I think this policy would help …
Models predict this policy would increase X by Y%.
I think this claim is too strong to be true …
The data does not support their conclusion because of X.
And now all the tools are free for everyone!
Credit to AnyDice
My area is computational social science.
Data Science, and this class, are collaborative.
You’ll be working with others often, so take some time to introduce yourselves.
Some suggestions:
SDS Departmental policy and university expectations do not include remote course options outside of Office of Disability Services (ODS) accommodations.
If you are sick, please stay home.
The majority of class communications will take place on Slack, a messaging platform used widely at Smith and beyond. Please install Slack and join the class workspace before the next class.
We will spend some time at the start of next class going over Slack usage and etiquette.
Class readings are all posted on the class reader and will direct you to Perusall.
Perusall is a reading platform that lets you collaboratively take notes on readings. You can see each others highlights, make comments, and ask questions.
This course will be using a version of standards-based grading.
Rather than tallying up the percentage of questions you answer correctly, I assess your responses by using a pre-defined set of course standards and then assign a level of proficiency.
 
 
 
 
 
Mean of A1-A5
 
 
 
 
 
Max of A1-A5
Standards-based grading provides:
No late work will be accepted.
You can request extensions per syllabus policy.
Remember, missing something does not harm your grade, you just miss a chance to show proficiency.
What is Data?
SDS 192-03: Intro to Data Science