MATH/COSC 3570 Introduction to Data Science
Statistics PhD student Qishi Zhan
Help desk hours: To be announced.
Welcome to set up a meeting with your TA via Teams.
Let me know if you need any other help! π
β What are prerequisites?
π COSC 1010 (Intro Programming) and MATH 4720 (Intro Stats) or MATH 2780 (Intro Regression)
β Is this like another intro stats course?
π No. Statistics and data science are closely related.
Nowadays
π Data science is a broader subject than statistics.
π Statistics focuses more on analyzing and learning from data, a part of the entire workflow of data science.
β Is this like another intro CS or programming course?
π Absolutely not. We learn how to code for doing data science, not for understanding computer systems and structures.
~ 60%
~ 40%
π Wouldnβt it be great to add both languages to your resume! π
β Donβt want to learn R and/or Python? Take 3570 next semester~!
β Drop deadline: 01/21 (Tue), 11:59 PM.
Have nice computing power and interactive collaboration with me and your teammates!
Student plan: $5/month (Cheaper than buying a textbook!)
30% In-class lab exercises and participation.
30% Homework
15% Midterm mini project
25% Final project competition
Extra credit opportunities
Grade | Percentage |
---|---|
A | [94, 100] |
A- | [90, 94) |
B+ | [87, 90) |
B | [84, 87) |
B- | [80, 84) |
C+ | [77, 80) |
C | [74, 77) |
C- | [70, 74) |
D+ | [65, 70) |
D | [60, 65) |
F | [0, 60) |
Graded as Complete/Incomplete and used as evidence of attendance and participation.
Allowed to have one incomplete lab exercise without any penalty.
Beyond that, 2% grade percentage will be taken off for each missing/incomplete exercise.
You will create a RStudio project in Posit Cloud saving all of your lab exercises. (Weβll go through know-how together)
The homework assignments are individual. Submit your own work.
β You may not share answers/code with your classmates.
You will have at least one week to complete your assignment.
β No make-up homework for any reason unless you have excused absence. π
You will be team up to do the midterm mini project.
More details about the mini project presentation will be released later.
You will be team up to do the final project.
Your project can be in either of the following categories:
Data analysis using statistical models or machine learning algorithms
Introduce a R or Python package not learned in class, including live demo
Introduce a data science tool (visualization, computing, etc) not learned in class, including live demo
Introduce a programming language not learned in class for doing data science, including live demo, Julia, SQL, MATLAB, SAS for example.
Web development: Shiny website or dashboard, including live demo
The final project presentation is on Thursday, 5/1, 2 PM and Monday, 5/5, 10:30 AM - 12:30 PM.
More information will be released later.
[Example] Data science is an interdisciplinary field that β¦ 1
[Example]
The code is modified from the GitHub repo https://github.com/chenghanyustats/slam
The code is generated from ChatGPT response to βPlease generate Python code for solving the math problem I attach,β Jan 14, 2025.
This course expects all students to follow University and College statements on academic integrity.
β K: I hope to learn more about programming. R: Iβm looking forward to learning more about what data science consists of rather than just learning a programming language.
π To make sure everyone is on the same page, first couple of weeks is about learning R and Python syntax. After spring break, we focus on modeling and machine learning methods.
β What do you think will be the most interesting part of the course?
π I love data visualization and web development.
β D: Do I need good coding skills to be able to succeed? G: How much of this class is about coding?
π Weβll learn basic syntax for doing data science step by step.
β What kind of time estimate do you believe most students should spend on reading + assignments for the course?
π Everyone is different. The more the better.
β What kind of projects will we be doing for this course?
π Any project related to data works. You propose one to me. We discuss it, then decide.
β Will this class help me better understand how to code proficiently?
π As you learn to speak a foreign language, you need to code a lot, frequently and constantly in order to be proficient in any programming language. No shortcut.
β Do you know of any internships or research positions offered through Marquette University that incorporate the skills learned in this Data Science course?
π Quite many. Northwestern Mutual, Direct Supply, for example. Iβll share intern info with you if I get any.