We use Edstem for all announcements and discussion.
Everyone must join this class's Ed Discussion through Canvas.
Double check that you are joining the correct Edstem! There are multiple concurrent course sections with the same name and course number taking place, e.g., online for OMSA and OMSCS, and campus for Atlanta-based students.
Students must always use Ed Discussion to communicate with course staff or for any class-related questions. Ed Discussion will be used for general posts, including private and public posts, threads, mega threads, Q&A, and announcements.
If course staff needs to communicate with specific students (i.e. members of a project team), the Ed Chat feature of Ed Discussion will be used. Students can benefit from this feature to communicate with other students. e.g., to discuss forming a project.
IMPORTANT: Everyone must ensure that the notification setting is on for both Ed Discussion and its Ed Chat feature to stay up to date with the class requirements and prevent losing points because of missing updates and announcements on Ed Discussion.
The fastest way to get help with homework assignments is to post your questions on Ed Discussion. That way, you can get help from our TAs and instructor can help, as well as your peers.
If you prefer that your question is addressed to only our TAs and the instructor, you can use the private post feature (i.e., check the "Individual Students(s) / Instructors(s)" radio box).
While we welcome everyone to share their experiences in tackling issues and helping each other out, you must not post your answers, as that may affect the learning experience of your fellow classmates.
For special cases such as failed submissions due to system errors, missing grades, failed file uploads, emergencies that prevent you from submitting, or personal issues, you can contact the staff using a private Ed Discussion post.
Office hours are held virtually starting week 2, except on Georgia Tech holidays (e.g., Thanksgiving, MLK day, spring break).
Each TA office hour is run by at least one TA, and is 1 hour long. See GT’s academic calendar for the full list of holidays (https://registrar.gatech.edu/calendar). We spread the office hours across weekdays, and across time of the day. TA office hours are held on Slack, where the TA running the office hour will be responsive. We will announce TA office hour times and how to join the appropriate Slack group.
You are always welcome to ask questions on Ed Discussion. Office hours supplement Ed Discussion, and do not replace it.
Polo's office hours are on Tuesdays from 3:30pm to 4:30pm, divided into 10-minutes slots. Visit Canvas to sign up for a slot. All questions are welcome, except homework assignment related questions, which are best addressed via Ed Discussion and TA office hours.
* Course Introduction [slides]
Fri, Jan 13
* Data Cleaning [slides]
* Data Science Buzzwords [slides]
* Class Project Overview [slides]
** Example project: Firebird - Predicting Fire Risks in Atlanta [2min | 20min]
* Code Back-up & Version Control [slides]
* Data Integration [slides]
* Data Analytics, Concepts and Tasks [slides]
* Visualization 101 [slides]
Fri, Feb 3
(Sat, 06:59 ET)
Fri, Feb 3
|5||6-10||* Data Visualization for Web (D3) [ d3 slides][ html; css slides][js slides]||Form project teams by
Fri, Feb 10
* Scalable Computing: Hadoop [slides]
* Scalable Computing: Pig [slides]
* Scalable Computing: Hive [slides]
* Scalable Computing: Spark [slides]
* Scalable Computing: HBase [slides]
Fri, Feb 24
Fri, Feb 24
* Classification [slides]
|Proposal Document due
Fri, Mar 3
Proposal Presentation Slides and Video due
Fri, Mar 3
|9||6-10||* Introduction to Clustering [slides]|
* Graph Analytics [slides] [slides]
Fri, Mar 17
(Sat, 07:59 ET)
Fri, Mar 17
* Time series: basics and linear forecasting [slides]
* Time series: non-linear forecasting, visualization [slides]
|Progress Report due
Fri, Mar 31
|13||Apr||3-7||* Project Final Discussion (In-person meeting with each group)|
|14||10-14||* Text Analytics [slides] [PCA Slide]||HW4 due
Fri, Apr 14
|15||17-21||* Ethics in ML [slides]
* Preparation for final report and poster presentation
|Poster Presentation Video due
Fri, Apr 21
Final Report due
Fri, Apr 21
* Course Review (11 Lessons Learned from Working with Tech Companies) and wrap up
|Poster Presentation Video grading starts
Tue, Apr 25
Poster Presentation Video grading due
Fri, Apr 28
Students have at least 3 weeks to complete each homework assignment. In the past, some students have waited until the last week to begin, and could not finish. It is critical to plan ahead and prepare for the significant time required to complete the homework assignments.
Basic linear algebra, probability and statistics knowledge is also expected.
The Office of Disability Services offers accommodations for students with disabilities. Please contact the office should you need help.
Graduate Student Resources and academic and personal support services : Office of the Dean of Students, Counseling Center, Health Serivces, Women's Resource Center, LGBTQIA Resource Center, Veteran's Resource Center, Georgia Tech Police.
All content and course materials can be accessed online. There is no textbook for this course.
All Georgia Tech students have FREE access to https://www.oreilly.com, where you can find a huge number of highly rated and classic books (e.g., the "animal" books) from O'Reilly and Pearson covering a wide variety of computer science topics, including some of those listed below. Just log in with your official GT email address, e.g., firstname.lastname@example.org.
None, but you should have taken courses similar to those listed in the next section, at Georgia Tech or at another school.
If you are an Analytics (OMS or campus) degree student, you should first take CSE 6040 and do very well in it; if necessary, please also first take CS 1301.
We thank Intel's support in curriculum development for the memory mapping module (scaling up algorithms with virtual memory).
We thank Amazon Educate for providing free cloud credit for Amazon Web Services. We are excited to be am AWS partner university and part of AWS Educate's private beta.
We thank Microsoft Azure's special grant for providing free cloud credit.
We thank Tableau for Teaching program's data visualization software.Many thanks to my colleagues for sharing their course materials: