The fastest way to get help with homework assignments is to post your questions on Piazza. That way, not only our TAs and instructor can help, your peers can too.
If you prefer that your question addresses to only our TAs and the instructor, you can use the private post feature (i.e., check the "Individual Students(s) / Instructors(s)" radio box).
While we welcome everyone to share their experiences in tackling issues and helping each other out, but please do not post your answers, as that may affect the learning experience of your fellow classmates.
For special cases such as failed submissions due to system errors, missing grades, failed file uploads, emergencies that prevent you from submitting, personal issues, you can contact the staff using a private Piazza post.
TAs plan to hold office hours starting week 2, except on Georgia Tech holidays (e.g., thanksgiving, MLK day, spring break). Each office hour session will be run by at least one TA, and is 1 hour long. See GT’s academic calendar for the full list of holidays (https://registrar.gatech.edu/calendar). We will spread the office hours across weekdays, and across time of the day. We will announce the office hour times.
We will hold office hours via Slack, where the TA running the office hour will be responsive. We will share information about how to join the appropriate Slack group.
Please note that you are always welcome to ask questions on Piazza. Office hours supplement Piazza, and do not replace it.
Polo's and Mahdi's office hours (click to sign up) are on Tuesdays 4:45 to 5:15 pm Atlanta time (i.e., after class), starting week 2, focusing on lectures and projects. For assignemnt related questions, please use Piazza and TA office hours.
* Course Introduction [slides]
* Analytics Building Blocks [slides]
* Data Science Buzzwords [slides]
Fri, Aug 27
* Data Collection
* SQLite [slides]
* Data Cleaning [slides]
* Class Project Overview [slides]
** Example project: Firebird - Predicting Fire Risks in Atlanta [2min | 20min]
* Code Back-up & Version Control [slides]
* Visualization 101 [slides]
* Fixing Common Visualization Issues[slides]
Fri, Sept 17
(Sat, 07:59 ET)
Fri, Sept 17
|5||20-24||* Data Visualization for Web (D3) [slides]||Form project teams by
Fri, Sept 24
* Scalable Computing: Hadoop [slides]
* Scalable Computing: Pig [slides]
* Scalable Computing: Hive [slides]
* Scalable Computing: Spark [slides]
* Scalable Computing: HBase [slides]
Fri, Oct 8
Fri, Oct 8
* Classification [slides]
* Visualization for Classification [slides]
|Proposal Document due
Fri, Oct 15
Proposal Presentation Slides and Video due
Fri, Oct 15
|9||18-22||* Introduction to Clustering [slides]|
|10||25-29|| * Graph Analytics [slides] [slides]
* Ensemble Method [slides]
* Scaling up Algorithms with Virtual Memory [slides]
Fri, Oct 29
Fri, Oct 29
|11||Nov||1-5||[Work on Project]||Progress Report due
Fri, Nov 5
* Time series: basics and linear forecasting [slides]
* Time series: non-linear forecasting, visualization [slides]
|13||15-19||* Text Analytics [slides]||HW4 due
Fri, Nov 19
(Sat, 06:59 ET)
|15||29-3||[Work on Project]
||Poster Presentation Video due
Fri, Dec 3
Final Report due
Fri, Dec 3
* Course Review and wrap up
|Poster Presentation Video grading starts
Tue, Dec 7
Poster Presentation Video grading due
Fri, Dec 10
Students have at least 3 weeks to complete each homework assignment. Some students waited until the last week, and could not finish. It is critical to plan ahead and prepare for the significant time needed.
Basic linear algebra, probability and statistics knowledge is also expected.
This semester is challenging due to the ongoing Covid-19 pandemic and a growing awareness of inequities. Please review the most up-todate information relates to specific services and guidelines for courses during this semester at TECH Moving Forward website and in the Academic Restart Frequently Asked Questions.
The Office of Disability Services offers accommodations for students with disabilities. Please contact the office should you need help.
Graduate Student Resources and academic and personal support services : Office of the Dean of Students, Counseling Center, Health Serivces, Women's Resource Center, LGBTQIA Resource Center, Veteran's Resource Center, Georgia Tech Police.
All content and course materials can be accessed online. There is no textbook for this course.
All Georgia Tech students have FREE access to https://www.oreilly.com, where you can find a huge number of highly rated and classic books (e.g., the "animal" books) from O'Reilly and Pearson covering a wide variety of computer science topics, including some of those listed below. Just log in with your official GT email address, e.g., email@example.com.
None, but you should have taken courses similar to those listed in the next section, at Georgia Tech or at another school.
If you are an Analytics (OMS or campus) degree student, you should first take CSE 6040 and do very well in it; if necessary, please also first take CS 1301.
We thank Intel's support in curriculum development for the memory mapping module (scaling up algorithms with virtual memory).
We thank Amazon Educate for providing free cloud credit for Amazon Web Services. We are excited to be am AWS partner university and part of AWS Educate's private beta.
We thank Microsoft Azure's special grant for providing free cloud credit.
We thank Tableau for Teaching program's data visualization software.Many thanks to my colleagues for sharing their course materials: