Courses Detail Information

STAT4710J – Data Science and Analytics using Python


Instructors:

Ailin Zhang;

Credits:

4

Pre-requisites:

ECE4010J

Description:

Data science integrates data, computation, and analytical thinking to transform the way we solve problems and make decisions. This course introduces the core components of data science, including question formulation, data collection and cleaning, visualization, statistical inference, predictive modeling, and decision making.
Emphasizing real-world, data-driven problem solving, the course provides hands-on experience with Python for data transformation, querying, and analysis. Students will learn algorithms for machine learning methods such as regression, classification, and clustering; explore principles for creating effective data visualizations; and understand key statistical concepts related to measurement error, uncertainty, and prediction.

Course Topics:

1. Jupyter notebook
2. Data Acquisition and Manipulation
3. Pandas
4. Regex
5. Web Scraping
6. SQL and Big Data
7. Modeling
8. Feature Engineering
9. Regression and Classification
10. Unsupervised learning