In this course, students will learn to apply machine learning concepts to analyze data and make predictions. Students will learn how to collect and wrangle data, to explore data using statistics and visualizations, to transform data for further modeling, to model data using machine learning algorithms to predict data patterns, and to evaluate these model-based predictions. Students will be expected to have prior experience with fundamentals of programming.
- Programming language review for data analytics
- Basic syntax, variables, control flow, loops, install and import libraries for data processing such as SciPy, NumPy, Pandas, Sci-Kit Learn, TensorFlow or other similar libraries and packages.
- Data And Features: using libraries such as NumPy and Pandas
- Represent data using lists, arrays for structured data
- Work with data frames using packages such as Pandas to represent diverse data
- Use Control Flow for filtering data and performing filtered computations
- Manipulate Data using functions and packages to process the data and perform computations
- Understand, determine and represent Features
- Perform Data Wrangling
- Exploring Data: - using libraries such as Matplotlib
- Visualize Data by creating plots using tools such as Matplotlib
- Perform high dimensionality visualizations
- Transforming Data: using libraries such as Sci-Kit learn
- Create Data Transformers and apply dimensionality reducing techniques as PCA
- Data Modeling: using libraries such as Keras, TensorFlow and scikit-learn
- Use machine learning techniques such as clustering, supervised learning, K-nearest neighbours, Regression to model the data
- Evaluating Data: Evaluate modeled data using evaluation techniques
- Create and apply confusion matrices
- Perform cross-validation using scoring metrics
- Implement and apply power tuning and pipelining to evaluate the data
Methods of Instruction
Lecture, seminars, demonstrations, and hands-on exercises/projects in the lab
Means of Assessment
*Some of these assessments may involve group work.
**In order to pass the course, students must, in addition to receiving an overall course grade of 50%, also achieve a grade of at least 50% on the combined weighted examination components (including quizzes, tests, exams).
- Install and use appropriate tools and libraries needed for Data Science
- Understand and process data and features
- Collect and Wrangle Data for further processing
- Explore Data using statistics and visualizations
- Transform Data to a structure suitable for data modeling
- Model Data using machine learning algorithms
- Evaluate model-based predictions
min grade C in CSIS 1190 OR currently active in the
PDD Data Analytics
PBD Computer and Information Systems
(Note: CSIS 1175 recommended)
No corequisite courses
No equivalency courses
Course Guidelines for previous years are viewable by selecting the version desired. If you took this course and do not see a listing for the starting semester/year of the course, consider the previous version as the applicable version.
Below shows how this course and its credits transfer within the BC transfer system.
A course is considered university-transferable (UT) if it transfers to at least one of the five research universities in British Columbia: University of British Columbia; University of British Columbia-Okanagan; Simon Fraser University; University of Victoria; and the University of Northern British Columbia.
For more information on transfer visit the BC Transfer Guide and BCCAT websites.
If your course prerequisites indicate that you need an assessment, please see our Assessment page for more information.