Machine Learning
Course objectives:
This comprehensive course offers participants a hands-on introduction to the world of machine learning. It covers essential topics such as data preprocessing, regression, classification, clustering, and dimensionality reduction. Participants will also explore advanced techniques such as neural networks and decision trees. The course is ideal for professionals aiming to transition into data science, as well as students and researchers interested in artificial intelligence.
Participant’s profile:
G12 students, data science enthusiasts, IT professionals, students in computer science, and researchers interested in AI and machine learning applications.
Requirements to the participants:
Basic computer skills.
Length of the course:
- Credit Hours: 3
- Total Hours Required: 20
- Delivery format: online/offline
- Contact hours: 3
- Self-study: 10 hours
- Final control: Project
Course content
1. Introduction to Machine Learning
| Classroom session | Topic for the classroom session | Sub topics | Classroom activities/forms of | Self-study tasks | hours |
|---|---|---|---|---|---|
| Overview of machine learning concepts | Types of machine learning: supervised vs. unsupervised Setting up the machine learning environment (Anaconda, Jupyter Notebook) |
|
|
|
5 |
2. Data Preprocessing
| Classroom session | Topic for the classroom session | Sub topics | Classroom activities/forms of | Self-study tasks | hours |
|---|---|---|---|---|---|
| Data cleaning and preparation |
|
|
|
|
5 |
3. Regression and Classification
| Classroom session | Topic for the classroom session | Sub topics | Classroom activities/forms of | Self-study tasks | hours |
|---|---|---|---|---|---|
| Linear regression model |
|
|
|
|
5 |
4. Clustering and Dimensionality Reduction
| Classroom session | Topic for the classroom session | Sub topics | Classroom activities/forms of | Self-study tasks | hours |
|---|---|---|---|---|---|
| k-means clustering |
|
|
|
|
5 |
5. Advanced Topics and Applications
| Classroom session | Topic for the classroom session | Sub topics | Classroom activities/forms of | Self-study tasks | hours |
|---|---|---|---|---|---|
| Introduction to neural networks | Overview of decision trees and random forests |
|
|
|
5 |
Final control
Assessment Components:
-
Final Exam (40%):
- Format: Written and practical components.
- Content: Covers all chapters, including machine learning concepts, data preprocessing, regression, clustering, and advanced topics.
- Skills Assessed: Application of concepts, implementation of algorithms, and interpretation of results.
-
Project (30%):
- Description: Develop a machine learning model to solve a real-world problem.
- Example: Predict housing prices using regression, or classify customer reviews using logistic regression.
- Evaluation Criteria: Problem understanding and approach, code quality and functionality, analysis and presentation of results.
-
Class Participation and Assignments (20%):
- Description: Ongoing assessment of participation in classroom activities and submission of weekly tasks.
- Evaluation Criteria: Consistency and engagement in class, accuracy and completeness of assignments.
-
Quizzes (10%):
- Description: Two quizzes during the course to assess understanding of key topics.
- Format: Multiple-choice, coding snippets, and short-answer questions.
- Content: Focuses on data preprocessing, regression, and clustering.
Learning Outcomes:
By the end of the course, students will be able to:
-
Machine Learning Fundamentals:
- Explain key concepts of machine learning and distinguish between supervised, unsupervised, and advanced learning methods.
-
Data Preprocessing:
- Perform data cleaning, feature scaling, normalization, and split datasets for training and testing.
-
Regression and Classification:
- Build and evaluate linear regression and logistic regression models.
- Apply and interpret evaluation metrics like accuracy, precision, recall, and RMSE.
-
Clustering and Dimensionality Reduction:
- Implement clustering algorithms like k-means and hierarchical clustering.
- Apply Principal Component Analysis (PCA) to reduce dimensionality and enhance model performance.
-
Advanced Techniques:
- Understand the basics of neural networks, decision trees, and random forests.
- Implement and analyze these models in Python for solving real-world problems.
-
Practical Implementation:
- Use Python libraries such as pandas, scikit-learn, and matplotlib to preprocess data, build models, and visualize results.
-
Critical Thinking:
- Analyze the strengths and limitations of different machine learning techniques.
- Evaluate and optimize models based on problem requirements and dataset characteristics.
-
Ethical Considerations:
- Understand the ethical implications of data handling and machine learning applications.