Thank you for sending your enquiry! One of our team members will contact you shortly.
Thank you for sending your booking! One of our team members will contact you shortly.
Course Outline
Python Fundamentals for Data Manipulation
- Installing Python and configuring the development environment
- Core language concepts: variables, data types, and control structures
- Writing and executing basic Python scripts
File Handling: CSV and Excel
- Reading and writing CSV files using the csv module and Pandas
- Managing Excel files using openpyxl/xlrd and Pandas
- Practical exercises: automating file conversions
Introduction to Pandas
- DataFrame essentials: creation, indexing, selection, and filtering
- Aggregation and grouping operations
- Standard cleaning operations: handling missing values, duplicates, and type conversions
Introduction to Polars
- Polars concepts and performance attributes compared to Pandas
- Basic DataFrame operations in Polars
- Use-case illustration: selecting Polars over Pandas
Advanced Data Transformation (Intermediate)
- Complex joins, window functions, and pivot operations in Pandas
- Efficient data processing patterns with Polars
- Chaining operations and optimising memory usage
Process Automation with Python
- Writing scripts to automate repetitive data tasks and ETL steps
- Scheduling scripts using OS schedulers or task schedulers
- Implementing logging, error handling, and notifications
Packaging Scripts and Best Practices
- Creating executables using PyInstaller or similar tools
- Project structuring, virtual environments, and dependency management
- Version control basics and documenting workflows
Hands-on Mini-Project
- End-to-end task: reading raw files, cleaning and transforming data, and generating outputs
- Automating the workflow and packaging it as a runnable script or executable
- Review and improvements based on peer feedback
Summary and Next Steps
Requirements
- Foundational familiarity with programming concepts or a readiness to learn
- Competence in using the command line or terminal for package installation
- Experience with spreadsheet applications (CSV/Excel)
Target Audience
- Data analysts and operations personnel looking to automate data tasks
- Analytical engineers seeking lightweight ETL scripting solutions
- Professionals interested in practical Python-based data workflows
14 Hours
Testimonials (2)
Hands-on exercises related to content really helps to understand more about each topic. Also, style of start class with lecture and continue with hands-on exercise is good and helpful to relate with the lecture that presented earlier.
Nazeera Mohamad - Ministry of Science, Technology and Innovation
Course - Introduction to Data Science and AI using Python
Examples/exercices perfectly adapted to our domain