Data Cleaning Training Course
Data cleansing, also known as data cleaning, involves identifying and correcting errors or inconsistencies within a dataset prior to analysis.
This instructor-led, live training (available online or onsite) is designed for data scientists, data analysts, and business analysts who wish to enhance their ability to clean and process data effectively.
By the end of this training, participants will be able to:
- Develop an effective data cleansing strategy.
- Implement useful tools for data cleansing.
- Get a result more efficiently.
- Learn and apply data cleansing best practices.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction
Overview of Data Cleaning
- Why is Data Cleaning Important?
Case Study: When Big Data Is Dirty
Developing A Thorough Data Cleaning Strategy
Common Data Cleaning Tools
- Drake
- OpenRefine
- Pandas (for Python)
- Dplyr (for R)
Achieving High Data Integrity
- Complete
- Correct
- Accurate
- Relevant
- Consistent
Automating the Data Cleaning Process
Monitoring Your Data Cleaning System
Summary and Conclusion
Requirements
- An understanding of data analytics concepts.
Audience
- Data Scientists
- Data Analysts
- Business Analysts
Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793
Data Cleaning Training Course - Enquiry
Testimonials (2)
Using Road Safety data when doing praticals
Maphahamiso Ralienyane - Road Safety Department
Course - Data Cleaning
It was insightful and I gained a lot of data analysis skills
Mamonyane Taoana - Road Safety Department
Course - Data Cleaning
Related Courses
ArcGIS for Spatial Analysis
14 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at field ecologists and conservation managers who wish to create data spatial projects in ArcGIS.
By the end of this training, participants will be able to:
- Output spatial data as visualizations.
- Conduct geostatics on actual data.
- Implement spatial data analysis, data processing, and mapping with ArcGIS.
- Analyze spatial data for projects in ArcGIS.
ArcMap in ArcGIS
14 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at intermediate-level environmental scientists and archaeologists who wish to learn how to use ArcMap in ArcGIS for collecting, organizing, managing, and analyzing geographic information.
By the end of this training, participants will be able to:
- Understand the fundamentals of ArcMap and ArcGIS.
- Collect, organize, manage, and analyze geographic information on the social and archaeological elements.
- Conduct spatial queries for impact evaluation.
ArcGIS from Basic to Advanced
35 HoursThis instructor-led, live training in Botswana (online or on-site) is tailored for GIS professionals and analysts ranging from beginner to advanced levels who wish to learn how to effectively use ArcGIS for data visualization, spatial analysis, and geospatial project management.
By the end of this training, participants will be able to:
- Navigate and utilise ArcGIS tools for geospatial data management.
- Create and customise maps with layers and attributes.
- Perform advanced spatial analysis and geoprocessing tasks.
- Automate workflows using ModelBuilder and Python.
ArcGIS Enterprise for Technical Support
14 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at beginner-level IT support personnel who wish to provide robust support for ArcGIS Enterprise, addressing any anomalies or failures effectively.
By the end of this training, participants will be able to:
- Understand the architecture and components of ArcGIS Enterprise.
- Learn to install, configure, and manage ArcGIS Enterprise.
- Gain skills in troubleshooting and resolving common issues.
- Develop proficiency in monitoring and maintaining ArcGIS Enterprise environments.
- Master the techniques for backup, recovery, and performance optimization.
ArcGIS Fundamentals
14 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at beginner-level professionals who wish to learn the fundamental concepts and tools of ArcGIS.
By the end of this training, participants will be able to:
- Understand the basic concepts of GIS and spatial data.
- Navigate the ArcGIS interface.
- Create and manage spatial data.
- Perform basic spatial analysis.
- Create maps and visualizations.
ArcGIS Pro for Spatial Analysis
14 HoursUpon completing this training, participants will be equipped with the following skills:
- A comprehensive understanding of GIS principles and spatial data types, leveraging the most recent version of ArcGIS Pro.
- The ability to navigate the user interface and assess its various applications.
- Proficiency in exploring ArcGIS* Pro and effectively utilising online resources.
- Competence in managing, manipulating, and analysing personal datasets.
- The capability to present and share data in innovative and meaningful ways.
Note: This course utilizes the latest version of ArcGIS Pro.
ArcGIS Professional Plus: Advanced GIS Data Management and Analysis
14 HoursArcGIS Professional Plus is an enhanced version of ArcGIS Pro, offering extended features for geospatial data analysis, 3D modeling, automation, and enterprise collaboration.
This instructor-led live training (online or onsite) is aimed at intermediate-level GIS professionals who wish to deepen their skills in spatial data analysis, automation, and sharing using ArcGIS Professional Plus tools.
By the end of this training, participants will be able to:
- Leverage ArcGIS Pro Plus tools for data visualization and analysis.
- Create 2D and 3D maps with advanced symbology and geoprocessing techniques.
- Automate workflows using ModelBuilder and Python scripting.
- Integrate ArcGIS with external data services and enterprise systems.
Format of the Course
- Interactive lecture and discussion.
- Lots of exercises and practice.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request a customized training for this course, please contact us to arrange.
Advanced ArcGIS Pro for Spatial Analysis
35 HoursThis instructor-led, live training in Botswana (online or onsite) is designed for advanced GIS professionals who wish to use ArcGIS Pro to enhance their spatial analysis capabilities, conduct comprehensive geostatistical analysis, and apply advanced 3D modelling techniques for more effective decision-making and problem-solving in real-world scenarios.
By the end of this training, participants will be able to:
- Develop advanced skills in spatial analysis techniques using ArcGIS Pro.
- Utilize Python scripting for automation and complex data processing.
- Apply spatial modeling for problem-solving in real-world scenarios.
- Conduct geostatistical analysis for advanced data interpretation.
- Integrate external data sources and leverage 3D spatial data analysis.
ArcGIS with Python Scripting
14 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at GIS analysts who wish to automate repetitive tasks in GIS processes.
By the end of this training, participants will be able to:
- Build GIS applications using Python and ArcGIS tools.
- Develop with the ArcGIS package ArcPy, using Python.
- Apply the ArcGIS modules for map automation using object classes in Python.
Advanced Power Systems and GIS Integrated Solutions
70 HoursIn the dynamic energy sector, integrating electrical transient analysis with accurate geographic data has become a strategic imperative. Currently, dependence on disjointed data sources exposes operations to considerable risks. This intensive 14-day programme, hosted in Melbourne, aims to bridge the divide between electrical engineering and geospatial management.
Advanced Geographic Information Systems (GIS)
21 HoursThis instructor-led, live training in Botswana (online or onsite) is tailored for intermediate-level geographers aiming to deepen their expertise in spatial analysis, data management, and GIS applications.
By the end of this training, participants will be able to:
- Apply advanced spatial analysis techniques to solve complex geographical problems.
- Manage large spatial databases and perform data quality control.
- Create dynamic and interactive maps and visualizations for various applications.
- Utilize programming and automation to streamline GIS workflows.
Google Maps API for Developers
14 HoursThis instructor-led, live training in Botswana (online or onsite) is designed for developers who wish to enhance their website or web application by adding geo-location functionality via Google Maps.
Upon completion of this training, participants will be able to:
- Embed a map into a new or existing website or web application.
- Display live data that synchronises with the current location.
- Create a mashup incorporating Google Maps.
- Apply best practices for CSS and JavaScript coding.
- Create a custom map for an application.
Python for ArcGIS and QGIS for Earth Sciences and Engineering Professionals
35 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at beginner-level earth sciences and engineering professionals who wish to use Python for geospatial analysis in both ArcGIS and QGIS environments.
By the end of this training, participants will be able to:
- Learn Python syntax and control structures for executing geospatial tasks efficiently.
- Use Pandas, Numpy, and Matplotlib for data analysis and visualization in GIS.
- Manipulate and analyze vector data with Geopandas, Arcpy, and PyQGIS libraries.
- Automate geospatial processes and workflows using Python scripting in ArcGIS and QGIS.
- Develop custom Python-based geoprocessing tools for ArcGIS and QGIS to streamline tasks.
QGIS for Geographic Information System
21 HoursA Geographic Information System (GIS) is a framework built to capture, store, manipulate, analyze, manage, and present spatial or geographic data. The term GIS is occasionally used to denote Geographic Information Science (GIScience), which refers to the academic field studying these systems, representing a significant area within the broader discipline of geoinformatics.
QGIS operates as GIS software, enabling users to analyze and edit spatial data, as well as compose and export graphical maps. It supports both raster and vector layers; vector data is stored as points, lines, or polygons. The software accommodates various raster image formats and can georeference images. In summary, it allows users to create, edit, visualize, analyze, and publish geospatial information across Windows, Mac, Linux, and BSD platforms.
In the first phase, this program introduces the QGIS interface for general use. The second phase covers PyQGIS, which comprises the Python libraries of QGIS that facilitate the integration of GIS functionalities into your Python code or application, allowing you to develop custom Python plugins around specific GIS capabilities.