Stratio Data Fabric Basics Training Course
Stratio Data Fabric is an enterprise data management platform engineered to empower organisations to integrate, manage, and govern large volumes of data across various systems and environments. It unifies technologies such as big data, artificial intelligence (AI), and machine learning (ML) to establish a cohesive platform that simplifies data access, storage, governance, and analytics.
This instructor-led, live training (available online or onsite) is designed for beginner-level data analysts who aim to utilise Stratio Data Fabric for data mining, database exploitation, and advanced analytical projects.
Upon completion of this training, participants will be able to:
- Acquire a thorough understanding of Stratio Data Fabric’s components, architecture, and functionalities.
- Design and execute data pipelines using Stratio’s Rocket analytical environment.
- Apply augmented data governance principles within Stratio to ensure compliance, quality control, and security.
- Conduct descriptive and prescriptive data analyses using the Intelligence module in Stratio.
- Confidently migrate existing Databricks projects to Stratio while ensuring smooth operational workflows and continuous data analysis.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to make arrangements.
Course Outline
Introduction
Overview of Stratio Data Fabric
- Introduction to the Stratio platform and its role in the data ecosystem
- Migration from Databricks to Stratio: Key concepts and advantages
- High-level architecture of Stratio and its integration with existing systems
Introduction to Augmented Data Governance
- Data governance principles in Stratio
- How Stratio ensures compliance and data quality
- Setting up governance rules, policies, and workflows
Introduction to Rocket Analytical Environment
- Overview of the Rocket module for analytical workflows
- Creating data pipelines in the Rocket environment
- Building, testing, and deploying a simple data pipeline using Rocket
Introduction to Intelligence Analytical Environment
- Exploring the Intelligence module for AI and ML workflows
- Implementing prescriptive and descriptive analysis
- Developing and deploying a basic machine learning model using Stratio
Querying and Transforming Information
- Introduction to Stratio's querying language and tools
- Performing data transformations for analysis
- Querying and manipulating datasets in the Stratio environment
Stored Procedures for Querying
- Understanding stored procedures within Stratio
- Use cases: When and how to implement stored procedures for complex querying
- Writing and executing stored procedures for data mining
Scripts of Examples Reviewed in Class
- Reviewing example scripts used in practical exercises
- Discussing optimization techniques and best practices
Presentations and Additional Bibliography
- Presentations on advanced topics in Stratio
- Sharing additional resources, including documentation and further reading
- Q&A session for clarifying any outstanding questions
Summary and Next Steps
Requirements
- Basic knowledge of Databricks
- Fundamental understanding of data mining, database exploitation, and analytical processes
Audience
- Data analysts
- Data scientists
- Database administrators
Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793
Stratio Data Fabric Basics Training Course - Enquiry
Testimonials (2)
The variety of the information shared and the clarity to explain terms in plain English.
Arisbe Mendoza - Fairtrade International
Course - GDPR Workshop
It's a hands-on session.
Vorraluck Sarechuer - Total Access Communication Public Company Limited (dtac)
Course - Talend Open Studio for ESB
Related Courses
Data Ethics
14 HoursData Ethics is the area dedicated to the responsible collection, utilisation, and decision-making regarding data, ensuring that human rights, privacy, transparency, and fairness are upheld.
This instructor-led, live training (available online or onsite) is designed for public sector professionals who have limited or no prior training in data ethics but manage or govern data. Participants will learn to understand ethical risks, evaluate real-world dilemmas, and apply principles of responsible data use in alignment with institutional values and public trust.
By the end of this training, participants will be able to:
- Define key concepts and frameworks in data ethics.
- Identify ethical risks and trade-offs in data collection, analysis, and deployment.
- Apply principles of transparency, consent, and fairness to real-world scenarios.
- Integrate ethical review into governance or operational workflows.
Format of the Course
- Interactive lecture and discussion.
- Hands-on analysis of real-world data ethics cases.
- Guided exercises focused on ethical evaluation and policy alignment.
Course Customization Options
- To request a customized training for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Integrity and Availability
14 HoursData Integrity and Availability involves ensuring that data remains accurate, complete, consistent, and accessible when required, particularly within high-trust public sector environments.
This instructor-led, live training (delivered online or onsite) is designed for public sector professionals tasked with managing or safeguarding data—regardless of their technical background—who wish to ensure the reliability, consistency, and availability of critical datasets and systems under their control.
By the end of this training, participants will be able to:
- Define and distinguish the principles of integrity and availability throughout the data lifecycle.
- Detect and prevent data corruption, inconsistency, or unauthorized alterations.
- Design data environments that ensure high availability and business continuity.
- Implement policies and controls that promote long-term data reliability.
Format of the Course
- Interactive lecture and discussion.
- Hands-on evaluation of data risks and failure points.
- Guided exercises focused on policy development and incident prevention.
Course Customization Options
- To request a customized training for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Policies and Standards
14 HoursData Policies and Standards represent the structured methodology for ensuring that government data is created, maintained, accessed, and utilized in a manner that is consistent, secure, and aligned with legal and ethical guidelines.
This instructor-led live training (available online or onsite) is designed for public sector professionals who are responsible for establishing or implementing data policies, regardless of their technical background. The course aims to help participants standardize, document, and enforce data practices across various departments or systems.
By the conclusion of this training, participants will be able to:
- Define and distinguish between data policies, standards, and procedures.
- Draft and assess data governance policies that align with national and international frameworks.
- Champion consistent and high-quality data practices across teams and departments.
- Establish a foundation for compliance, audit readiness, and trustworthy data systems.
Format of the Course
- Interactive lecture and discussion.
- Hands-on drafting of sample policies and standards.
- Guided evaluation of existing data workflows and controls.
Course Customization Options
- To request a customized training for this course based on your department's workflows or internal tools, please contact us to arrange.
EBX5 for Developers
21 HoursThis instructor-led, live training in Botswana (online or onsite) is designed for developers who wish to utilise EBX5 (TIBCO EBX) to implement a Master Data Management solution within their organisation.
By the end of this training, participants will be able to:
- Interpret requirements and architect an MDM solution.
- Enable the management and integration of master data.
- Integrate and transfer data across multiple systems.
- Import data into EBX5 using match and merge logic.
- Design, create, and document a data model that addresses their organisation's business requirements.
- Integrate EBX5 with third-party services.
GDPR Workshop
7 HoursThis one-day course provides a concise overview of the GDPR – General Data Protection Regulations, which came into effect on 25 May 2018. It is ideal for managers, department heads, and staff members who need to grasp the fundamental principles of the GDPR.
How to Audit GDPR Compliance
14 HoursDesigned primarily for auditors and administrative personnel responsible for verifying that control systems and IT environments adhere to current laws and regulations, this course offers a comprehensive overview of the General Data Protection Regulation (GDPR). It begins by elucidating core GDPR concepts and illustrating their impact on auditing practices. Participants will delve into the rights of data subjects, the obligations of data controllers and processors, and the enforcement mechanisms governing compliance. Additionally, the training incorporates ISACA's audit programme, empowering auditors to evaluate GDPR governance, response strategies, and supporting processes to effectively mitigate risks linked to non-compliance.
Oracle GoldenGate
14 HoursThis instructor-led live training, conducted in Botswana (online or onsite), is designed for system administrators and developers who wish to set up, deploy, and manage Oracle GoldenGate for data transformation.
By the end of this training, participants will be able to:
- Install and configure Oracle GoldenGate.
- Comprehend database replication using Oracle GoldenGate.
- Understand the Oracle GoldenGate architecture.
- Configure and execute database replication and migration tasks.
- Optimize Oracle GoldenGate performance and resolve technical issues.
PECB GDPR - Certified Data Protection Officer
35 HoursThe PECB Certified Data Protection Officer training course empowers you with the essential knowledge, skills, and competence to effectively execute the role of a Data Protection Officer within a GDPR compliance framework.
Why attend this course?
As data protection grows in value, organisations face increasing pressure to safeguard this information. Non-compliance with data protection regulations not only violates the fundamental rights and freedoms of individuals but also exposes organisations to significant risks that can damage their credibility, reputation, and financial standing. This is where your expertise as a Data Protection Officer becomes vital.
This PECB Certified Data Protection Officer training course equips you with the knowledge and skills needed to serve as a Data Protection Officer (DPO), helping organisations meet General Data Protection Regulation (GDPR) requirements.
Through practical exercises, you will master the DPO role, gaining the competence to inform, advise, and monitor GDPR compliance, as well as collaborate with supervisory authorities.
Upon completing the training, you may sit for the exam. If you pass, you can apply for the \"PECB Certified Data Protection Officer\" credential. This internationally recognised certificate demonstrates your professional capability and practical knowledge to advise controllers and processors on fulfilling their GDPR obligations.
Who should attend?
- Managers or consultants aiming to prepare and support organisations in planning, implementing, and maintaining a GDPR-based compliance programme
- DPOs and individuals responsible for maintaining GDPR conformance
- Members of information security, incident management, and business continuity teams
- Technical and compliance professionals preparing for a Data Protection Officer role
- Expert advisors involved in personal data security
Learning objectives
- Understand GDPR concepts and interpret its requirements
- Grasp the content and correlation between GDPR and other regulatory frameworks and applicable standards, such as ISO/IEC 27701 and ISO/IEC 29134
- Acquire the competence to perform the daily tasks and role of a Data Protection Officer within an organisation
- Develop the ability to inform, advise, and monitor GDPR compliance, and cooperate with supervisory authorities
Personal Data Protection Officer - Basic Level
21 HoursPurpose of the Training
- Familiarising participants with the structured and comprehensive aspects of personal data protection under Polish and European law.
- Imparting practical knowledge regarding the new regulations governing the processing of personal data.
- Highlighting key legal risks associated with the implementation of the GDPR.
- Providing practical preparation for independently performing the duties of a Personal Data Protection Officer.
Personal Data Protection Officer - Advanced Level
14 HoursPurpose of the Training
- Gaining practical knowledge on how to perform the tasks of the Inspector
- Gaining practical knowledge of how to audit and how to assess risk
- Providing practical knowledge about the new rules for the processing of personal data
Talend Administration Center (TAC)
14 HoursThis instructor-led, live training in Botswana (online or onsite) targets system administrators, data scientists, and business analysts who wish to set up Talend Administration Center to deploy and manage the organisation's roles and tasks.
By the end of this training, participants will be able to:
- Install and configure Talend Administration Center.
- Understand and implement Talend management fundamentals.
- Build, deploy, and run business projects or tasks in Talend.
- Monitor the security of datasets and develop business routines based on the TAC framework.
- Obtain a broader comprehension of big data applications.
Talend Big Data Integration
28 HoursThis instructor-led, live training in Botswana (online or onsite) is aimed at technical persons who wish to deploy Talend Open Studio for Big Data to simplifying the process of reading and crunching through Big Data.
By the end of this training, participants will be able to:
- Install and configure Talend Open Studio for Big Data.
- Connect with Big Data systems such as Cloudera, HortonWorks, MapR, Amazon EMR and Apache.
- Understand and set up Open Studio's big data components and connectors.
- Configure parameters to automatically generate MapReduce code.
- Use Open Studio's drag-and-drop interface to run Hadoop jobs.
- Prototype big data pipelines.
- Automate big data integration projects.
Talend Cloud
7 HoursThis instructor-led, live training in Botswana (online or onsite) is tailored for data administrators and developers who wish to manage, monitor, and execute data integration processes using Talend Cloud services.
By the end of this training, participants will be able to:
- Navigate the Talend Management Console to oversee users and assign roles within the platform.
- Assess data to identify and comprehend relevant datasets.
- Construct pipelines to process and track data, whether at rest or in motion.
- Prepare data for analysis to derive business-relevant insights.
Talend Data Stewardship
14 HoursThis instructor-led live training in Botswana (available online or onsite) is designed for data analysts at beginner to intermediate levels who wish to deepen their understanding and skills in managing and improving data quality using Talend Data Stewardship.
Upon completion of this training, participants will be able to:
- Develop a thorough understanding of the role data stewardship plays in maintaining data quality.
- Utilise Talend Data Stewardship to oversee data quality tasks.
- Create, assign, and manage tasks within Talend Data Stewardship, including the customisation of workflows.
- Leverage the tool’s reporting and monitoring features to track data quality and stewardship activities.
Talend Open Studio for ESB
21 HoursIn this instructor-led live training in Botswana, participants will learn how to use Talend Open Studio for ESB to create, connect, mediate, and manage services and their interactions.
By the end of this training, participants will be able to:
- Integrate, enhance, and deliver ESB technologies as single packages in a variety of deployment environments.
- Understand and utilize the most commonly used components of Talend Open Studio.
- Integrate any application, database, API, or Web services.
- Seamlessly integrate heterogeneous systems and applications.
- Embed existing Java code libraries to extend projects.
- Leverage community components and code to extend projects.
- Rapidly integrate systems, applications, and data sources within a drag-and-drop Eclipse environment.
- Reduce development time and maintenance costs by generating optimized, reusable code.