MarkLogic Data Hub Training Course
MarkLogic Data Hub serves as an open-source consolidated data repository, providing a suite of tools and libraries designed to accelerate enterprise data integration and delivery.
This instructor-led live training, available online or onsite, targets system administrators, database administrators, data architects, and developers looking to install, configure, and manage MarkLogic Data Hub to organize and manage data from various silos.
Upon completion of this training, participants will be equipped to customize, secure, track, and manage their enterprise data using the capabilities and tools of MarkLogic Data Hub.
Course Format
- Interactive lectures and discussions.
- Extensive exercises and practice sessions.
- Hands-on implementation in a live-lab environment.
Course Customization Options
- To request customized training for this course, please contact us to make arrangements.
Course Outline
Introduction
Overview of MarkLogic Data Hub Features and Architecture
Getting Started with MarkLogic Data Hub
Importing, Migrating, and Converting Existing Artifacts
Exploring MarkLogic Data Hub Concepts
Setting up Users, Roles, and Privileges
Deploying Security Configuration Using QuickStart and ml-gradle
Working with Data Ingestion and Flow Pipelines
Working with Steps, Mapping, and Modules
Configuring Project Steps and Flows
Understanding Key Semantic Data Modeling Concepts
Accessing Data Using JavaScript APIs and SPARQL
Managing Data on DHS Using Hub Central
Managing On-Premises Data, Projects, Flows, and Steps
Serving Data Out of MarkLogic Using REST and ODBC
Tracking the Data History and Data Lineage Origin
Replicating Existing Data Flow with a New Data Source
Using Smart Mastering with MarkLogic Data Hub
Troubleshooting
Summary and Conclusion
Requirements
- Experience with database management systems
- Familiarity with JavaScript, C, C++, or any other programming language
Audience
- System administrators
- Database administrators
- Data architects
- Developers
Open Training Courses require 5+ participants.
MarkLogic Data Hub Training Course - Booking
MarkLogic Data Hub Training Course - Enquiry
MarkLogic Data Hub - Consultancy Enquiry
Testimonials (2)
The variety of the information shared and the clarity to explain terms in plain English.
Arisbe Mendoza - Fairtrade International
Course - GDPR Workshop
It's a hands-on session.
Vorraluck Sarechuer - Total Access Communication Public Company Limited (dtac)
Course - Talend Open Studio for ESB
Upcoming Courses
Related Courses
Data Ethics
14 HoursData Ethics addresses the responsible collection, utilization, and decision-making processes regarding data, ensuring that human rights, privacy, transparency, and fairness are upheld.
This instructor-led live training, available both online and onsite, is designed for public sector professionals who have limited or no prior background in data ethics. It targets individuals who manage or govern data and wish to understand ethical risks, evaluate real-world dilemmas, and apply principles of responsible data use in alignment with institutional values and public trust.
By the end of this training, participants will be able to:
- Define key concepts and frameworks in data ethics.
- Identify ethical risks and trade-offs in data collection, analysis, and deployment.
- Apply principles of transparency, consent, and fairness to real-world scenarios.
- Integrate ethical review into governance or operational workflows.
Format of the Course
- Interactive lecture and discussion.
- Hands-on analysis of real-world data ethics cases.
- Guided exercises focused on ethical evaluation and policy alignment.
Course Customization Options
- To request a customized training for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Integrity and Availability
14 HoursData Integrity and Availability focuses on the discipline of ensuring that data remains accurate, complete, consistent, and accessible when needed, particularly within high-trust public sector environments.
This instructor-led, live training (available online or onsite) is designed for public sector professionals responsible for managing or safeguarding data—regardless of their technical background—who wish to ensure the reliability, consistency, and availability of critical datasets and systems under their control.
By the end of this training, participants will be able to:
- Define and differentiate the principles of integrity and availability in the data lifecycle.
- Detect and prevent data corruption, inconsistency, or unauthorized alterations.
- Design data environments that ensure high availability and business continuity.
- Implement policies and controls that promote long-term data reliability.
Format of the Course
- Interactive lecture and discussion.
- Hands-on evaluation of data risks and failure points.
- Guided exercises focused on policy development and incident prevention.
Course Customization Options
- To request a customized training for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Policies and Standards
14 HoursData Policies and Standards refers to the structured approach of ensuring that government data is created, maintained, accessed, and utilized in a manner that is consistent, secure, and aligned with legal and ethical guidelines.
This instructor-led, live training (available online or onsite) is designed for public sector professionals responsible for establishing or applying data policies, regardless of their technical background, who aim to standardize, document, and enforce data practices across departments or systems.
Upon completion of this training, participants will be capable of:
- Defining and distinguishing between data policies, standards, and procedures.
- Drafting and evaluating data governance policies in alignment with national and international frameworks.
- Promoting consistent and high-quality data practices across teams and departments.
- Building a foundation for compliance, audit readiness, and trustworthy data systems.
Course Format
- Interactive lectures and discussions.
- Hands-on drafting of sample policies and standards.
- Guided evaluation of existing data workflows and controls.
Course Customization Options
- To request customized training for this course based on your department's workflows or internal tools, please contact us to arrange.
Data Strategy
14 HoursA Data Strategy serves as the long-term blueprint for how an organization manages, utilizes, and invests in its data assets to fulfill its mission, enhance public services, and maintain accountability.
This instructor-led training session (available online or onsite) is designed for public sector professionals who have limited or emerging experience in data strategy. It targets those who shape or influence strategic decisions and aim to develop sustainable, mission-aligned data strategies throughout their organization or department.
Upon completing this training, participants will be equipped to:
- Identify the core components of a comprehensive data strategy.
- Align data initiatives with organizational goals and public value.
- Create roadmaps covering data governance, infrastructure, skills development, and innovation.
- Assess maturity levels and track progress toward becoming a data-driven organization.
Course Format
- Interactive lectures and group discussions.
- Practical development of strategy components and roadmaps.
- Guided analysis of public sector case studies and strategic frameworks.
Customization Options
- If you require a customized version of this course tailored to your department's workflows or internal tools, please contact us to arrange it.
EBX5 for Developers
21 HoursThis instructor-led, live training in Brazil (online or onsite) is designed for developers who wish to utilize EBX5 (TIBCO EBX) to establish a Master Data Management solution within their organization.
By the end of this training, participants will be able to:
- Interpret requirements and architect an MDM solution.
- Enable the management and integration of master data.
- Integrate and transfer data across multiple systems.
- Import data into EBX5 using match and merge logic.
- Design, create, and document a data model that addresses their organization's business requirements.
- Integrate EBX5 with third-party services.
GDPR Workshop
7 HoursAchieve mastery over the core principles of the General Data Protection Regulation through this intensive one-day workshop, tailored for managers, department heads, and compliance personnel. The program covers the fundamentals of GDPR, rights of data subjects, data protection principles, consent requirements, obligations regarding breach notification, and the concept of privacy by design. It offers practical frameworks for implementing GDPR compliance strategies throughout your organization, ensuring lawful data processing and fostering a culture of accountability in data protection.
How to Audit GDPR Compliance
14 HoursThis course is primarily designed for auditors and administrative professionals responsible for ensuring that their control systems and IT environments adhere to current laws and regulations. The program starts by providing a solid understanding of essential GDPR concepts and how they impact the work of auditors. Participants will examine the rights of data subjects, the obligations of data controllers and processors, and the principles of enforcement and compliance under the Regulation. Additionally, the training includes the audit framework provided by ISACA, which equips auditors to evaluate GDPR governance and response mechanisms, as well as supporting processes that help mitigate risks associated with non-compliance.
Oracle GoldenGate
14 HoursThis instructor-led live training in Brazil (available online or onsite) is designed for system administrators and developers who want to set up, deploy, and manage Oracle GoldenGate for data transformation.
By the end of this training, participants will be able to:
- Install and configure Oracle GoldenGate.
- Understand Oracle database replication using the Oracle GoldenGate tool.
- Comprehend the Oracle GoldenGate architecture.
- Configure and perform database replication and migration.
- Optimize Oracle GoldenGate performance and troubleshoot issues.
PECB GDPR - Certified Data Protection Officer
35 HoursThe PECB Certified Data Protection Officer training course empowers you with the essential knowledge, skills, and competence to effectively assume the role of a Data Protection Officer within a GDPR compliance initiative.
Why should you attend?
As data protection gains increasing importance, organizations face growing demands to safeguard this valuable information. Failure to comply with data protection regulations not only violates individuals' fundamental rights and freedoms but also exposes organizations to significant risks that can damage their credibility, reputation, and financial standing. This is where your expertise as a Data Protection Officer becomes crucial.
The PECB Certified Data Protection Officer training course is designed to help you acquire the necessary knowledge and skills to serve as a Data Protection Officer (DPO), thereby assisting organizations in meeting the requirements of the General Data Protection Regulation (GDPR).
Through practical exercises, you will master the DPO role, gaining the ability to inform, advise, and monitor GDPR compliance, as well as interact effectively with supervisory authorities.
Upon completing the training course, you may sit for the exam. If you pass, you can apply for the “PECB Certified Data Protection Officer” credential. This internationally recognized certificate validates your professional capabilities and practical knowledge to advise data controllers and processors on fulfilling their GDPR compliance obligations.
Who should attend?
- Managers or consultants looking to prepare and support an organization in planning, implementing, and maintaining a GDPR-based compliance program
- DPOs and individuals responsible for maintaining conformance with GDPR requirements
- Members of information security, incident management, and business continuity teams
- Technical and compliance professionals aiming to transition into a Data Protection Officer role
- Expert advisors involved in securing personal data
Learning objectives
- Understand GDPR concepts and interpret its requirements
- Understand the content and relationship between the General Data Protection Regulation and other regulatory frameworks and applicable standards, such as ISO/IEC 27701 and ISO/IEC 29134
- Gain the competence to perform the DPO role and daily tasks within an organization
- Develop the ability to inform, advise, and monitor GDPR compliance, and cooperate with supervisory authorities
Personal Data Protection Officer - Basic Level
21 HoursPurpose of the Training
- Familiarizing participants with the systematic and comprehensive aspects of personal data protection under Polish and European legislation
- Providing practical insights into the new rules governing personal data processing
- Highlighting key areas of legal risk associated with the implementation of the GDPR
- Preparing participants practically to independently perform the duties of a Personal Data Protection Officer
Personal Data Protection Officer - Advanced Level
14 HoursPurpose of the Training
- Gaining practical knowledge on how to perform the tasks of the Inspector
- Gaining practical knowledge of how to audit and how to assess risk
- Providing practical knowledge about the new rules for the processing of personal data
Talend Administration Center (TAC)
14 HoursThis instructor-led live training in Brazil (online or onsite) is designed for system administrators, data scientists, and business analysts who wish to set up Talend Administration Center to deploy and manage the organization's roles and tasks.
Upon completion of this training, participants will be able to:
- Install and configure Talend Administration Center.
- Grasp and apply the fundamentals of Talend management.
- Create, deploy, and execute business projects or tasks in Talend.
- Monitor dataset security and develop business routines based on the TAC framework.
- Gain a deeper understanding of big data applications.
Talend Big Data Integration
28 HoursThis instructor-led live training in Brazil (online or onsite) is designed for technical professionals who want to deploy Talend Open Studio for Big Data to streamline the process of reading and analyzing big data.
Upon completing this training, participants will be able to:
- Install and configure Talend Open Studio for Big Data.
- Connect to big data systems such as Cloudera, Hortonworks, MapR, Amazon EMR, and Apache.
- Understand and configure the big data components and connectors in Open Studio.
- Set parameters to automatically generate MapReduce code.
- Use Open Studio's drag-and-drop interface to execute Hadoop jobs.
- Prototype big data pipelines.
- Automate big data integration projects.
Talend Data Stewardship
14 HoursThis instructor-led, live training in Brazil (online or onsite) is designed for beginner to intermediate-level data analysts who wish to enhance their understanding and skills in managing and improving data quality using Talend Data Stewardship.
By the end of this training, participants will be able to:
- Gain a comprehensive understanding of the role of data stewardship in maintaining data quality.
- Utilize Talend Data Stewardship for managing data quality tasks.
- Create, assign, and manage tasks within Talend Data Stewardship, including workflow customization.
- Use the tool's reporting and monitoring capabilities to track data quality and stewardship efforts.
Talend Open Studio for ESB
21 HoursIn this instructor-led live training conducted in Brazil, participants will learn how to utilize Talend Open Studio for ESB to create, connect, mediate, and manage services and their interactions.
Upon completing this training, participants will be capable of:
- Integrating, enhancing, and deploying ESB technologies as unified packages across diverse deployment environments.
- Understanding and effectively using the most frequently utilized components of Talend Open Studio.
- Connecting any application, database, API, or web service.
- Seamlessly integrating heterogeneous systems and applications.
- Incorporating existing Java code libraries to extend project capabilities.
- Utilizing community-provided components and code to expand project functionality.
- Rapidly integrating systems, applications, and data sources within an intuitive drag-and-drop Eclipse-based environment.
- Reducing development time and maintenance costs through the generation of optimized, reusable code.