Get in Touch

Course Outline

Hunyuan Multimodal Foundations and Lab Setup

  • Understanding Hunyuan's multimodal capabilities for image, 3D, and video use cases.
  • Identifying practical business scenarios for creative, product, and content teams.
  • Setting up the lab environment, sample assets, and model access.
  • Executing initial generation tasks and reviewing outputs.

Prompt Design and Workflow Patterns

  • Structuring prompts to achieve consistent multimodal results.
  • Utilizing text prompts, reference images, and basic input settings.
  • Selecting appropriate workflows for image, video, or 3D generation.
  • Iterating on prompts based on output quality and business objectives.

Image Generation and Review Labs

  • Creating marketing, product, and concept images from prompts.
  • Refining visual style, composition, and content consistency.
  • Reviewing outputs for utility, quality, and brand alignment.
  • Organizing image outputs for approval and downstream use.

Video Generation Labs

  • Producing short video outputs from prompts and prepared inputs.
  • Managing style, scene intent, and output variation.
  • Reviewing videos for clarity, continuity, and practical applicability.
  • Preparing video outputs for demonstration or content workflows.

3D Asset Creation Labs

  • Generating basic 3D assets from text or image inputs.
  • Evaluating geometry, texture quality, and asset usability.
  • Exporting assets for visualization, prototyping, or content pipelines.
  • Comparing scenarios where 3D generation is suitable versus image or video workflows.

Integration, Governance, and Next Steps

  • Delivering generated assets through simple applications, services, or APIs.
  • Connecting multimodal outputs to product, content, and review workflows.
  • Applying practical checks for quality, brand safety, copyright, and responsible use.
  • Planning pilot use cases and next steps for internal adoption.

Requirements

  • Foundational understanding of AI and generative AI concepts.
  • Experience with web applications, APIs, or standard developer tools.
  • Basic proficiency in Python or scripting.

Audience

  • Developers creating AI-powered product features.
  • Technical product managers and solution architects.
  • Innovation, media, and digital teams working with image, video, or 3D content.
 14 Hours

Number of participants


Price per participant

Upcoming Courses

Related Categories