AI Powered Computer Vision (AIARC-103)

Price: $2,495.00
Duration: 3 days
Certification: 
Exam: 
Continuing Education Credits:
Learning Credits:

AI Powered Computer Vision is a comprehensive 3-day course that prepares students for obtaining and organizing data for fine-tuning a pre-trained ViT (Vision Transformer)! Select your own ViT Image Classification model, and fine-tune it with datasets you’ve prepared, or gathered. This course offers a combination of lectures and hands-on labs, providing participants with a solid understanding of AI concepts and the skills to design and implement AI solutions.


Throughout the course, you will learn about ViT transformer-based architectures, the fundamentals of Python programming for AI models, and the deployment of open source Transformer models. You will begin with data preparation by gathering, cleaning, labeling, and organizing data for the purpose of fine-tuning a ViT.


You will gain practical experience through hands-on exercises with open source ViT frameworks, allowing you to work with fine-tuned models and run workloads on different models to understand their strengths and weaknesses. Additionally, the course covers model formats and provides in-depth exploration of AI programming environments like PyTorch.


Towards the end of the course, you will delve into advanced topics such as context extension through fine-tuning and quantization for a specific application target environment: Road Surface Image Classification. By the completion of the course, you will have the opportunity to earn an AI certification from Alta3 Research, further enhancing your credentials in the field of Artificial Intelligence. This course is ideal for Python Developers, DevSecOps Engineers, and Managers or Directors seeking a overview of AI and its practical application in the enterprise.


Upcoming Class Dates and Times

All Sunset Learning courses are guaranteed to run

Course Outline and Details

  • Python – PCEP Certification or Equivalent Experience
  • Familiarity with Linux
  • Familiarity with Natural Language Processing


  • Project Managers
  • Architects
  • Developers
  • Data Acquisition Specialists


  • Understand Visual Transformer Model architecture.
  • Describe the role of patches and patch embeddings in ViT.
  • Install and use ViT frameworks like Google ViT, Microsoft BEiT, and Swin.
  • Prepare Dataset for ViT.
  • Establish guidelines for obtaining RAW Data for ViT Fine-Tuning.
  • Clean and Curate Data for ViT Fine-Tuning.
  • Fine-Tune ViT Models with PyTorch.
  • Deploy and Maximize ViT Model Performance.
  • Perform End-to-End procedure for Fine-Tuning a Road Surface ViT.


The Visual Transformer Model

  • Lecture: What is Intelligence?
  • Lecture: Generative AI
  • Lecture: The Transformer Model
  • Lecture: Feed Forward Neural Networks

Computer Vision

  • Lecture: Introduction to Computer Vision
  • Lecture: NLP to ViT: Key Modifications
  • Lecture + Lab: Patch Embedding
  • Lecture + Lab: Positional Encoding in Vison Transformer
  • Lecture: CNN vs ViT – A Comparison

Pre-trained ViT

  • Lecture: Preparing A100 for Server Operations
  • Lecture: Selecting a Pre-Trained ViT Model
  • Lecture + Lab: Operating Google ViT Model for Face Recognition 
  • Lecture + Lab: Operating Microsoft BEiT Model for Scene Segmentation 

Data Curation for Road Surface ViT

  • Lecture: Curating Data for ViT
  • Lecture + Lab: Gathering Raw Data
  • Lecture + Lab: Data Cleaning and Preparation
  • Lecture + Lab: Data Labeling
  • Lecture + Lab: Data Organization
  • Lecture: Premade Datasets for Fine Tuning
  • Lecture + Lab: Obtain and Prepare Premade Datasets

Fine Tuning for Road Surface Image Classification

  • Lecture: Fine-Tuning a Pre-Trained ViT
  • Lecture: PyTorch
  • Lecture + Lab: Fine Tuning ViT with PyTorch
  • Lecture + Lab: Operating our Road Surface Image Classification ViT Model


Course Delivery Options

Train face-to-face with the live instructor.
Access to on-demand training content anytime, anywhere.
Attend the live class from the comfort of your home or office.
Interact with a live, remote instructor from a specialized, HD-equipped classroom near you. An SLI sales rep will confirm location availability prior to registration confirmation.