Comprehensive Flink

This Course Includes neXT LIVE 365

LEARN FOR 365 DAYS!

Sunset Learning Institute believes in a 365-day learning experience that begins immediately, regardless of when you attend your ILT course. At SLI, you get a range of learning opportunities, from instructor-led hands-on training, to self-directed, customizable learning paths based on your environment, your needs, and your level of experience. We provide the tools and options, and you decide what you need, when you need it, and how you want to learn it! 

  • Immediate access to supplemental learning assets that are INCLUDED with your purchase of the above instructor-led training course: 365 Days of Access to SLI’s Entire Big Data Video Reference Library (VRL), not just the 5-day class you sign up for (hundreds of searchable, on-demand learning bytes in 5-15-minute videos)
  • 365 Days of Unlimited Access to Delta Sessions - What’s Not Covered in Class! (Version Upgrades, Industry Updates, Etc.)
  • 365 Days of Unlimited 24x7 Access to SLI's Community - Collaborate with SLI Instructors and Other Members (Monitored Daily by SLI Instructors) See Community Demo
  • 365 Days of Unlimited Access to Interactive neXTpertise Sessions and other IT Resources with SLI Instructors (featured hot topics, exam prep, etc.)  See Upcoming neXTpertise Sessions
  • Unlimited Access to Hosted Webinars and All Previously Recorded Sessions
  • Unlimited Access to your Digital Courseware

See Entire Portfolio

  • Benefits:
    • Training that fits your needs (from high intensity to small learning bytes)
    • Build immediate competency - start at time of purchase!
    • Gain know-how and skills gaps with limited work disruptions
    • Get quick answers to daily challenges - live interaction!

Overview

Apache Flink is an open source platform for distributed stream and batch data processing

Flink’s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. Flink builds batch processing on top of the streaming engine, overlaying native iteration support, managed memory, and program optimization. This 5 day course covers every technical aspect a developer, architect and DevOps individual will need to know to install, administrate, develop toward, manage and monitor every capability of this 4th generation, distributed data-flow Apache Software Project (ASF).

By 4th generation it is meant that Flink is not just the culmination of the ideas and functions for data flow that developers have had to assemble from predecessor Apache projects such as Spark and Kafka, it is a much more powerful and performant complement or successor to both of those projects. Flink, in fact, changes the very meaning of ’data flow’ and ’infinite’ versus
’finite’ data sources.

In addition, Flink makes batch and micro-batch processing simple subclasses of true streaming. For those software engineers who use imperative or functional languages, Flink supports Python, Java and Scala APIs. For those developers who work with a tabular data set visualization and SQL, Flink provides a 100% SQL interface. As with the Flink programming APIs, Flink SQL can be used for batch, micro-batch and pure streaming processing. Flink allows the use of the same programming paradigm for data flow and data analysis with finite data sets, infinite data sets, heterogeneous data sets, batch, micro-batch and streaming data.

This course will present all essential concepts, libraries and techniques, in a complete hands-on environment, for understanding, creating and supporting Flink and Flink-ecosystem-based applications.

50% Lecture, 50% Lab

Target Audience

We believe the audience for this class will be bifurcated into two types of software engineers. First, those Java or Scala software engineers, with minimal knowledge of Spark and Kafka, who must quickly generate rigorous, extensible, enterprise-level applications reliant upon a distributed data flow topology.

Second, those software engineers who have worked with Java, the Spark API and the Kafka API who desire to understand how the Flink functionality and performance complements or supersedes the functionality offered by Spark and Kafka. Companies like Alibaba, Capital One, Ericsson, Netflix and Uber consider Spark and Kafka to be 3rd generation and Flink 4th generation in their capabilities.

Prerequisites

Development experience with Linux, Java and Hadoop are a prerequisite. Knowledge or experience with implementing EAI/EII patterns is assumed. Experience with a distributed data flow project such as NiFi is helpful. Experience with or comprehensive conceptual knowledge of Spark and/or Kafka are helpful. It is suggested that a student new to Hadoop first take the course ’Advanced Hadoop.’ A student not familiar with EAI/EII patterns is referred to http://www.enterpriseintegrationpatterns.com/

Course Outline

  • Day 1: Introduction to Flink concepts, ecosystem, use cases
  • Day 2: Application development with Flink
  • Day 3: Extending Flink into the Flink ecosystem
  • Day 4: DevOps, installation options, deployment and monitoring
  • Day 5: Performance enhancement practices with Flink and Flink ecosystem

SLI Main Menu