Udemy Free Coupon - Apache Spark Streaming with Python and PySpark
Add Big Data Streaming to your Data Science and Machine Learning Python Projects
What will i learn?
Description
What will you learn from this lecture?
In this couse, you'll learn the following:
- An overview of the architecture of Apache Spark.
- How to develop Apache Spark 2.0 applications with PySpark using RDD transformations and actions and Spark SQL.
- How to work with Spark's primary abstraction, resilient distributed datasets(RDDs), to process and analyze large data sets.
- Advanced techniques to optimize and tune Apache Spark jobs by partitioning, caching and persisting RDDs.
- Analyzing structured and semi-structured data using Datasets and DataFrames, and develop a thorough understanding of Spark SQL.
- How to scale up Spark Streaming applications for both bandwidth and processing speed
- How to integrate Spark Streaming with cluster computing tools like Apache Kafka
- How to connect your Spark Stream to a data source like Amazon Web Services (AWS) Kinesis
- Best practices of working with Apache Spark in the field.
- Big data ecosystem overview.
Comments
Post a Comment