Registrer deg nå

Logg Inn

Mistet Passord

Mistet passordet ditt? Vennligst skriv inn E-postadressen din. Du vil motta en lenke og opprette et nytt passord via e-post.

Legg til innlegg

Du må logge inn for å legge til innlegget .

Legg til spørsmål

Du må logge inn for å stille et spørsmål.

Logg Inn

Registrer deg nå

Velkommen til Scholarsark.com! Registreringen din gir deg tilgang til å bruke flere funksjoner på denne plattformen. Du kan stille spørsmål, gi bidrag eller gi svar, se profiler til andre brukere og mye mer. Registrer deg nå!

A Big Data Hadoop and Spark project for absolute beginners

A Big Data Hadoop and Spark project for absolute beginners

Pris: $29.99

This course will prepare you for a real world Data Engineer role !

Get started with Big Data quickly leveraging free cloud cluster and solving a real world use case! Learn Hadoop, Hive , Big Data for ledere (both Python and Scala) fra bunnen av!

Learn to code Spark Scala & PySpark like a real world developer. Understand real world coding best practices, hogst, error handling , configuration management using both Scala and Python.

Prosjekt

A bank is launching a new credit card and wants to identify prospects it can target in its marketing campaign.

It has received prospect data from various internal and 3rd party sources. The data has various issues such as missing or unknown values in certain fields. The data needs to be cleansed before any kind of analysis can be done.

Since the data is in huge volume with billions of records, the bank has asked you to use Big Data Hadoop and Spark technology to cleanse, transform and analyze this data.

Hva du vil lære :

  • Stor Data, Hadoop concepts

  • How to create a free Hadoop and Spark cluster using Google Dataproc

  • Hadoop hands-onHDFS, Hive

  • Python basics

  • PySpark RDD – Microsofts populære kurs DASHBOARD-IN-A-DAY

  • PySpark SQL, DataFrame – Microsofts populære kurs DASHBOARD-IN-A-DAY

  • Project work using PySpark and Hive

  • Scala basics

  • Spark Scala DataFrame

  • Project work using Spark Scala

  • Spark Scala Real world coding framework and development using Winutil, Maven and IntelliJ.

  • Python Spark Hadoop Hive coding framework and development using PyCharm

  • Building a data pipeline using Hive , PostgreSQL, Big Data for ledere

  • Logging , error handling and unit testing of PySpark and Spark Scala applications

  • Spark Scala Structured Streaming

  • Applying spark transformation on data stored in AWS S3 using Glue and viewing data using Athena

Forutsetninger :

  • Some basic programming skills

  • Some knowledge of SQL queries

Legg igjen et svar