MuerBT磁力搜索 BT种子搜索利器 免费下载BT种子,超5000万条种子数据

[FreeCourseSite.com] Udemy - Data Engineering Essentials using SQL, Python, and PySpark

磁力链接/BT种子名称

[FreeCourseSite.com] Udemy - Data Engineering Essentials using SQL, Python, and PySpark

磁力链接/BT种子简介

种子哈希:ed79e89b5619ca524db7b7f719aca2abc2c9f74b
文件大小: 14.96G
已经下载:5325次
下载速度:极快
收录时间:2023-12-19
最近下载:2025-07-18

移花宫入口

移花宫.com邀月.com怜星.com花无缺.comyhgbt.icuyhgbt.top

磁力链接下载

magnet:?xt=urn:btih:ED79E89B5619CA524DB7B7F719ACA2ABC2C9F74B
推荐使用PIKPAK网盘下载资源,10TB超大空间,不限制资源,无限次数离线下载,视频在线观看

下载BT种子文件

磁力链接 迅雷下载 PIKPAK在线播放 世界之窗 91视频 含羞草 欲漫涩 逼哩逼哩 成人快手 51品茶 抖阴破解版 极乐禁地 91短视频 TikTok成人版 PornHub 草榴社区 哆哔涩漫 呦乐园 萝莉岛

最近搜索

涵菱 ?????? 榜一大哥 极品清秀 商k 老姨 小敏儿 不拿拿 调教妹妹 情深叉喔干 超大鸡吧 金子人 みづなれい 绝美女友 小点点点 撅着 插妈 炮友骑乘 熟女母子 chloe 18 走秀 旁边看 体检 痛苦 电影 一男多女 纯欲 little chloe tvp 学生足交 国产原创

文件列表

  • 45 - Recap of important Linux Commands for Data Engineering/005 Understanding PATH Environment Variable.mp4 139.2 MB
  • 51 - Submitting Python based Spark Applications/012 Submit Spark Applications with dependencies as jars.mp4 129.6 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/019 Schedule Hive Applications using Cron.mp4 118.7 MB
  • 45 - Recap of important Linux Commands for Data Engineering/002 Overview of SSH to connect to remote Servers.mp4 118.6 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/007 Setup Project using VS Code Remote Development.mp4 98.4 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/018 Recap of HDFS on Dataproc Cluster.mp4 90.6 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/006 Populate Data into Delta Lake Tables using Spark SQL.mp4 88.9 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/014 Overview of Blocks related to files in HDFS.mp4 87.9 MB
  • 51 - Submitting Python based Spark Applications/010 Deep Dive into Spark Deploy Modes.mp4 87.5 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/007 Populate Data for Additional Years into Delta NYSE Table.mp4 87.1 MB
  • 45 - Recap of important Linux Commands for Data Engineering/010 Listing Files and Folders using ls command.mp4 84.0 MB
  • 51 - Submitting Python based Spark Applications/011 Submit Spark Applications with dependencies as packages.mp4 79.6 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/008 Run Spark SQL Scripts using Spark SQL CLI.mp4 79.3 MB
  • 45 - Recap of important Linux Commands for Data Engineering/007 Copy Files and Folders in Linux using cp command.mp4 79.2 MB
  • 10 - Solutions for Basic SQL Queries/005 Solution for Exercise 2 to get Dormant Customers using Outer Join.mp4 79.2 MB
  • 16 - Troubleshooting and Debugging Python Issues/014 Debug VS Code Notebooks using Debug Feature.mp4 77.3 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/028 Demo on Spark Dynamic Allocation.mp4 76.2 MB
  • 24 - Pre-Defined Functions in Spark SQL/018 Overview of Numeric Functions in Spark SQL.mp4 76.0 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/002 Getting Started with Hive.mp4 75.2 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/025 Run Spark Application with out Adaptive Query Execution.mp4 73.5 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/005 Run Hive Commands using Scripts.mp4 73.1 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/007 Run Individual Spark SQL Commands.mp4 73.0 MB
  • 45 - Recap of important Linux Commands for Data Engineering/013 Troubleshooting issues in Linux using grep command.mp4 71.3 MB
  • 13 - Data Processing using Pandas Dataframe APIs/012 Write Pandas Dataframes to JSON Files.mp4 68.4 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/017 Overview of HDFS Namenode for HDFS File Metadata.mp4 67.8 MB
  • 45 - Recap of important Linux Commands for Data Engineering/012 Standard Directories in Linux.mp4 67.5 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/007 Overview of Multinode Hadoop Cluster.mp4 66.8 MB
  • 10 - Solutions for Basic SQL Queries/006 Solution for Exercise 3 to get Revenue Per Customer using Outer Join.mp4 65.3 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/015 Overview of Replication related to files in HDFS.mp4 65.0 MB
  • 45 - Recap of important Linux Commands for Data Engineering/014 Overview of Shell Scripts.mp4 64.8 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/017 Overriding Spark Executor Instances to tune the performance.mp4 64.7 MB
  • 45 - Recap of important Linux Commands for Data Engineering/009 Delete Files and Folders in Linux using rm command.mp4 64.6 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/027 Overview of Spark Dynamic Allocation.mp4 63.8 MB
  • 45 - Recap of important Linux Commands for Data Engineering/006 Creating Folders in Linux using mkdir.mp4 63.0 MB
  • 15 - Project 2 - Files to Database Loader/007 Overview of Deploying File to DB Loader Project.mp4 62.5 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/011 Setup Data Sets to understand HDFS Concepts.mp4 61.9 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/013 Determining Number of Blocks for each file.mp4 61.2 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/010 Overview of local storage of files.mp4 60.4 MB
  • 24 - Pre-Defined Functions in Spark SQL/031 Solutions for Exercises 7 and 8 on Pre-defined Functions in Spark SQL.mp4 59.6 MB
  • 24 - Pre-Defined Functions in Spark SQL/030 Solutions for Exercises 5 and 6 on Pre-defined Functions in Spark SQL.mp4 59.5 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/009 Getting Started with HDFS Commands to Manage Files.mp4 59.3 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/015 Overview of Jobs related to Spark Applications using Spark UI.mp4 59.2 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/018 Overview of Scheduling and Crontab.mp4 58.5 MB
  • 10 - Solutions for Basic SQL Queries/007 Solution for Exercise 4 to get Revenue Per Category.mp4 58.1 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/016 Physical Storage of HDFS File Blocks.mp4 57.8 MB
  • 17 - Performance Tuning of Python Applications/009 Review Pandas Data Frame API to load data into the target table.mp4 57.6 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/006 Create Target Table for NYSE Data using Delta Format.mp4 57.2 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/007 Getting Started Data Loader for NYSE Data using HIve.mp4 57.0 MB
  • 52 - Logging in Python based Spark Applications/008 Validate Logging of Spark Application using Cluster Mode.mp4 56.9 MB
  • 51 - Submitting Python based Spark Applications/013 Develop Shell Wrappers to submit Spark Applications.mp4 56.7 MB
  • 10 - Solutions for Basic SQL Queries/004 Solution for Exercise 1 to get Customer Order Count.mp4 56.3 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/030 Overview of number of Spark Partitions.mp4 56.2 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/013 Run Hive QL Commands using Script for NYSE Loader.mp4 55.0 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/016 Review Environment Properties and Disabling Dynamic Allocation.mp4 53.0 MB
  • 30 - Copy Query Results into Spark Metastore Tables/005 Design Pipeline using CTAS and INSERT in Spark SQL.mp4 53.0 MB
  • 17 - Performance Tuning of Python Applications/003 Ensure Postgres Database is setup for file to db loader Python Application.mp4 52.8 MB
  • 52 - Logging in Python based Spark Applications/002 Run Application without logging.mp4 52.4 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/015 Validate Hive Application to Convert NYSE Data.mp4 52.3 MB
  • 24 - Pre-Defined Functions in Spark SQL/014 Extract Information from Date or Time using Spark SQL.mp4 52.1 MB
  • 51 - Submitting Python based Spark Applications/004 Specify Paths using Environment Variables in Spark Applications.mp4 52.1 MB
  • 14 - Project 1 - File Format Converter using Python/021 Pass Data Sets as Run Time Arguments to File Format Converter.mp4 51.9 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/023 Overview of Lazy Evaluation.mp4 51.1 MB
  • 12 - Python Collections for Data Engineering/009 Sort Python lists using key.mp4 50.3 MB
  • 07 - SQL Troubleshooting and Debugging Guide/014 Identify and Troubleshoot Bugs in SQL Queries.mp4 49.3 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/007 Detailed outline of Spark SQL Topics in the course.mp4 49.2 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/012 Generate Test Data for Spark Performance Tuning.mp4 48.9 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/009 Create Partitioned Parquet Table for NYSE Data.mp4 48.8 MB
  • 51 - Submitting Python based Spark Applications/008 Review YARN Logs for Spark Applications in Cluster Mode.mp4 48.8 MB
  • 13 - Data Processing using Pandas Dataframe APIs/004 Filter Data in Pandas Dataframe using query.mp4 48.7 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/006 Programmatically Copy files into HDFS using Python.mp4 48.3 MB
  • 24 - Pre-Defined Functions in Spark SQL/007 Trimming Characters or Strings using Spark SQL.mp4 48.1 MB
  • 15 - Project 2 - Files to Database Loader/006 Write CSV Data from Files to Database Tables in Chunks.mp4 48.0 MB
  • 24 - Pre-Defined Functions in Spark SQL/028 Solutions for Exercises 1 and 2 on Pre-defined Functions in Spark SQL.mp4 47.5 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/003 Overview of Hadoop and Spark Cluster Types and Architecture.mp4 47.0 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/010 Determine Overall YARN Capacity.mp4 46.5 MB
  • 23 - Create Delta Tables using Spark SQL/013 Using Merge to Update and Insert into Delta Tables in Spark Metastore.mp4 46.5 MB
  • 21 - Setup Databricks Environment using GCP/012 Overview of Databricks CLI Commands.mp4 46.2 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/004 Overview of Integrating Hive Commands with Shell Scripts.mp4 45.6 MB
  • 39 - ELT Data Pipelines using Databricks/011 Validate Applications for ELT Pipeline using Databricks.mp4 45.3 MB
  • 22 - Basic Transformations using Spark SQL/009 Create Dataframe with Schema from JSON File using Pyspark.mp4 45.3 MB
  • 33 - Getting Started with Pyspark Data Frame APIs/003 Create Dataframe with Schema from JSON File using Pyspark.mp4 45.3 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/014 Redesign the Solution using HDFS to stage files.mp4 45.1 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/029 Running Spark Application using Dynamic Allocation.mp4 45.1 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/006 Override Run time Hive Configuration Properties and Variables.mp4 45.1 MB
  • 28 - Joins using Spark SQL Queries/004 Outer Joins using Spark SQL Queries.mp4 44.9 MB
  • 45 - Recap of important Linux Commands for Data Engineering/004 Overview of Environment Variables in Linux.mp4 44.7 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/010 Develop Shell Wrapper for Spark SQL Application.mp4 44.0 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/026 Run Spark Application using Adaptive Query Execution.mp4 43.7 MB
  • 14 - Project 1 - File Format Converter using Python/009 Modularize File Format Converter for Dataset.mp4 43.4 MB
  • 20 - Overview of Spark and Spark Architecture/004 Code Examples of Pandas, Dask and Pyspark.mp4 43.1 MB
  • 10 - Solutions for Basic SQL Queries/001 Solutions for Filtering and Aggregations.mp4 42.1 MB
  • 16 - Troubleshooting and Debugging Python Issues/006 Troubleshoot Module Related issues for Database Connectivity using Python.mp4 41.6 MB
  • 17 - Performance Tuning of Python Applications/014 Refactor File to Database Loader Application for Multiprocessing.mp4 41.6 MB
  • 31 - Ranking using Spark SQL Windowing Functions/006 Filter on Ranks using Spark SQL Windowing Functions.mp4 41.4 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/007 Process Data using pyspark Dataframe APIs.mp4 41.3 MB
  • 33 - Getting Started with Pyspark Data Frame APIs/002 Process Schema Details in JSON using Pyspark.mp4 41.1 MB
  • 15 - Project 2 - Files to Database Loader/004 Validate Pandas and SQL Integration.mp4 41.1 MB
  • 22 - Basic Transformations using Spark SQL/008 Process Schema Details in JSON using Pyspark.mp4 41.1 MB
  • 20 - Overview of Spark and Spark Architecture/013 Understand Spark Key Terms.mp4 40.6 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/003 Create Hadoop and Spark Cluster and Setup VS Code Workspace.mp4 40.4 MB
  • 24 - Pre-Defined Functions in Spark SQL/026 Word Count Query using Pre-defined Functions in Spark SQL.mp4 39.9 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/009 Validate Spark SQL Application for NYSE Data Conversion.mp4 39.8 MB
  • 17 - Performance Tuning of Python Applications/016 Validate File to DB Loader Application with Multiprocessing.mp4 39.8 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/019 Overview of Adaptive Query Execution.mp4 39.8 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/021 Overview of Shuffling - Part 2.mp4 39.7 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/011 Process NYSE Data and load into partitioned table.mp4 39.7 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/011 Evaluate Requirements against Partition Pruning.mp4 39.5 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/006 Setup SSH Connectivity and VS Code Workspace using Master Node.mp4 39.5 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/017 Redesign Partition Strategy to tune the performance.mp4 39.4 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/004 Overview of Python topics covered in the course.mp4 39.3 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/008 Review Important Properties of HDFS.mp4 39.2 MB
  • 15 - Project 2 - Files to Database Loader/005 Write CSV Data from File to Database Table.mp4 39.0 MB
  • 08 - Performance Tuning of SQL Queries/003 Generate Explain Plans for SQL Queries.mp4 38.9 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/008 Overview of Multinode Hadoop and Spark Cluster Topology.mp4 38.7 MB
  • 14 - Project 1 - File Format Converter using Python/019 Use Environment Variables in File Format Converter.mp4 38.6 MB
  • 20 - Overview of Spark and Spark Architecture/003 Setup Environment to explore Pandas, Dask and Pyspark.mp4 38.6 MB
  • 45 - Recap of important Linux Commands for Data Engineering/003 Overview of Profile in Linux Shell.mp4 38.5 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/014 Develop and Validate Shell Script for Word Count.mp4 38.4 MB
  • 52 - Logging in Python based Spark Applications/005 Changing the Log Message Format using logging.mp4 38.1 MB
  • 08 - Performance Tuning of SQL Queries/002 Overview of SQL Compilation Process and Explain Plans.mp4 37.9 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/003 Overview of SQL topics covered in the course.mp4 37.9 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/008 Develop Spark SQL Application for NYSE Data Conversion.mp4 37.8 MB
  • 22 - Basic Transformations using Spark SQL/003 Create Temporary Views using Spark SQL.mp4 37.7 MB
  • 07 - SQL Troubleshooting and Debugging Guide/007 Troubleshoot Database Credentials and Permissions Issues.mp4 37.6 MB
  • 18 - Getting Started with GCP/011 Install Google Cloud SDK on Windows.mp4 37.4 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/010 Composite Sorting using Spark Data Frame APIs.mp4 37.3 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/011 Overview of Spark History Server UI.mp4 37.0 MB
  • 21 - Setup Databricks Environment using GCP/014 Setup Data Sets in DBFS using Databricks CLI Commands.mp4 37.0 MB
  • 24 - Pre-Defined Functions in Spark SQL/008 Padding Characters to Strings using Spark SQL.mp4 36.8 MB
  • 17 - Performance Tuning of Python Applications/008 Performance Tuning using Chunksize in Pandas.mp4 36.8 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/012 Review Data Set with Nulls for Sorting using Spark Data Frame APIs.mp4 36.6 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/008 Detailed outline of Pyspark Topics in the course.mp4 36.5 MB
  • 37 - Ranking using Pyspark Data Frame APIs/007 Difference Between rank and dense_rank.mp4 36.4 MB
  • 39 - ELT Data Pipelines using Databricks/006 Create and Run Orchestrated Pipeline using Databricks Job.mp4 36.3 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/008 Overview of Spark Submit Command.mp4 36.3 MB
  • 51 - Submitting Python based Spark Applications/009 Overview of Execution Process of Spark Applications.mp4 36.2 MB
  • 24 - Pre-Defined Functions in Spark SQL/009 Reverse and Concatenate Strings using Spark SQL.mp4 36.2 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/005 Launch Spark SQL CLI with Delta Lake Packages.mp4 36.1 MB
  • 17 - Performance Tuning of Python Applications/015 Add Parallel Processing to file to db loader Python Application.mp4 36.0 MB
  • 14 - Project 1 - File Format Converter using Python/024 Exception Handling in File Format Converter Application.mp4 36.0 MB
  • 10 - Solutions for Basic SQL Queries/008 Solution for Exercise 5 to get Product Count Per Department.mp4 35.6 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/005 Create Folder and Copy Files into HDFS using Commands.mp4 35.5 MB
  • 20 - Overview of Spark and Spark Architecture/008 Overview of Spark Key Features and Platforms.mp4 35.5 MB
  • 24 - Pre-Defined Functions in Spark SQL/021 Replace Null Values with default values using nvl and coalesce in Spark SQL.mp4 35.5 MB
  • 27 - Aggregations using Spark SQL Queries/001 Perform Total Aggregations using Spark SQL Queries.mp4 35.4 MB
  • 24 - Pre-Defined Functions in Spark SQL/019 Data Type Conversion using Spark SQL.mp4 35.3 MB
  • 24 - Pre-Defined Functions in Spark SQL/024 Using CASE and WHEN for conditional logic in Spark SQL.mp4 35.1 MB
  • 39 - ELT Data Pipelines using Databricks/002 Pass Arguments to Databricks Python Notebooks.mp4 35.1 MB
  • 24 - Pre-Defined Functions in Spark SQL/017 Dealing with Unix Timestamp using Spark SQL.mp4 34.9 MB
  • 10 - Solutions for Basic SQL Queries/002 Solutions for Filtering and Aggregations.mp4 34.8 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/004 Understand the size of the data using dbutils.mp4 34.6 MB
  • 11 - Getting Started with Python/009 Getting help on Python Variables and Functions.mp4 34.5 MB
  • 12 - Python Collections for Data Engineering/003 Overview of Python Collections.mp4 34.4 MB
  • 16 - Troubleshooting and Debugging Python Issues/013 Overview of Debugging VS Code Notebooks using Debug Feature.mp4 34.3 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/010 Overview of Performance Tuning of Spark covered in the course.mp4 34.2 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/002 Review the Requirements and Datasets for NYSE Data.mp4 34.2 MB
  • 08 - Performance Tuning of SQL Queries/007 Interpret Explain Plans for Basic SQL Queries.mp4 34.2 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/020 Recap of Application Development Life Cycle using Hive.mp4 34.2 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/005 Create External Stage Table for NYSE CSV Files.mp4 34.1 MB
  • 14 - Project 1 - File Format Converter using Python/020 Pass JSON Array as argument to Python Applications.mp4 34.0 MB
  • 23 - Create Delta Tables using Spark SQL/005 Copy Data into Spark Metastore Managed Table.mp4 33.9 MB
  • 51 - Submitting Python based Spark Applications/002 Develop Pyspark Application for Daily Revenue.mp4 33.9 MB
  • 17 - Performance Tuning of Python Applications/007 Overview of Execution of file to db loader application.mp4 33.8 MB
  • 45 - Recap of important Linux Commands for Data Engineering/015 Running and Debugging Shell Scripts with Arguments.mp4 33.8 MB
  • 51 - Submitting Python based Spark Applications/003 Run Spark Application using spark-submit.mp4 33.5 MB
  • 22 - Basic Transformations using Spark SQL/002 Getting Started with Spark SQL Example using Databricks.mp4 33.3 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/005 Validate SSH Connectivity to the Dataproc Cluster.mp4 33.2 MB
  • 08 - Performance Tuning of SQL Queries/011 Add Required Indexes to tune performance of SQL Queries.mp4 32.7 MB
  • 24 - Pre-Defined Functions in Spark SQL/029 Solutions for Exercises 3 and 4 on Pre-defined Functions in Spark SQL.mp4 32.7 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/012 Develop Hive QL Script to Load NYSE Data.mp4 32.6 MB
  • 14 - Project 1 - File Format Converter using Python/018 Use Environment Variables in Python Applications.mp4 32.6 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/016 Add Condition for Partition Pruning.mp4 32.5 MB
  • 24 - Pre-Defined Functions in Spark SQL/013 Overview of trunc and date_trunc in Spark SQL.mp4 32.4 MB
  • 23 - Create Delta Tables using Spark SQL/003 Create Database and Review the Details.mp4 32.3 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/003 Spark SQL Metastore Architecture.mp4 32.1 MB
  • 02 - Getting Started with SQL for Data Engineering/003 Overview of Database Technologies and relevance of SQL.mp4 31.7 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/003 Overview of Spark Properties Files.mp4 31.7 MB
  • 39 - ELT Data Pipelines using Databricks/009 Review File Format Converter Pyspark Code.mp4 31.5 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/003 Overview of Data Processing using Conventional loops.mp4 31.5 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/004 Review Data Sets to explore Pyspark APIs.mp4 31.5 MB
  • 14 - Project 1 - File Format Converter using Python/008 Write Pandas Dataframe to JSON Files.mp4 31.5 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/010 Understand Join and Aggregation for Daily Product Revenue.mp4 31.5 MB
  • 05 - Writing Basic SQL Queries/010 Outer Joins using SQL Queries.mp4 31.5 MB
  • 45 - Recap of important Linux Commands for Data Engineering/016 Overview of Hadoop and Spark Executables.mp4 31.4 MB
  • 14 - Project 1 - File Format Converter using Python/010 Wrapper to Process all Data Sets.mp4 31.3 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/009 Review HDFS Properties on Dataproc Cluster using VS Code.mp4 31.3 MB
  • 20 - Overview of Spark and Spark Architecture/010 Overview of Spark Cluster using Databricks.mp4 31.2 MB
  • 14 - Project 1 - File Format Converter using Python/013 Add Core Logic to Python Application.mp4 31.2 MB
  • 21 - Setup Databricks Environment using GCP/010 Configure Databricks CLI on Mac or Windows.mp4 31.1 MB
  • 33 - Getting Started with Pyspark Data Frame APIs/006 Convert CSV to Parquet with Schema using Pyspark.mp4 31.1 MB
  • 22 - Basic Transformations using Spark SQL/012 Convert CSV to Parquet with Schema using Pyspark.mp4 31.1 MB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/007 Manage Spark Metastore Database Objects using Spark APIs.mp4 31.1 MB
  • 24 - Pre-Defined Functions in Spark SQL/005 Extract Substring using substr in Spark SQL.mp4 31.1 MB
  • 52 - Logging in Python based Spark Applications/004 Getting Started with logging using Python.mp4 30.9 MB
  • 28 - Joins using Spark SQL Queries/002 Inner Join using Spark SQL Queries.mp4 30.8 MB
  • 51 - Submitting Python based Spark Applications/006 Run Spark Application with Environment Variables in Cluster Mode.mp4 30.8 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/002 Overview of different Spark Platforms on Cloud.mp4 30.8 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/012 Overview of Distributed Storage of files in HDFS.mp4 30.5 MB
  • 16 - Troubleshooting and Debugging Python Issues/012 Overview of Unit Testing or Validation of Applications.mp4 30.4 MB
  • 39 - ELT Data Pipelines using Databricks/012 Build ELT Pipeline using Databricks Job in Workflows.mp4 30.3 MB
  • 13 - Data Processing using Pandas Dataframe APIs/005 Get Count by Status using Pandas Dataframe APIs.mp4 30.3 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/004 Copy Files into HDFS for NYSE Converter.mp4 30.2 MB
  • 04 - Setup Application Tables and Data in Postgres Database/001 Overview of Postgres Database Server and pgAdmin.mp4 30.2 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/010 Populate Data into Partitioned NYSE table from Stage Table.mp4 30.2 MB
  • 13 - Data Processing using Pandas Dataframe APIs/008 Performing Inner Join between Pandas Dataframes.mp4 30.1 MB
  • 09 - Exercises for Basic SQL Queries/001 Simple Exercises for Filtering and Aggregations.mp4 29.7 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/003 Apply Row Level transformations using Pyspark Data Frame APIs.mp4 29.7 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/002 Launching or Getting Started Pyspark CLI.mp4 29.6 MB
  • 13 - Data Processing using Pandas Dataframe APIs/001 Overview of Pandas for Data Processing.mp4 29.4 MB
  • 16 - Troubleshooting and Debugging Python Issues/017 Debug Python Application using VS Code with breakpoints.mp4 29.3 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/013 Develop Word Count Application using Spark.mp4 29.3 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/002 Side effects of using CSV Files in Data Lake.mp4 29.0 MB
  • 45 - Recap of important Linux Commands for Data Engineering/008 Move Files and Folders in Linux using mv command.mp4 28.9 MB
  • 28 - Joins using Spark SQL Queries/003 Concepts Behind Inner Joins in Spark SQL.mp4 28.8 MB
  • 13 - Data Processing using Pandas Dataframe APIs/010 Sort Data in Pandas Dataframes.mp4 28.8 MB
  • 17 - Performance Tuning of Python Applications/017 Understanding the concept of Multiprocessing in Python.mp4 28.7 MB
  • 39 - ELT Data Pipelines using Databricks/005 Run Databricks Jobs and Tasks with Parameters.mp4 28.7 MB
  • 04 - Setup Application Tables and Data in Postgres Database/002 Overview of Database Connection Details.mp4 28.6 MB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/004 Overview of overhead for inferring schema.mp4 28.5 MB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/005 Performance Tuning to infer schema of Spark Dataframe.mp4 28.5 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/002 Difference Between All Purpose and Jobs Clusters.mp4 28.4 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/010 Getting Started with Spark CLI using Python.mp4 28.3 MB
  • 12 - Python Collections for Data Engineering/001 Overview of File IO using Python.mp4 28.3 MB
  • 04 - Setup Application Tables and Data in Postgres Database/008 Overview of pgAdmin to write SQL Queries.mp4 28.2 MB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/002 Run Spark SQL Queries on Spark Data Frames.mp4 28.0 MB
  • 17 - Performance Tuning of Python Applications/005 Run and Validate File to DB Loader Application.mp4 28.0 MB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/006 Process Data in Spark Metastore Tables using Data Frame APIs.mp4 27.9 MB
  • 37 - Ranking using Pyspark Data Frame APIs/001 Introduction to Ranking using Spark Data Frame APIs.mp4 27.9 MB
  • 16 - Troubleshooting and Debugging Python Issues/016 Recap of running File Format Converter application.mp4 27.9 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/008 Design NYSE Data Loader Application.mp4 27.9 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/002 Getting Started with Data Sets and Spark SQL CLI.mp4 27.8 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/013 Dealing with Nulls while Sorting the Data in Spark Data Frames.mp4 27.7 MB
  • 22 - Basic Transformations using Spark SQL/006 Save Query Result to DBFS using Spark SQL.mp4 27.7 MB
  • 36 - Joining Data using Spark Data Frame APIs/005 Join and other Spark Data Frame APIs to process the data.mp4 27.6 MB
  • 26 - Filtering Data using Spark SQL Queries/002 Using IN, LIKE and BETWEEN in Spark SQL Queries.mp4 27.6 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/002 Overview of our support to Data Engineering Essentials course.mp4 27.6 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/005 Filtering Data with Multiple Conditions using Pyspark Data Frame APIs.mp4 27.4 MB
  • 17 - Performance Tuning of Python Applications/013 Invoking User Defined Functions using multiprocessing in Python.mp4 27.2 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/004 Restructure CSV Data to Columnar Format using Pyspark.mp4 27.1 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/017 Develop Shell Wrapper to run Hive Application.mp4 27.1 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/005 Generate Explain Plans on Spark Dataframes using explain function.mp4 27.0 MB
  • 18 - Getting Started with GCP/012 Initialize gcloud CLI using GCP Project.mp4 26.9 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/011 Overview of Writing Data in Data Frame to Delta Files.mp4 26.8 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/004 Overview of OVER and PARTITION BY Clause in SQL Queries.mp4 26.7 MB
  • 20 - Overview of Spark and Spark Architecture/007 Overview of Official Documentation of Apache Spark.mp4 26.7 MB
  • 30 - Copy Query Results into Spark Metastore Tables/006 Copy Query Results into Spark Meatstore Tables using MERGE.mp4 26.7 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/004 Overview of Spark Metastore Warehouse Directory.mp4 26.6 MB
  • 36 - Joining Data using Spark Data Frame APIs/009 Equivalent Spark SQL Queries for Joins.mp4 26.4 MB
  • 15 - Project 2 - Files to Database Loader/003 Run Queries from Notebook using SQL Magic.mp4 26.1 MB
  • 14 - Project 1 - File Format Converter using Python/023 Raising Exceptions in Python Applications.mp4 26.0 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/007 Overview of Performance Assessment of Spark Jobs.mp4 25.7 MB
  • 14 - Project 1 - File Format Converter using Python/002 Get File Names to be processed using glob.mp4 25.7 MB
  • 05 - Writing Basic SQL Queries/014 Outer Join with Additional Conditions in SQL Queries.mp4 25.5 MB
  • 24 - Pre-Defined Functions in Spark SQL/015 Convert Non Standard Dates or Timestamps to Standard Ones using Spark SQL.mp4 25.4 MB
  • 52 - Logging in Python based Spark Applications/007 Validate Logging of Spark Application using Client Mode.mp4 25.4 MB
  • 21 - Setup Databricks Environment using GCP/006 Overview of Databricks on GCP.mp4 25.2 MB
  • 08 - Performance Tuning of SQL Queries/004 Review Tables used for Performance Tuning of SQL Queries.mp4 25.0 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/006 Overview of Spark and Databricks Environment related topics.mp4 25.0 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/016 Deploy Hive Application in HDFS.mp4 24.9 MB
  • 36 - Joining Data using Spark Data Frame APIs/002 Create Data Frames to Join using Spark Data Frame APIs.mp4 24.8 MB
  • 08 - Performance Tuning of SQL Queries/009 Write SQL Queries for Customer Orders.mp4 24.7 MB
  • 24 - Pre-Defined Functions in Spark SQL/004 Case Conversion and Length of Strings using Spark SQL.mp4 24.7 MB
  • 12 - Python Collections for Data Engineering/016 Create Function to get Column Details from Schemas JSON File.mp4 24.6 MB
  • 11 - Getting Started with Python/002 Setup Notebook Environment in VS Code Workspace.mp4 24.4 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/007 Perform Aggregations by Key using Spark Data Frame APIs.mp4 24.3 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/020 Overview of Shuffling - Part 1.mp4 24.3 MB
  • 07 - SQL Troubleshooting and Debugging Guide/003 Validate and Setup Telnet on Mac or PC.mp4 24.0 MB
  • 22 - Basic Transformations using Spark SQL/005 Spark SQL Query to compute Daily Product Revenue.mp4 23.9 MB
  • 17 - Performance Tuning of Python Applications/002 Setup Database Loader Python Application.mp4 23.6 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/002 Overview of Row Level Transformations.mp4 23.5 MB
  • 03 - Setup Tools for Data Engineering Essentials/003 Setup Python 3.9 on Windows.mp4 23.5 MB
  • 33 - Getting Started with Pyspark Data Frame APIs/004 Transform Data using Spark APIs.mp4 23.5 MB
  • 22 - Basic Transformations using Spark SQL/010 Transform Data using Spark APIs.mp4 23.5 MB
  • 23 - Create Delta Tables using Spark SQL/010 Overview of Spark Metastore.mp4 23.5 MB
  • 45 - Recap of important Linux Commands for Data Engineering/011 Searching for files using find command in Linux.mp4 23.5 MB
  • 25 - Setup Spark Metastore Tables for Basic Transformations/003 Projecting Data using Spark SQL.mp4 23.4 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/003 Overview of Hive Architecture.mp4 23.4 MB
  • 05 - Writing Basic SQL Queries/003 Filtering Data using SQL Queries.mp4 23.4 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/018 Determine Maximum Capacity to submit a Spark Application.mp4 23.3 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/010 Review Running Job Details using Spark UI.mp4 22.7 MB
  • 24 - Pre-Defined Functions in Spark SQL/022 Conditional Logic on Null Values using nvl2 and case in Spark SQL.mp4 22.6 MB
  • 07 - SQL Troubleshooting and Debugging Guide/009 Troubleshooting Syntax Errors in SQL Queries.mp4 22.5 MB
  • 12 - Python Collections for Data Engineering/008 Get unique values from list using map and set.mp4 22.4 MB
  • 23 - Create Delta Tables using Spark SQL/011 Difference Between Managed and External Spark Metastore Tables.mp4 22.4 MB
  • 21 - Setup Databricks Environment using GCP/007 High level architecture of Databricks.mp4 22.4 MB
  • 37 - Ranking using Pyspark Data Frame APIs/004 Filter Based on Global Ranks using Spark Data Frame APIs.mp4 22.3 MB
  • 23 - Create Delta Tables using Spark SQL/004 Create and Review Managed Spark Metastore Table using Delta Format.mp4 22.3 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/015 Parameterize Spark SQL Solution for Partition Pruning.mp4 22.3 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/009 Setup Databricks Job Compute Clusters using Workflows.mp4 22.1 MB
  • 20 - Overview of Spark and Spark Architecture/005 Differences between Pandas, Dask and Pyspark.mp4 22.1 MB
  • 14 - Project 1 - File Format Converter using Python/017 Setting Environment Variables on Windows or Mac or Linux.mp4 22.1 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/003 Review Explain Plan for Spark Dataframe logic using Spark UI.mp4 22.1 MB
  • 32 - Processing JSON like Data using Spark SQL/007 Dealing with Array of Struct Type Columns using Spark SQL Queries.mp4 22.0 MB
  • 18 - Getting Started with GCP/008 Overview of GCP Credits.mp4 22.0 MB
  • 08 - Performance Tuning of SQL Queries/013 Interpreting the explain plan for SQL Queries using Indexes.mp4 22.0 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/009 Understand Filter and Broadcast of Orders Data.mp4 21.9 MB
  • 16 - Troubleshooting and Debugging Python Issues/003 Overview of Database Connectivity using Python Applications.mp4 21.9 MB
  • 32 - Processing JSON like Data using Spark SQL/009 Generate Array Type Columns from Regular Columns in Spark SQL.mp4 21.9 MB
  • 16 - Troubleshooting and Debugging Python Issues/007 Troubleshoot Credentials Related issues for Database Connectivity using Python.mp4 21.9 MB
  • 28 - Joins using Spark SQL Queries/007 Example - Filtering and Outer Joins along with GROUP BY in Spark SQL Queries.mp4 21.7 MB
  • 11 - Getting Started with Python/012 Loops and Conditions in Python.mp4 21.5 MB
  • 23 - Create Delta Tables using Spark SQL/012 Perform CRUD Operations on Delta Tables in Spark Metastore.mp4 21.3 MB
  • 05 - Writing Basic SQL Queries/005 Group By Aggregations using SQL Queries.mp4 21.1 MB
  • 12 - Python Collections for Data Engineering/007 Filter Data in Python Lists using filter and lambda.mp4 21.1 MB
  • 17 - Performance Tuning of Python Applications/001 Introduction to Performance of Python Applications.mp4 21.1 MB
  • 52 - Logging in Python based Spark Applications/003 Overview of Logging Concepts such as Log Levels.mp4 21.1 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/004 Filtering Data using Pyspark Data Frame APIs.mp4 21.0 MB
  • 13 - Data Processing using Pandas Dataframe APIs/007 Create Dataframes using dynamic column list on CSV Data.mp4 21.0 MB
  • 20 - Overview of Spark and Spark Architecture/006 Overview of Distributed Computing.mp4 20.9 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/005 Getting Started with Pyspark for Data Processing.mp4 20.9 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/006 Perform Aggregations by Key using Spark Data Frame APIs.mp4 20.7 MB
  • 11 - Getting Started with Python/011 Overview of Python Lists.mp4 20.7 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/006 Convert IP Address to Static for Dataproc Cluster.mp4 20.6 MB
  • 32 - Processing JSON like Data using Spark SQL/005 Projecting Data From Struct Type Fields in Spark SQL.mp4 20.5 MB
  • 36 - Joining Data using Spark Data Frame APIs/004 Inner Join using Spark Data Frame APIs.mp4 20.4 MB
  • 23 - Create Delta Tables using Spark SQL/007 Create and Review External Spark Metastore Table using Delta Format.mp4 20.4 MB
  • 23 - Create Delta Tables using Spark SQL/008 Insert Data into Spark Metastore External Table.mp4 20.3 MB
  • 27 - Aggregations using Spark SQL Queries/005 Filter Data based on Aggregate Results using HAVING in Spark SQL Queries.mp4 20.2 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/006 Overview of Spark Data Frames and their Characteristics.mp4 20.2 MB
  • 32 - Processing JSON like Data using Spark SQL/011 Processing Delimited Strings using Spark SQL Queries.mp4 20.0 MB
  • 23 - Create Delta Tables using Spark SQL/009 Validate Data in Spark Metastore External Table.mp4 20.0 MB
  • 24 - Pre-Defined Functions in Spark SQL/020 Overview of Handling Null Values using Spark SQL.mp4 19.9 MB
  • 07 - SQL Troubleshooting and Debugging Guide/002 Overview of Database Connectivity Issues.mp4 19.8 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/010 Filtering based on Global Ranks using Nested Queries and CTEs in SQL.mp4 19.8 MB
  • 24 - Pre-Defined Functions in Spark SQL/002 Validate Functions in Spark SQL.mp4 19.8 MB
  • 17 - Performance Tuning of Python Applications/006 Fix the error message in file to db loader application.mp4 19.7 MB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/003 Create Spark Metastore Tables using Data Frames.mp4 19.7 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/011 Getting Started with Spark CLI using Scala.mp4 19.7 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/005 Setup Multinode Hadoop and Spark Cluster using GCP Dataproc.mp4 19.6 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/009 Applying functions on Spark Data Frame Columns.mp4 19.6 MB
  • 25 - Setup Spark Metastore Tables for Basic Transformations/002 Prepare Spark Metastore Tables for Basic Transformations.mp4 19.4 MB
  • 05 - Writing Basic SQL Queries/001 Review Data Model Diagram.mp4 19.4 MB
  • 31 - Ranking using Spark SQL Windowing Functions/003 Compute Global Rank using Spark SQL Windowing Functions.mp4 19.4 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/004 Review Explain Plan for Spark SQL logic using Spark UI.mp4 19.2 MB
  • 12 - Python Collections for Data Engineering/011 Read JSON Strings to Python dicts or lists.mp4 19.1 MB
  • 04 - Setup Application Tables and Data in Postgres Database/007 Setup Application Tables and Data in Postgres Database.mp4 19.1 MB
  • 05 - Writing Basic SQL Queries/009 Inner Joins using SQL Queries.mp4 19.0 MB
  • 14 - Project 1 - File Format Converter using Python/005 Read CSV Data into Pandas Dataframe with Schema Dynamically.mp4 19.0 MB
  • 27 - Aggregations using Spark SQL Queries/004 Order of Execution of Spark SQL Queries.mp4 19.0 MB
  • 13 - Data Processing using Pandas Dataframe APIs/002 Overview of Reading CSV Data using Pandas.mp4 18.9 MB
  • 14 - Project 1 - File Format Converter using Python/015 Using Run Time Arguments in Python Applications.mp4 18.9 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/005 Overview of Getting Started with GCP related to the course.mp4 18.6 MB
  • 36 - Joining Data using Spark Data Frame APIs/007 Left Outer Join using Spark Data Frame APIs.mp4 18.6 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/014 Download VS Code Workspace and Delete Cluster.mp4 18.5 MB
  • 12 - Python Collections for Data Engineering/012 Read JSON Schemas from file to Python dicts.mp4 18.4 MB
  • 13 - Data Processing using Pandas Dataframe APIs/006 Get count by Month and Status using Pandas Dataframe APIs.mp4 18.4 MB
  • 05 - Writing Basic SQL Queries/006 Order of Execution of SQL Queries.mp4 18.3 MB
  • 32 - Processing JSON like Data using Spark SQL/006 Creating Spark Metastore Tables with Array of Struct Column.mp4 18.2 MB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/003 Overview of CSV or JSON Files.mp4 18.1 MB
  • 08 - Performance Tuning of SQL Queries/010 Performance Testing of SQL Queries using Stored Procedure.mp4 18.0 MB
  • 07 - SQL Troubleshooting and Debugging Guide/010 Troubleshooting Semantec Errors in SQL Queries.mp4 17.9 MB
  • 05 - Writing Basic SQL Queries/015 Explanation about Fix of SQL Queries with Filtering on Outer Join Results.mp4 17.9 MB
  • 13 - Data Processing using Pandas Dataframe APIs/003 Read Data from CSV Files to Pandas Dataframes.mp4 17.9 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/009 Sorting Data using Spark Data Frame APIs.mp4 17.9 MB
  • 39 - ELT Data Pipelines using Databricks/004 Create and Run First Databricks Job.mp4 17.7 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/004 Overview of Data Frame Concepts.mp4 17.7 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/024 Review the code of Word Count Application.mp4 17.7 MB
  • 24 - Pre-Defined Functions in Spark SQL/006 Extract Substrings from Delimited Strings using split in Spark SQL.mp4 17.7 MB
  • 02 - Getting Started with SQL for Data Engineering/002 Overview of Application Architecture and RDBMS.mp4 17.7 MB
  • 26 - Filtering Data using Spark SQL Queries/001 Filtering Data using Equal Condition in Spark SQL.mp4 17.6 MB
  • 02 - Getting Started with SQL for Data Engineering/005 Overview of Data Warehouse and Data Lake.mp4 17.5 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/008 Analyze Airlines Data using Spark SQL.mp4 17.5 MB
  • 32 - Processing JSON like Data using Spark SQL/003 Dealing with Array Type Columns using Spark SQL Queries.mp4 17.5 MB
  • 30 - Copy Query Results into Spark Metastore Tables/003 Copy Query Results into Spark Metastore Tables using CTAS.mp4 17.4 MB
  • 24 - Pre-Defined Functions in Spark SQL/012 Date Arithmetic using Spark SQL Functions.mp4 17.4 MB
  • 21 - Setup Databricks Environment using GCP/009 Overview of Databricks CLI and other clients.mp4 17.2 MB
  • 26 - Filtering Data using Spark SQL Queries/003 Filter Data using Boolean AND in Spark SQL Queries.mp4 17.1 MB
  • 20 - Overview of Spark and Spark Architecture/011 Overview of Executors in Spark Cluster.mp4 17.1 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/007 Review Multi Node Hadoop and Spark Clusters using Web Interfaces.mp4 17.1 MB
  • 21 - Setup Databricks Environment using GCP/011 Troubleshoot issues to configure Databricks CLI.mp4 17.0 MB
  • 15 - Project 2 - Files to Database Loader/002 Install Python Dependencies for Pandas and Database Integration.mp4 17.0 MB
  • 02 - Getting Started with SQL for Data Engineering/004 Overview of Purpose Built Databases.mp4 17.0 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/004 Overview of important HDFS Commands.mp4 16.9 MB
  • 11 - Getting Started with Python/010 Pre-Defined String Manipulation Functions.mp4 16.9 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/007 Interpreting Explain Plan for Spark SQL Query.mp4 16.9 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/008 Overview of Spark Architecture.mp4 16.9 MB
  • 21 - Setup Databricks Environment using GCP/002 Signing up for Databricks on GCP.mp4 16.9 MB
  • 39 - ELT Data Pipelines using Databricks/013 Run and Review Execution details of ELT Data Pipeline using Databricks Job.mp4 16.7 MB
  • 37 - Ranking using Pyspark Data Frame APIs/003 Compute Global Ranks using Spark Data Frame APIs.mp4 16.7 MB
  • 05 - Writing Basic SQL Queries/012 Overview of Database Views.mp4 16.7 MB
  • 22 - Basic Transformations using Spark SQL/001 Process Data in DBFS using Databricks Spark SQL.mp4 16.4 MB
  • 21 - Setup Databricks Environment using GCP/013 Setup Data Repository for Data Sets.mp4 16.4 MB
  • 29 - Sorting using Spark SQL Queries/001 Sorting Data using Spark SQL Queries.mp4 16.3 MB
  • 28 - Joins using Spark SQL Queries/008 Example - Filtering and Outer Joins along with GROUP BY in Spark SQL Queries.mp4 16.2 MB
  • 12 - Python Collections for Data Engineering/015 Sort Data in JSON Arrays using Python.mp4 16.2 MB
  • 13 - Data Processing using Pandas Dataframe APIs/009 Perform Aggregations on Join results.mp4 16.1 MB
  • 12 - Python Collections for Data Engineering/005 Overview of Lambda Functions in Python.mp4 15.9 MB
  • 03 - Setup Tools for Data Engineering Essentials/007 Install Postgres 14 on Windows 11.mp4 15.9 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/006 Read Orders and Order Items Data into Spark Data Frames.mp4 15.9 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/009 Detailed outline of ELT Data Pipelines on Databricks.mp4 15.9 MB
  • 17 - Performance Tuning of Python Applications/018 Performance Tuning Scenarios of Python Applications.mp4 15.8 MB
  • 52 - Logging in Python based Spark Applications/006 Add logging to Python based Spark Applications.mp4 15.8 MB
  • 32 - Processing JSON like Data using Spark SQL/010 Generate Array of Struct Type Columns from Regular Columns in Spark SQL.mp4 15.8 MB
  • 39 - ELT Data Pipelines using Databricks/014 Cleanup Databricks Environment on GCP.mp4 15.8 MB
  • 29 - Sorting using Spark SQL Queries/002 Dealing with Nulls while Sorting Data using Spark SQL Queries.mp4 15.8 MB
  • 11 - Getting Started with Python/001 Setup Visual Studio Workspace for Python Application Development.mp4 15.8 MB
  • 02 - Getting Started with SQL for Data Engineering/006 Usage of RDBMS and Data Warehouse technologies.mp4 15.7 MB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/004 Insert into Spark Metastore Tables using Data Frames.mp4 15.7 MB
  • 31 - Ranking using Spark SQL Windowing Functions/004 Compute Ranks Per Key using Spark SQL Windowing Functions.mp4 15.7 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/013 Difference between rank and dense rank using SQL.mp4 15.5 MB
  • 11 - Getting Started with Python/013 User Defined Functions in Python.mp4 15.3 MB
  • 14 - Project 1 - File Format Converter using Python/004 Get Data Set Names from File Names or Paths using regular expressions.mp4 15.3 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/018 Recap of Spark Performance Tuning Scenarios.mp4 15.2 MB
  • 36 - Joining Data using Spark Data Frame APIs/008 Right Outer Join using Spark Data Frame APIs.mp4 15.1 MB
  • 37 - Ranking using Pyspark Data Frame APIs/006 Filter Based on Ranks Per Partition using Spark Data Frame APIs.mp4 15.0 MB
  • 27 - Aggregations using Spark SQL Queries/003 GROUP BY Examples using Spark SQL Queries.mp4 14.9 MB
  • 21 - Setup Databricks Environment using GCP/008 Setup Databricks CLI on Mac or Windows.mp4 14.9 MB
  • 08 - Performance Tuning of SQL Queries/005 Review Data Storage Internals for Tables and Indexes.mp4 14.9 MB
  • 19 - Overview of Big Data and Data Lakes/007 Overview of Data Lake using Hadoop eco system.mp4 14.8 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/006 Run Operations on Partitioned Parquet Data.mp4 14.8 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/004 Setup Single Node Hadoop and Spark Cluster using Dataproc.mp4 14.7 MB
  • 03 - Setup Tools for Data Engineering Essentials/002 Setup VS Code on Windows.mp4 14.6 MB
  • 17 - Performance Tuning of Python Applications/012 Getting Started with Multiprocessing using Python.mp4 14.6 MB
  • 07 - SQL Troubleshooting and Debugging Guide/005 Troubleshoot Database Connectivity Issue with Correct Host Details.mp4 14.5 MB
  • 19 - Overview of Big Data and Data Lakes/005 Overview of Big Data.mp4 14.5 MB
  • 27 - Aggregations using Spark SQL Queries/002 Overview of Aggregations using GROUP BY in Spark SQL Queries.mp4 14.5 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/011 Develop Spark SQL Queries for Sorting Data.mp4 14.4 MB
  • 32 - Processing JSON like Data using Spark SQL/002 Creating Spark Metastore Tables with Array Type Columns.mp4 14.4 MB
  • 07 - SQL Troubleshooting and Debugging Guide/008 Overview of Compilation of SQL Queries.mp4 14.4 MB
  • 05 - Writing Basic SQL Queries/007 Rules and Restrictions to Group and Filter Data in SQL queries.mp4 14.3 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/001 Create Spark Data Frame using Pyspark Data Frame APIs.mp4 14.3 MB
  • 28 - Joins using Spark SQL Queries/006 Example - Outer Join along with GROUP BY using Spark SQL Queries.mp4 14.2 MB
  • 26 - Filtering Data using Spark SQL Queries/004 Filter Data using Boolean OR in Spark SQL Queries.mp4 14.2 MB
  • 30 - Copy Query Results into Spark Metastore Tables/004 Copy Query Results into Spark Metastore Tables using INSERT.mp4 14.2 MB
  • 11 - Getting Started with Python/005 Defining Functions in VS Code Notebooks.mp4 14.1 MB
  • 12 - Python Collections for Data Engineering/010 Overview of JSON Strings and Files.mp4 14.1 MB
  • 05 - Writing Basic SQL Queries/004 Total Aggregations using SQL Queries.mp4 14.1 MB
  • 17 - Performance Tuning of Python Applications/010 Overview of multi or batch insert into Database Tables.mp4 14.1 MB
  • 16 - Troubleshooting and Debugging Python Issues/009 Troubleshooting Compilation Errors in Python.mp4 14.0 MB
  • 08 - Performance Tuning of SQL Queries/006 Review key terms used in Explain Plans for SQL Queries.mp4 13.9 MB
  • 39 - ELT Data Pipelines using Databricks/003 Pass Arguments to Databricks SQL Notebooks.mp4 13.9 MB
  • 07 - SQL Troubleshooting and Debugging Guide/015 Develop Solution using Development Best Practices.mp4 13.8 MB
  • 51 - Submitting Python based Spark Applications/005 Run Spark Application with Environment Variables in Client Mode.mp4 13.8 MB
  • 33 - Getting Started with Pyspark Data Frame APIs/005 Get Schema Details for all Data Sets using Pyspark.mp4 13.7 MB
  • 22 - Basic Transformations using Spark SQL/011 Get Schema Details for all Data Sets using Pyspark.mp4 13.7 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/011 Review Completed Job Details using Spark UI.mp4 13.6 MB
  • 11 - Getting Started with Python/004 Overview of Cells in VS Code Notebook.mp4 13.6 MB
  • 03 - Setup Tools for Data Engineering Essentials/009 Getting Started with pgAdmin on Mac.mp4 13.6 MB
  • 16 - Troubleshooting and Debugging Python Issues/005 Troubleshoot Network Connectivity to the Database Server using telnet.mp4 13.6 MB
  • 28 - Joins using Spark SQL Queries/005 Example - Inner Join along with GROUP BY using Spark SQL Queries.mp4 13.5 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/008 Compute Ranks based on key using SQL.mp4 13.4 MB
  • 19 - Overview of Big Data and Data Lakes/008 Limitations of Hadoop eco system.mp4 13.4 MB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/005 Read Data from Spark Metastore Table to Data Frames.mp4 13.4 MB
  • 23 - Create Delta Tables using Spark SQL/006 Validate Data in Spark Metastore Managed Table.mp4 13.4 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/004 Review Quotas to setup Multinode Hadoop and Spark Cluster.mp4 13.4 MB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/001 Introduction to Data Engineering Essentials Course.mp4 13.2 MB
  • 12 - Python Collections for Data Engineering/014 Extract Details from Complex JSON Arrays using Python.mp4 13.2 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/022 Overview of Spark Application.mp4 13.1 MB
  • 51 - Submitting Python based Spark Applications/007 Review Spark Application Details using Spark UI.mp4 13.1 MB
  • 20 - Overview of Spark and Spark Architecture/001 Overview of Data Processing.mp4 13.1 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/001 Getting Started with Performance Tuning using Spark on Databricks.mp4 13.1 MB
  • 16 - Troubleshooting and Debugging Python Issues/018 Managing Breakpoints for Debugging in VS Code.mp4 13.0 MB
  • 03 - Setup Tools for Data Engineering Essentials/004 Configure Environment Variable PATH for Python on Windows.mp4 12.9 MB
  • 10 - Solutions for Basic SQL Queries/003 Validate Data and Review Data Model Diagram.mp4 12.9 MB
  • 07 - SQL Troubleshooting and Debugging Guide/013 Develop Initial Solution based on the requirement.mp4 12.9 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/002 Overview of CTAS to create tables based on Query Results.mp4 12.8 MB
  • 11 - Getting Started with Python/007 Constants and Variables in Python.mp4 12.8 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/013 Stopping the Cluster and Understanding the costs.mp4 12.7 MB
  • 11 - Getting Started with Python/006 Run the Code in VS Code Notebook Cell by Line.mp4 12.6 MB
  • 05 - Writing Basic SQL Queries/008 Filter Data based on Aggregated Results using Group By and Having.mp4 12.6 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/008 Perform Aggregations by Key using Spark Data Frame APIs.mp4 12.6 MB
  • 07 - SQL Troubleshooting and Debugging Guide/006 Current Databases and Users in Postgres Database Server.mp4 12.5 MB
  • 12 - Python Collections for Data Engineering/004 Getting Started with Processing Python Lists.mp4 12.5 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/002 Overview of Spark Catalyst Optimizer.mp4 12.5 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/007 Performance Tuning of Cluster using Auto Scaling.mp4 12.5 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/007 Compute Global Ranks using SQL.mp4 12.4 MB
  • 09 - Exercises for Basic SQL Queries/002 Exercises on Joins and Aggregations using SQL.mp4 12.3 MB
  • 18 - Getting Started with GCP/013 Reinitialize Google Cloud Shell with Project id.mp4 12.3 MB
  • 03 - Setup Tools for Data Engineering Essentials/006 Integrate VSCode with Python on Windows.mp4 12.2 MB
  • 14 - Project 1 - File Format Converter using Python/012 Install Dependencies for the Python Project using pip.mp4 12.1 MB
  • 18 - Getting Started with GCP/004 Overview of Google Cloud Platform or GCP.mp4 12.1 MB
  • 16 - Troubleshooting and Debugging Python Issues/010 Troubleshooting Run Time Errors in Python.mp4 12.1 MB
  • 08 - Performance Tuning of SQL Queries/008 Review the Common Application Scenarios for Performance Tuning.mp4 12.0 MB
  • 16 - Troubleshooting and Debugging Python Issues/004 Overview of Database Connectivity using Python.mp4 11.9 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/012 Getting Started with Spark CLI using SQL.mp4 11.9 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/007 Projecting Data in Spark Data Frames using Select.mp4 11.9 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/003 Setting up All Purpose Databricks Compute Clusters.mp4 11.8 MB
  • 26 - Filtering Data using Spark SQL Queries/005 Dealing with NULLS while Filtering Data in Spark SQL Queries.mp4 11.8 MB
  • 14 - Project 1 - File Format Converter using Python/006 Generate File Paths for Target JSON Files Dynamically.mp4 11.8 MB
  • 17 - Performance Tuning of Python Applications/004 Cleanup the tables to run file to db loader application.mp4 11.8 MB
  • 14 - Project 1 - File Format Converter using Python/003 Get Column Names using Schemas File.mp4 11.7 MB
  • 08 - Performance Tuning of SQL Queries/012 Guidelines on adding Indexes on Tables for SQL Queries.mp4 11.6 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/010 Using withColumn to apply transformations on Spark Data Frames.mp4 11.6 MB
  • 16 - Troubleshooting and Debugging Python Issues/011 Overview of Software Development Life Cycle.mp4 11.6 MB
  • 39 - ELT Data Pipelines using Databricks/001 Overview of Databricks Workflows.mp4 11.5 MB
  • 12 - Python Collections for Data Engineering/006 Usage of Lambda Functions.mp4 11.5 MB
  • 14 - Project 1 - File Format Converter using Python/011 Setup Project for File Format Converter using Python.mp4 11.5 MB
  • 08 - Performance Tuning of SQL Queries/014 Conclusion of Performance Tuning of SQL Queries.mp4 11.5 MB
  • 07 - SQL Troubleshooting and Debugging Guide/012 Development Best Practices with tips to troubleshoot SQL bugs.mp4 11.4 MB
  • 02 - Getting Started with SQL for Data Engineering/007 Differences and Similarities between RDBMS and Data Warehouse Technologies.mp4 11.3 MB
  • 31 - Ranking using Spark SQL Windowing Functions/005 Difference Between rank and dense_rank.mp4 11.3 MB
  • 05 - Writing Basic SQL Queries/011 Filter and Aggregate on Join Results using SQL.mp4 11.2 MB
  • 24 - Pre-Defined Functions in Spark SQL/016 Extract Information using Calendar Functions from Date or Timestamp using Spark.mp4 11.2 MB
  • 04 - Setup Application Tables and Data in Postgres Database/003 Overview of Connecting to External Databases using pgAdmin.mp4 11.2 MB
  • 32 - Processing JSON like Data using Spark SQL/001 Overview of JSON.mp4 11.1 MB
  • 37 - Ranking using Pyspark Data Frame APIs/005 Compute Ranks per Partition using Spark Data Frame APIs.mp4 11.1 MB
  • 32 - Processing JSON like Data using Spark SQL/008 Overview of Important Functions to Process JSON Data in Spark SQL.mp4 11.0 MB
  • 19 - Overview of Big Data and Data Lakes/011 Advantages of Modern Data Lakes on Cloud.mp4 11.0 MB
  • 20 - Overview of Spark and Spark Architecture/012 Overview of Spark Glossary.mp4 11.0 MB
  • 05 - Writing Basic SQL Queries/013 Overview of Common Table Expressions or CTEs.mp4 11.0 MB
  • 18 - Getting Started with GCP/010 Overview of Google Cloud Shell.mp4 10.8 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/009 Rules and Restrictions to Filter Data based on Ranks in SQL.mp4 10.8 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/003 Increase GCP VM Quotas for Mutlinode Hadoop and Spark Cluster.mp4 10.6 MB
  • 03 - Setup Tools for Data Engineering Essentials/005 Overview of learning Python using Python CLI.mp4 10.6 MB
  • 12 - Python Collections for Data Engineering/002 Read Data from CSV File into Python List.mp4 10.6 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/005 Create Multinode Databricks Cluster with Auto Scaling.mp4 10.5 MB
  • 14 - Project 1 - File Format Converter using Python/016 Overview of Environment Variables.mp4 10.5 MB
  • 24 - Pre-Defined Functions in Spark SQL/025 Aggregate using CASE and WHEN in GROUP BY in Spark SQL.mp4 10.4 MB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/002 Steps to convert CSV or JSON Files to Parquet or Delta Files.mp4 10.4 MB
  • 39 - ELT Data Pipelines using Databricks/007 Import ELT Data Pipeline Applications into Databricks Environment.mp4 10.4 MB
  • 39 - ELT Data Pipelines using Databricks/010 Review Databricks SQL Notebooks for Tables and Final Results.mp4 10.3 MB
  • 24 - Pre-Defined Functions in Spark SQL/023 Overview of Case and When in Spark SQL.mp4 10.2 MB
  • 20 - Overview of Spark and Spark Architecture/009 Overview of Spark Infrastructure.mp4 10.2 MB
  • 11 - Getting Started with Python/008 Overview of Python Data Types.mp4 10.2 MB
  • 21 - Setup Databricks Environment using GCP/003 Create Databricks Workspace on GCP.mp4 10.2 MB
  • 36 - Joining Data using Spark Data Frame APIs/006 Analyze Data for outer joins using Spark Data Frame APIs.mp4 10.2 MB
  • 14 - Project 1 - File Format Converter using Python/007 Recap of Writing Pandas Dataframe to JSON File.mp4 10.2 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/006 Overview of Auto Scaling of Databricks Clusters.mp4 10.1 MB
  • 28 - Joins using Spark SQL Queries/001 Overview of Joins in Spark SQL Queries.mp4 10.0 MB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/003 Design NYSE Converter Application using Spark SQL and Delta.mp4 10.0 MB
  • 08 - Performance Tuning of SQL Queries/001 Introduction to Performance Tuning of SQL Queries.mp4 10.0 MB
  • 13 - Data Processing using Pandas Dataframe APIs/011 Overview of Writing Pandas Dataframes to Files.mp4 9.8 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/031 Delete Multinode Hadoop and Spark Cluster.mp4 9.8 MB
  • 07 - SQL Troubleshooting and Debugging Guide/011 Overview of Bugs in SQL Queries.mp4 9.8 MB
  • 16 - Troubleshooting and Debugging Python Issues/015 Getting Started with Debugging of Python Programs using VS Code.mp4 9.7 MB
  • 19 - Overview of Big Data and Data Lakes/003 Technologies for Different Types of Databases.mp4 9.6 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/005 Advantages of Pyspark Data Frames.mp4 9.6 MB
  • 04 - Setup Application Tables and Data in Postgres Database/006 Register Server in pgAdmin using Application Database and User.mp4 9.6 MB
  • 24 - Pre-Defined Functions in Spark SQL/011 Overview of Standard Date and Timestamp in Spark SQL.mp4 9.6 MB
  • 19 - Overview of Big Data and Data Lakes/004 Volumes for Different Types of Databases.mp4 9.6 MB
  • 16 - Troubleshooting and Debugging Python Issues/008 Overview of Python process to run Python Applications.mp4 9.6 MB
  • 11 - Getting Started with Python/003 Overview of VS Code Notebook Environment.mp4 9.5 MB
  • 19 - Overview of Big Data and Data Lakes/002 Usecases for Different Types of Databases.mp4 9.5 MB
  • 04 - Setup Application Tables and Data in Postgres Database/004 Create Application Database and User in Postgres Database Server.mp4 9.4 MB
  • 04 - Setup Application Tables and Data in Postgres Database/005 Clone Data Sets from Git Repository for Database Scripts.mp4 9.3 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/006 Overview of Ranking in SQL.mp4 9.3 MB
  • 39 - ELT Data Pipelines using Databricks/008 Spark SQL Application to Cleanup Database and Datasets.mp4 9.3 MB
  • 14 - Project 1 - File Format Converter using Python/022 Exception Handling in Python Applications.mp4 9.3 MB
  • 24 - Pre-Defined Functions in Spark SQL/027 Exercises for Pre-defined functions in Spark SQL.mp4 9.3 MB
  • 19 - Overview of Big Data and Data Lakes/010 Implementation of Modern Data Lakes on Cloud.mp4 9.1 MB
  • 18 - Getting Started with GCP/003 Overview of Cloud Platforms.mp4 8.8 MB
  • 07 - SQL Troubleshooting and Debugging Guide/004 Validate Connectivity to Database Server using telnet.mp4 8.7 MB
  • 18 - Getting Started with GCP/014 Overview of Analytics Services on GCP.mp4 8.7 MB
  • 24 - Pre-Defined Functions in Spark SQL/001 Overview of Functions in Spark SQL.mp4 8.7 MB
  • 12 - Python Collections for Data Engineering/013 Overview of Processing JSON Data using Python.mp4 8.4 MB
  • 03 - Setup Tools for Data Engineering Essentials/001 Introduction to Setting up Tools for Data Engineering Essentials.mp4 8.4 MB
  • 17 - Performance Tuning of Python Applications/011 Develop application for multiprocessing.mp4 8.2 MB
  • 32 - Processing JSON like Data using Spark SQL/004 Creating Spark Metastore Tables with Struct Type Columns.mp4 8.1 MB
  • 18 - Getting Started with GCP/007 Sign up for GCP using Google Account.mp4 8.1 MB
  • 03 - Setup Tools for Data Engineering Essentials/008 Getting Started with pgAdmin on Windows.mp4 8.1 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/008 Setup Local Data Sets on Hadoop and Spark Cluster.mp4 8.1 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/012 Create Students table with Data for ranking using SQL.mp4 8.0 MB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/006 Generate Explain Plans on Spark SQL Queries using explain command.mp4 7.9 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/008 Using drop to drop columns from Spark Data Frame.mp4 7.9 MB
  • 16 - Troubleshooting and Debugging Python Issues/001 Introduction to Troubleshooting and Debugging Python issues.mp4 7.9 MB
  • 21 - Setup Databricks Environment using GCP/004 Getting Started with Databricks Clusters on GCP.mp4 7.8 MB
  • 21 - Setup Databricks Environment using GCP/005 Getting Started with Databricks Notebook.mp4 7.7 MB
  • 36 - Joining Data using Spark Data Frame APIs/003 Review Syntax for join using Spark Data Frame APIs.mp4 7.4 MB
  • 05 - Writing Basic SQL Queries/002 Define Problem Statement for SQL Queries.mp4 7.3 MB
  • 30 - Copy Query Results into Spark Metastore Tables/002 Query to Compute Daily Revenue using Spark SQL.mp4 7.0 MB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/001 Introduction to Setup Hadoop and Spark Cluster using Dataproc.mp4 6.9 MB
  • 19 - Overview of Big Data and Data Lakes/009 Overview of Modern Data Lakes on Cloud.mp4 6.9 MB
  • 16 - Troubleshooting and Debugging Python Issues/019 Conclusion to Troubleshooting and Debugging Python Issues.mp4 6.8 MB
  • 18 - Getting Started with GCP/009 Overview of GCP Project and Billing.mp4 6.6 MB
  • 02 - Getting Started with SQL for Data Engineering/001 Introduction to SQL for Data Engineering.mp4 6.4 MB
  • 19 - Overview of Big Data and Data Lakes/006 Evolution of Big Data Technologies.mp4 6.3 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/003 Create Tables for Cumulative Aggregations and Ranking.mp4 6.3 MB
  • 22 - Basic Transformations using Spark SQL/004 Exercise to create temporary views using Spark SQL.mp4 6.3 MB
  • 23 - Create Delta Tables using Spark SQL/014 Conclusion of Creating Delta Tables using Spark SQL.mp4 6.2 MB
  • 18 - Getting Started with GCP/006 Create New Google Account using Non Gmail Id.mp4 6.2 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/002 Start the Hadoop and Spark Cluster using Dataproc.mp4 6.2 MB
  • 18 - Getting Started with GCP/005 Overview of Signing for GCP Account.mp4 6.1 MB
  • 30 - Copy Query Results into Spark Metastore Tables/001 Overview of Copying Query Results into Spark Metastore Tables.mp4 6.1 MB
  • 37 - Ranking using Pyspark Data Frame APIs/002 Syntax for ranking using Spark Data Frame APIs.mp4 6.0 MB
  • 03 - Setup Tools for Data Engineering Essentials/010 Conclusion of Setting up Tools for Data Engineering Essentials.mp4 5.9 MB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/001 Overview of Basic Transformations using Pyspark Data Frame APIs.mp4 5.8 MB
  • 14 - Project 1 - File Format Converter using Python/014 Overview of Run-time Arguments and Environment Variables.mp4 5.8 MB
  • 31 - Ranking using Spark SQL Windowing Functions/002 Create Temporary View for ranking using Spark SQL Windowing Functions.mp4 5.8 MB
  • 45 - Recap of important Linux Commands for Data Engineering/001 Introduction to Linux Commands and Scripts for Data Engineers.mp4 5.7 MB
  • 23 - Create Delta Tables using Spark SQL/002 Overview of Supported Providers for Spark Metastore Tables.mp4 5.6 MB
  • 16 - Troubleshooting and Debugging Python Issues/002 Guidelines for Troubleshooting and Debugging Python related Issues.mp4 5.6 MB
  • 18 - Getting Started with GCP/002 Pre-requisite Skills to Sign up for course on GCP Data Analytics.mp4 5.6 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/005 Compute Total Aggregation using OVER and PARTITION BY in SQL Queries.mp4 5.5 MB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/002 Delete Single Node Hadoop and Spark Cluster using Dataproc.mp4 5.3 MB
  • 41 - Performance Tuning of Spark - Cluster Configuration/001 Introduction to Databricks Cluster Configuration.mp4 5.3 MB
  • 24 - Pre-Defined Functions in Spark SQL/003 Overview of String Manipulation Functions in Spark SQL.mp4 5.1 MB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/001 Overview of Inferring Schema using CSV or JSON Files.mp4 4.7 MB
  • 07 - SQL Troubleshooting and Debugging Guide/001 Introduction to SQL Troubleshooting and Debugging Guide.mp4 4.4 MB
  • 20 - Overview of Spark and Spark Architecture/002 Overview of Data Processing Libraries.mp4 4.4 MB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/001 Introduction to Cumulative Aggregations and Ranking in SQL Queries.mp4 4.4 MB
  • 24 - Pre-Defined Functions in Spark SQL/010 Overview of Date Manipulation Functions in Spark SQL.mp4 4.4 MB
  • 18 - Getting Started with GCP/015 Conclusion to Get Started with GCP for Data Engineering.mp4 4.4 MB
  • 18 - Getting Started with GCP/001 Introduction to Getting Started with GCP.mp4 4.4 MB
  • 19 - Overview of Big Data and Data Lakes/001 Different Types of Databases.mp4 4.3 MB
  • 23 - Create Delta Tables using Spark SQL/001 Introduction to Creating Delta Tables using Spark SQL.mp4 4.3 MB
  • 25 - Setup Spark Metastore Tables for Basic Transformations/001 Introduction to Basic Transformations using Spark SQL.mp4 4.3 MB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/001 Introduction to Integration of Spark SQL and Pyspark Data Frame APIs.mp4 4.2 MB
  • 22 - Basic Transformations using Spark SQL/007 Overview of Pyspark Examples on Databricks.mp4 4.1 MB
  • 33 - Getting Started with Pyspark Data Frame APIs/001 Overview of Pyspark Examples on Databricks.mp4 4.1 MB
  • 21 - Setup Databricks Environment using GCP/001 Overview of Databicks on GCP.mp4 3.6 MB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/001 Introduction to Mastering Hadoop HDFS Commands and Concepts.mp4 3.3 MB
  • 31 - Ranking using Spark SQL Windowing Functions/001 Ranking using Spark SQL Windowing Functions.mp4 3.1 MB
  • 52 - Logging in Python based Spark Applications/001 Introduction to Logging in Python baesd Spark Applications.mp4 3.1 MB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/001 Getting Started with Spark SQL on Hadoop and Spark Cluster.mp4 3.0 MB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/001 Introduction to Building Hive Applications.mp4 2.9 MB
  • 36 - Joining Data using Spark Data Frame APIs/001 Introduction to Joining Data using Spark Data Frame APIs.mp4 2.8 MB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/002 Introduction to Processing JSON like Data using Spark SQL.mp4 2.5 MB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/001 Introduction to Getting Started with Pyspark.mp4 2.2 MB
  • 51 - Submitting Python based Spark Applications/001 Introduction to Submitting Python based Spark Applications.mp4 2.1 MB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675412.mpd 99.8 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675346.mpd 83.4 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/temp/index_48380214.mpd 79.1 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/temp/index_48380352.mpd 78.5 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675388.mpd 74.2 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675414.mpd 66.4 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/009 Computing Overall Capacity of Multinode Hadoop and Spark Clusters.encrypted.m4a.part.frag.urls 65.6 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/009 Computing Overall Capacity of Multinode Hadoop and Spark Clusters.encrypted.mp4.part.frag.urls 65.6 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675360.mpd 61.6 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675372.mpd 59.7 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675362.mpd 57.2 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675386.mpd 53.6 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/011 Filtering based on Ranks per Partition using Nested Queries and CTEs in SQL.encrypted.m4a.part.frag.urls 52.5 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/011 Filtering based on Ranks per Partition using Nested Queries and CTEs in SQL.encrypted.mp4.part.frag.urls 52.5 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/temp/index_47675378.mpd 44.4 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/temp/index_47681948.mpd 34.6 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/003 Review the side effects of using CSV Files in Data Lake.encrypted.mp4.part.frag.urls 33.0 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/003 Review the side effects of using CSV Files in Data Lake.encrypted.m4a.part.frag.urls 33.0 kB
  • 45 - Recap of important Linux Commands for Data Engineering/002 Overview of SSH to connect to remote Servers_en.srt 32.8 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/temp/index_48380372.mpd 32.8 kB
  • 45 - Recap of important Linux Commands for Data Engineering/005 Understanding PATH Environment Variable_en.srt 29.1 kB
  • 51 - Submitting Python based Spark Applications/012 Submit Spark Applications with dependencies as jars_en.srt 25.3 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/006 Populate Data into Delta Lake Tables using Spark SQL_en.srt 24.0 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/007 Setup Project using VS Code Remote Development_en.srt 24.0 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/002 Getting Started with Hive_en.srt 23.6 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/007 Populate Data for Additional Years into Delta NYSE Table_en.srt 23.0 kB
  • 04 - Setup Application Tables and Data in Postgres Database/008 Overview of pgAdmin to write SQL Queries_en.srt 21.0 kB
  • 45 - Recap of important Linux Commands for Data Engineering/013 Troubleshooting issues in Linux using grep command_en.srt 20.8 kB
  • 51 - Submitting Python based Spark Applications/010 Deep Dive into Spark Deploy Modes_en.srt 20.5 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/013 Develop Word Count Application using Spark_en.srt 19.7 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/019 Schedule Hive Applications using Cron_en.srt 19.3 kB
  • 51 - Submitting Python based Spark Applications/011 Submit Spark Applications with dependencies as packages_en.srt 19.1 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/014 Redesign the Solution using HDFS to stage files_en.srt 18.9 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/009 Create Partitioned Parquet Table for NYSE Data_en.srt 18.7 kB
  • 24 - Pre-Defined Functions in Spark SQL/018 Overview of Numeric Functions in Spark SQL_en.srt 18.6 kB
  • 10 - Solutions for Basic SQL Queries/005 Solution for Exercise 2 to get Dormant Customers using Outer Join_en.srt 18.4 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/006 Override Run time Hive Configuration Properties and Variables_en.srt 18.1 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/003 Spark SQL Metastore Architecture_en.srt 17.7 kB
  • 16 - Troubleshooting and Debugging Python Issues/014 Debug VS Code Notebooks using Debug Feature_en.srt 17.6 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/028 Demo on Spark Dynamic Allocation_en.srt 17.5 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/021 Overview of Shuffling - Part 2_en.srt 17.3 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/018 Recap of HDFS on Dataproc Cluster_en.srt 17.2 kB
  • 45 - Recap of important Linux Commands for Data Engineering/014 Overview of Shell Scripts_en.srt 16.9 kB
  • 16 - Troubleshooting and Debugging Python Issues/013 Overview of Debugging VS Code Notebooks using Debug Feature_en.srt 16.8 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/007 Overview of Multinode Hadoop Cluster_en.srt 16.7 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/025 Run Spark Application with out Adaptive Query Execution_en.srt 16.7 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/010 Populate Data into Partitioned NYSE table from Stage Table_en.srt 16.5 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/013 Review Execution Details for Performance Tuning using Partitioning_en.srt 16.3 kB
  • 45 - Recap of important Linux Commands for Data Engineering/010 Listing Files and Folders using ls command_en.srt 16.2 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/006 Programmatically Copy files into HDFS using Python_en.srt 16.1 kB
  • 23 - Create Delta Tables using Spark SQL/013 Using Merge to Update and Insert into Delta Tables in Spark Metastore_en.srt 16.1 kB
  • 15 - Project 2 - Files to Database Loader/007 Overview of Deploying File to DB Loader Project_en.srt 16.1 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/008 Overview of Multinode Hadoop and Spark Cluster Topology_en.srt 15.9 kB
  • 39 - ELT Data Pipelines using Databricks/012 Build ELT Pipeline using Databricks Job in Workflows_en.srt 15.8 kB
  • 17 - Performance Tuning of Python Applications/003 Ensure Postgres Database is setup for file to db loader Python Application_en.srt 15.8 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/003 Overview of Hadoop and Spark Cluster Types and Architecture_en.srt 15.8 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/007 Run Individual Spark SQL Commands_en.srt 15.8 kB
  • 24 - Pre-Defined Functions in Spark SQL/031 Solutions for Exercises 7 and 8 on Pre-defined Functions in Spark SQL_en.srt 15.7 kB
  • 30 - Copy Query Results into Spark Metastore Tables/005 Design Pipeline using CTAS and INSERT in Spark SQL_en.srt 15.7 kB
  • 14 - Project 1 - File Format Converter using Python/019 Use Environment Variables in File Format Converter_en.srt 15.7 kB
  • 45 - Recap of important Linux Commands for Data Engineering/007 Copy Files and Folders in Linux using cp command_en.srt 15.7 kB
  • 14 - Project 1 - File Format Converter using Python/021 Pass Data Sets as Run Time Arguments to File Format Converter_en.srt 15.6 kB
  • 45 - Recap of important Linux Commands for Data Engineering/015 Running and Debugging Shell Scripts with Arguments_en.srt 15.4 kB
  • 28 - Joins using Spark SQL Queries/004 Outer Joins using Spark SQL Queries_en.srt 15.2 kB
  • 24 - Pre-Defined Functions in Spark SQL/030 Solutions for Exercises 5 and 6 on Pre-defined Functions in Spark SQL_en.srt 15.1 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/007 Process Data using pyspark Dataframe APIs_en.srt 15.0 kB
  • 24 - Pre-Defined Functions in Spark SQL/007 Trimming Characters or Strings using Spark SQL_en.srt 14.9 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/018 Overview of Scheduling and Crontab_en.srt 14.8 kB
  • 17 - Performance Tuning of Python Applications/009 Review Pandas Data Frame API to load data into the target table_en.srt 14.7 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/027 Overview of Spark Dynamic Allocation_en.srt 14.6 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/023 Overview of Lazy Evaluation_en.srt 14.5 kB
  • 45 - Recap of important Linux Commands for Data Engineering/006 Creating Folders in Linux using mkdir_en.srt 14.5 kB
  • 11 - Getting Started with Python/012 Loops and Conditions in Python_en.srt 14.4 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/010 Overview of local storage of files_en.srt 14.4 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/005 Run Hive Commands using Scripts_en.srt 14.3 kB
  • 07 - SQL Troubleshooting and Debugging Guide/014 Identify and Troubleshoot Bugs in SQL Queries_en.srt 14.2 kB
  • 12 - Python Collections for Data Engineering/003 Overview of Python Collections_en.srt 14.2 kB
  • 45 - Recap of important Linux Commands for Data Engineering/009 Delete Files and Folders in Linux using rm command_en.srt 14.2 kB
  • 37 - Ranking using Pyspark Data Frame APIs/007 Difference Between rank and dense_rank_en.srt 14.0 kB
  • 45 - Recap of important Linux Commands for Data Engineering/012 Standard Directories in Linux_en.srt 13.9 kB
  • 07 - SQL Troubleshooting and Debugging Guide/007 Troubleshoot Database Credentials and Permissions Issues_en.srt 13.9 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/015 Validate Hive Application to Convert NYSE Data_en.srt 13.7 kB
  • 15 - Project 2 - Files to Database Loader/003 Run Queries from Notebook using SQL Magic_en.srt 13.6 kB
  • 39 - ELT Data Pipelines using Databricks/011 Validate Applications for ELT Pipeline using Databricks_en.srt 13.4 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/002 Launching or Getting Started Pyspark CLI_en.srt 13.4 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/030 Overview of number of Spark Partitions_en.srt 13.4 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/003 Apply Row Level transformations using Pyspark Data Frame APIs_en.srt 13.4 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/005 Create Folder and Copy Files into HDFS using Commands_en.srt 13.4 kB
  • 08 - Performance Tuning of SQL Queries/004 Review Tables used for Performance Tuning of SQL Queries_en.srt 13.4 kB
  • 24 - Pre-Defined Functions in Spark SQL/017 Dealing with Unix Timestamp using Spark SQL_en.srt 13.3 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/014 Overview of Blocks related to files in HDFS_en.srt 13.3 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/006 Create Target Table for NYSE Data using Delta Format_en.srt 13.2 kB
  • 04 - Setup Application Tables and Data in Postgres Database/001 Overview of Postgres Database Server and pgAdmin_en.srt 13.2 kB
  • 12 - Python Collections for Data Engineering/001 Overview of File IO using Python_en.srt 13.1 kB
  • 10 - Solutions for Basic SQL Queries/006 Solution for Exercise 3 to get Revenue Per Customer using Outer Join_en.srt 13.0 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/008 Run Spark SQL Scripts using Spark SQL CLI_en.srt 12.9 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/016 Review Environment Properties and Disabling Dynamic Allocation_en.srt 12.9 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/015 Overview of Jobs related to Spark Applications using Spark UI_en.srt 12.9 kB
  • 12 - Python Collections for Data Engineering/009 Sort Python lists using key_en.srt 12.8 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/020 Overview of Shuffling - Part 1_en.srt 12.8 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/013 Run Hive QL Commands using Script for NYSE Loader_en.srt 12.8 kB
  • 05 - Writing Basic SQL Queries/005 Group By Aggregations using SQL Queries_en.srt 12.8 kB
  • 24 - Pre-Defined Functions in Spark SQL/028 Solutions for Exercises 1 and 2 on Pre-defined Functions in Spark SQL_en.srt 12.7 kB
  • 39 - ELT Data Pipelines using Databricks/004 Create and Run First Databricks Job_en.srt 12.7 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/005 Create External Stage Table for NYSE CSV Files_en.srt 12.7 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/008 Review Performance Details of Spark Operations on Parquet Files_en.srt 12.6 kB
  • 10 - Solutions for Basic SQL Queries/004 Solution for Exercise 1 to get Customer Order Count_en.srt 12.6 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/007 Getting Started Data Loader for NYSE Data using HIve_en.srt 12.6 kB
  • 17 - Performance Tuning of Python Applications/017 Understanding the concept of Multiprocessing in Python_en.srt 12.5 kB
  • 24 - Pre-Defined Functions in Spark SQL/014 Extract Information from Date or Time using Spark SQL_en.srt 12.5 kB
  • 05 - Writing Basic SQL Queries/014 Outer Join with Additional Conditions in SQL Queries_en.srt 12.5 kB
  • 10 - Solutions for Basic SQL Queries/008 Solution for Exercise 5 to get Product Count Per Department_en.srt 12.5 kB
  • 17 - Performance Tuning of Python Applications/007 Overview of Execution of file to db loader application_en.srt 12.4 kB
  • 15 - Project 2 - Files to Database Loader/006 Write CSV Data from Files to Database Tables in Chunks_en.srt 12.4 kB
  • 22 - Basic Transformations using Spark SQL/008 Process Schema Details in JSON using Pyspark_en.srt 12.4 kB
  • 33 - Getting Started with Pyspark Data Frame APIs/002 Process Schema Details in JSON using Pyspark_en.srt 12.4 kB
  • 13 - Data Processing using Pandas Dataframe APIs/012 Write Pandas Dataframes to JSON Files_en.srt 12.3 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/003 Overview of Hive Architecture_en.srt 12.3 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/009 Getting Started with HDFS Commands to Manage Files_en.srt 12.3 kB
  • 13 - Data Processing using Pandas Dataframe APIs/004 Filter Data in Pandas Dataframe using query_en.srt 12.2 kB
  • 04 - Setup Application Tables and Data in Postgres Database/007 Setup Application Tables and Data in Postgres Database_en.srt 12.2 kB
  • 45 - Recap of important Linux Commands for Data Engineering/016 Overview of Hadoop and Spark Executables_en.srt 12.2 kB
  • 30 - Copy Query Results into Spark Metastore Tables/006 Copy Query Results into Spark Meatstore Tables using MERGE_en.srt 12.2 kB
  • 51 - Submitting Python based Spark Applications/013 Develop Shell Wrappers to submit Spark Applications_en.srt 12.1 kB
  • 08 - Performance Tuning of SQL Queries/011 Add Required Indexes to tune performance of SQL Queries_en.srt 12.1 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/017 Overview of HDFS Namenode for HDFS File Metadata_en.srt 12.1 kB
  • 21 - Setup Databricks Environment using GCP/006 Overview of Databricks on GCP_en.srt 12.1 kB
  • 20 - Overview of Spark and Spark Architecture/013 Understand Spark Key Terms_en.srt 12.1 kB
  • 10 - Solutions for Basic SQL Queries/007 Solution for Exercise 4 to get Revenue Per Category_en.srt 12.0 kB
  • 45 - Recap of important Linux Commands for Data Engineering/003 Overview of Profile in Linux Shell_en.srt 12.0 kB
  • 16 - Troubleshooting and Debugging Python Issues/003 Overview of Database Connectivity using Python Applications_en.srt 12.0 kB
  • 23 - Create Delta Tables using Spark SQL/011 Difference Between Managed and External Spark Metastore Tables_en.srt 12.0 kB
  • 24 - Pre-Defined Functions in Spark SQL/008 Padding Characters to Strings using Spark SQL_en.srt 12.0 kB
  • 16 - Troubleshooting and Debugging Python Issues/017 Debug Python Application using VS Code with breakpoints_en.srt 11.9 kB
  • 07 - SQL Troubleshooting and Debugging Guide/002 Overview of Database Connectivity Issues_en.srt 11.9 kB
  • 51 - Submitting Python based Spark Applications/009 Overview of Execution Process of Spark Applications_en.srt 11.9 kB
  • 12 - Python Collections for Data Engineering/005 Overview of Lambda Functions in Python_en.srt 11.9 kB
  • 27 - Aggregations using Spark SQL Queries/001 Perform Total Aggregations using Spark SQL Queries_en.srt 11.8 kB
  • 39 - ELT Data Pipelines using Databricks/002 Pass Arguments to Databricks Python Notebooks_en.srt 11.8 kB
  • 07 - SQL Troubleshooting and Debugging Guide/005 Troubleshoot Database Connectivity Issue with Correct Host Details_en.srt 11.8 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/017 Develop Shell Wrapper to run Hive Application_en.srt 11.8 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/004 Overview of Integrating Hive Commands with Shell Scripts_en.srt 11.8 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/003 Create Hadoop and Spark Cluster and Setup VS Code Workspace_en.srt 11.8 kB
  • 22 - Basic Transformations using Spark SQL/003 Create Temporary Views using Spark SQL_en.srt 11.7 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/015 Overview of Replication related to files in HDFS_en.srt 11.7 kB
  • 11 - Getting Started with Python/011 Overview of Python Lists_en.srt 11.7 kB
  • 24 - Pre-Defined Functions in Spark SQL/015 Convert Non Standard Dates or Timestamps to Standard Ones using Spark SQL_en.srt 11.7 kB
  • 17 - Performance Tuning of Python Applications/016 Validate File to DB Loader Application with Multiprocessing_en.srt 11.7 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/017 Overriding Spark Executor Instances to tune the performance_en.srt 11.6 kB
  • 05 - Writing Basic SQL Queries/010 Outer Joins using SQL Queries_en.srt 11.6 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/013 Determining Number of Blocks for each file_en.srt 11.6 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/004 Overview of OVER and PARTITION BY Clause in SQL Queries_en.srt 11.6 kB
  • 39 - ELT Data Pipelines using Databricks/006 Create and Run Orchestrated Pipeline using Databricks Job_en.srt 11.5 kB
  • 51 - Submitting Python based Spark Applications/008 Review YARN Logs for Spark Applications in Cluster Mode_en.srt 11.5 kB
  • 20 - Overview of Spark and Spark Architecture/004 Code Examples of Pandas, Dask and Pyspark_en.srt 11.5 kB
  • 17 - Performance Tuning of Python Applications/008 Performance Tuning using Chunksize in Pandas_en.srt 11.5 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/019 Overview of Adaptive Query Execution_en.srt 11.4 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/003 Overview of Data Processing using Conventional loops_en.srt 11.4 kB
  • 13 - Data Processing using Pandas Dataframe APIs/010 Sort Data in Pandas Dataframes_en.srt 11.3 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/006 Setup SSH Connectivity and VS Code Workspace using Master Node_en.srt 11.3 kB
  • 31 - Ranking using Spark SQL Windowing Functions/006 Filter on Ranks using Spark SQL Windowing Functions_en.srt 11.2 kB
  • 16 - Troubleshooting and Debugging Python Issues/016 Recap of running File Format Converter application_en.srt 11.2 kB
  • 14 - Project 1 - File Format Converter using Python/010 Wrapper to Process all Data Sets_en.srt 11.2 kB
  • 11 - Getting Started with Python/004 Overview of Cells in VS Code Notebook_en.srt 11.1 kB
  • 04 - Setup Application Tables and Data in Postgres Database/002 Overview of Database Connection Details_en.srt 11.1 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/007 Perform Aggregations by Key using Spark Data Frame APIs_en.srt 11.1 kB
  • 11 - Getting Started with Python/010 Pre-Defined String Manipulation Functions_en.srt 11.1 kB
  • 12 - Python Collections for Data Engineering/011 Read JSON Strings to Python dicts or lists_en.srt 11.1 kB
  • 11 - Getting Started with Python/013 User Defined Functions in Python_en.srt 11.1 kB
  • 11 - Getting Started with Python/002 Setup Notebook Environment in VS Code Workspace_en.srt 11.0 kB
  • 17 - Performance Tuning of Python Applications/001 Introduction to Performance of Python Applications_en.srt 11.0 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/029 Running Spark Application using Dynamic Allocation_en.srt 11.0 kB
  • 20 - Overview of Spark and Spark Architecture/005 Differences between Pandas, Dask and Pyspark_en.srt 11.0 kB
  • 05 - Writing Basic SQL Queries/006 Order of Execution of SQL Queries_en.srt 11.0 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/010 Develop Shell Wrapper for Spark SQL Application_en.srt 10.9 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/010 Composite Sorting using Spark Data Frame APIs_en.srt 10.9 kB
  • 23 - Create Delta Tables using Spark SQL/003 Create Database and Review the Details_en.srt 10.9 kB
  • 27 - Aggregations using Spark SQL Queries/005 Filter Data based on Aggregate Results using HAVING in Spark SQL Queries_en.srt 10.9 kB
  • 24 - Pre-Defined Functions in Spark SQL/009 Reverse and Concatenate Strings using Spark SQL_en.srt 10.8 kB
  • 45 - Recap of important Linux Commands for Data Engineering/004 Overview of Environment Variables in Linux_en.srt 10.8 kB
  • 05 - Writing Basic SQL Queries/003 Filtering Data using SQL Queries_en.srt 10.8 kB
  • 28 - Joins using Spark SQL Queries/002 Inner Join using Spark SQL Queries_en.srt 10.7 kB
  • 36 - Joining Data using Spark Data Frame APIs/005 Join and other Spark Data Frame APIs to process the data_en.srt 10.7 kB
  • 11 - Getting Started with Python/001 Setup Visual Studio Workspace for Python Application Development_en.srt 10.7 kB
  • 27 - Aggregations using Spark SQL Queries/004 Order of Execution of Spark SQL Queries_en.srt 10.7 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/007 Detailed outline of Spark SQL Topics in the course_en.srt 10.7 kB
  • 45 - Recap of important Linux Commands for Data Engineering/008 Move Files and Folders in Linux using mv command_en.srt 10.7 kB
  • 36 - Joining Data using Spark Data Frame APIs/007 Left Outer Join using Spark Data Frame APIs_en.srt 10.6 kB
  • 11 - Getting Started with Python/009 Getting help on Python Variables and Functions_en.srt 10.6 kB
  • 12 - Python Collections for Data Engineering/012 Read JSON Schemas from file to Python dicts_en.srt 10.6 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/012 Solutions on Airlines Data for Performance Tuning using Partitioning_en.srt 10.6 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/009 Overview of Columnar File Formats in Spark and Databricks_en.srt 10.5 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/009 Applying functions on Spark Data Frame Columns_en.srt 10.5 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/004 Understand the size of the data using dbutils_en.srt 10.5 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/005 Validate SSH Connectivity to the Dataproc Cluster_en.srt 10.5 kB
  • 37 - Ranking using Pyspark Data Frame APIs/001 Introduction to Ranking using Spark Data Frame APIs_en.srt 10.4 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/009 Validate Spark SQL Application for NYSE Data Conversion_en.srt 10.4 kB
  • 19 - Overview of Big Data and Data Lakes/007 Overview of Data Lake using Hadoop eco system_en.srt 10.4 kB
  • 23 - Create Delta Tables using Spark SQL/012 Perform CRUD Operations on Delta Tables in Spark Metastore_en.srt 10.4 kB
  • 11 - Getting Started with Python/005 Defining Functions in VS Code Notebooks_en.srt 10.4 kB
  • 14 - Project 1 - File Format Converter using Python/002 Get File Names to be processed using glob_en.srt 10.4 kB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/004 Overview of overhead for inferring schema_en.srt 10.3 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/026 Run Spark Application using Adaptive Query Execution_en.srt 10.3 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/014 Get Airlines Data using Date Range without Partition Pruning_en.srt 10.3 kB
  • 21 - Setup Databricks Environment using GCP/012 Overview of Databricks CLI Commands_en.srt 10.2 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/016 Physical Storage of HDFS File Blocks_en.srt 10.1 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/003 Overview of SQL topics covered in the course_en.srt 10.1 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/004 Overview of Python topics covered in the course_en.srt 10.1 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/009 Sorting Data using Spark Data Frame APIs_en.srt 10.1 kB
  • 24 - Pre-Defined Functions in Spark SQL/026 Word Count Query using Pre-defined Functions in Spark SQL_en.srt 10.1 kB
  • 15 - Project 2 - Files to Database Loader/005 Write CSV Data from File to Database Table_en.srt 10.0 kB
  • 22 - Basic Transformations using Spark SQL/009 Create Dataframe with Schema from JSON File using Pyspark_en.srt 10.0 kB
  • 33 - Getting Started with Pyspark Data Frame APIs/003 Create Dataframe with Schema from JSON File using Pyspark_en.srt 10.0 kB
  • 24 - Pre-Defined Functions in Spark SQL/021 Replace Null Values with default values using nvl and coalesce in Spark SQL_en.srt 10.0 kB
  • 22 - Basic Transformations using Spark SQL/005 Spark SQL Query to compute Daily Product Revenue_en.srt 10.0 kB
  • 25 - Setup Spark Metastore Tables for Basic Transformations/003 Projecting Data using Spark SQL_en.srt 10.0 kB
  • 20 - Overview of Spark and Spark Architecture/010 Overview of Spark Cluster using Databricks_en.srt 10.0 kB
  • 52 - Logging in Python based Spark Applications/004 Getting Started with logging using Python_en.srt 10.0 kB
  • 05 - Writing Basic SQL Queries/012 Overview of Database Views_en.srt 10.0 kB
  • 11 - Getting Started with Python/006 Run the Code in VS Code Notebook Cell by Line_en.srt 9.9 kB
  • 17 - Performance Tuning of Python Applications/002 Setup Database Loader Python Application_en.srt 9.9 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/005 Filtering Data with Multiple Conditions using Pyspark Data Frame APIs_en.srt 9.9 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/013 Difference between rank and dense rank using SQL_en.srt 9.9 kB
  • 12 - Python Collections for Data Engineering/007 Filter Data in Python Lists using filter and lambda_en.srt 9.9 kB
  • 12 - Python Collections for Data Engineering/008 Get unique values from list using map and set_en.srt 9.9 kB
  • 17 - Performance Tuning of Python Applications/013 Invoking User Defined Functions using multiprocessing in Python_en.srt 9.9 kB
  • 21 - Setup Databricks Environment using GCP/008 Setup Databricks CLI on Mac or Windows_en.srt 9.8 kB
  • 11 - Getting Started with Python/007 Constants and Variables in Python_en.srt 9.8 kB
  • 24 - Pre-Defined Functions in Spark SQL/019 Data Type Conversion using Spark SQL_en.srt 9.8 kB
  • 17 - Performance Tuning of Python Applications/005 Run and Validate File to DB Loader Application_en.srt 9.8 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/008 Review Important Properties of HDFS_en.srt 9.7 kB
  • 36 - Joining Data using Spark Data Frame APIs/004 Inner Join using Spark Data Frame APIs_en.srt 9.7 kB
  • 21 - Setup Databricks Environment using GCP/010 Configure Databricks CLI on Mac or Windows_en.srt 9.7 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/004 Overview of Data Frame Concepts_en.srt 9.7 kB
  • 45 - Recap of important Linux Commands for Data Engineering/011 Searching for files using find command in Linux_en.srt 9.7 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/012 Overview of Distributed Storage of files in HDFS_en.srt 9.7 kB
  • 02 - Getting Started with SQL for Data Engineering/004 Overview of Purpose Built Databases_en.srt 9.6 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/002 Overview of different Spark Platforms on Cloud_en.srt 9.6 kB
  • 51 - Submitting Python based Spark Applications/004 Specify Paths using Environment Variables in Spark Applications_en.srt 9.6 kB
  • 14 - Project 1 - File Format Converter using Python/004 Get Data Set Names from File Names or Paths using regular expressions_en.srt 9.6 kB
  • 52 - Logging in Python based Spark Applications/005 Changing the Log Message Format using logging_en.srt 9.5 kB
  • 08 - Performance Tuning of SQL Queries/013 Interpreting the explain plan for SQL Queries using Indexes_en.srt 9.5 kB
  • 07 - SQL Troubleshooting and Debugging Guide/010 Troubleshooting Semantec Errors in SQL Queries_en.srt 9.5 kB
  • 19 - Overview of Big Data and Data Lakes/008 Limitations of Hadoop eco system_en.srt 9.5 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/011 Evaluate Requirements against Partition Pruning_en.srt 9.4 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/005 Getting Started with Pyspark for Data Processing_en.srt 9.4 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/009 Understand Filter and Broadcast of Orders Data_en.srt 9.4 kB
  • 14 - Project 1 - File Format Converter using Python/020 Pass JSON Array as argument to Python Applications_en.srt 9.4 kB
  • 07 - SQL Troubleshooting and Debugging Guide/008 Overview of Compilation of SQL Queries_en.srt 9.4 kB
  • 51 - Submitting Python based Spark Applications/003 Run Spark Application using spark-submit_en.srt 9.4 kB
  • 24 - Pre-Defined Functions in Spark SQL/022 Conditional Logic on Null Values using nvl2 and case in Spark SQL_en.srt 9.4 kB
  • 08 - Performance Tuning of SQL Queries/003 Generate Explain Plans for SQL Queries_en.srt 9.4 kB
  • 02 - Getting Started with SQL for Data Engineering/003 Overview of Database Technologies and relevance of SQL_en.srt 9.4 kB
  • 13 - Data Processing using Pandas Dataframe APIs/005 Get Count by Status using Pandas Dataframe APIs_en.srt 9.3 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/013 Stopping the Cluster and Understanding the costs_en.srt 9.3 kB
  • 12 - Python Collections for Data Engineering/016 Create Function to get Column Details from Schemas JSON File_en.srt 9.3 kB
  • 31 - Ranking using Spark SQL Windowing Functions/003 Compute Global Rank using Spark SQL Windowing Functions_en.srt 9.2 kB
  • 08 - Performance Tuning of SQL Queries/007 Interpret Explain Plans for Basic SQL Queries_en.srt 9.2 kB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/002 Run Spark SQL Queries on Spark Data Frames_en.srt 9.2 kB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/007 Manage Spark Metastore Database Objects using Spark APIs_en.srt 9.2 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/012 Review Data Set with Nulls for Sorting using Spark Data Frame APIs_en.srt 9.2 kB
  • 14 - Project 1 - File Format Converter using Python/009 Modularize File Format Converter for Dataset_en.srt 9.2 kB
  • 15 - Project 2 - Files to Database Loader/004 Validate Pandas and SQL Integration_en.srt 9.2 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/017 Redesign Partition Strategy to tune the performance_en.srt 9.1 kB
  • 21 - Setup Databricks Environment using GCP/007 High level architecture of Databricks_en.srt 9.1 kB
  • 10 - Solutions for Basic SQL Queries/002 Solutions for Filtering and Aggregations_en.srt 9.1 kB
  • 14 - Project 1 - File Format Converter using Python/015 Using Run Time Arguments in Python Applications_en.srt 9.1 kB
  • 05 - Writing Basic SQL Queries/004 Total Aggregations using SQL Queries_en.srt 9.1 kB
  • 52 - Logging in Python based Spark Applications/002 Run Application without logging_en.srt 9.1 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/002 Overview of Row Level Transformations_en.srt 9.1 kB
  • 36 - Joining Data using Spark Data Frame APIs/002 Create Data Frames to Join using Spark Data Frame APIs_en.srt 9.1 kB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/005 Performance Tuning to infer schema of Spark Dataframe_en.srt 9.1 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/008 Design NYSE Data Loader Application_en.srt 9.1 kB
  • 21 - Setup Databricks Environment using GCP/011 Troubleshoot issues to configure Databricks CLI_en.srt 9.0 kB
  • 37 - Ranking using Pyspark Data Frame APIs/003 Compute Global Ranks using Spark Data Frame APIs_en.srt 9.0 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/006 Overview of Spark Data Frames and their Characteristics_en.srt 9.0 kB
  • 08 - Performance Tuning of SQL Queries/005 Review Data Storage Internals for Tables and Indexes_en.srt 9.0 kB
  • 20 - Overview of Spark and Spark Architecture/001 Overview of Data Processing_en.srt 9.0 kB
  • 16 - Troubleshooting and Debugging Python Issues/006 Troubleshoot Module Related issues for Database Connectivity using Python_en.srt 9.0 kB
  • 17 - Performance Tuning of Python Applications/012 Getting Started with Multiprocessing using Python_en.srt 9.0 kB
  • 39 - ELT Data Pipelines using Databricks/005 Run Databricks Jobs and Tasks with Parameters_en.srt 9.0 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/011 Setup Data Sets to understand HDFS Concepts_en.srt 9.0 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/011 Process NYSE Data and load into partitioned table_en.srt 8.9 kB
  • 51 - Submitting Python based Spark Applications/002 Develop Pyspark Application for Daily Revenue_en.srt 8.9 kB
  • 08 - Performance Tuning of SQL Queries/009 Write SQL Queries for Customer Orders_en.srt 8.9 kB
  • 14 - Project 1 - File Format Converter using Python/018 Use Environment Variables in Python Applications_en.srt 8.9 kB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/006 Process Data in Spark Metastore Tables using Data Frame APIs_en.srt 8.9 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/007 Interpreting Explain Plan for Spark SQL Query_en.srt 8.8 kB
  • 39 - ELT Data Pipelines using Databricks/009 Review File Format Converter Pyspark Code_en.srt 8.8 kB
  • 05 - Writing Basic SQL Queries/009 Inner Joins using SQL Queries_en.srt 8.8 kB
  • 23 - Create Delta Tables using Spark SQL/004 Create and Review Managed Spark Metastore Table using Delta Format_en.srt 8.7 kB
  • 20 - Overview of Spark and Spark Architecture/008 Overview of Spark Key Features and Platforms_en.srt 8.7 kB
  • 39 - ELT Data Pipelines using Databricks/013 Run and Review Execution details of ELT Data Pipeline using Databricks Job_en.srt 8.7 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/012 Getting Started with Spark CLI using SQL_en.srt 8.7 kB
  • 16 - Troubleshooting and Debugging Python Issues/009 Troubleshooting Compilation Errors in Python_en.srt 8.7 kB
  • 32 - Processing JSON like Data using Spark SQL/003 Dealing with Array Type Columns using Spark SQL Queries_en.srt 8.7 kB
  • 36 - Joining Data using Spark Data Frame APIs/009 Equivalent Spark SQL Queries for Joins_en.srt 8.7 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/002 Difference Between All Purpose and Jobs Clusters_en.srt 8.7 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/002 Review the Requirements and Datasets for NYSE Data_en.srt 8.6 kB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/003 Create Spark Metastore Tables using Data Frames_en.srt 8.6 kB
  • 23 - Create Delta Tables using Spark SQL/005 Copy Data into Spark Metastore Managed Table_en.srt 8.6 kB
  • 24 - Pre-Defined Functions in Spark SQL/024 Using CASE and WHEN for conditional logic in Spark SQL_en.srt 8.6 kB
  • 24 - Pre-Defined Functions in Spark SQL/029 Solutions for Exercises 3 and 4 on Pre-defined Functions in Spark SQL_en.srt 8.6 kB
  • 21 - Setup Databricks Environment using GCP/014 Setup Data Sets in DBFS using Databricks CLI Commands_en.srt 8.6 kB
  • 05 - Writing Basic SQL Queries/007 Rules and Restrictions to Group and Filter Data in SQL queries_en.srt 8.5 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/007 Review Multi Node Hadoop and Spark Clusters using Web Interfaces_en.srt 8.5 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/010 Filtering based on Global Ranks using Nested Queries and CTEs in SQL_en.srt 8.5 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/003 Setting up All Purpose Databricks Compute Clusters_en.srt 8.5 kB
  • 05 - Writing Basic SQL Queries/001 Review Data Model Diagram_en.srt 8.5 kB
  • 20 - Overview of Spark and Spark Architecture/006 Overview of Distributed Computing_en.srt 8.5 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/011 Overview of Spark History Server UI_en.srt 8.5 kB
  • 28 - Joins using Spark SQL Queries/003 Concepts Behind Inner Joins in Spark SQL_en.srt 8.4 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/011 Overview of Writing Data in Data Frame to Delta Files_en.srt 8.4 kB
  • 13 - Data Processing using Pandas Dataframe APIs/007 Create Dataframes using dynamic column list on CSV Data_en.srt 8.4 kB
  • 23 - Create Delta Tables using Spark SQL/010 Overview of Spark Metastore_en.srt 8.4 kB
  • 27 - Aggregations using Spark SQL Queries/003 GROUP BY Examples using Spark SQL Queries_en.srt 8.4 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/016 Deploy Hive Application in HDFS_en.srt 8.4 kB
  • 14 - Project 1 - File Format Converter using Python/005 Read CSV Data into Pandas Dataframe with Schema Dynamically_en.srt 8.4 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/020 Recap of Application Development Life Cycle using Hive_en.srt 8.4 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/009 Setup Databricks Job Compute Clusters using Workflows_en.srt 8.4 kB
  • 13 - Data Processing using Pandas Dataframe APIs/009 Perform Aggregations on Join results_en.srt 8.3 kB
  • 22 - Basic Transformations using Spark SQL/002 Getting Started with Spark SQL Example using Databricks_en.srt 8.3 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/011 Filtering based on Ranks per Partition using Nested Queries and CTEs in SQL_en.srt 8.3 kB
  • 26 - Filtering Data using Spark SQL Queries/001 Filtering Data using Equal Condition in Spark SQL_en.srt 8.3 kB
  • 03 - Setup Tools for Data Engineering Essentials/003 Setup Python 3.9 on Windows_en.srt 8.3 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/012 Develop Hive QL Script to Load NYSE Data_en.srt 8.3 kB
  • 17 - Performance Tuning of Python Applications/010 Overview of multi or batch insert into Database Tables_en.srt 8.3 kB
  • 21 - Setup Databricks Environment using GCP/009 Overview of Databricks CLI and other clients_en.srt 8.3 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/005 Launch Spark SQL CLI with Delta Lake Packages_en.srt 8.3 kB
  • 10 - Solutions for Basic SQL Queries/001 Solutions for Filtering and Aggregations_en.srt 8.2 kB
  • 32 - Processing JSON like Data using Spark SQL/009 Generate Array Type Columns from Regular Columns in Spark SQL_en.srt 8.2 kB
  • 11 - Getting Started with Python/003 Overview of VS Code Notebook Environment_en.srt 8.2 kB
  • 30 - Copy Query Results into Spark Metastore Tables/003 Copy Query Results into Spark Metastore Tables using CTAS_en.srt 8.2 kB
  • 32 - Processing JSON like Data using Spark SQL/001 Overview of JSON_en.srt 8.2 kB
  • 21 - Setup Databricks Environment using GCP/003 Create Databricks Workspace on GCP_en.srt 8.2 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/004 Filtering Data using Pyspark Data Frame APIs_en.srt 8.2 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/014 Develop and Validate Shell Script for Word Count_en.srt 8.2 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/008 Detailed outline of Pyspark Topics in the course_en.srt 8.2 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/002 Getting Started with Data Sets and Spark SQL CLI_en.srt 8.1 kB
  • 03 - Setup Tools for Data Engineering Essentials/009 Getting Started with pgAdmin on Mac_en.srt 8.1 kB
  • 14 - Project 1 - File Format Converter using Python/017 Setting Environment Variables on Windows or Mac or Linux_en.srt 8.1 kB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/003 Overview of CSV or JSON Files_en.srt 8.1 kB
  • 07 - SQL Troubleshooting and Debugging Guide/015 Develop Solution using Development Best Practices_en.srt 8.1 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/004 Restructure CSV Data to Columnar Format using Pyspark_en.srt 8.1 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/004 Overview of Spark Metastore Warehouse Directory_en.srt 8.1 kB
  • 20 - Overview of Spark and Spark Architecture/003 Setup Environment to explore Pandas, Dask and Pyspark_en.srt 8.1 kB
  • 22 - Basic Transformations using Spark SQL/001 Process Data in DBFS using Databricks Spark SQL_en.srt 8.0 kB
  • 22 - Basic Transformations using Spark SQL/012 Convert CSV to Parquet with Schema using Pyspark_en.srt 8.0 kB
  • 33 - Getting Started with Pyspark Data Frame APIs/006 Convert CSV to Parquet with Schema using Pyspark_en.srt 8.0 kB
  • 07 - SQL Troubleshooting and Debugging Guide/006 Current Databases and Users in Postgres Database Server_en.srt 8.0 kB
  • 17 - Performance Tuning of Python Applications/018 Performance Tuning Scenarios of Python Applications_en.srt 8.0 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/006 Read Orders and Order Items Data into Spark Data Frames_en.srt 8.0 kB
  • 24 - Pre-Defined Functions in Spark SQL/013 Overview of trunc and date_trunc in Spark SQL_en.srt 7.9 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/016 Add Condition for Partition Pruning_en.srt 7.9 kB
  • 13 - Data Processing using Pandas Dataframe APIs/001 Overview of Pandas for Data Processing_en.srt 7.9 kB
  • 03 - Setup Tools for Data Engineering Essentials/007 Install Postgres 14 on Windows 11_en.srt 7.9 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/010 Determine Overall YARN Capacity_en.srt 7.8 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/010 Understand Join and Aggregation for Daily Product Revenue_en.srt 7.8 kB
  • 07 - SQL Troubleshooting and Debugging Guide/003 Validate and Setup Telnet on Mac or PC_en.srt 7.8 kB
  • 17 - Performance Tuning of Python Applications/014 Refactor File to Database Loader Application for Multiprocessing_en.srt 7.8 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/006 Perform Aggregations by Key using Spark Data Frame APIs_en.srt 7.8 kB
  • 13 - Data Processing using Pandas Dataframe APIs/002 Overview of Reading CSV Data using Pandas_en.srt 7.7 kB
  • 13 - Data Processing using Pandas Dataframe APIs/006 Get count by Month and Status using Pandas Dataframe APIs_en.srt 7.7 kB
  • 13 - Data Processing using Pandas Dataframe APIs/003 Read Data from CSV Files to Pandas Dataframes_en.srt 7.7 kB
  • 23 - Create Delta Tables using Spark SQL/007 Create and Review External Spark Metastore Table using Delta Format_en.srt 7.7 kB
  • 09 - Exercises for Basic SQL Queries/001 Simple Exercises for Filtering and Aggregations_en.srt 7.7 kB
  • 52 - Logging in Python based Spark Applications/008 Validate Logging of Spark Application using Cluster Mode_en.srt 7.7 kB
  • 32 - Processing JSON like Data using Spark SQL/007 Dealing with Array of Struct Type Columns using Spark SQL Queries_en.srt 7.7 kB
  • 17 - Performance Tuning of Python Applications/015 Add Parallel Processing to file to db loader Python Application_en.srt 7.7 kB
  • 26 - Filtering Data using Spark SQL Queries/002 Using IN, LIKE and BETWEEN in Spark SQL Queries_en.srt 7.7 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/008 Develop Spark SQL Application for NYSE Data Conversion_en.srt 7.7 kB
  • 24 - Pre-Defined Functions in Spark SQL/005 Extract Substring using substr in Spark SQL_en.srt 7.7 kB
  • 27 - Aggregations using Spark SQL Queries/002 Overview of Aggregations using GROUP BY in Spark SQL Queries_en.srt 7.7 kB
  • 14 - Project 1 - File Format Converter using Python/024 Exception Handling in File Format Converter Application_en.srt 7.6 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/014 Download VS Code Workspace and Delete Cluster_en.srt 7.6 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/009 Computing Overall Capacity of Multinode Hadoop and Spark Clusters_en.srt 7.6 kB
  • 12 - Python Collections for Data Engineering/015 Sort Data in JSON Arrays using Python_en.srt 7.6 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/002 Side effects of using CSV Files in Data Lake_en.srt 7.6 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/001 Create Spark Data Frame using Pyspark Data Frame APIs_en.srt 7.5 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/018 Recap of Spark Performance Tuning Scenarios_en.srt 7.5 kB
  • 32 - Processing JSON like Data using Spark SQL/002 Creating Spark Metastore Tables with Array Type Columns_en.srt 7.5 kB
  • 36 - Joining Data using Spark Data Frame APIs/008 Right Outer Join using Spark Data Frame APIs_en.srt 7.5 kB
  • 25 - Setup Spark Metastore Tables for Basic Transformations/002 Prepare Spark Metastore Tables for Basic Transformations_en.srt 7.5 kB
  • 18 - Getting Started with GCP/011 Install Google Cloud SDK on Windows_en.srt 7.5 kB
  • 07 - SQL Troubleshooting and Debugging Guide/004 Validate Connectivity to Database Server using telnet_en.srt 7.5 kB
  • 37 - Ranking using Pyspark Data Frame APIs/004 Filter Based on Global Ranks using Spark Data Frame APIs_en.srt 7.5 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/010 Overview of Performance Tuning of Spark covered in the course_en.srt 7.5 kB
  • 03 - Setup Tools for Data Engineering Essentials/006 Integrate VSCode with Python on Windows_en.srt 7.5 kB
  • 26 - Filtering Data using Spark SQL Queries/003 Filter Data using Boolean AND in Spark SQL Queries_en.srt 7.5 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/006 Overview of Spark and Databricks Environment related topics_en.srt 7.5 kB
  • 13 - Data Processing using Pandas Dataframe APIs/008 Performing Inner Join between Pandas Dataframes_en.srt 7.4 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/002 Overview of CTAS to create tables based on Query Results_en.srt 7.4 kB
  • 28 - Joins using Spark SQL Queries/001 Overview of Joins in Spark SQL Queries_en.srt 7.4 kB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/004 Insert into Spark Metastore Tables using Data Frames_en.srt 7.4 kB
  • 22 - Basic Transformations using Spark SQL/006 Save Query Result to DBFS using Spark SQL_en.srt 7.4 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/010 Getting Started with Spark CLI using Python_en.srt 7.4 kB
  • 32 - Processing JSON like Data using Spark SQL/011 Processing Delimited Strings using Spark SQL Queries_en.srt 7.4 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/013 Dealing with Nulls while Sorting the Data in Spark Data Frames_en.srt 7.4 kB
  • 21 - Setup Databricks Environment using GCP/002 Signing up for Databricks on GCP_en.srt 7.3 kB
  • 16 - Troubleshooting and Debugging Python Issues/004 Overview of Database Connectivity using Python_en.srt 7.3 kB
  • 19 - Overview of Big Data and Data Lakes/005 Overview of Big Data_en.srt 7.3 kB
  • 04 - Setup Application Tables and Data in Postgres Database/003 Overview of Connecting to External Databases using pgAdmin_en.srt 7.3 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/006 Run Operations on Partitioned Parquet Data_en.srt 7.3 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/010 Overview of Folder Structure for Partitioned Data in Spark_en.srt 7.3 kB
  • 51 - Submitting Python based Spark Applications/006 Run Spark Application with Environment Variables in Cluster Mode_en.srt 7.3 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/008 Compute Ranks based on key using SQL_en.srt 7.3 kB
  • 32 - Processing JSON like Data using Spark SQL/005 Projecting Data From Struct Type Fields in Spark SQL_en.srt 7.3 kB
  • 07 - SQL Troubleshooting and Debugging Guide/013 Develop Initial Solution based on the requirement_en.srt 7.3 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/004 Copy Files into HDFS for NYSE Converter_en.srt 7.2 kB
  • 52 - Logging in Python based Spark Applications/003 Overview of Logging Concepts such as Log Levels_en.srt 7.2 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/003 Overview of Spark Properties Files_en.srt 7.2 kB
  • 14 - Project 1 - File Format Converter using Python/003 Get Column Names using Schemas File_en.srt 7.2 kB
  • 26 - Filtering Data using Spark SQL Queries/004 Filter Data using Boolean OR in Spark SQL Queries_en.srt 7.2 kB
  • 22 - Basic Transformations using Spark SQL/010 Transform Data using Spark APIs_en.srt 7.1 kB
  • 33 - Getting Started with Pyspark Data Frame APIs/004 Transform Data using Spark APIs_en.srt 7.1 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/008 Perform Aggregations by Key using Spark Data Frame APIs_en.srt 7.1 kB
  • 07 - SQL Troubleshooting and Debugging Guide/009 Troubleshooting Syntax Errors in SQL Queries_en.srt 7.1 kB
  • 29 - Sorting using Spark SQL Queries/001 Sorting Data using Spark SQL Queries_en.srt 7.1 kB
  • 08 - Performance Tuning of SQL Queries/002 Overview of SQL Compilation Process and Explain Plans_en.srt 7.1 kB
  • 24 - Pre-Defined Functions in Spark SQL/012 Date Arithmetic using Spark SQL Functions_en.srt 7.1 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/007 Compute Global Ranks using SQL_en.srt 7.1 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/008 Overview of Spark Submit Command_en.srt 7.0 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/002 Overview of our support to Data Engineering Essentials course_en.srt 7.0 kB
  • 08 - Performance Tuning of SQL Queries/010 Performance Testing of SQL Queries using Stored Procedure_en.srt 7.0 kB
  • 24 - Pre-Defined Functions in Spark SQL/006 Extract Substrings from Delimited Strings using split in Spark SQL_en.srt 7.0 kB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/005 Read Data from Spark Metastore Table to Data Frames_en.srt 7.0 kB
  • 18 - Getting Started with GCP/003 Overview of Cloud Platforms_en.srt 7.0 kB
  • 31 - Ranking using Spark SQL Windowing Functions/005 Difference Between rank and dense_rank_en.srt 7.0 kB
  • 32 - Processing JSON like Data using Spark SQL/010 Generate Array of Struct Type Columns from Regular Columns in Spark SQL_en.srt 7.0 kB
  • 14 - Project 1 - File Format Converter using Python/008 Write Pandas Dataframe to JSON Files_en.srt 7.0 kB
  • 14 - Project 1 - File Format Converter using Python/011 Setup Project for File Format Converter using Python_en.srt 6.9 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/007 Performance Tuning of Cluster using Auto Scaling_en.srt 6.9 kB
  • 05 - Writing Basic SQL Queries/008 Filter Data based on Aggregated Results using Group By and Having_en.srt 6.9 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/004 Review Data Sets to explore Pyspark APIs_en.srt 6.9 kB
  • 14 - Project 1 - File Format Converter using Python/013 Add Core Logic to Python Application_en.srt 6.9 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/002 Overview of Spark Catalyst Optimizer_en.srt 6.9 kB
  • 11 - Getting Started with Python/008 Overview of Python Data Types_en.srt 6.8 kB
  • 14 - Project 1 - File Format Converter using Python/016 Overview of Environment Variables_en.srt 6.8 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/007 Projecting Data in Spark Data Frames using Select_en.srt 6.8 kB
  • 07 - SQL Troubleshooting and Debugging Guide/012 Development Best Practices with tips to troubleshoot SQL bugs_en.srt 6.8 kB
  • 39 - ELT Data Pipelines using Databricks/010 Review Databricks SQL Notebooks for Tables and Final Results_en.srt 6.8 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/005 Generate Explain Plans on Spark Dataframes using explain function_en.srt 6.8 kB
  • 03 - Setup Tools for Data Engineering Essentials/004 Configure Environment Variable PATH for Python on Windows_en.srt 6.7 kB
  • 14 - Project 1 - File Format Converter using Python/023 Raising Exceptions in Python Applications_en.srt 6.7 kB
  • 19 - Overview of Big Data and Data Lakes/011 Advantages of Modern Data Lakes on Cloud_en.srt 6.7 kB
  • 19 - Overview of Big Data and Data Lakes/002 Usecases for Different Types of Databases_en.srt 6.7 kB
  • 32 - Processing JSON like Data using Spark SQL/006 Creating Spark Metastore Tables with Array of Struct Column_en.srt 6.7 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/012 Generate Test Data for Spark Performance Tuning_en.srt 6.7 kB
  • 20 - Overview of Spark and Spark Architecture/007 Overview of Official Documentation of Apache Spark_en.srt 6.7 kB
  • 08 - Performance Tuning of SQL Queries/006 Review key terms used in Explain Plans for SQL Queries_en.srt 6.6 kB
  • 22 - Basic Transformations using Spark SQL/011 Get Schema Details for all Data Sets using Pyspark_en.srt 6.6 kB
  • 05 - Writing Basic SQL Queries/015 Explanation about Fix of SQL Queries with Filtering on Outer Join Results_en.srt 6.6 kB
  • 33 - Getting Started with Pyspark Data Frame APIs/005 Get Schema Details for all Data Sets using Pyspark_en.srt 6.6 kB
  • 16 - Troubleshooting and Debugging Python Issues/010 Troubleshooting Run Time Errors in Python_en.srt 6.6 kB
  • 12 - Python Collections for Data Engineering/010 Overview of JSON Strings and Files_en.srt 6.6 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/003 Review Explain Plan for Spark Dataframe logic using Spark UI_en.srt 6.6 kB
  • 30 - Copy Query Results into Spark Metastore Tables/004 Copy Query Results into Spark Metastore Tables using INSERT_en.srt 6.6 kB
  • 02 - Getting Started with SQL for Data Engineering/002 Overview of Application Architecture and RDBMS_en.srt 6.5 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/022 Overview of Spark Application_en.srt 6.5 kB
  • 05 - Writing Basic SQL Queries/013 Overview of Common Table Expressions or CTEs_en.srt 6.5 kB
  • 28 - Joins using Spark SQL Queries/008 Example - Filtering and Outer Joins along with GROUP BY in Spark SQL Queries_en.srt 6.5 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/003 Design NYSE Converter Application using Spark SQL and Delta_en.srt 6.5 kB
  • 32 - Processing JSON like Data using Spark SQL/008 Overview of Important Functions to Process JSON Data in Spark SQL_en.srt 6.5 kB
  • 24 - Pre-Defined Functions in Spark SQL/002 Validate Functions in Spark SQL_en.srt 6.4 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/009 Review HDFS Properties on Dataproc Cluster using VS Code_en.srt 6.4 kB
  • 14 - Project 1 - File Format Converter using Python/006 Generate File Paths for Target JSON Files Dynamically_en.srt 6.4 kB
  • 37 - Ranking using Pyspark Data Frame APIs/006 Filter Based on Ranks Per Partition using Spark Data Frame APIs_en.srt 6.4 kB
  • 24 - Pre-Defined Functions in Spark SQL/004 Case Conversion and Length of Strings using Spark SQL_en.srt 6.4 kB
  • 04 - Setup Application Tables and Data in Postgres Database/006 Register Server in pgAdmin using Application Database and User_en.srt 6.4 kB
  • 16 - Troubleshooting and Debugging Python Issues/005 Troubleshoot Network Connectivity to the Database Server using telnet_en.srt 6.4 kB
  • 51 - Submitting Python based Spark Applications/005 Run Spark Application with Environment Variables in Client Mode_en.srt 6.4 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/006 Convert IP Address to Static for Dataproc Cluster_en.srt 6.4 kB
  • 05 - Writing Basic SQL Queries/011 Filter and Aggregate on Join Results using SQL_en.srt 6.4 kB
  • 23 - Create Delta Tables using Spark SQL/008 Insert Data into Spark Metastore External Table_en.srt 6.4 kB
  • 39 - ELT Data Pipelines using Databricks/008 Spark SQL Application to Cleanup Database and Datasets_en.srt 6.3 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/007 Overview of Performance Assessment of Spark Jobs_en.srt 6.3 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/011 Getting Started with Spark CLI using Scala_en.srt 6.3 kB
  • 12 - Python Collections for Data Engineering/014 Extract Details from Complex JSON Arrays using Python_en.srt 6.3 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/005 Compute Size of restrucutured data using Parquet File Format_en.srt 6.3 kB
  • 21 - Setup Databricks Environment using GCP/005 Getting Started with Databricks Notebook_en.srt 6.3 kB
  • 37 - Ranking using Pyspark Data Frame APIs/005 Compute Ranks per Partition using Spark Data Frame APIs_en.srt 6.3 kB
  • 04 - Setup Application Tables and Data in Postgres Database/004 Create Application Database and User in Postgres Database Server_en.srt 6.3 kB
  • 29 - Sorting using Spark SQL Queries/002 Dealing with Nulls while Sorting Data using Spark SQL Queries_en.srt 6.3 kB
  • 20 - Overview of Spark and Spark Architecture/009 Overview of Spark Infrastructure_en.srt 6.2 kB
  • 12 - Python Collections for Data Engineering/002 Read Data from CSV File into Python List_en.srt 6.2 kB
  • 12 - Python Collections for Data Engineering/006 Usage of Lambda Functions_en.srt 6.2 kB
  • 31 - Ranking using Spark SQL Windowing Functions/004 Compute Ranks Per Key using Spark SQL Windowing Functions_en.srt 6.2 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/008 Overview of Spark Architecture_en.srt 6.2 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/011 Review Completed Job Details using Spark UI_en.srt 6.2 kB
  • 39 - ELT Data Pipelines using Databricks/014 Cleanup Databricks Environment on GCP_en.srt 6.2 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/003 Increase GCP VM Quotas for Mutlinode Hadoop and Spark Cluster_en.srt 6.1 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/005 Setup Multinode Hadoop and Spark Cluster using GCP Dataproc_en.srt 6.1 kB
  • 16 - Troubleshooting and Debugging Python Issues/011 Overview of Software Development Life Cycle_en.srt 6.1 kB
  • 19 - Overview of Big Data and Data Lakes/010 Implementation of Modern Data Lakes on Cloud_en.srt 6.0 kB
  • 19 - Overview of Big Data and Data Lakes/003 Technologies for Different Types of Databases_en.srt 6.0 kB
  • 23 - Create Delta Tables using Spark SQL/006 Validate Data in Spark Metastore Managed Table_en.srt 6.0 kB
  • 18 - Getting Started with GCP/008 Overview of GCP Credits_en.srt 6.0 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/005 Overview of Getting Started with GCP related to the course_en.srt 6.0 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/004 Setup Single Node Hadoop and Spark Cluster using Dataproc_en.srt 5.9 kB
  • 10 - Solutions for Basic SQL Queries/003 Validate Data and Review Data Model Diagram_en.srt 5.9 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/010 Using withColumn to apply transformations on Spark Data Frames_en.srt 5.9 kB
  • 26 - Filtering Data using Spark SQL Queries/005 Dealing with NULLS while Filtering Data in Spark SQL Queries_en.srt 5.9 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/015 Parameterize Spark SQL Solution for Partition Pruning_en.srt 5.9 kB
  • 24 - Pre-Defined Functions in Spark SQL/020 Overview of Handling Null Values using Spark SQL_en.srt 5.9 kB
  • 32 - Processing JSON like Data using Spark SQL/004 Creating Spark Metastore Tables with Struct Type Columns_en.srt 5.9 kB
  • 08 - Performance Tuning of SQL Queries/012 Guidelines on adding Indexes on Tables for SQL Queries_en.srt 5.9 kB
  • 18 - Getting Started with GCP/010 Overview of Google Cloud Shell_en.srt 5.8 kB
  • 28 - Joins using Spark SQL Queries/006 Example - Outer Join along with GROUP BY using Spark SQL Queries_en.srt 5.8 kB
  • 08 - Performance Tuning of SQL Queries/001 Introduction to Performance Tuning of SQL Queries_en.srt 5.8 kB
  • 02 - Getting Started with SQL for Data Engineering/007 Differences and Similarities between RDBMS and Data Warehouse Technologies_en.srt 5.8 kB
  • 24 - Pre-Defined Functions in Spark SQL/025 Aggregate using CASE and WHEN in GROUP BY in Spark SQL_en.srt 5.8 kB
  • 19 - Overview of Big Data and Data Lakes/004 Volumes for Different Types of Databases_en.srt 5.8 kB
  • 18 - Getting Started with GCP/004 Overview of Google Cloud Platform or GCP_en.srt 5.8 kB
  • 16 - Troubleshooting and Debugging Python Issues/012 Overview of Unit Testing or Validation of Applications_en.srt 5.7 kB
  • 14 - Project 1 - File Format Converter using Python/022 Exception Handling in Python Applications_en.srt 5.7 kB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/002 Steps to convert CSV or JSON Files to Parquet or Delta Files_en.srt 5.7 kB
  • 52 - Logging in Python based Spark Applications/007 Validate Logging of Spark Application using Client Mode_en.srt 5.6 kB
  • 16 - Troubleshooting and Debugging Python Issues/007 Troubleshoot Credentials Related issues for Database Connectivity using Python_en.srt 5.6 kB
  • 14 - Project 1 - File Format Converter using Python/012 Install Dependencies for the Python Project using pip_en.srt 5.6 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/024 Review the code of Word Count Application_en.srt 5.6 kB
  • 13 - Data Processing using Pandas Dataframe APIs/011 Overview of Writing Pandas Dataframes to Files_en.srt 5.5 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/005 Advantages of Pyspark Data Frames_en.srt 5.5 kB
  • 21 - Setup Databricks Environment using GCP/004 Getting Started with Databricks Clusters on GCP_en.srt 5.5 kB
  • 02 - Getting Started with SQL for Data Engineering/005 Overview of Data Warehouse and Data Lake_en.srt 5.5 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/001 Introduction to Setup Hadoop and Spark Cluster using Dataproc_en.srt 5.5 kB
  • 18 - Getting Started with GCP/007 Sign up for GCP using Google Account_en.srt 5.5 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/004 Review Explain Plan for Spark SQL logic using Spark UI_en.srt 5.5 kB
  • 28 - Joins using Spark SQL Queries/007 Example - Filtering and Outer Joins along with GROUP BY in Spark SQL Queries_en.srt 5.5 kB
  • 37 - Ranking using Pyspark Data Frame APIs/002 Syntax for ranking using Spark Data Frame APIs_en.srt 5.5 kB
  • 03 - Setup Tools for Data Engineering Essentials/002 Setup VS Code on Windows_en.srt 5.5 kB
  • 12 - Python Collections for Data Engineering/013 Overview of Processing JSON Data using Python_en.srt 5.4 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/018 Determine Maximum Capacity to submit a Spark Application_en.srt 5.4 kB
  • 23 - Create Delta Tables using Spark SQL/009 Validate Data in Spark Metastore External Table_en.srt 5.4 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/010 Review Running Job Details using Spark UI_en.srt 5.4 kB
  • 36 - Joining Data using Spark Data Frame APIs/003 Review Syntax for join using Spark Data Frame APIs_en.srt 5.4 kB
  • 36 - Joining Data using Spark Data Frame APIs/006 Analyze Data for outer joins using Spark Data Frame APIs_en.srt 5.4 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/006 Overview of Ranking in SQL_en.srt 5.3 kB
  • 07 - SQL Troubleshooting and Debugging Guide/011 Overview of Bugs in SQL Queries_en.srt 5.3 kB
  • 39 - ELT Data Pipelines using Databricks/001 Overview of Databricks Workflows_en.srt 5.3 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/005 Create Multinode Databricks Cluster with Auto Scaling_en.srt 5.3 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/031 Delete Multinode Hadoop and Spark Cluster_en.srt 5.3 kB
  • 28 - Joins using Spark SQL Queries/005 Example - Inner Join along with GROUP BY using Spark SQL Queries_en.srt 5.3 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/001 Getting Started with Performance Tuning using Spark on Databricks_en.srt 5.3 kB
  • 16 - Troubleshooting and Debugging Python Issues/018 Managing Breakpoints for Debugging in VS Code_en.srt 5.3 kB
  • 51 - Submitting Python based Spark Applications/007 Review Spark Application Details using Spark UI_en.srt 5.3 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/008 Analyze Airlines Data using Spark SQL_en.srt 5.3 kB
  • 12 - Python Collections for Data Engineering/004 Getting Started with Processing Python Lists_en.srt 5.3 kB
  • 24 - Pre-Defined Functions in Spark SQL/023 Overview of Case and When in Spark SQL_en.srt 5.2 kB
  • 18 - Getting Started with GCP/012 Initialize gcloud CLI using GCP Project_en.srt 5.2 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/001 Overview of Basic Transformations using Pyspark Data Frame APIs_en.srt 5.2 kB
  • 03 - Setup Tools for Data Engineering Essentials/005 Overview of learning Python using Python CLI_en.srt 5.2 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/006 Overview of Auto Scaling of Databricks Clusters_en.srt 5.1 kB
  • 39 - ELT Data Pipelines using Databricks/003 Pass Arguments to Databricks SQL Notebooks_en.srt 5.1 kB
  • 17 - Performance Tuning of Python Applications/011 Develop application for multiprocessing_en.srt 5.1 kB
  • 02 - Getting Started with SQL for Data Engineering/006 Usage of RDBMS and Data Warehouse technologies_en.srt 5.1 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/003 Review the side effects of using CSV Files in Data Lake_en.srt 5.0 kB
  • 24 - Pre-Defined Functions in Spark SQL/011 Overview of Standard Date and Timestamp in Spark SQL_en.srt 5.0 kB
  • 24 - Pre-Defined Functions in Spark SQL/001 Overview of Functions in Spark SQL_en.srt 5.0 kB
  • 08 - Performance Tuning of SQL Queries/008 Review the Common Application Scenarios for Performance Tuning_en.srt 5.0 kB
  • 35 - Basic Transformations using Pyspark Data Frame APIs/011 Develop Spark SQL Queries for Sorting Data_en.srt 5.0 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/009 Rules and Restrictions to Filter Data based on Ranks in SQL_en.srt 4.9 kB
  • 09 - Exercises for Basic SQL Queries/002 Exercises on Joins and Aggregations using SQL_en.srt 4.9 kB
  • 16 - Troubleshooting and Debugging Python Issues/001 Introduction to Troubleshooting and Debugging Python issues_en.srt 4.9 kB
  • 39 - ELT Data Pipelines using Databricks/007 Import ELT Data Pipeline Applications into Databricks Environment_en.srt 4.8 kB
  • 04 - Setup Application Tables and Data in Postgres Database/005 Clone Data Sets from Git Repository for Database Scripts_en.srt 4.8 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/012 Create Students table with Data for ranking using SQL_en.srt 4.8 kB
  • 18 - Getting Started with GCP/013 Reinitialize Google Cloud Shell with Project id_en.srt 4.7 kB
  • 17 - Performance Tuning of Python Applications/004 Cleanup the tables to run file to db loader application_en.srt 4.7 kB
  • 24 - Pre-Defined Functions in Spark SQL/016 Extract Information using Calendar Functions from Date or Timestamp using Spark_en.srt 4.5 kB
  • 16 - Troubleshooting and Debugging Python Issues/008 Overview of Python process to run Python Applications_en.srt 4.5 kB
  • 52 - Logging in Python based Spark Applications/006 Add logging to Python based Spark Applications_en.srt 4.5 kB
  • 20 - Overview of Spark and Spark Architecture/011 Overview of Executors in Spark Cluster_en.srt 4.5 kB
  • 19 - Overview of Big Data and Data Lakes/009 Overview of Modern Data Lakes on Cloud_en.srt 4.5 kB
  • 19 - Overview of Big Data and Data Lakes/006 Evolution of Big Data Technologies_en.srt 4.4 kB
  • 40 - Performance Tuning of Spark - Catalyst Optimizer/006 Generate Explain Plans on Spark SQL Queries using explain command_en.srt 4.4 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/008 Using drop to drop columns from Spark Data Frame_en.srt 4.3 kB
  • 16 - Troubleshooting and Debugging Python Issues/015 Getting Started with Debugging of Python Programs using VS Code_en.srt 4.3 kB
  • 23 - Create Delta Tables using Spark SQL/014 Conclusion of Creating Delta Tables using Spark SQL_en.srt 4.3 kB
  • 45 - Recap of important Linux Commands for Data Engineering/001 Introduction to Linux Commands and Scripts for Data Engineers_en.srt 4.3 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/004 Overview of important HDFS Commands_en.srt 4.3 kB
  • 08 - Performance Tuning of SQL Queries/014 Conclusion of Performance Tuning of SQL Queries_en.srt 4.2 kB
  • 14 - Project 1 - File Format Converter using Python/007 Recap of Writing Pandas Dataframe to JSON File_en.srt 4.2 kB
  • 03 - Setup Tools for Data Engineering Essentials/008 Getting Started with pgAdmin on Windows_en.srt 4.2 kB
  • 30 - Copy Query Results into Spark Metastore Tables/001 Overview of Copying Query Results into Spark Metastore Tables_en.srt 4.2 kB
  • 17 - Performance Tuning of Python Applications/006 Fix the error message in file to db loader application_en.srt 4.1 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/001 Introduction to Data Engineering Essentials Course_en.srt 3.9 kB
  • 23 - Create Delta Tables using Spark SQL/002 Overview of Supported Providers for Spark Metastore Tables_en.srt 3.9 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/002 Start the Hadoop and Spark Cluster using Dataproc_en.srt 3.9 kB
  • 03 - Setup Tools for Data Engineering Essentials/001 Introduction to Setting up Tools for Data Engineering Essentials_en.srt 3.9 kB
  • 18 - Getting Started with GCP/014 Overview of Analytics Services on GCP_en.srt 3.9 kB
  • 15 - Project 2 - Files to Database Loader/002 Install Python Dependencies for Pandas and Database Integration_en.srt 3.8 kB
  • 16 - Troubleshooting and Debugging Python Issues/019 Conclusion to Troubleshooting and Debugging Python Issues_en.srt 3.8 kB
  • 18 - Getting Started with GCP/006 Create New Google Account using Non Gmail Id_en.srt 3.7 kB
  • 44 - Setup Hadoop and Spark Cluster using Dataproc/008 Setup Local Data Sets on Hadoop and Spark Cluster_en.srt 3.7 kB
  • 30 - Copy Query Results into Spark Metastore Tables/002 Query to Compute Daily Revenue using Spark SQL_en.srt 3.7 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/002 Delete Single Node Hadoop and Spark Cluster using Dataproc_en.srt 3.7 kB
  • 18 - Getting Started with GCP/009 Overview of GCP Project and Billing_en.srt 3.7 kB
  • 01 - Introduction to Data Engineering Essentials using SQL, Python, and PySpark/009 Detailed outline of ELT Data Pipelines on Databricks_en.srt 3.6 kB
  • 46 - Mastering Hadoop HDFS Commands and Concepts/001 Introduction to Mastering Hadoop HDFS Commands and Concepts_en.srt 3.6 kB
  • 21 - Setup Databricks Environment using GCP/013 Setup Data Repository for Data Sets_en.srt 3.6 kB
  • 18 - Getting Started with GCP/002 Pre-requisite Skills to Sign up for course on GCP Data Analytics_en.srt 3.6 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/004 Review Quotas to setup Multinode Hadoop and Spark Cluster_en.srt 3.5 kB
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/001 Introduction to Performance Tuning of Spark Applications on Hadoop and Spark Clu_en.srt 3.5 kB
  • 19 - Overview of Big Data and Data Lakes/001 Different Types of Databases_en.srt 3.5 kB
  • 16 - Troubleshooting and Debugging Python Issues/002 Guidelines for Troubleshooting and Debugging Python related Issues_en.srt 3.4 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/003 Create Tables for Cumulative Aggregations and Ranking_en.srt 3.3 kB
  • 20 - Overview of Spark and Spark Architecture/002 Overview of Data Processing Libraries_en.srt 3.3 kB
  • 14 - Project 1 - File Format Converter using Python/014 Overview of Run-time Arguments and Environment Variables_en.srt 3.2 kB
  • 25 - Setup Spark Metastore Tables for Basic Transformations/001 Introduction to Basic Transformations using Spark SQL_en.srt 3.2 kB
  • 41 - Performance Tuning of Spark - Cluster Configuration/001 Introduction to Databricks Cluster Configuration_en.srt 3.2 kB
  • 24 - Pre-Defined Functions in Spark SQL/027 Exercises for Pre-defined functions in Spark SQL_en.srt 3.2 kB
  • 47 - Build Hive Applications in Hadoop and Spark Clusters/001 Introduction to Building Hive Applications_en.srt 3.2 kB
  • 24 - Pre-Defined Functions in Spark SQL/003 Overview of String Manipulation Functions in Spark SQL_en.srt 3.1 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/005 Compute Total Aggregation using OVER and PARTITION BY in SQL Queries_en.srt 3.1 kB
  • 07 - SQL Troubleshooting and Debugging Guide/001 Introduction to SQL Troubleshooting and Debugging Guide_en.srt 3.0 kB
  • 18 - Getting Started with GCP/005 Overview of Signing for GCP Account_en.srt 3.0 kB
  • 38 - Integration of Spark SQL and Pyspark Data Frame APIs/001 Introduction to Integration of Spark SQL and Pyspark Data Frame APIs_en.srt 3.0 kB
  • 20 - Overview of Spark and Spark Architecture/012 Overview of Spark Glossary_en.srt 3.0 kB
  • 31 - Ranking using Spark SQL Windowing Functions/002 Create Temporary View for ranking using Spark SQL Windowing Functions_en.srt 2.9 kB
  • 42 - Performance Tuning while inferring schema from CSV or JSON files/001 Overview of Inferring Schema using CSV or JSON Files_en.srt 2.9 kB
  • 23 - Create Delta Tables using Spark SQL/001 Introduction to Creating Delta Tables using Spark SQL_en.srt 2.9 kB
  • 36 - Joining Data using Spark Data Frame APIs/001 Introduction to Joining Data using Spark Data Frame APIs_en.srt 2.8 kB
  • 05 - Writing Basic SQL Queries/002 Define Problem Statement for SQL Queries_en.srt 2.8 kB
  • 48 - Getting Started with Spark SQL on Hadoop and Spark Cluster/001 Getting Started with Spark SQL on Hadoop and Spark Cluster_en.srt 2.8 kB
  • 24 - Pre-Defined Functions in Spark SQL/010 Overview of Date Manipulation Functions in Spark SQL_en.srt 2.8 kB
  • 02 - Getting Started with SQL for Data Engineering/001 Introduction to SQL for Data Engineering_en.srt 2.7 kB
  • 03 - Setup Tools for Data Engineering Essentials/010 Conclusion of Setting up Tools for Data Engineering Essentials_en.srt 2.6 kB
  • 50 - Getting Started with Pyspark on Hadoop and Spark Cluster/001 Introduction to Getting Started with Pyspark_en.srt 2.5 kB
  • 22 - Basic Transformations using Spark SQL/004 Exercise to create temporary views using Spark SQL_en.srt 2.5 kB
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/001 Introduction to Cumulative Aggregations and Ranking in SQL Queries_en.srt 2.4 kB
  • 14 - Project 1 - File Format Converter using Python/001 Project 1 - File Format Converter Handout.html 2.4 kB
  • 15 - Project 2 - Files to Database Loader/001 Project 2 - Files To Database Loader Handout.html 2.3 kB
  • 31 - Ranking using Spark SQL Windowing Functions/001 Ranking using Spark SQL Windowing Functions_en.srt 2.3 kB
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/001 Introduction to Performance Tuning while storing data in Data Lake_en.srt 2.2 kB
  • 49 - Build Real Time Applications using Spark SQL with Shell Wrapper/001 Introduction to Application Development Life Cycle of Spark SQL Applications_en.srt 2.1 kB
  • 34 - Create Spark Data Frames using Pyspark Data Frame APIs/002 Introduction to Processing JSON like Data using Spark SQL_en.srt 2.1 kB
  • 21 - Setup Databricks Environment using GCP/001 Overview of Databicks on GCP_en.srt 2.0 kB
  • 51 - Submitting Python based Spark Applications/001 Introduction to Submitting Python based Spark Applications_en.srt 1.9 kB
  • 18 - Getting Started with GCP/001 Introduction to Getting Started with GCP_en.srt 1.8 kB
  • 18 - Getting Started with GCP/015 Conclusion to Get Started with GCP for Data Engineering_en.srt 1.7 kB
  • 22 - Basic Transformations using Spark SQL/007 Overview of Pyspark Examples on Databricks_en.srt 1.7 kB
  • 33 - Getting Started with Pyspark Data Frame APIs/001 Overview of Pyspark Examples on Databricks_en.srt 1.7 kB
  • 52 - Logging in Python based Spark Applications/001 Introduction to Logging in Python baesd Spark Applications_en.srt 1.5 kB
  • 0. Websites you may like/[FreeCourseSite.com].url 127 Bytes
  • 11 - Getting Started with Python/0. Websites you may like/[FreeCourseSite.com].url 127 Bytes
  • 24 - Pre-Defined Functions in Spark SQL/0. Websites you may like/[FreeCourseSite.com].url 127 Bytes
  • 0. Websites you may like/[CourseClub.Me].url 122 Bytes
  • 11 - Getting Started with Python/0. Websites you may like/[CourseClub.Me].url 122 Bytes
  • 24 - Pre-Defined Functions in Spark SQL/0. Websites you may like/[CourseClub.Me].url 122 Bytes
  • 0. Websites you may like/[GigaCourse.Com].url 49 Bytes
  • 11 - Getting Started with Python/0. Websites you may like/[GigaCourse.Com].url 49 Bytes
  • 24 - Pre-Defined Functions in Spark SQL/0. Websites you may like/[GigaCourse.Com].url 49 Bytes
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/011 Filtering based on Ranks per Partition using Nested Queries and CTEs in SQL.encrypted.m4a.part 0 Bytes
  • 06 - Cumulative Aggregations and Ranking in SQL Queries/011 Filtering based on Ranks per Partition using Nested Queries and CTEs in SQL.encrypted.mp4.part 0 Bytes
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/003 Review the side effects of using CSV Files in Data Lake.encrypted.m4a.part 0 Bytes
  • 43 - Performance Tuning using Columnar File Format and Partitioning Strategy/003 Review the side effects of using CSV Files in Data Lake.encrypted.mp4.part 0 Bytes
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/009 Computing Overall Capacity of Multinode Hadoop and Spark Clusters.encrypted.m4a.part 0 Bytes
  • 53 - Performance Tuning of Spark Applications on Hadoop and Spark/009 Computing Overall Capacity of Multinode Hadoop and Spark Clusters.encrypted.mp4.part 0 Bytes

随机展示

相关说明

本站不存储任何资源内容,只收集BT种子元数据(例如文件名和文件大小)和磁力链接(BT种子标识符),并提供查询服务,是一个完全合法的搜索引擎系统。 网站不提供种子下载服务,用户可以通过第三方链接或磁力链接获取到相关的种子资源。本站也不对BT种子真实性及合法性负责,请用户注意甄别!