The document contains a series of code cells that demonstrate the use of PySpark to read and manipulate flight data from a JSON file. It includes defining a schema, selecting specific columns, and performing various transformations on the data. The output showcases the structure of the data and examples of how to query and display it.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
17 views12 pages
Structured API Overview - Ipynb
The document contains a series of code cells that demonstrate the use of PySpark to read and manipulate flight data from a JSON file. It includes defining a schema, selecting specific columns, and performing various transformations on the data. The output showcases the structure of the data and examples of how to query and display it.