{"id":61742,"date":"2021-06-14T10:00:01","date_gmt":"2021-06-14T18:00:01","guid":{"rendered":"https:\/\/devblogs.microsoft.com\/devops\/?p=61742"},"modified":"2021-05-21T10:02:46","modified_gmt":"2021-05-21T18:02:46","slug":"azurefunbytes-intro-to-azure-data-factory-with-kromerbigdata","status":"publish","type":"post","link":"https:\/\/devblogs.microsoft.com\/devops\/azurefunbytes-intro-to-azure-data-factory-with-kromerbigdata\/","title":{"rendered":"AzureFunBytes &#8211; Intro to Azure Data Factory with @KromerBigData"},"content":{"rendered":"<p>AzureFunBytes is a weekly opportunity to learn more about the fundamentals and foundations that make up Azure. It&#8217;s a chance for me to understand more about what people across the Azure organization do and how they do it. Every week we get together at 11 AM Pacific on <a href=\"https:\/\/cda.ms\/226\">Microsoft LearnTV<\/a> and learn more about Azure.<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/dev-to-uploads.s3.amazonaws.com\/uploads\/articles\/ane07pu4we1r5r1bga3i.gif\" alt=\"AzureFunBytes animation\" \/><\/p>\n<p>Data drives so many of our decisions. Whether it&#8217;s determining which products to have viewed first in our online retail store, or creating reports for business intelligence, we&#8217;ve got so much data! It&#8217;s time to figure out how to learn how to take that data and provide human-readable information that will help us continue to make the right decisions.<\/p>\n<p>This week on AzureFunBytes, I am joined by <a href=\"https:\/\/twitter.com\/KromerBigData\">Principal Program Manager, Mark Kromer<\/a> about how to store and process our big data with Azure Data Factory. Mark will discuss the <a href=\"https:\/\/cda.ms\/27y\">ETL (Extract, Transform, Load)<\/a> process that gets our data into <a href=\"https:\/\/cda.ms\/27w\">Azure Data Factory<\/a>. I ask Mark <a href=\"https:\/\/cda.ms\/27B\">how can we transfer the data we might have to Azure<\/a>? We look into how to create pipelines to automate the ingestion of our data from various data stores.<\/p>\n<p><iframe title=\"AzureFunBytes Episode 43 - Intro to @Azure Data Factory with @KromerBigData\" width=\"500\" height=\"281\" src=\"https:\/\/www.youtube.com\/embed\/j6lTgorn-9A?start=1&#038;feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p><a href=\"https:\/\/youtu.be\/OuOE6OG0G8U?t=275\">00:04:35 &#8211; Intro to Mark<\/a><br \/>\n<a href=\"https:\/\/youtu.be\/OuOE6OG0G8U?t=585\">00:09:45 &#8211; Let&#8217;s meet Data Factory<\/a><br \/>\n<a href=\"https:\/\/youtu.be\/OuOE6OG0G8U?t=862\">00:14:48 &#8211; CI\/CD With Data Factory Pipelines<\/a><br \/>\n<a href=\"https:\/\/youtu.be\/OuOE6OG0G8U?t=1232\">00:20:32 &#8211; Azure Data Factory connector overview<\/a><br \/>\n<a href=\"https:\/\/youtu.be\/OuOE6OG0G8U?t=1917\">00:31:57 &#8211; Demo Time<\/a><\/p>\n<p>Our Agenda:<\/p>\n<ul>\n<li>Intro to Data Factory<\/li>\n<li>Differences between ADF &amp; Synapse<\/li>\n<li>Data Flows in ADF &amp; Synapse<\/li>\n<li>Data lake ETL patterns<\/li>\n<li>Build an ETL flow using taxi sample data (Demo)<\/li>\n<li>Q&amp;A<\/li>\n<\/ul>\n<p>From the Azure Documentation &#8220;<a href=\"https:\/\/cda.ms\/27r\">What is Azure Data Factory?<\/a>&#8220;<\/p>\n<p><img decoding=\"async\" src=\"https:\/\/dev-to-uploads.s3.amazonaws.com\/uploads\/articles\/9jbmy22v1lufk8rimbic.png\" alt=\"Overview of Data Factory\" \/><\/p>\n<p><a href=\"https:\/\/cda.ms\/27w\">Azure Data Factory<\/a> is the platform that solves such data scenarios. It is the <em>cloud-based ETL and data integration service that allows you to create data-driven workflows for orchestrating data movement and transforming data at scale<\/em>. Using Azure Data Factory, you can create and schedule data-driven workflows (called pipelines) that can ingest data from disparate data stores. You can build complex ETL processes that transform data visually with data flows or by using compute services such as <a href=\"https:\/\/cda.ms\/27D\">Azure HDInsight Hadoop<\/a>, <a href=\"https:\/\/cda.ms\/27F\">Azure Databricks<\/a>, and <a href=\"https:\/\/cda.ms\/27G\">Azure SQL Database<\/a>.<\/p>\n<p>Integrate all your data with Azure Data Factory\u2014a fully managed, serverless data integration service. Visually integrate data sources with more than 90 built-in, maintenance-free connectors at no added cost. Easily construct ETL and ELT processes code-free in an intuitive environment or write your own code. Then deliver integrated data to Azure Synapse Analytics to unlock business insights.<\/p>\n<hr \/>\n<p>Learn about Azure fundamentals with me!<\/p>\n<p>Live stream is available on Twitch, YouTube, and LearnTV at 11 AM PT \/ 2 PM ET Thursday. You can also find the recordings here as well:<\/p>\n<p><a href=\"https:\/\/twitch.tv\/azurefunbytes\">AzureFunBytes on Twitch<\/a><br \/>\n<a href=\"https:\/\/aka.ms\/jaygordononyoutube\">AzureFunBytes on YouTube<\/a><br \/>\n<a href=\"https:\/\/www.youtube.com\/channel\/UC-ikyViYMM69joIAv7dlMsA\">Azure DevOps YouTube Channel<\/a><br \/>\n<a href=\"https:\/\/twitter.com\/azurefunbytes\">Follow AzureFunBytes on Twitter<\/a><\/p>\n<p>Useful Docs:<\/p>\n<p><a href=\"https:\/\/cda.ms\/219\">Get $200 in free Azure Credit<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/243\">Microsoft Learn: Introduction to Azure fundamentals<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27K\">Microsoft Learn: Integrate data with Azure Data Factory or Azure Synapse Pipeline<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27L\">Microsoft Learn: Data integration at scale with Azure Data Factory or Azure Synapse Pipeline<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27w\">Azure Data Factory<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27x\">Azure Data Factory documentation<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27J\">Azure Data Factory Tutorials<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27y\">Extract, transform, and load (ETL)<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27C\">Transferring data to and from Azure<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27H\">Big data architecture style<\/a><br \/>\n<a href=\"https:\/\/www.youtube.com\/channel\/UC2S0k7NeLcEm5_IhHUwpN0g\/videos\">Watch our snack-sized video tutorials here to learn more about building ETL with data flows<\/a> <a href=\"https:\/\/cda.ms\/27s\">Follow the Delta Lake tutorial here to build your own lake<\/a><br \/>\n<a href=\"https:\/\/cda.ms\/27t\">Branching and chaining activities in an Azure Data Factory pipeline using the Azure portal<\/a><br \/>\n<a href=\"https:\/\/github.com\/kromerm\/adfdataflowdocs\/tree\/master\/sampledata\">For access to the taxi medallion sample data to build these pipelines on your own, visit Mark&#8217;s sample data repo here and look for trip data and trip fare<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>This week on AzureFunBytes, I am joined by Principal Program Manager, Mark Kromer about how to store and process our big data with Azure Data Factory. Mark will discuss the ETL (Extract, Transform, Load) process that gets our data into Azure Data Factory. I ask Mark how can we transfer the data we might have to Azure? <\/p>\n","protected":false},"author":39313,"featured_media":61743,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[224],"tags":[],"class_list":["post-61742","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-azure"],"acf":[],"blog_post_summary":"<p>This week on AzureFunBytes, I am joined by Principal Program Manager, Mark Kromer about how to store and process our big data with Azure Data Factory. Mark will discuss the ETL (Extract, Transform, Load) process that gets our data into Azure Data Factory. I ask Mark how can we transfer the data we might have to Azure? <\/p>\n","_links":{"self":[{"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/posts\/61742","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/users\/39313"}],"replies":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/comments?post=61742"}],"version-history":[{"count":0,"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/posts\/61742\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/media\/61743"}],"wp:attachment":[{"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/media?parent=61742"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/categories?post=61742"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/devblogs.microsoft.com\/devops\/wp-json\/wp\/v2\/tags?post=61742"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}