{"id":50844,"date":"2016-01-07T13:00:26","date_gmt":"2016-01-07T11:00:26","guid":{"rendered":"http:\/\/www.javacodegeeks.com\/?p=50844"},"modified":"2016-01-06T23:08:59","modified_gmt":"2016-01-06T21:08:59","slug":"will-2016-apache-spark-kafka-drill","status":"publish","type":"post","link":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html","title":{"rendered":"What Will You Do in 2016? Apache Spark, Kafka, Drill and More"},"content":{"rendered":"<p>Let\u2019s have some fun.<\/p>\n<p>It\u2019s the start of a new year &#8212; we\u2019re on the threshold of something new &#8212; so let\u2019s look forward to what you\u2019re likely to be doing in 2016. Now I know the riskiness of making predictions \u2013 especially ones on record \u2013 but I\u2019m happy for you to drop back by in a year from now and see how my projections for 2016 pan out.<\/p>\n<h2>What will you do in 2016?<\/h2>\n<p>Before I make my 2016 predictions, first think more generally (and playfully) about the challenge of speculating on what the future will be. Do we work from data and models? From observation and hunch? And remember that the accuracy of describing the future lies in part on how long into the future the target is.<\/p>\n<p>It\u2019s tradition to project distantly future views of what people\u2019s lives will be. Sometimes they are accurate and more often hilariously wrong. Looking back at what the future was supposed to be is an entertainment I call \u201cremembering the future\u201d.<\/p>\n<p>For example, the year 2000 caught people\u2019s imagination for many years. I came across an essay published in the year 1900 in <i>Ladies Home Journal<\/i> with predictions for what our lives would be like in 2000. Among the predictions that were roughly correct were that motor cars would become numerous, photographs could be telegraphed from distant countries such that they could be printed in newspapers within an hour and that the US population would reach over 350 million including territories (the 2000 census put the US population at 282 million, a bit short). Much less accurate were predictions that there would be no more flies or mosquitos, through-traffic in cities would be underground or overhead such that cities would be \u201cfree from noise\u201d and that we would have ceased to use the letters C, X or Q.<\/p>\n<p>The future did not turn out as described, but partly because we often solve the same problems in a different way than predicted: today traffic is shunted through cities on freeways, but &#8212; alas &#8212; that does not eliminate noise. And instead of normalizing spelling by \u201cfiring\u201d certain consonants, we rely on automated spell-correct systems (sometimes with laughable results).<\/p>\n<h2>Back to Big Data<\/h2>\n<p>This idea of \u201cremembering the future\u201d was a theme picked up in a lively presentation by Ted Dunning on current and future big data trends at a Big Data Singapore meet-up during the week of the Strata Hadoop World conference in that city. Also touching on the idea of where big data systems are going in the near future was another presenter, Hadoop founder Doug Cutting.<\/p>\n<p><a href=\"http:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apachespark-kafka-blog-img1.png\"><img decoding=\"async\" class=\"aligncenter size-full wp-image-50857\" src=\"http:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apachespark-kafka-blog-img1.png\" alt=\"apachespark-kafka-blog-img1\" width=\"516\" height=\"484\" srcset=\"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apachespark-kafka-blog-img1.png 516w, https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apachespark-kafka-blog-img1-300x281.png 300w\" sizes=\"(max-width: 516px) 100vw, 516px\" \/><\/a><\/p>\n<p>Doug talked about the evolution of the Hadoop ecosystem particularly with regard to analytics. Batch-based computation is giving way in many cases to in-memory micro-batched computational capabilities, hence the widespread and growing interest in <a href=\"https:\/\/www.mapr.com\/products\/apache-spark\">Apache Spark.<\/a><\/p>\n<p>Ted first entertained people with cultural trends that did not turn out as predicted before he described a successful, forward looking, big data project &#8211; an open source project from the 19th century that made good use ocean and wind data to build navigational charts for sailing. Jumping to the present day, Ted explained the current big data trend toward simplification in machine learning projects such that they deliver practical value. Ted also talked about the need for more streamlined ways to handle complex data to avoid having to build hundreds of tables (as was true with traditional relational systems) and he showed the advantage of leveraging the flexibility of SQL engine Apache Drill in these situations.<div style=\"display:inline-block; margin: 15px 0;\"> <div id=\"adngin-JavaCodeGeeks_incontent_video-0\" style=\"display:inline-block;\"><\/div> <\/div><\/p>\n<h2>Six Predictions for 2016<\/h2>\n<p>Inspired by others who have described big data trends, now I\u2019ll stick out my neck to make my own predictions (purely opinion) about what you\u2019ll be doing in 2016. After all, it\u2019s only one year in the future\u2026<\/p>\n<h4>Streaming Data<\/h4>\n<p>I feel confident that throughout 2016 there will be explosive interest in <a href=\"https:\/\/www.mapr.com\/products\/mapr-streams\">streaming data and streaming analytics<\/a>. Streaming data will be used by many more organizations than previously and in new ways. Increasing volume in IoT sensor data is just one of the sources of streaming data. Series of events \u2013 such as clickstream data from web traffic or machine log files \u2013 will increasingly be analyzed as streams, using near-real time processing with Apache Spark or actual real time analytics with a newer tool, <a href=\"https:\/\/www.mapr.com\/blog\/apache-flink-new-way-handle-streaming-data\">Apache Flink<\/a>.<\/p>\n<p>One of the big shifts will be a different way to think of the architecture that best supports these applications: <u>The message queue will become a central focus in designing these systems.<\/u> The messaging layer will be much more than just a safety buffer in the workflow of a streaming analytic program. Done right, the message queue becomes a re-playable, immutable persistent log that serves multiple masters such as real time analytics applications, databases or search documents. For these reasons, I predict a greatly increased use of the already popular messaging tool Apache Kafka as well as strong interest in new <a href=\"https:\/\/www.mapr.com\/products\/mapr-streams\">MapR Streams<\/a>, an integrated messaging technology that supports the Kafka API.<\/p>\n<h4>Shorter Time-to-Value<\/h4>\n<p>Businesses want practical ways to get to value faster, so I believe you are likely to try out <a href=\"https:\/\/www.mapr.com\/products\/apache-drill\">Apache Drill<\/a> some time in 2016 if your business has any need for <a href=\"https:\/\/www.mapr.com\/why-hadoop\/sql-hadoop\/sql-hadoop-details\">SQL<\/a>. With more frequent releases, Drill\u2019s capabilities continue to expand, but it\u2019s already a highly performant, highly scalable and extremely flexible query engine that uses standard SQL. This makes it equally attractive to people coming to big data from a traditional background as well as veterans of the Hadoop and NoSQL world who want a query engine that easily handles a wider variety of unstructured and nested data types, such as JSON and Parquet.<\/p>\n<p>Perhaps the characteristic of Drill that is most likely to make you want to try it is its ability to query data with little or no preparation \u2013 that can cut hours or days off the time needed to go from data to insights. With less time needed before you start your query, with Drill you\u2019re able to quickly build a second query based on what you\u2019ve learned from the first one. Faster development, faster insights, shorter time-to-value.<\/p>\n<p><a href=\"http:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apacespark-kafka-blog-img2.png\"><img decoding=\"async\" class=\"aligncenter size-full wp-image-50858\" src=\"http:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apacespark-kafka-blog-img2.png\" alt=\"apacespark-kafka-blog-img2\" width=\"566\" height=\"422\" srcset=\"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apacespark-kafka-blog-img2.png 566w, https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2016\/01\/apacespark-kafka-blog-img2-300x224.png 300w\" sizes=\"(max-width: 566px) 100vw, 566px\" \/><\/a><\/p>\n<h4>Centralization<\/h4>\n<p>Increasingly people will think of the big data platform as a central part of their overall organization rather than a special purpose project. <a href=\"https:\/\/www.mapr.com\/\">A big data platform<\/a> such as an <a href=\"https:\/\/www.mapr.com\/products\/apache-hadoop\">Hadoop<\/a> and <a href=\"https:\/\/www.mapr.com\/products\/mapr-db-in-hadoop-nosql\">NoSQL based system<\/a> will need to be easily connected to traditional technologies such as an enterprise data warehouse, relational data base or BI tools.<\/p>\n<p>A paradoxical aspect of centralization for a global organization is the need to globally distribute data. Different parts of your organization need access to unified data sets. As you break down unwanted silos within or between geographically distinct centers, you\u2019ll want to avoid propagation delays. There may be legal issues that require localization of data. For these reasons, I predict many organizations will want a system with secure and reliable ways to maintain multiple data centers that can be quickly synchronized.<\/p>\n<h4>Special Topics: Healthcare<\/h4>\n<p>I think the use of big data in the <a href=\"https:\/\/www.mapr.com\/solutions\/industry\/big-data-and-apache-hadoop-healthcare-and-life-sciences\">health care industry<\/a> is poised for rapid expansion in 2016. People are recognizing the power of using data to reduce fraud and improve healthcare delivery through the use of electronic patient histories, long-term maintenance records for machinery, and the flow of sensor information. Excellent data security and governance will of course be important for these <a href=\"https:\/\/www.mapr.com\/solutions\/enterprise-big-data-and-hadoop-use-cases\">use cases<\/a>.<\/p>\n<h4>Special Topics: Telecommunications<\/h4>\n<p>Another area that will increasingly stand out in the big data space in 2016 is <a href=\"https:\/\/www.mapr.com\/solutions\/industry\/big-data-and-apache-hadoop-telecommunications\">telecommunications<\/a>. Telecom companies already have excellent use cases for big data: offloading the strain of ETL to Hadoop while maintaining complex billing on enterprise warehouses; using anomaly detection on data to and from cell towers to discover and quickly respond to sudden usage shifts, and employing real time analytics to respond quickly users after a dropped call in order to improve experience and reduce churn.<\/p>\n<p>Expansions in streaming data architectures and technologies (described above) will benefit telecoms. But even if you are not working with a telecom yourself, this special case may still affect you. More non-phone applications are making use of telecom networks. Sensors in cars, for instance, often send data via a telecommunications network. Taking all that together, I predict you will likely be combining advanced telecommunications with big data in 2016.<\/p>\n<h4>The Best Prediction: You\u2019ll Surprise Me<\/h4>\n<p>And my best prediction for 2016 is that you will come up with some innovative way to put big data to use that has not yet occurred to me. Maybe it will solve a problem I\u2019m already aware of but in a novel way. Or maybe it will be something entirely new. Either way, by January 2017 I\u2019ll be \u201cremembering the future\u201d as I saw it and be surprised by something new, even if my other five predictions turn out to be accurate.<\/p>\n<h2>Other Resources<\/h2>\n<p>For related content by the author see these free resources:<\/p>\n<ul>\n<li><a href=\"https:\/\/www.mapr.com\/practical-machine-learning-new-look-anomaly-detection\">Practical Machine Learning: A New Look at Anomaly Detection <\/a><\/li>\n<li><a href=\"https:\/\/www.mapr.com\/real-world-hadoop\">Real World Hadoop<\/a><\/li>\n<li><a href=\"http:\/\/radar.oreilly.com\/2015\/09\/apache-drill-tracking-its-history-as-an-open-source-community.html\">Article on Apache Drill on the O\u2019Reilly Radar blog<\/a><\/li>\n<\/ul>\n<div class=\"attribution\">\n<table>\n<tbody>\n<tr>\n<td><span class=\"reference\">Reference: <\/span><\/td>\n<td><a href=\"https:\/\/www.mapr.com\/practical-machine-learning-new-look-anomaly-detection\">What Will You Do in 2016? Apache Spark, Kafka, Drill and More<\/a> from our <a href=\"http:\/\/www.javacodegeeks.com\/join-us\/jcg\/\">JCG partner<\/a> Ellen Friedman at the <a href=\"http:\/\/www.mapr.com\/blog\">Mapr<\/a> blog.<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Let\u2019s have some fun. It\u2019s the start of a new year &#8212; we\u2019re on the threshold of something new &#8212; so let\u2019s look forward to what you\u2019re likely to be doing in 2016. Now I know the riskiness of making predictions \u2013 especially ones on record \u2013 but I\u2019m happy for you to drop back &hellip;<\/p>\n","protected":false},"author":992,"featured_media":22307,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8],"tags":[1088,940,1092,372],"class_list":["post-50844","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-enterprise-java","tag-apache-drill","tag-apache-kafka","tag-apache-spark","tag-big-data"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>What Will You Do in 2016? Apache Spark, Kafka, Drill and More - Java Code Geeks<\/title>\n<meta name=\"description\" content=\"Let\u2019s have some fun. It\u2019s the start of a new year -- we\u2019re on the threshold of something new -- so let\u2019s look forward to what you\u2019re likely to be doing in\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What Will You Do in 2016? Apache Spark, Kafka, Drill and More - Java Code Geeks\" \/>\n<meta property=\"og:description\" content=\"Let\u2019s have some fun. It\u2019s the start of a new year -- we\u2019re on the threshold of something new -- so let\u2019s look forward to what you\u2019re likely to be doing in\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html\" \/>\n<meta property=\"og:site_name\" content=\"Java Code Geeks\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/javacodegeeks\" \/>\n<meta property=\"article:published_time\" content=\"2016-01-07T11:00:26+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2014\/03\/apache-spark-logo.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"150\" \/>\n\t<meta property=\"og:image:height\" content=\"150\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Ellen Friedman\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@javacodegeeks\" \/>\n<meta name=\"twitter:site\" content=\"@javacodegeeks\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Ellen Friedman\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html\"},\"author\":{\"name\":\"Ellen Friedman\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#\\\/schema\\\/person\\\/daeeef3356f379f825a4056ba7249e71\"},\"headline\":\"What Will You Do in 2016? Apache Spark, Kafka, Drill and More\",\"datePublished\":\"2016-01-07T11:00:26+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html\"},\"wordCount\":1529,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.javacodegeeks.com\\\/wp-content\\\/uploads\\\/2014\\\/03\\\/apache-spark-logo.jpg\",\"keywords\":[\"Apache Drill\",\"Apache Kafka\",\"Apache Spark\",\"Big Data\"],\"articleSection\":[\"Enterprise Java\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html\",\"url\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html\",\"name\":\"What Will You Do in 2016? Apache Spark, Kafka, Drill and More - Java Code Geeks\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.javacodegeeks.com\\\/wp-content\\\/uploads\\\/2014\\\/03\\\/apache-spark-logo.jpg\",\"datePublished\":\"2016-01-07T11:00:26+00:00\",\"description\":\"Let\u2019s have some fun. It\u2019s the start of a new year -- we\u2019re on the threshold of something new -- so let\u2019s look forward to what you\u2019re likely to be doing in\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#primaryimage\",\"url\":\"https:\\\/\\\/www.javacodegeeks.com\\\/wp-content\\\/uploads\\\/2014\\\/03\\\/apache-spark-logo.jpg\",\"contentUrl\":\"https:\\\/\\\/www.javacodegeeks.com\\\/wp-content\\\/uploads\\\/2014\\\/03\\\/apache-spark-logo.jpg\",\"width\":150,\"height\":150},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/2016\\\/01\\\/will-2016-apache-spark-kafka-drill.html#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.javacodegeeks.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Java\",\"item\":\"https:\\\/\\\/www.javacodegeeks.com\\\/category\\\/java\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Enterprise Java\",\"item\":\"https:\\\/\\\/www.javacodegeeks.com\\\/category\\\/java\\\/enterprise-java\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"What Will You Do in 2016? Apache Spark, Kafka, Drill and More\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#website\",\"url\":\"https:\\\/\\\/www.javacodegeeks.com\\\/\",\"name\":\"Java Code Geeks\",\"description\":\"Java Developers Resource Center\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#organization\"},\"alternateName\":\"JCG\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.javacodegeeks.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#organization\",\"name\":\"Exelixis Media P.C.\",\"url\":\"https:\\\/\\\/www.javacodegeeks.com\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.javacodegeeks.com\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/exelixis-logo.png\",\"contentUrl\":\"https:\\\/\\\/www.javacodegeeks.com\\\/wp-content\\\/uploads\\\/2022\\\/06\\\/exelixis-logo.png\",\"width\":864,\"height\":246,\"caption\":\"Exelixis Media P.C.\"},\"image\":{\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/javacodegeeks\",\"https:\\\/\\\/x.com\\\/javacodegeeks\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.javacodegeeks.com\\\/#\\\/schema\\\/person\\\/daeeef3356f379f825a4056ba7249e71\",\"name\":\"Ellen Friedman\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e679afb2129a47229636b15f8e19ec02a71baf08dc665b43d219f1ea8523ff28?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e679afb2129a47229636b15f8e19ec02a71baf08dc665b43d219f1ea8523ff28?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/e679afb2129a47229636b15f8e19ec02a71baf08dc665b43d219f1ea8523ff28?s=96&d=mm&r=g\",\"caption\":\"Ellen Friedman\"},\"description\":\"She is a consultant and commentator on big data topics. Active in open source, she is committer for Apache Drill and Apache Mahout projects and co-author of many books on working with data in the Hadoop ecosystem. She has a PhD in biochemistry, years of experience as a research scientist and has written about a wide range of technical topics including biology, oceanography and the genetics of learning and memory.\",\"sameAs\":[\"https:\\\/\\\/www.mapr.com\\\/\"],\"url\":\"https:\\\/\\\/www.javacodegeeks.com\\\/author\\\/ellen-friedman\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"What Will You Do in 2016? Apache Spark, Kafka, Drill and More - Java Code Geeks","description":"Let\u2019s have some fun. It\u2019s the start of a new year -- we\u2019re on the threshold of something new -- so let\u2019s look forward to what you\u2019re likely to be doing in","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html","og_locale":"en_US","og_type":"article","og_title":"What Will You Do in 2016? Apache Spark, Kafka, Drill and More - Java Code Geeks","og_description":"Let\u2019s have some fun. It\u2019s the start of a new year -- we\u2019re on the threshold of something new -- so let\u2019s look forward to what you\u2019re likely to be doing in","og_url":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html","og_site_name":"Java Code Geeks","article_publisher":"https:\/\/www.facebook.com\/javacodegeeks","article_published_time":"2016-01-07T11:00:26+00:00","og_image":[{"width":150,"height":150,"url":"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2014\/03\/apache-spark-logo.jpg","type":"image\/jpeg"}],"author":"Ellen Friedman","twitter_card":"summary_large_image","twitter_creator":"@javacodegeeks","twitter_site":"@javacodegeeks","twitter_misc":{"Written by":"Ellen Friedman","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#article","isPartOf":{"@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html"},"author":{"name":"Ellen Friedman","@id":"https:\/\/www.javacodegeeks.com\/#\/schema\/person\/daeeef3356f379f825a4056ba7249e71"},"headline":"What Will You Do in 2016? Apache Spark, Kafka, Drill and More","datePublished":"2016-01-07T11:00:26+00:00","mainEntityOfPage":{"@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html"},"wordCount":1529,"commentCount":0,"publisher":{"@id":"https:\/\/www.javacodegeeks.com\/#organization"},"image":{"@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#primaryimage"},"thumbnailUrl":"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2014\/03\/apache-spark-logo.jpg","keywords":["Apache Drill","Apache Kafka","Apache Spark","Big Data"],"articleSection":["Enterprise Java"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html","url":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html","name":"What Will You Do in 2016? Apache Spark, Kafka, Drill and More - Java Code Geeks","isPartOf":{"@id":"https:\/\/www.javacodegeeks.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#primaryimage"},"image":{"@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#primaryimage"},"thumbnailUrl":"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2014\/03\/apache-spark-logo.jpg","datePublished":"2016-01-07T11:00:26+00:00","description":"Let\u2019s have some fun. It\u2019s the start of a new year -- we\u2019re on the threshold of something new -- so let\u2019s look forward to what you\u2019re likely to be doing in","breadcrumb":{"@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#primaryimage","url":"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2014\/03\/apache-spark-logo.jpg","contentUrl":"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2014\/03\/apache-spark-logo.jpg","width":150,"height":150},{"@type":"BreadcrumbList","@id":"https:\/\/www.javacodegeeks.com\/2016\/01\/will-2016-apache-spark-kafka-drill.html#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.javacodegeeks.com\/"},{"@type":"ListItem","position":2,"name":"Java","item":"https:\/\/www.javacodegeeks.com\/category\/java"},{"@type":"ListItem","position":3,"name":"Enterprise Java","item":"https:\/\/www.javacodegeeks.com\/category\/java\/enterprise-java"},{"@type":"ListItem","position":4,"name":"What Will You Do in 2016? Apache Spark, Kafka, Drill and More"}]},{"@type":"WebSite","@id":"https:\/\/www.javacodegeeks.com\/#website","url":"https:\/\/www.javacodegeeks.com\/","name":"Java Code Geeks","description":"Java Developers Resource Center","publisher":{"@id":"https:\/\/www.javacodegeeks.com\/#organization"},"alternateName":"JCG","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.javacodegeeks.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.javacodegeeks.com\/#organization","name":"Exelixis Media P.C.","url":"https:\/\/www.javacodegeeks.com\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.javacodegeeks.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2022\/06\/exelixis-logo.png","contentUrl":"https:\/\/www.javacodegeeks.com\/wp-content\/uploads\/2022\/06\/exelixis-logo.png","width":864,"height":246,"caption":"Exelixis Media P.C."},"image":{"@id":"https:\/\/www.javacodegeeks.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/javacodegeeks","https:\/\/x.com\/javacodegeeks"]},{"@type":"Person","@id":"https:\/\/www.javacodegeeks.com\/#\/schema\/person\/daeeef3356f379f825a4056ba7249e71","name":"Ellen Friedman","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/e679afb2129a47229636b15f8e19ec02a71baf08dc665b43d219f1ea8523ff28?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/e679afb2129a47229636b15f8e19ec02a71baf08dc665b43d219f1ea8523ff28?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e679afb2129a47229636b15f8e19ec02a71baf08dc665b43d219f1ea8523ff28?s=96&d=mm&r=g","caption":"Ellen Friedman"},"description":"She is a consultant and commentator on big data topics. Active in open source, she is committer for Apache Drill and Apache Mahout projects and co-author of many books on working with data in the Hadoop ecosystem. She has a PhD in biochemistry, years of experience as a research scientist and has written about a wide range of technical topics including biology, oceanography and the genetics of learning and memory.","sameAs":["https:\/\/www.mapr.com\/"],"url":"https:\/\/www.javacodegeeks.com\/author\/ellen-friedman"}]}},"_links":{"self":[{"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/posts\/50844","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/users\/992"}],"replies":[{"embeddable":true,"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/comments?post=50844"}],"version-history":[{"count":0,"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/posts\/50844\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/media\/22307"}],"wp:attachment":[{"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/media?parent=50844"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/categories?post=50844"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.javacodegeeks.com\/wp-json\/wp\/v2\/tags?post=50844"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}