{"id":3568,"date":"2011-11-07T12:00:57","date_gmt":"2011-11-07T12:00:57","guid":{"rendered":"http:\/\/www.markwilson.co.uk\/blog\/?p=3568"},"modified":"2017-01-14T13:55:14","modified_gmt":"2017-01-14T13:55:14","slug":"sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination","status":"publish","type":"post","link":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm","title":{"rendered":"SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination"},"content":{"rendered":"<p><em>Big Data is hard to avoid \u2013 what does Microsoft\u2019s embrace of Hadoop mean for IT Managers?<\/em><\/p>\n<p>There are two words that seem particularly difficult to avoid at the moment: big data.\u00a0Infrastructure guys instinctivly shy away from data but such is its prevalence that big data is much more than just the latest IT buzzword and is becoming a major theme in our industry right now<\/p>\n<p>But what does \u201cbig data\u201d actually mean? It&#8217;s one of those phrases that, like cloud computing earlier, it is being &#8220;adopted&#8221; by vendors to mean whatever they want it to.<\/p>\n<p>The\u00a0<a href=\"http:\/\/www.mckinsey.com\/mgi\/publications\/big_data\/\">McKinsey Global Institute describes big data<\/a>\u00a0as \u201cthe next frontier for innovation, competition and productivity\u201d but, put simply, it\u2019s about analysing masses of unstructured (or semi-structured) data which, until recently, was considered too expensive to do anything with.<\/p>\n<p>That data comes from a variety of sources including sensors, social networks and digital media and it includes text, audio, video, click-streams, log files and more. Cynics who scoff at the description of \u201cbig\u201d data (what\u2019s next, \u201chuge\u201d data?) miss the point that it\u2019s not just about the volume of the data (typically many petabytes) but also the variety and frequency of that data. Some even refer to it as \u201cnano data\u201d because what we\u2019re actually looking at is massive sets of very small data.<\/p>\n<p>Processing big data typically involves distributed computer systems and one project that has come to the fore is\u00a0<a href=\"http:\/\/hadoop.apache.org\/\">Apache Hadoop<\/a>\u00a0\u2013 a framework for development of open-source software for reliable, scalable distributed computing.<\/p>\n<p>Over the last few weeks though, there have been some significant announcements from established IT players, not all of whom are known for embracing open source technology. This indicates a growing acceptance for big data solutions in general and specifically for solutions that include both open- and closed- source elements.<\/p>\n<p>When Microsoft released a\u00a0<a href=\"http:\/\/www.microsoft.com\/download\/en\/details.aspx?id=27584\">SQL Server-Hadoop (SQOOP) Connector<\/a>,there were questions about what this would mean for CIOs and IT Managers who may previously have viewed technologies like Hadoop as a little esoteric.<\/p>\n<p>The key to understanding what this would mean would be understanding the two main types of data: structured and unstructured. Structured data tends to be stored in a relational database management system (RDBMS), for example Microsoft SQL Server, IBM DB2, Oracle 11G or MySQL.<\/p>\n<p>By structuring the data with a schema, tables, keys and all manner of relationships it\u2019s possible to run queries (with a language like SQL) to analyse the data and techniques have developed over the years to optimise those queries. By contrast, unstructured data has no schema (at least not a formal one) and may be as simple as a set of files.\u00a0 Structured data offers maturity, stability and efficiency but unstructured data offers flexibility.<\/p>\n<p>Secondly, there needs to be an understanding of the term \u201cNoSQL\u201d.\u00a0 Commonly misinterpreted as an instruction (no to SQL), it really means\u00a0<em>not only<\/em>\u00a0SQL \u2013 i.e. there are some types of data that are not worth storing in an RDBMS.\u00a0 Rather than following the database model of extract, transform and load (ETL), with a NoSQL system the data arrives and the application knows how to interpret the data, providing a faster time to insight from data acquisition.<\/p>\n<p>Just as there are two main types of data, there are two main types of NoSQL system: key\/value stores (like MongoDB or Windows Azure Table Storage) can be thought of as NoSQL OLTP; Hadoop is more like NoSQL data warehousing and is particularly suited to storing and analysing massive data sets.<\/p>\n<p>One of the key elements towards understanding Hadoop is understanding how the various Hadoop components work together. There&#8217;s a degree of complexity so perhaps it&#8217;s best to summarise \u00a0by saying that the Hadoop stack consists of a highly distributed, fault tolerant, file system (HDFS) and the MapReduce framework for writing and executing distributed, fault tolerant, algorithms. Built on top of that are query languages (live Hive and Pig) and then we have the layer where Microsoft\u2019s SQOOP connector sits, connecting the two worlds of structured and unstructured data.<\/p>\n<p>The trouble is that SQOOP is just a bridge \u2013 and not a particularly efficient one either \u2013 working on SQL data in the unstructured world involves subdivision of the SQL database so that MapReduce can work correctly.<\/p>\n<p>Because most enterprises have both the structured and unstructured data, we really need tools that allow us to analyse and manage data in multiple environments \u2013 ideally without having to go back and forth. That\u2019s why there are \u00a0so many vendors jumping on the big data bandwagon but it seems that a SQOOP connector is not the only work Microsoft is doing in the big data space:<\/p>\n<ul>\n<li><a href=\"http:\/\/msdn.microsoft.com\/en-us\/library\/ee362541.aspx\">SQL Server 2008 R2 includes a complex event processing (CEP) capability called StreamInsight<\/a>. The principle is that streams of data can be monitored, managed and mined for particular events (instead of running queries across data, run the data through a set of queries looking for matches) and this can help organisations to respond quickly to new opportunities \u2013 maybe even adopting a predictive business model.<\/li>\n<li>The next version of SQL Server will include\u00a0<a href=\"http:\/\/blogs.msdn.com\/b\/sqlrsteamblog\/archive\/2011\/10\/13\/power-view-pass-and-mobile.aspx\">a new data analysis tool called Power View<\/a>\u00a0which will even be supported on competitive mobile operating systems (including iOS and Android).<\/li>\n<li><a href=\"http:\/\/msdn.microsoft.com\/en-us\/library\/windowsazure\/hh508997.aspx\">Windows Azure includes table storage<\/a>\u00a0\u2013 a key\/value pair storage solution with partitioning.<\/li>\n<li>Also on Azure,\u00a0<a href=\"http:\/\/www.microsoft.com\/en-us\/sqlazurelabs\/labs\/dataexplorer.aspx\">Microsoft is creating a new Data Explorer tool<\/a>\u00a0to create rich data sets that can be published as a service and\u00a0<a href=\"http:\/\/research.microsoft.com\/en-us\/news\/headlines\/daytona-071811.aspx\">an iterative MapReduce runtime codenamed \u201cDaytona\u201d<\/a>\u00a0for scaling data analytics across hundreds of processing cores.<\/li>\n<li><a href=\"http:\/\/blogs.technet.com\/b\/microsoft_blog\/archive\/2011\/10\/12\/microsoft-expands-data-platform-to-help-customers-manage-the-new-currency-of-the-cloud.aspx\">Microsoft is also creating new implementations of the Hadoop stack for Windows Azure and Windows Server<\/a>\u00a0(<a href=\"http:\/\/blogs.technet.com\/b\/dataplatforminsider\/archive\/2011\/10\/13\/microsoft-s-big-data-roadmap-amp-approach.aspx\">including a Hive ODBC driver and a Hive Add-in for Excel<\/a>) but it also has a competing\u00a0 technology called LINQ to HPC (formerly codenamed Dryad)\u00a0 that\u00a0<a href=\"http:\/\/blogs.technet.com\/b\/windowshpc\/archive\/2011\/10\/18\/preview-of-windows-azure-scheduler-and-the-hpc-pack-2008-r2-service-pack-3-releases-now-available.aspx\">allows a Windows High Performance Compute (HPC) cluster to not only perform parallel computing but also to integrate with Azure<\/a>\u00a0(the theory behind this is that big data jobs are typically I\/O-bound, rather than compute-bound).<\/li>\n<\/ul>\n<p>In our increasingly cloudy world, infrastructure and platforms are rapidly becoming commoditised. We need to focus on software that allows us to derive value from data to gain some business value. Consider that Microsoft is only one vendor, then think about what Oracle, IBM, Fujitsu and others are doing. If you weren\u2019t convinced before,\u00a0<a href=\"http:\/\/gigaom.com\/cloud\/autonomy-exec-says-hp-now-a-big-data-player\/\">maybe HP\u2019s Autonomy purchase is starting to make sense now<\/a>?<\/p>\n<p>Looking specifically at Microsoft\u2019s developments in the big data world, it therefore makes sense to see the company get closer to Hadoop. The world has spoken and the de facto solution for analysing large data sets seems to be HDFS\/MapReduce\/Hive (or similar).<\/p>\n<p>Maybe Hadoop\u2019s success comes down to HDFS and MapReduce being based on work from Google whilst Hive and Pig are supported by Facebook and Yahoo respectively (i.e. they are all from established Internet businesses).\u00a0 But, by embracing Hadoop (together with porting its tools to competitive platforms), Microsoft is better placed to support the entire enterprise with both their structured and unstructured needs.<\/p>\n<p>[<a href=\"http:\/\/www.cloudpro.co.uk\/paas\/2175\/sql-server-and-hadoop-unlikely-bedfellows-combine-powerfully\">This post was originally written as an article for Cloud Pro<\/a>.]<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Big Data is hard to avoid \u2013 what does Microsoft\u2019s embrace of Hadoop mean for IT Managers? There are two words that seem particularly difficult to avoid at the moment: big data.\u00a0Infrastructure guys instinctivly shy away from data but such is its prevalence that big data is much more than just the latest IT buzzword &hellip; <a href=\"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[218],"tags":[278,456,320,83],"class_list":["post-3568","post","type-post","status-publish","format-standard","hentry","category-technology","tag-big-data","tag-cloud-pro","tag-hadoop","tag-sql-server"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.5 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>SQL Server and Hadoop - unlikely bedfellows but a powerful combination - markwilson.it<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\" \/>\n<meta property=\"og:locale\" content=\"en_GB\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"SQL Server and Hadoop - unlikely bedfellows but a powerful combination - markwilson.it\" \/>\n<meta property=\"og:description\" content=\"Big Data is hard to avoid \u2013 what does Microsoft\u2019s embrace of Hadoop mean for IT Managers? There are two words that seem particularly difficult to avoid at the moment: big data.\u00a0Infrastructure guys instinctivly shy away from data but such is its prevalence that big data is much more than just the latest IT buzzword &hellip; Continue reading SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\" \/>\n<meta property=\"og:site_name\" content=\"markwilson.it\" \/>\n<meta property=\"article:published_time\" content=\"2011-11-07T12:00:57+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2017-01-14T13:55:14+00:00\" \/>\n<meta name=\"author\" content=\"Mark Wilson\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@markwilsonit\" \/>\n<meta name=\"twitter:site\" content=\"@markwilsonit\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Mark Wilson\" \/>\n\t<meta name=\"twitter:label2\" content=\"Estimated reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\"},\"author\":{\"name\":\"Mark Wilson\",\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/98f61365e7c39d6be942174b8c4de468\"},\"headline\":\"SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination\",\"datePublished\":\"2011-11-07T12:00:57+00:00\",\"dateModified\":\"2017-01-14T13:55:14+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\"},\"wordCount\":1204,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/98f61365e7c39d6be942174b8c4de468\"},\"keywords\":[\"Big data\",\"Cloud Pro\",\"Hadoop\",\"Microsoft SQL Server\"],\"articleSection\":[\"Technology\"],\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\",\"url\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\",\"name\":\"SQL Server and Hadoop - unlikely bedfellows but a powerful combination - markwilson.it\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/#website\"},\"datePublished\":\"2011-11-07T12:00:57+00:00\",\"dateModified\":\"2017-01-14T13:55:14+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#breadcrumb\"},\"inLanguage\":\"en-GB\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/2011\\\/11\\\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/#website\",\"url\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/\",\"name\":\"markwilson.it\",\"description\":\"get-info -class technology | write-output &gt; \\\/dev\\\/web\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/98f61365e7c39d6be942174b8c4de468\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-GB\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/#\\\/schema\\\/person\\\/98f61365e7c39d6be942174b8c4de468\",\"name\":\"Mark Wilson\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-GB\",\"@id\":\"https:\\\/\\\/i0.wp.com\\\/www.markwilson.co.uk\\\/blog\\\/uploads\\\/image-4.png?fit=800%2C800&ssl=1\",\"url\":\"https:\\\/\\\/i0.wp.com\\\/www.markwilson.co.uk\\\/blog\\\/uploads\\\/image-4.png?fit=800%2C800&ssl=1\",\"contentUrl\":\"https:\\\/\\\/i0.wp.com\\\/www.markwilson.co.uk\\\/blog\\\/uploads\\\/image-4.png?fit=800%2C800&ssl=1\",\"width\":800,\"height\":800,\"caption\":\"Mark Wilson\"},\"logo\":{\"@id\":\"https:\\\/\\\/i0.wp.com\\\/www.markwilson.co.uk\\\/blog\\\/uploads\\\/image-4.png?fit=800%2C800&ssl=1\"},\"description\":\"A Chartered IT Professional, with recent experience in technology leadership, IT strategy and practice management roles, Mark Wilson is an Enterprise Architect in the Advisory and Management Group at risual. During a career spanning more than two decades, Mark has gained widespread recognition as an expert in his field including both industry and national press exposure. In addition to certifications from Microsoft, VMware, Red Hat, The Open Group and Axelos, Mark held a Microsoft Most Valuable Professional (MVP) award for three years and is now part of the MVP Reconnect programme. Mark is also well-known on social media and maintains an award-winning blog.\",\"sameAs\":[\"http:\\\/\\\/www.markwilson.co.uk\\\/\",\"https:\\\/\\\/www.instagram.com\\\/markwilsonuk\\\/\",\"https:\\\/\\\/www.linkedin.com\\\/in\\\/markawilson\\\/\",\"https:\\\/\\\/x.com\\\/markwilsonit\",\"https:\\\/\\\/www.youtube.com\\\/channel\\\/UCWHlZCoHRTocdvtrOJ2IL4A\"],\"url\":\"https:\\\/\\\/www.markwilson.co.uk\\\/blog\\\/author\\\/mark-wilson\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"SQL Server and Hadoop - unlikely bedfellows but a powerful combination - markwilson.it","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm","og_locale":"en_GB","og_type":"article","og_title":"SQL Server and Hadoop - unlikely bedfellows but a powerful combination - markwilson.it","og_description":"Big Data is hard to avoid \u2013 what does Microsoft\u2019s embrace of Hadoop mean for IT Managers? There are two words that seem particularly difficult to avoid at the moment: big data.\u00a0Infrastructure guys instinctivly shy away from data but such is its prevalence that big data is much more than just the latest IT buzzword &hellip; Continue reading SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination","og_url":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm","og_site_name":"markwilson.it","article_published_time":"2011-11-07T12:00:57+00:00","article_modified_time":"2017-01-14T13:55:14+00:00","author":"Mark Wilson","twitter_card":"summary_large_image","twitter_creator":"@markwilsonit","twitter_site":"@markwilsonit","twitter_misc":{"Written by":"Mark Wilson","Estimated reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#article","isPartOf":{"@id":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm"},"author":{"name":"Mark Wilson","@id":"https:\/\/www.markwilson.co.uk\/blog\/#\/schema\/person\/98f61365e7c39d6be942174b8c4de468"},"headline":"SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination","datePublished":"2011-11-07T12:00:57+00:00","dateModified":"2017-01-14T13:55:14+00:00","mainEntityOfPage":{"@id":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm"},"wordCount":1204,"commentCount":0,"publisher":{"@id":"https:\/\/www.markwilson.co.uk\/blog\/#\/schema\/person\/98f61365e7c39d6be942174b8c4de468"},"keywords":["Big data","Cloud Pro","Hadoop","Microsoft SQL Server"],"articleSection":["Technology"],"inLanguage":"en-GB","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm","url":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm","name":"SQL Server and Hadoop - unlikely bedfellows but a powerful combination - markwilson.it","isPartOf":{"@id":"https:\/\/www.markwilson.co.uk\/blog\/#website"},"datePublished":"2011-11-07T12:00:57+00:00","dateModified":"2017-01-14T13:55:14+00:00","breadcrumb":{"@id":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#breadcrumb"},"inLanguage":"en-GB","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/sql-server-and-hadoop-unlikely-bedfellows-but-a-powerful-combination.htm#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.markwilson.co.uk\/blog"},{"@type":"ListItem","position":2,"name":"SQL Server and Hadoop &#8211; unlikely bedfellows but a powerful combination"}]},{"@type":"WebSite","@id":"https:\/\/www.markwilson.co.uk\/blog\/#website","url":"https:\/\/www.markwilson.co.uk\/blog\/","name":"markwilson.it","description":"get-info -class technology | write-output &gt; \/dev\/web","publisher":{"@id":"https:\/\/www.markwilson.co.uk\/blog\/#\/schema\/person\/98f61365e7c39d6be942174b8c4de468"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.markwilson.co.uk\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-GB"},{"@type":["Person","Organization"],"@id":"https:\/\/www.markwilson.co.uk\/blog\/#\/schema\/person\/98f61365e7c39d6be942174b8c4de468","name":"Mark Wilson","image":{"@type":"ImageObject","inLanguage":"en-GB","@id":"https:\/\/i0.wp.com\/www.markwilson.co.uk\/blog\/uploads\/image-4.png?fit=800%2C800&ssl=1","url":"https:\/\/i0.wp.com\/www.markwilson.co.uk\/blog\/uploads\/image-4.png?fit=800%2C800&ssl=1","contentUrl":"https:\/\/i0.wp.com\/www.markwilson.co.uk\/blog\/uploads\/image-4.png?fit=800%2C800&ssl=1","width":800,"height":800,"caption":"Mark Wilson"},"logo":{"@id":"https:\/\/i0.wp.com\/www.markwilson.co.uk\/blog\/uploads\/image-4.png?fit=800%2C800&ssl=1"},"description":"A Chartered IT Professional, with recent experience in technology leadership, IT strategy and practice management roles, Mark Wilson is an Enterprise Architect in the Advisory and Management Group at risual. During a career spanning more than two decades, Mark has gained widespread recognition as an expert in his field including both industry and national press exposure. In addition to certifications from Microsoft, VMware, Red Hat, The Open Group and Axelos, Mark held a Microsoft Most Valuable Professional (MVP) award for three years and is now part of the MVP Reconnect programme. Mark is also well-known on social media and maintains an award-winning blog.","sameAs":["http:\/\/www.markwilson.co.uk\/","https:\/\/www.instagram.com\/markwilsonuk\/","https:\/\/www.linkedin.com\/in\/markawilson\/","https:\/\/x.com\/markwilsonit","https:\/\/www.youtube.com\/channel\/UCWHlZCoHRTocdvtrOJ2IL4A"],"url":"https:\/\/www.markwilson.co.uk\/blog\/author\/mark-wilson"}]}},"jetpack_featured_media_url":"","jetpack-related-posts":[{"id":3112,"url":"https:\/\/www.markwilson.co.uk\/blog\/2011\/11\/more-on-nosql-hadoop-and-microsofts-entry-to-the-world-of-big-data.htm","url_meta":{"origin":3568,"position":0},"title":"More on NoSQL, Hadoop and Microsoft&#8217;s entry to the world of big data","author":"Mark Wilson","date":"Tuesday 8 November 2011","format":false,"excerpt":"Yesterday, my article on Microsoft's forays into the world of big data went up on Cloud Pro. It's been fun learning a bit about the subject (far more than is in that article - because big data is a big theme in my work at the moment) and I wanted\u2026","rel":"","context":"In &quot;Technology&quot;","block_context":{"text":"Technology","link":"https:\/\/www.markwilson.co.uk\/blog\/topic\/technology"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":3897,"url":"https:\/\/www.markwilson.co.uk\/blog\/2012\/05\/big-data-according-to-the-oracle.htm","url_meta":{"origin":3568,"position":1},"title":"Big data according to the Oracle","author":"Mark Wilson","date":"Wednesday 9 May 2012","format":false,"excerpt":"After many years of working mostly with Microsoft infrastructure products, the time came for me to increase my breadth of knowledge and, with that, comes the opportunity to take a look at what some of the other big players in our industry are up to. \u00a0Last year, I was invited\u2026","rel":"","context":"In &quot;Technology&quot;","block_context":{"text":"Technology","link":"https:\/\/www.markwilson.co.uk\/blog\/topic\/technology"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":7352,"url":"https:\/\/www.markwilson.co.uk\/blog\/2017\/12\/7352.htm","url_meta":{"origin":3568,"position":2},"title":"Microsoft SQL Server overview","author":"Mark Wilson","date":"Tuesday 19 December 2017","format":false,"excerpt":"I wrote this post a few months ago... and it crashed my blog. Gone. Needed to be restored from backup... ...hopefully this time I'll have more luck! One of the advantages of being in the MVP Reconnect programme is that I occasionally get invited to webcasts that open my eyes\u2026","rel":"","context":"In &quot;Technology&quot;","block_context":{"text":"Technology","link":"https:\/\/www.markwilson.co.uk\/blog\/topic\/technology"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":2922,"url":"https:\/\/www.markwilson.co.uk\/blog\/2011\/07\/can-we-process-big-data-in-the-cloud.htm","url_meta":{"origin":3568,"position":3},"title":"Can we process &#8220;Big Data&#8221; in the cloud?","author":"Mark Wilson","date":"Tuesday 26 July 2011","format":false,"excerpt":"I wrote last week about one of the presentations I saw at the recent Unvirtual conference and this post highlights another one of the lightning talks - this time on a subject that was truly new to me: Big Data. Tim Moreton (@timmoreton), from Acunu, spoke about using big data\u2026","rel":"","context":"In &quot;Technology&quot;","block_context":{"text":"Technology","link":"https:\/\/www.markwilson.co.uk\/blog\/topic\/technology"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":2700,"url":"https:\/\/www.markwilson.co.uk\/blog\/2011\/04\/azure-connect-the-missing-link-between-on-premise-and-cloud.htm","url_meta":{"origin":3568,"position":4},"title":"Azure Connect &#8211; the missing link between on-premise and cloud","author":"Mark Wilson","date":"Monday 18 April 2011","format":false,"excerpt":"Azure Connect offers a way to connect on-premise infrastructure with Windows Azure but it's lacking functionality that may hinder adoption. While Microsoft is one of the most dominant players in client-server computing, until recently, its position in the cloud seemed uncertain. \u00a0More recently, we've seen Microsoft lay out its stall\u2026","rel":"","context":"In &quot;Technology&quot;","block_context":{"text":"Technology","link":"https:\/\/www.markwilson.co.uk\/blog\/topic\/technology"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":452,"url":"https:\/\/www.markwilson.co.uk\/blog\/2005\/08\/sql-server-2005-overview.htm","url_meta":{"origin":3568,"position":5},"title":"SQL Server 2005 overview","author":"Mark Wilson","date":"Wednesday 3 August 2005","format":false,"excerpt":"As part of my recent quest to learn about SQL Server from an infrastructure perspective, I've been attending a number of events, one of which was Michael Platt's keynote at the May 2005 Microsoft Technical Roadshow - SQL Server 2005 for IT Pros.In my opinion, SQL Server 2005 (codenamed Yukon)\u2026","rel":"","context":"In \"Microsoft SQL Server\"","block_context":{"text":"Microsoft SQL Server","link":"https:\/\/www.markwilson.co.uk\/blog\/tag\/sql-server"},"img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/posts\/3568","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/comments?post=3568"}],"version-history":[{"count":5,"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/posts\/3568\/revisions"}],"predecessor-version":[{"id":6877,"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/posts\/3568\/revisions\/6877"}],"wp:attachment":[{"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/media?parent=3568"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/categories?post=3568"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.markwilson.co.uk\/blog\/wp-json\/wp\/v2\/tags?post=3568"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}