“Big Data” Is Coming, “Big Data” Is Coming:

What Does It Mean To Analytic Professionals?

For my first blog posting, I decided to focus on one of the hottest topics in analytics right now: “Big Data”. Big Data is typically defined as very large, newer sources of data that don’t necessarily have a convenient or consistent structure. Think web logs, sensor data, RFID data, and other similar data streams. With such a wide range of new data streams coming online in rapid succession, Big Data has rapidly risen in importance and visibility.

As with any new topic getting a lot of attention, there are all sorts of claims related to how Big Data is going to fundamentally change everything. As a person who has “grown up” over the past few decades with roles both doing analytics and managing analytic projects, I don’t buy that. Certainly, Big Data is something that will play an increasing role in corporate analytics. It will provide some big benefits and lead to some terrific new analytics. But, from the view of an analyst in the trenches or analytics leader running the team, will Big Data fundamentally change what you do and why you do it? Let’s explore…

Analysts have been at the forefront of exploring new data sources for a long time. Who first started to analyze call detail records within telecom companies? Analysts. I was doing churn analysis against mainframe tapes at AT&T in my first job. This was at a time when reports or analysis on such data were far from standard. Who first started digging into retail point of sale (POS) data to figure out what nuggets it held? Analysts. Originally, the thought of tracking 10s to 100s of thousands of products across thousands of stores was considered a huge problem. Today, not so much. The analytical professionals who first dipped their toe into such sources were dealing with what at the time was an unthinkably large amount of data to try and analyze. Many people doubted it was possible and even questioned the value of such data. Sounds a bit like Big Data, doesn’t it?

Analysts have always sought out new, interesting data sources. They’ve also always pushed scalability to the limit. So, in my mind, Big Data isn’t really going to change much about what analysts are doing and why. Sure, the problems addressed will evolve due to the new data sources just as they always have. But, at the end of the day, analysts will simply be exploring new, unthinkably large data sets as they have always done.

One commonly accepted aspect of Big Data is that what qualifies as Big Data will change over time. As more capacity and scale is available, what is Big Data today won’t necessarily be Big Data tomorrow. Sounds a lot like how call detail records and POS data aren’t considered all that big anymore either.

What the Big Data trend will change are some of the tactics that analysts utilize to do their work. New tools such as MapReduce will be added alongside SAS and SQL to help deal more effectively with the flood of data. Complex filtering algorithms will be developed to help parse out the few meaningful pieces from a raw stream of Big Data. Modeling and forecasting processes will be updated to include Big Data inputs on top of the currently existing inputs.

Big Data will drive new & innovative analytics. It will force analysts to continue to get creative to work within scalability constraints. Big Data will only get bigger over time. But, analysts are prepared for this. Incorporating Big Data really isn’t different from what they’ve always done. It is simply the next generation of data sources to understand and put to use. Organizations need to turn their analytics professionals loose to do their thing and let them do what they do best. They are fully capable of taming Big Data if provided the opportunity and support to do so.

Related Posts Plugin for WordPress, Blogger...
  • Joshua

    Great post. The big data marketers make it sound like there has never been big datasets before.

    • http://twitter.com/randyzwitch Randy Zwitch

      Exactly.  Once a dataset can’t be opened with an Excel spreadsheet, it becomes “big data”.  That’s been around for quite some time, and analytics has always been there to figure out how to extract the knowledge.

      Today’s “Big Data” sounds like a data feed prior to ETL.  Not that crazy.

  • Sandeep Raut

    this is excellent article about big data.

  • Johnfurrier

    Great post on big data…analytics at the speed of business is the new differentiation for business..we are following this trend big time at siliconangle.com

  • http://hackathorn.myopenid.com/ hackathorn

    Excellent overview of Big Data and its impact on analysts into the future. One aspect that will complicate life for future analysts is the increasing complexity of data semantics. Within the EDW, we had some control over such semantics. However, when analyzing Twitter streams..for example… anything goes as long as it fits within 140 characters! The semantics is ever changing with random abbreviations, #labels, multi-cultural humor, and on and on… So Big Data is not only big in volume, it is also big in complexity!

  • http://www.bridgei2i.com Prithvijit Roy

    Nice article and i agree with your viewpoint that Big Data is not something new. I think this big bang around Big Data is happening today because of the increased awareness of the value of data and analytics in today’s world. While increasing complexity of data in today’s world can not be undermined, but many more people recognizes the value of Big data and analytics today as compared to a decade or two back. Hence we will see Big data and analytics resulting into much higher impact compared to ever before.

  • Doug Laney

    Great post Bill.  Reality is that Big Data is simply data that’s an order of magnitude bigger than what one is accustomed to….Grasshopper. :-)    Seriously, at Gartner (then META Group) around 2003 I established a framework for “bigness” in terms of both the challenges and opportunities presented. The three dimensions are data volume, data velocity and data variety.  So when we think about data girth, we also shouldn’t forget that data is flowing and demanded at increasing speeds, and also comes in an increasing array of formats. Vendors of course love this while IT organizations loathe it. 

  • Srinath Sridhar

    Good article. However what is missing is the analysis is the definition of “BIG Data” has two aspects.One is the data size which you have rightly pointed out is nothing new.However the bigger aspect is the unstructured nature of the data and converting this into structured form and making sense will be the bigger challenge

Content Protected Using Blog Protector By: PcDrome.