Welcome!

Big Data Journal Authors: Roger Strukhoff, Pat Romanski, Elizabeth White, Yeshim Deniz, Jackie Kahle

Related Topics: Big Data Journal, SOA & WOA, Virtualization, Cloud Expo, GovIT, SDN Journal

Big Data Journal: Blog Post

Public Sector Big Data: Five Ways Big Data Must Evolve in 2013

2012 will go down as a “Big” year for Big Data in the public sector

By

Editor’s note: This guest post provides context on mission focused data analytics in the federal space by one of the leaders of the federal big data movement, Ray Muslimani. -bg

2012 will go down as a “Big” year for Big Data in the public sector. Rhetoric and hype has been followed by tangible action on the part of both government and industry. The $200 million Big Data initiative unveiled by the White House in March 2012 was an injection of R&D and credibility towards efforts to develop tools and technologies to help solve the nation’s most pressing challenges.

On the industry side, the recently issued TechAmerica report, “Demystifying Big Data,” provides agencies with a roadmap for using Big Data to better serve citizens. It also offers a set of policy recommendations and practical steps agencies can take to get started with Big Data initiatives.

For all of the enthusiasm around Big Data this year, every indication is that 2013 will be the year when Big Data transforms the business of government. Below are 5 steps that need to be taken in order for Big Data to evolve in 2013 and deliver on its promise.

Demystify Big Data
Government agencies warmed to the potential of Big Data throughout 2012, but more education is required to help decision makers wade through their options and how further investments can be justified. Removing the ambiguitiessurrounding Big Data requires an emphasis in 2013 on education from both industry and government.

The TechAmerica Big Data report is a good example of how industry can play an active role in guiding agencies through Big Data initiatives. It also underscores that vendors can’t generate more Big Data RFPs through marketing slicks and sales tactics alone. This approach will not demystify Big Data – it will simply seed further doubt if providers of Big Data tools and solutions focus only on poking holes in competitor alternatives.

Industry and government should follow proven templates for education in 2013. For example, agencies can arrange “Big Data Days” in a similar format as Industry Tech Days occur today. Big Data industry days can help IT providers gain better insight into how each Agency plans to approach their Big Data challenges in 2013 and offer these agencies an opportunity to see a wide range of Big Data services.

The Big Data education process must also extend to contracting officers. Agencies need guidance on how RFPs can be constructed to address a service-based model.

Consumerize Big Data
While those within the public sector with the proper training and skills to analyze data have benefited from advanced Big Data tools, it has been far more difficult for everyday business users and decision makers to access the data in a useful way. Sluggish data query responses, data quality issues, and a clunky user experience is undermining the benefits Big Data Analytics can deliver and requiring users to be de facto “data scientists” to make sense of it all.

Supporting this challenge is a 2012 MeriTalk survey, “The Big Data Gap,” that finds just 60 percent of IT professionals indicate their agency is analyzing the data it collects and a modest 40 percent are using data to make strategic decisions. All of this despite the fact that 96 percent of those surveyed expects their agency’s stored data to grow in the next two years by an average of 64 percent.  The gap here suggests a struggle for non “data scientists” to convert data into business decisions. 

What if any government user could ask a question in natural language and receive the answer in a relevant visualization?  For Big Data to evolve in 2013 we must consumerize the user experience by removing spreadsheets and reports, and place the power of analytics in the hands of users of any level without analytics expertise.

Mobilize Big Data
IDC Government Insights predicts that in 2013, 35 percent of new Federal and state applications will be mobile. At the same time, 65 percent of Federal IT executives expect mobile device use to increase by 20 percent in 2013, according to The 2012-2013 Telework/Mobile IT Almanac.

Part of consumerizing Big Data means building it for any device so that users do not need to be tethered to their desktops to analyze data. Agency decision makers must be empowered to easily view and analyze data on tablets and smartphones, while the increase of teleworking in the public sector requires Big Data to be accessible from anywhere, at any time, and on any device.

There is promising innovation at work by both established Federal IT providers and upstarts in taking a mobile-first path to Big Data, rather than the traditional approach of building BI dashboards for the desktop. The degree to which 2013 sees a shift in Big Data from the desktop to tablets and smartphones will depend on how forcefully solutions providers employ a mobile-first approach to Big Data.

Act on Big Data
A tremendous amount of “thought” energy went into Big Data in 2012. For Big Data to evolve in a meaningful way in 2013, initiatives and studies must generate more action in the form of Big Data RFIs and RFPs.

Within the tight budget climate, agencies will not act on Big Data if vendor proposals require massive investments in IT infrastructure and staffing. There must be a shift –to the extent possible – of the financial and resource burden from agency to vendor. For example, some vendors have developed “Big Data Clouds” that allow agencies to leverage a secure, scalable framework for storing and managing data, along with a toolset for performing consumer-grade search and analysis on that data.

Open Big Data
Adoption of Big Data solutions has been accelerated by open source tools such as Hadoop, MapReduce, Hive, and HBase. While some agencies will find it tempting to withdraw to the comfort of proprietary Big Data tools that they can control in closed systems, that path undermines the value Big Data can ultimately deliver.

One could argue that as open source goes in 2013, Big Data goes as well. If open source platforms and tools continue to address agency demands for security, scalability, and flexibility, benefits within from Big Data within and across agencies will increase exponentially. There are hundreds of thousands of viable open source technologies on the market today. Not all are suitable for agency requirements, but as agencies update and expand their uses of data, these tools offer limitless opportunities to innovate. Additionally, opting for open source instead of proprietary vendor solutions prevents an agency from being locked into a single vendor’s tool that it may at some point outgrow or find ill suited for their needs.

Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley, former CTO of the Defense Intelligence Agency (DIA), is Founder and CTO of Crucial Point LLC, a technology research and advisory firm providing fact based technology reviews in support of venture capital, private equity and emerging technology firms. He has extensive industry experience in intelligence and security and was awarded an intelligence community meritorious achievement award by AFCEA in 2008, and has also been recognized as an Infoworld Top 25 CTO and as one of the most fascinating communicators in Government IT by GovFresh.

Cloud Expo Breaking News
SYS-CON Events announced today that Cloudian, Inc., the leading provider of hybrid cloud storage solutions, has been named “Bronze Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Cloudian is a Foster City, Calif.-based software company specializing in cloud storage. Cloudian HyperStore® is an S3-compatible cloud object storage platform that enables service providers and enterprises to build reliable, affordable and scalable hybrid cloud storage solutions. Cloudian actively partners with leading cloud computing environments including Amazon Web Services, Citrix Cloud Platform, Apache CloudStack, OpenStack and the vast ecosystem of S3 compatible tools and applications. Cloudian's customers include Vodafone, Nextel, NTT, Nifty, and LunaCloud. The company has additional offices in China and Japan.
The Internet of Things is not new. Historically, smart businesses have used its basic concept of leveraging data to drive better decision making and have capitalized on those insights to realize additional revenue opportunities. So, what has changed to make the Internet of Things one of the hottest topics in tech? In his session at Internet of @ThingsExpo, Chris Gray, Director, Embedded and Internet of Things, will discuss the underlying factors that are driving the economics of intelligent systems. Discover how hardware commoditization, the ubiquitous nature of connectivity, and the emergence of Big Data and analysis are providing the pull to meet customer expectations of a widely connected, multi-dimensional universe of people, things, and information.
It’s time to face reality: "Americans are from Mars, Europeans are from Venus," and in today’s increasingly connected world, understanding “inter-planetary” alignments and deviations is mission-critical for cloud. In her session at 15th Cloud Expo, Evelyn de Souza, Data Privacy and Compliance Strategy Leader at Cisco Systems, will discuss cultural expectations of privacy based on new research across these elements.
SYS-CON Events announced today that Esri has been named “Bronze Sponsor” of SYS-CON's 15th International Cloud Expo®, which will take place on November 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA. Esri inspires and enables people to positively impact the future through a deeper, geographic understanding of the changing world around them. For more information, visit http://www.esri.com.
There will be 50 billion Internet connected devices by 2020. Today, every manufacturer has a propriety protocol and an app. How do we securely integrate these "things" into our lives and businesses in a way that we can easily control and manage? Even better, how do we integrate these "things" so that they control and manage each other so our lives become more convenient or our businesses become more profitable and/or safe? We have heard that the best interface is no interface. In his session at Internet of @ThingsExpo, Chris Matthieu, Co-Founder & CTO at Octoblu, Inc., will discuss how these devices generate enough data to learn our behaviors and simplify/improve our lives. What if we could connect everything to everything? I'm not only talking about connecting things to things but also systems, cloud services, and people. Add in a little machine learning and artificial intelligence and now we have something interesting...
After a couple of false starts, cloud-based desktop solutions are picking up steam, driven by trends such as BYOD and pervasive high-speed connectivity. In his session at 15th Cloud Expo, Seth Bostock, CEO of IndependenceIT, cuts through the hype and the acronyms, and discusses the emergence of full-featured cloud workspaces that do for the desktop what cloud infrastructure did for the server. He’ll discuss VDI vs DaaS, implementation strategies and evaluation criteria.
Cloud computing started a technology revolution; now DevOps is driving that revolution forward. By enabling new approaches to service delivery, cloud and DevOps together are delivering even greater speed, agility, and efficiency. No wonder leading innovators are adopting DevOps and cloud together! In his session at DevOps Summit, Andi Mann, Vice President of Strategic Solutions at CA Technologies, will explore the synergies in these two approaches, with practical tips, techniques, research data, war stories, case studies, and recommendations.
Cloud Computing is evolving into a Big Three of Amazon Web Services, Google Cloud, and Microsoft Azure. Cloud 360: Multi-Cloud Bootcamp, being held Nov 4–5, 2014, in conjunction with 15th Cloud Expo in Santa Clara, CA, delivers a real-world demonstration of how to deploy and configure a scalable and available web application on all three platforms. The Cloud 360 Bootcamp, led by Janakiram MSV, an analyst with Gigaom Research, is the first bootcamp that introduces the core concepts of Infrastructure as a Service (IaaS) based on the workings of the Big Three platforms – Amazon EC2, Google Compute Engine, and Azure VMs. Bootcamp attendees will get to see the big picture and also receive the knowledge needed to make the best cloud decisions for their business applications and entire enterprise IT organization.
The Internet of Things promises to transform businesses (and lives), but navigating the business and technical path to success can be difficult to understand. In his session at 15th Internet of @ThingsExpo, Chad Jones, Vice President, Product Strategy of LogMeIn's Xively IoT Platform, will show you how to approach creating broadly successful connected customer solutions using real world business transformation studies including New England BioLabs and more.
“Distrix fits into the overall cloud and IoT model around software-defined networking. There’s a broad category around software-defined networking that’s focused on data center, and we focus on the WAN,” explained Jay Friedman, President of Distrix, in this SYS-CON.tv interview at the Internet of @ThingsExpo, held June 10-12, 2014, at the Javits Center in New York City. Internet of @ThingsExpo 2014 Silicon Valley, November 4–6, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading IoT industry players in the world.
Scott Jenson leads a project called The Physical Web within the Chrome team at Google. Project members are working to take the scalability and openness of the web and use it to talk to the exponentially exploding range of smart devices. Nearly every company today working on the IoT comes up with the same basic solution: use my server and you'll be fine. But if we really believe there will be trillions of these devices, that just can't scale. We need a system that is open a scalable and by using the URL as a basic building block, we open this up and get the same resilience that the web enjoys.
“The Internet of Things is a wave that has arrived and it’s growing really fast. The concern at Aria Systems is making sure that people understand the ramifications of their attempts to monetize whatever it is they build on the Internet of Things," explained C Brendan O’Brien, Co-founder and Chief Architect at Aria Systems, in this SYS-CON.tv interview at the Internet of @ThingsExpo, held June 10-12, 2014, at the Javits Center in New York City. Internet of @ThingsExpo 2014 Silicon Valley, November 4–6, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading IoT industry players in the world.
The Internet of Things is a natural complement to the cloud and related technologies such as Big Data, analytics, and mobility. In his session at Internet of @ThingsExpo, Joe Weinman will lay out four generic strategies – digital disciplines – to exploit emerging digital technologies for strategic advantage. Joe Weinman has held executive leadership positions at Bell Labs, AT&T, Hewlett-Packard, and Telx, in areas such as corporate strategy, business development, product management, operations, and R&D.
SYS-CON Events announced today that DevOps.com has been named “Media Sponsor” of SYS-CON's “DevOps Summit at Cloud Expo,” which will take place on June 10–12, 2014, at the Javits Center in New York City, New York. DevOps.com is where the world meets DevOps. It is the largest collection of original content relating to DevOps on the web today Featuring up-to-the-minute news, feature stories, blogs, bylined articles and more, DevOps.com is where the thought leaders of the DevOps movement make their ideas known.
There are 182 billion emails sent every day, generating a lot of data about how recipients and ISPs respond. Many marketers take a more-is-better approach to stats, preferring to have the ability to slice and dice their email lists based numerous arbitrary stats. However, fundamentally what really matters is whether or not sending an email to a particular recipient will generate value. Data Scientists can design high-level insights such as engagement prediction models and content clusters that allow marketers to cut through the noise and design their campaigns around strong, predictive signals, rather than arbitrary statistics. SendGrid sends up to half a billion emails a day for customers such as Pinterest and GitHub. All this email adds up to more text than produced in the entire twitterverse. We track events like clicks, opens and deliveries to help improve deliverability for our customers – adding up to over 50 billion useful events every month. While SendGrid data covers only abo...