Welcome!

@BigDataExpo Authors: Karthick Viswanathan, Elizabeth White, Pat Romanski, Liz McMillan, Angsuman Dutta

Related Topics: @ThingsExpo, Machine Learning , @BigDataExpo

@ThingsExpo: Blog Post

Are You Thinking About Big Data When Doing IoT? – You Should Be | @ThingsExpo #ML #IoT #M2M #BigData

Based on all estimates by industry analysts and current trends, the IoT is growing at an incredible rate and is here to stay

Are You Thinking About Big Data When Doing IoT? - You Should Be

There is no denying the Internet of Things (IoT) is a hot topic. Gartner positions IoT as being at the peak of the ‘hype cycle.' From a size perspective, these ‘Things' can be anything, from a small sensor to a large appliance, and everything in between. The data transmitted by these devices, for the most part, tends to be small - tiny packets of information destined for consumption and analysis, bringing value to the business.

Is there hype? Yes. As with any new technology, there is always a level of hype involved. Are the data packets involved small? For the most part, yes (there are always exceptions). While both may be true, The Internet of Things is growing at breakneck speed. No matter which analyst you read, the growth predictions are staggering. Gartner predicts that we will hit over 20 billion (with a B) devices by 2020. IHS predicts even larger numbers, with 30 billion by 2020, and over 75 billion devices by 2025. No matter what, that's a lot of devices, and no matter how small the packets, multiplied by the number of devices, that's a lot of data.

It's not the things, it's the data
What I find interesting is that many times the focus of discussion when talking IoT are the devices, the sensors, the hardware itself. The latest Fitbit or smartwatch. The new smart appliances, the new smart or self-driving cars (which are amalgamations of many ‘things'). Yes, those technologies are interesting (okay, fascinating, I will admit, my inner geek loves getting down into the actual technologies), but when we are looking at the world of IoT, we should take a step back, look at the big picture. What value are these devices providing?

What I am about to say may sound like heresy to many. IoT is not about the devices. The devices are not the end goal. The devices are tools, mechanisms, conduits, conduits of information. They provide (and consume) information. Massive amounts of information. A former colleague of mine for years was always fond of saying, ‘Ed, It's all about the data.' In the burgeoning world of IoT that statement identifies the true business value of IoT. Information.

Watching out for potholes
Recently, Ford announced they were testing a pothole detector and alert system for cars. Living in New England, let me tell you, potholes are the bane of a car driver's existence. Many a car ends up in the repair shop during pothole season. Given that, the concept is intriguing. The manufacturer has cameras mounted on the vehicles. The cameras scan the roadway around the vehicle looking for signs of potholes. Image recognition allows it to make this determination. If a pothole is detected, the system will allow the car to avoid hitting the pothole, and thus potential damage to the vehicle.

Now some would say, ‘what does that have to do with big data?' The system is self-contained within the vehicle. To be useful, the system needs to react in near real-time to the situation. It doesn't have time to send all the data back to the cloud for analysis to determine if there is a pothole. Also, what if it loses network connection? All valid points. Let's take a step back, and look at the bigger picture.

  • How does the system recognize a pothole? Image recognition. What does image recognition need? Lots of data about what potholes look like. Machine learning algorithms help it determine if its seeing a pothole, and those algorithms need data to do that.
  • What will be the source of those pothole images? Wouldn't it be useful if images of any potholes the system encounters become part of the source data for the image recognition system to improve its detection? Wouldn't it be useful to provide that back to a central location to improve the algorithms and detection software, which could then be sent back to all the other vehicles to improve their capability?
  • What about all the cars without the system? Wouldn't it be nice if the pothole locations were flagged to the various GPS applications people use so they are aware of the pothole and its location?
  • What about the local public works department? Wouldn't it be nice if they were automatically notified about the new pothole identified so it could be repaired?

Ingestion considerations
Given the importance of the data to the success of any IoT implementation, ingesting that information is critical to the successful implementation.

  • Data Quality - In the world of data, quality has always been an important consideration. Data cleansing and scrubbing is standard practice already in many organizations. It has become critical for IoT implementations. Ingesting dirty data into even the best IoT implementation will bring it to a grinding halt.
  • Data Volume - As I have mentioned already, many times the data packets for an individual device/sensor are small. That being said, multiplied by the sheer number of devices, the volume can quickly overwhelm a network or storage environment if not planned for appropriately. These considerations also must take into account location
  • Data Timeliness - Besides volume, new and timely data is also a consideration. In the pothole example, if the last update was weeks ago, how valid is the location anymore?
  • Data Pedigree - Where did the data come from? Is it a valid source? The pedigree is less important when using internal systems, as the source is well known, but IoT systems, by their nature, frequently will be getting their data from devices and sources outside the normal perimeter. This requires extra effort to ensure you trust the information being consumed.

No technology negates the need for good design and planning
Based on all estimates by industry analysts and current trends, the Internet of Things is growing at an incredible rate and is here to stay. There is a big radar blip of data outside your data center that is not going anywhere. That data provides great value, but also many challenges that need to be taken into consideration. If you are doing IoT and are not looking at Big Data, you are missing an opportunity and business value. As many of my readers have heard me say frequently, no technology negates the need for good design and planning. The Internet of Things and the accompanying Big Data demands it if you are to be successful.

More Stories By Ed Featherston

Ed Featherston is VP, Principal Architect at Cloud Technology Partners. He brings 35 years of technology experience in designing, building, and implementing large complex solutions. He has significant expertise in systems integration, Internet/intranet, and cloud technologies. He has delivered projects in various industries, including financial services, pharmacy, government and retail.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


@BigDataExpo Stories
Cloud resources, although available in abundance, are inherently volatile. For transactional computing, like ERP and most enterprise software, this is a challenge as transactional integrity and data fidelity is paramount – making it a challenge to create cloud native applications while relying on RDBMS. In his session at 21st Cloud Expo, Claus Jepsen, Chief Architect and Head of Innovation Labs at Unit4, will explore that in order to create distributed and scalable solutions ensuring high availa...
Connecting to major cloud service providers is becoming central to doing business. But your cloud provider’s performance is only as good as your connectivity solution. Massive Networks will place you in the driver's seat by exposing how you can extend your LAN from any location to include any cloud platform through an advanced high-performance connection that is secure and dedicated to your business-critical data. In his session at 21st Cloud Expo, Paul Mako, CEO & CIO of Massive Networks, wil...
When shopping for a new data processing platform for IoT solutions, many development teams want to be able to test-drive options before making a choice. Yet when evaluating an IoT solution, it’s simply not feasible to do so at scale with physical devices. Building a sensor simulator is the next best choice; however, generating a realistic simulation at very high TPS with ease of configurability is a formidable challenge. When dealing with multiple application or transport protocols, you would be...
As businesses adopt functionalities in cloud computing, it’s imperative that IT operations consistently ensure cloud systems work correctly – all of the time, and to their best capabilities. In his session at @BigDataExpo, Bernd Harzog, CEO and founder of OpsDataStore, presented an industry answer to the common question, “Are you running IT operations as efficiently and as cost effectively as you need to?” He then expounded on the industry issues he frequently came up against as an analyst, and ...
SYS-CON Events announced today that App2Cloud will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct. 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. App2Cloud is an online Platform, specializing in migrating legacy applications to any Cloud Providers (AWS, Azure, Google Cloud).
IoT is at the core or many Digital Transformation initiatives with the goal of re-inventing a company's business model. We all agree that collecting relevant IoT data will result in massive amounts of data needing to be stored. However, with the rapid development of IoT devices and ongoing business model transformation, we are not able to predict the volume and growth of IoT data. And with the lack of IoT history, traditional methods of IT and infrastructure planning based on the past do not app...
Blockchain is a shared, secure record of exchange that establishes trust, accountability and transparency across supply chain networks. Supported by the Linux Foundation's open source, open-standards based Hyperledger Project, Blockchain has the potential to improve regulatory compliance, reduce cost and time for product recall as well as advance trade. Are you curious about Blockchain and how it can provide you with new opportunities for innovation and growth? In her session at 20th Cloud Exp...
To get the most out of their data, successful companies are not focusing on queries and data lakes, they are actively integrating analytics into their operations with a data-first application development approach. Real-time adjustments to improve revenues, reduce costs, or mitigate risk rely on applications that minimize latency on a variety of data sources. Jack Norris reviews best practices to show how companies develop, deploy, and dynamically update these applications and how this data-first...
Intelligent Automation is now one of the key business imperatives for CIOs and CISOs impacting all areas of business today. In his session at 21st Cloud Expo, Brian Boeggeman, VP Alliances & Partnerships at Ayehu, will talk about how business value is created and delivered through intelligent automation to today’s enterprises. The open ecosystem platform approach toward Intelligent Automation that Ayehu delivers to the market is core to enabling the creation of the self-driving enterprise.
Internet-of-Things discussions can end up either going down the consumer gadget rabbit hole or focused on the sort of data logging that industrial manufacturers have been doing forever. However, in fact, companies today are already using IoT data both to optimize their operational technology and to improve the experience of customer interactions in novel ways. In his session at @ThingsExpo, Gordon Haff, Red Hat Technology Evangelist, shared examples from a wide range of industries – including en...
You know you need the cloud, but you’re hesitant to simply dump everything at Amazon since you know that not all workloads are suitable for cloud. You know that you want the kind of ease of use and scalability that you get with public cloud, but your applications are architected in a way that makes the public cloud a non-starter. You’re looking at private cloud solutions based on hyperconverged infrastructure, but you’re concerned with the limits inherent in those technologies.
SYS-CON Events announced today that Massive Networks will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Massive Networks mission is simple. To help your business operate seamlessly with fast, reliable, and secure internet and network solutions. Improve your customer's experience with outstanding connections to your cloud.
SYS-CON Events announced today that SkyScale will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. SkyScale is a world-class provider of cloud-based, ultra-fast multi-GPU hardware platforms for lease to customers desiring the fastest performance available as a service anywhere in the world. SkyScale builds, configures, and manages dedicated systems strategically located in maximum-security...
SYS-CON Events announced today that Grape Up will exhibit at SYS-CON's 21st International Cloud Expo®, which will take place on Oct. 31 – Nov 2, 2017, at the Santa Clara Convention Center in Santa Clara, CA. Grape Up is a software company specializing in cloud native application development and professional services related to Cloud Foundry PaaS. With five expert teams that operate in various sectors of the market across the U.S. and Europe, Grape Up works with a variety of customers from emergi...
Detecting internal user threats in the Big Data eco-system is challenging and cumbersome. Many organizations monitor internal usage of the Big Data eco-system using a set of alerts. This is not a scalable process given the increase in the number of alerts with the accelerating growth in data volume and user base. Organizations are increasingly leveraging machine learning to monitor only those data elements that are sensitive and critical, autonomously establish monitoring policies, and to detect...
Because IoT devices are deployed in mission-critical environments more than ever before, it’s increasingly imperative they be truly smart. IoT sensors simply stockpiling data isn’t useful. IoT must be artificially and naturally intelligent in order to provide more value In his session at @ThingsExpo, John Crupi, Vice President and Engineering System Architect at Greenwave Systems, will discuss how IoT artificial intelligence (AI) can be carried out via edge analytics and machine learning techn...
Everything run by electricity will eventually be connected to the Internet. Get ahead of the Internet of Things revolution and join Akvelon expert and IoT industry leader, Sergey Grebnov, in his session at @ThingsExpo, for an educational dive into the world of managing your home, workplace and all the devices they contain with the power of machine-based AI and intelligent Bot services for a completely streamlined experience.
With tough new regulations coming to Europe on data privacy in May 2018, Calligo will explain why in reality the effect is global and transforms how you consider critical data. EU GDPR fundamentally rewrites the rules for cloud, Big Data and IoT. In his session at 21st Cloud Expo, Adam Ryan, Vice President and General Manager EMEA at Calligo, will examine the regulations and provide insight on how it affects technology, challenges the established rules and will usher in new levels of diligence a...
Existing Big Data solutions are mainly focused on the discovery and analysis of data. The solutions are scalable and highly available but tedious when swapping in and swapping out occurs in disarray and thrashing takes place. The resolution for thrashing through machine learning algorithms and support nomenclature is through simple techniques. Organizations that have been collecting large customer data are increasingly seeing the need to use the data for swapping in and out and thrashing occurs ...
An increasing number of companies are creating products that combine data with analytical capabilities. Running interactive queries on Big Data requires complex architectures to store and query data effectively, typically involving data streams, an choosing efficient file format/database and multiple independent systems that are tied together through custom-engineered pipelines. In his session at @BigDataExpo at @ThingsExpo, Tomer Levi, a senior software engineer at Intel’s Advanced Analytics ...