Welcome!

@DXWorldExpo Authors: Pat Romanski, Zakia Bouachraoui, Elizabeth White, Yeshim Deniz, Liz McMillan

Related Topics: @DXWorldExpo, Java IoT, Microservices Expo, Containers Expo Blog, @CloudExpo, SDN Journal

@DXWorldExpo: Article

Consolidating Big Data

How to make your data center more cost-effective while improving performance

Cloud computing has opened the doors to a vast array of online services. With the emergence of new cloud technologies, both public and private companies are seeing increases in performance gains, elasticity and convenience. However, maintaining a competitive advantage has become increasingly difficult. Service providers are taking a closer look at their data storage infrastructure for ways to improve performance and cut costs.

If the status quo remains, maintaining low-cost cloud services will become increasingly difficult. Service providers will incur higher costs, while consumers become burdened with storage capacity restrictions. Such obstacles are influencing service providers to find new ways to scale cost-effectively and increase performance in the data center.

Cost-Benefit Analysis
In response to the increase of online account activity, service providers are consolidating their data centers to a centralized environment. By doing so, they are able to cut costs while increasing efficiency, allowing data to be accessible from any location. Centralizing equipment enables providers the ability to deliver enhanced Internet connections, performance and reliability.

However, with these added benefits also come disadvantages. For instance, scalability becomes more expensive and difficult to achieve. Improving efficiency within a centralized data center requires the purchase of additional high-performance, specialized equipment, which increases costs and energy consumption, challenging endeavors to control at scale. In an economy where cost-cutting is becoming a necessity for large and small enterprises alike, these added expenses are unacceptable.

Characteristics of the Cloud
Solving performance problems, like data bottlenecks, is a growing concern for cloud providers who must oversee significantly more users and accompanying performance demands, than do enterprises. Although the average user of an enterprise system requires elevated performance, these systems generally manage fewer users who are able to access their data directly through the network. Moreover, enterprise system users are accessing, saving and sending comparatively relatively small files that require less storage capacity and performance.

Outside the internal enterprise network, however, it's a different story. Cloud systems are simultaneously being accessed by a multitude of users across the Internet, which itself becomes a performance bottleneck. The average cloud user stores relatively larger files than the average enterprise user placing greater strains on data center resources. The cloud provider's storage system not only has to scale to each user, but must also sustain performance across all users as well.

Best Practices
In response to growing storage demands, cloud providers are faced with profound business implications. Service providers need to scale quickly in order to meet the booming demand for more data storage. The following best practices can help optimize data center ROI in a period of significant IT cutbacks:

  • Opt for commodity components when possible: Low-energy hardware makes good business sense. Commodity hardware is not only cost-effective, but also energy-efficient, which significantly reduces both setup and operating costs in one move.
  • Seek out a distributed storage system: Distributed storage presents the best way to build at scale even though the data center trend has been moving toward centralization. Increased performance at the software level counterbalances the performance advantage of a centralized data storage approach.
  • Avoid bottlenecks: A single point of entry can easily lead to a performance bottleneck. Adding caches to relieve the bottleneck, as most data center infrastructures currently do, quickly adds cost and complexity to a system. On the other hand, a horizontally scalable system that distributes data among all nodes delivers a high level of redundancy.

Moving Forward
Currently, Big Data storage consists mainly of high performance, vertically scaled storage systems. Since these infrastructures can only scale to a single petabyte and are costly, they are not a sustainable solution. Moving to a horizontally scaled data storage model that distributes data evenly onto energy-efficient hardware can reduce costs and increase performance in the cloud. With these insights, cloud service providers can take the necessary steps to improve the efficiency, scalability and performance of their data storage centers.

More Stories By Stefan Bernbo

Stefan Bernbo is the founder and CEO of Compuverde. For 20 years, he has designed and built numerous enterprise scale data storage solutions designed to be cost effective for storing huge data sets. From 2004 to 2010 Stefan worked within this field for Storegate, the wide-reaching Internet based storage solution for consumer and business markets, with the highest possible availability and scalability requirements. Previously, Stefan has worked with system and software architecture on several projects with Swedish giant Ericsson, the world-leading provider of telecommunications equipment and services to mobile and fixed network operators.

Comments (0)

Share your thoughts on this story.

Add your comment
You must be signed in to add a comment. Sign-in | Register

In accordance with our Comment Policy, we encourage comments that are on topic, relevant and to-the-point. We will remove comments that include profanity, personal attacks, racial slurs, threats of violence, or other inappropriate material that violates our Terms and Conditions, and will block users who make repeated violations. We ask all readers to expect diversity of opinion and to treat one another with dignity and respect.


DXWorldEXPO Digital Transformation Stories
CloudEXPO New York 2018, colocated with DevOpsSUMMIT and DXWorldEXPO New York 2018 will be held November 12-13, 2018, in New York City and will bring together Cloud Computing, FinTech and Blockchain, Digital Transformation, Big Data, Internet of Things, DevOps, AI and Machine Learning to one location.
CloudEXPO | DevOpsSUMMIT | DXWorldEXPO are the world's most influential, independent events where Cloud Computing was coined and where technology buyers and vendors meet to experience and discuss the big picture of Digital Transformation and all of the strategies, tactics, and tools they need to realize their goals. Sponsors of DXWorldEXPO | CloudEXPO benefit from unmatched branding, profile building and lead generation opportunities.
ICC is a computer systems integrator and server manufacturing company focused on developing products and product appliances to meet a wide range of computational needs for many industries. Their solutions provide benefits across many environments, such as datacenter deployment, HPC, workstations, storage networks and standalone server installations. ICC has been in business for over 23 years and their phenomenal range of clients include multinational corporations, universities, and small busines...
Headquartered in Plainsboro, NJ, Synametrics Technologies has provided IT professionals and computer systems developers since 1997. Based on the success of their initial product offerings (WinSQL and DeltaCopy), the company continues to create and hone innovative products that help its customers get more from their computer applications, databases and infrastructure. To date, over one million users around the world have chosen Synametrics solutions to help power their accelerated business or per...
All in Mobile is a place where we continually maximize their impact by fostering understanding, empathy, insights, creativity and joy. They believe that a truly useful and desirable mobile app doesn't need the brightest idea or the most advanced technology. A great product begins with understanding people. It's easy to think that customers will love your app, but can you justify it? They make sure your final app is something that users truly want and need. The only way to do this is by ...
Digital Transformation and Disruption, Amazon Style - What You Can Learn. Chris Kocher is a co-founder of Grey Heron, a management and strategic marketing consulting firm. He has 25+ years in both strategic and hands-on operating experience helping executives and investors build revenues and shareholder value. He has consulted with over 130 companies on innovating with new business models, product strategies and monetization. Chris has held management positions at HP and Symantec in addition to ...
DXWorldEXPO LLC announced today that Nutanix has been named "Platinum Sponsor" of CloudEXPO | DevOpsSUMMIT | DXWorldEXPO New York, which will take place November 12-13, 2018 in New York City. Nutanix makes infrastructure invisible, elevating IT to focus on the applications and services that power their business. The Nutanix Enterprise Cloud Platform blends web-scale engineering and consumer-grade design to natively converge server, storage, virtualization and networking into a resilient, softwar...
DXWorldEXPO LLC announced today that Big Data Federation to Exhibit at the 22nd International CloudEXPO, colocated with DevOpsSUMMIT and DXWorldEXPO, November 12-13, 2018 in New York City. Big Data Federation, Inc. develops and applies artificial intelligence to predict financial and economic events that matter. The company uncovers patterns and precise drivers of performance and outcomes with the aid of machine-learning algorithms, big data, and fundamental analysis. Their products are deployed...
Dynatrace is an application performance management software company with products for the information technology departments and digital business owners of medium and large businesses. Building the Future of Monitoring with Artificial Intelligence. Today we can collect lots and lots of performance data. We build beautiful dashboards and even have fancy query languages to access and transform the data. Still performance data is a secret language only a couple of people understand. The more busine...
The challenges of aggregating data from consumer-oriented devices, such as wearable technologies and smart thermostats, are fairly well-understood. However, there are a new set of challenges for IoT devices that generate megabytes or gigabytes of data per second. Certainly, the infrastructure will have to change, as those volumes of data will likely overwhelm the available bandwidth for aggregating the data into a central repository. Ochandarena discusses a whole new way to think about your next...