Welcome!

Big Data Journal Authors: AppDynamics Blog, Srinivasan Sundara Rajan, Yeshim Deniz, Pat Romanski, Liz McMillan

News Feed Item

New YCSB Benchmark from Thumbtack Technology Reveals a Nearly 10x Performance Advantage for the Native Flash Aerospike Database Over Other Leading NoSQL Databases

The advantage of flash technology for powering fast big data transactions is highlighted in a new, independent NoSQL database benchmark report from Thumbtack Technology. Aerospike today announced that the Thumbtack benchmark results revealed a nearly 10x performance advantage for the Aerospike database running on a combination of native flash and DRAM over two other NoSQL databases running on a mix of DRAM and RAM: Apache Cassandra and 10gen MongoDB. Aerospike is making reprints of the Thumbtack report, “Ultra-High Performance NoSQL Benchmarking: Analyzing Durability and Performance Tradeoffs,” available at www.aerospike.com/benchmark.

“Our goal with the study was to determine which key-value store is most appropriate in a real-world scenario. That meant not only using raw hardware and solid-state drives, but configuring each database to optimally handle the workload,” said Ben Engber, Thumbtack Technology CEO and co-author of the benchmark study. “Aerospike was the only vendor among those tested that specifically optimizes its platform around SSDs, so the Aerospike database’s superior performance was not unexpected.”

Benchmark Overview

Thumbtack evaluated the three NoSQL database and key-value store offerings using the Yahoo Cloud Serving Benchmark (YCSB) as the basis for the testing. To gauge their capacity and speed in handling big data while operating against disk, Thumbtack tested the databases’ ability to process 500 million records. A fourth database, the Couchbase Server, was dropped from this evaluation because it could not load all of the records.

In the Thumbtack benchmark measuring a read-heavy workload, Aerospike achieved a maximum throughput of more than 300,000 operations per second, nearly 10x faster than its nearest competitor, Cassandra. For the balanced workload benchmark, Aerospike was 5x faster than Cassandra. In both tests, MongoDB placed third. The Thumbtack report notes, “This was particularly impressive since both Cassandra and MongoDB were using weak consistency models (returning success after only one copy has been written) and Aerospike was using a strong consistency model.”

Across all four Thumbtack tests for latency, Aerospike maintained sub-millisecond latency even while processing 180,000 to 400,000 operations per second. Latency for Cassandra ranged from sub-millisecond to 8 milliseconds, and for MongoDB, it varied from 1 to 20 milliseconds; meanwhile neither database exceeded 40,000 operations per second.

The Flash-based SSD Advantage

“From the start, we recognized that a database optimized for both flash and DRAM would be ideal for emerging classes of applications that live or die by their ability to reliably respond within milliseconds,” said Brian Bulkowski, Aerospike founder and CTO. “Today our customers rely on our native flash Aerospike database to achieve both the real-time responses and 100% uptime that this extremely high performance enables. Now the ThumbTack benchmark adds significant new validation of the value in optimizing databases and applications for SSDs.”

“Providing fast reliable access to data in real-time is simple to say, but it’s not easy to do,” said Patrick DeAngelis, [x+1] CTO. “However, running our Aerospike clusters on SSDs, we’ve seen the database scale to a few billion key values with no compromise to performance, and we’ve even seen response times under 1 millisecond, which is phenomenal. We are now rethinking the way we store and access our data and intelligence because of the options that Aerospike gives us.”

“We continue to push the boundaries of what you can do in a real-time environment, and Aerospike’s architecture, which is optimized for flash storage, means we can be a lot more efficient with the way we spread our data,” said Mazdak Rezvani, Chango CTO. “Already we’ve seen a 25% improvement in efficiency with Aerospike over our previous NoSQL database.”

Complementing Thumbtack’s NoSQL benchmark is the Aerospike Certification Tool (ACT) for benchmarking the ability of SSDs to support database transactions. ACT, which Aerospike has made available as open source software, can be downloaded at http://github.com/aerospike/act. In December 2012, Aerospike published the ACT benchmark results for several SSD models from Fusion-io, Intel, OCZ, and Samsung. These results can be viewed at http://www.aerospike.com/blog/act-ssd-benchmark.

About Thumbtack

Based in Brooklyn, New York, Thumbtack Technology is a leading software development services firm that specializes in building and integrating scalable applications and systems for Fortune 500 clients and start-ups. Thumbtack is business driven, focusing on key customer pain points, tackling the most difficult problems while scaling applications to an Internet audience. To learn more, visit http://www.thumbtack.net.

About Aerospike

Aerospike, Inc. offers the only real-time NoSQL database and key-value store that delivers predictable high performance for mission-critical, Web-scale applications. Aerospike’s flash-optimized, shared-nothing architecture scales linearly, consistently processing over 500k transactions per second per node with sub-millisecond latency. With automatic fail-over, replication, and cross data center synchronization, the Aerospike database reliably stores billions of objects and terabytes of data—while providing 100% uptime and 17x improvement in TCO over other NoSQL databases. Customers accelerating their business with Aerospike include adMarketplace, Bluekai, eXelate, Sony’s So-net, and The Trade Desk. For more information, visit http://Aerospike.com.

Aerospike is a registered trademark of Aerospike, Inc., in the United States and/or other countries. All other trademarks and registered trademarks are the properties of their respective owners.

More Stories By Business Wire

Copyright © 2009 Business Wire. All rights reserved. Republication or redistribution of Business Wire content is expressly prohibited without the prior written consent of Business Wire. Business Wire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

@BigDataExpo Stories
The adoption of the Internet Of Things (IoT) is growing and its growth is synonymous with the growth of cloud. As per predictions from IDC: IoT and the Cloud: Within the next five years, more than 90% of all IoT data will be hosted on service provider platforms as cloud computing reduces the complexity of supporting IoT "Data Blending." This means that any organization that wanted to transform themselves using IoT has to automatically embrace the cloud too, especially the public cloud. This b...
Since 2008 and for the first time in history, more than half of humans live in urban areas, urging cities to become “smart.” Today, cities can leverage the wide availability of smartphones combined with new technologies such as Beacons or NFC to connect their urban furniture and environment to create citizen-first services that improve transportation, way-finding and information delivery. In her session at @ThingsExpo, Laetitia Gazel-Anthoine, CEO of Connecthings, will focus on successful use c...
Things are being built upon cloud foundations to transform organizations. This CEO Power Panel at 15th Cloud Expo, moderated by Roger Strukhoff, Cloud Expo and @ThingsExpo conference chair, addressed the big issues involving these technologies and, more important, the results they will achieve. Rodney Rogers, chairman and CEO of Virtustream; Brendan O'Brien, co-founder of Aria Systems, Bart Copeland, president and CEO of ActiveState Software; Jim Cowie, chief scientist at Dyn; Dave Wagstaff, VP ...
The Internet of Things is tied together with a thin strand that is known as time. Coincidentally, at the core of nearly all data analytics is a timestamp. When working with time series data there are a few core principles that everyone should consider, especially across datasets where time is the common boundary. In his session at Internet of @ThingsExpo, Jim Scott, Director of Enterprise Strategy & Architecture at MapR Technologies, discussed single-value, geo-spatial, and log time series dat...
Cultural, regulatory, environmental, political and economic (CREPE) conditions over the past decade are creating cross-industry solution spaces that require processes and technologies from both the Internet of Things (IoT), and Data Management and Analytics (DMA). These solution spaces are evolving into Sensor Analytics Ecosystems (SAE) that represent significant new opportunities for organizations of all types. Public Utilities throughout the world, providing electricity, natural gas and water,...
The 3rd International Internet of @ThingsExpo, co-located with the 16th International Cloud Expo - to be held June 9-11, 2015, at the Javits Center in New York City, NY - announces that its Call for Papers is now open. The Internet of Things (IoT) is the biggest idea since the creation of the Worldwide Web more than 20 years ago.
SYS-CON Media announced that Splunk, a provider of the leading software platform for real-time Operational Intelligence, has launched an ad campaign on Big Data Journal. Splunk software and cloud services enable organizations to search, monitor, analyze and visualize machine-generated big data coming from websites, applications, servers, networks, sensors and mobile devices. The ads focus on delivering ROI - how improved uptime delivered $6M in annual ROI, improving customer operations by minin...
There is no doubt that Big Data is here and getting bigger every day. Building a Big Data infrastructure today is no easy task. There are an enormous number of choices for database engines and technologies. To make things even more challenging, requirements are getting more sophisticated, and the standard paradigm of supporting historical analytics queries is often just one facet of what is needed. As Big Data growth continues, organizations are demanding real-time access to data, allowing immed...
The true value of the Internet of Things (IoT) lies not just in the data, but through the services that protect the data, perform the analysis and present findings in a usable way. With many IoT elements rooted in traditional IT components, Big Data and IoT isn’t just a play for enterprise. In fact, the IoT presents SMBs with the prospect of launching entirely new activities and exploring innovative areas. CompTIA research identifies several areas where IoT is expected to have the greatest impac...
The Internet of Things will greatly expand the opportunities for data collection and new business models driven off of that data. In her session at @ThingsExpo, Esmeralda Swartz, CMO of MetraTech, discussed how for this to be effective you not only need to have infrastructure and operational models capable of utilizing this new phenomenon, but increasingly service providers will need to convince a skeptical public to participate. Get ready to show them the money!
In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect at GE, and Ibrahim Gokcen, who leads GE's advanced IoT analytics, focused on the Internet of Things / Industrial Internet and how to make it operational for business end-users. Learn about the challenges posed by machine and sensor data and how to marry it with enterprise data. They also discussed the tips and tricks to provide the Industrial Internet as an end-user consumable service using Big Data Analytics and Industrial C...
How do APIs and IoT relate? The answer is not as simple as merely adding an API on top of a dumb device, but rather about understanding the architectural patterns for implementing an IoT fabric. There are typically two or three trends: Exposing the device to a management framework Exposing that management framework to a business centric logic Exposing that business layer and data to end users. This last trend is the IoT stack, which involves a new shift in the separation of what stuff happe...
The 4th International DevOps Summit, co-located with16th International Cloud Expo – being held June 9-11, 2015, at the Javits Center in New York City, NY – announces that its Call for Papers is now open. Born out of proven success in agile development, cloud computing, and process automation, DevOps is a macro trend you cannot afford to miss. From showcase success stories from early adopters and web-scale businesses, DevOps is expanding to organizations of all sizes, including the world's large...
The Internet of Things (IoT) is rapidly in the process of breaking from its heretofore relatively obscure enterprise applications (such as plant floor control and supply chain management) and going mainstream into the consumer space. More and more creative folks are interconnecting everyday products such as household items, mobile devices, appliances and cars, and unleashing new and imaginative scenarios. We are seeing a lot of excitement around applications in home automation, personal fitness,...
Dale Kim is the Director of Industry Solutions at MapR. His background includes a variety of technical and management roles at information technology companies. While his experience includes work with relational databases, much of his career pertains to non-relational data in the areas of search, content management, and NoSQL, and includes senior roles in technical marketing, sales engineering, and support engineering. Dale holds an MBA from Santa Clara University, and a BA in Computer Science f...
“We help people build clusters, in the classical sense of the cluster. We help people put a full stack on top of every single one of those machines. We do the full bare metal install," explained Greg Bruno, Vice President of Engineering and co-founder of StackIQ, in this SYS-CON.tv interview at 15th Cloud Expo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
In this demo at 15th Cloud Expo, John Meza, Product Engineer at Esri, showed how Esri products hook into Hadoop cluster to allow you to do spatial analysis on the spatial data within your cluster, and he demonstrated rendering from a data center with ArcGIS Pro, a new product that has a brand new rendering engine.
"People are a lot more knowledgeable about APIs now. There are two types of people who work with APIs - IT people who want to use APIs for something internal and the product managers who want to do something outside APIs for people to connect to them," explained Roberto Medrano, Executive Vice President at SOA Software, in this SYS-CON.tv interview at Cloud Expo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
Performance is the intersection of power, agility, control, and choice. If you value performance, and more specifically consistent performance, you need to look beyond simple virtualized compute. Many factors need to be considered to create a truly performant environment. In his General Session at 15th Cloud Expo, Harold Hannon, Sr. Software Architect at SoftLayer, discussed how to take advantage of a multitude of compute options and platform features to make cloud the cornerstone of your onlin...
Hardware will never be more valuable than on the day it hits your loading dock. Each day new servers are not deployed to production the business is losing money. While Moore's Law is typically cited to explain the exponential density growth of chips, a critical consequence of this is rapid depreciation of servers. The hardware for clustered systems (e.g., Hadoop, OpenStack) tends to be significant capital expenses. In his session at Big Data Expo, Mason Katz, CTO and co-founder of StackIQ, disc...