Click here to close now.


@BigDataExpo Authors: Esmeralda Swartz, Yeshim Deniz, Carmen Gonzalez, Pat Romanski, Liz McMillan

Blog Feed Post

vCloud Automation Center – vCAC 5.1 – Amazon EC2 Configuration

Usually most people go straight for connecting vCAC to vCenter, but I have decided to connect to Amazon EC2 first. I’m doing this for a few reasons, but mainly because anyone reading this has access to EC2. All you really need is any computer with a Desktop Virtualization tool like VMware workstation and you can test vCAC with Amazon EC2. If you don’t have an Amazon AWWS account go to and sign-up.

Signing up for Amazon AWS is free and what’s even better is you can also provision “Micro.Instances” for free for an entire year as long as you stay within these guidelines. The basics are this:

  • 750 Hours of Linux/Windows Micro Instance Usage per month. (613Mb Memory). This is enough to run a single micro instance for the whole month.
  • 750 Hours of Elastic Load Balancing plus 15GB of data processing
  • 30GB of Elastic Block Storage
  • 5GB of S3 Storage with 20,000 Get requests and 2,000 Put requests
  • And some other goodies…..

You can run more than one micro instance at a time as long as the consecutive run time of your machines doesn’t go over 750 hours a month. Once you provision an instance it automatically counts as 15 minutes used. I don’t bother trying to calculate by the 15 minutes so the way I look at it is I can perform 750 provisioning tests per month if each test is less than an hour.

Backgroud information

Before we begin the configuration there are a few things we need in place. If you don’t already have vCAC installed and the foundation laid check out these posts to get going:

What were going to configure

In order to configure EC2 integration we are going to setup some additional components of vCAC as outlined below:

  • Credentials -Credentials will be utilized by out endpoints to authenticate us to the infrastructure element managers that we are going to communicate with.
  • End Point – Endpoints are how we manage connections from vCAC to other infrastructure elements in the environment. There are endpoints that allow us to communicate with EC2, vCenter, vCloud Director, vCenter Orchestrator, Hyper-V, NetApp Filers, as well as Physical Servers such as HP iLO, Dell iDrac, and Cisco UCS.
  • Enterprise Group – Although we already created an Enterprise Group we are going to add Compute Resources to the group in this exercise. FOr more information on what Enterprise Groups are see my earlier article “vCloud Automation Center – Laying the foundation“.
  • Reservations – A resource reservation is how we provide available resources to our provisioning groups. Resource Reservation are a one to one mapping to provisioning groups. Resource reservation will get created for any type of resources you want to make available to your groups. we will discuss these in more detail in another article.
  • Global Blueprints – A Blueprint is really a service definition that details what the consumer can request and all the policies and configuration of that service. We will create an Amazon Ec2 Blueprint that a consumer can request through the service catalog in this example. I will cover Blueprints in greater detail in another article.

      Configuring vCAC to provision to Amazon EC2

      Crating Credentials

      1.) The first thing we need to do is log into the vCAC console at “http://[host]/dcac“, then go to the “vCAC Administrator” menu on the “Left” and select “Credentials“.
      2.) On the “Credentials” page select “New Credentials” in the “Upper Right” corner.


      3.) Give your “Credential” a “Name” and “Description“. We then need to get your Amazon AWS “Access Key ID” and “Secret Access Key” which are covered in the following steps. The “Access Key ID” will be your “Username” and the “Secret Access Key” will be used as the “Password“.


      Getting your AWS Access Key ID and Secret Access Key

      4.)Login to your Amazon AWS account at ““. At the top “Right” corner “Hover” over “My Account/Console” and then select “Security Credentials


      5.) Scroll Down the page until you set to the section labeled “Access Credentials” and you will see your “Access Key ID” displayed. Copy and paste this in the “Credentials” “Username” field.


      6.) Next “Click” “Show” to display your “Secret Access Key“. Copy and paste this into the “Credentials” “Password” Fields.


      7.) Once you have input your “Username” and “Password” click the “Green” check on the “Left” hand side.


      Creating an EndPoint”

      8.) Next go to “vCAC Administrator” menu and “Click” “Endpoints” Once the “EndPoints” page displays “Hover” over “New EndPoint” and select “Amazon EC2“.


      9.) Give your “Endpoint” and “Name” and then “click” the selection box next to “Credentials“. Select the “Amazon EC2” “Credentials” you just created and “Click” “Ok“., then “Click” “Ok” on the “New Endpoint” Screen.


      10.) You will now see your newly crated Endpoint listed on the Endpoints screen. At this point vCAC executes a workflows that connects to Amazon AWS and validates your Credentials. If your credentials are validated the workflow will proceed to do a Data Discovery. The discovery will detect the available Amazon EC2 resources available for use. Once the discovery if finished the Amazon EC2 resources will become available within the “Enterprise Group” for selection.


      Adding Compute Resources to an Enterprise Group

      11.) Next let’s go to the “vCAC Administrators” menu and select “Enterprise Groups“. Once on the “Enterprise Groups” page “Hover” over the “Enterprise Group” we created and “select” “Edit


      12.) In the “Enterprise Group” we now see the “Amazon Regions” that are available. Select the “Amazon Region” that you would like to use and “Click” “Ok“.


      13.) Next if you go to the “Enterprise Administrators” Menu on the left and select “Compute Resources” you will see a “Compute Resource” for each “Amazon Region” you selected. Once the “Compute Resource” is available we can create a “Resource Reservation” to assign to our “Provisioning Group“.


      Creating a Reservation

      14.)On the “Enterprise Administrators” menu select “Reservations” and then “Hover” over “New Reservation” in the upper right corner and select “Cloud



      15.)On the “New Reservation – Cloud” page select the “Drop Down” dialog next to “Compute Resource” and select the “Amazon EC2” “Compute Resource


      16.) vCAC will “auto-generate” a “Name” for the “Reservation” however you can change the name if you like. The select the “Drop Down” dialog next to “Provisioning Group” and “Select” the “Provisioning Group” we created.


      17.) Next if you like you can set a “Machine Quota” to limit the number of machines that can be provisioned on to this “Amazon AWS Reservation“. You must set a “Priority” for the “Reservation” which is used to assist in making placement decisions if you have multiple reservations. I will talk more about this in another post. Once you have set your “Priority” “click” the “Resources” tab above.


      18.)”Amazon AWS” utilized “Key Pairs” for enhanced security of machine management tasks. You ave a few options within vCAC. You can let vCAC “Auto-generate a key pair per Provisioning Group“, “Auto-Generate a key pair per Machine“, or you can use a “Specific key pair” that you have already created through the “Amazon AWS” console. I’m going to use the “Auto-Generated per Provisioning Group” option in this example.


      19.) Next we need to select the “Locations” within the “Selected AWS Region” that we want to make available for use. I’m going to select them all. Then we need to select the “Security Group” we would like to make our machine part of. The “Security Group” can be looked at as a firewall rules for your machine. I’m going to select my “Default” “Security Group“. Optionally you can select a “Load Balancer” to attach the machine to as well. I will cover this in a later article. When you are finished “Click” “Alerts” above.


      20.) Here you can optionally enable “Alerts” that will send notifications if the “Reservation” is nearing capacity. Set the “Quota Threshold” for your alert, the email addresses to be notified, and the “Reminder Frequency” and click “Ok


      21.) You will now see your newly created “Reservation” listed on the “Reservations” screen. Now select “Global Blueprints” located under the “Enterprise Administrators” menu.


      Creating a Blueprint

      22.) Once you are on the “Global Blueprints” page “Hover” over “New Blueprint” and select “Cloud


      23.) Once on the “Blueprint Information” tab give your “Blueprint” a “Name“, and optionally change the “Display Icon“. Next assign it to a “Group(s)” and then optionally override the “Prefix” associated with this “Blueprint“. Then you can optionally set the max number of machines a user can request for this blueprint and a daily cost if you wish. Once complete select the “Build Information” tab above.


      24.) On the “Build Information” tab change the “Blueprint Type” to “Server”


      25.) Then next to “Amazon Machine Image” click the “Selection” box.


      26.) Once the dialog box appears you can filer the results at the top to narrow the result for the AMI you would like to use. If you selected multiple regions for use make sure the AMI is in the Region you want to use. Select the “AMI” you would like to use and click “Ok


      27.) “Optionally” you can “override” the “key Pair” setting that we configured in the “Reservation“.


      28.) “Optionally” you can “Enable” network options for the “Bluepeint“. The will allow the requester to select the “Security Group” they would like to apply to the machine if more than one was selected in the “Reservation“.


      29.) Next select the “Instance Types” you would like the requester to be able to choose from.


      30.) Then select the “Security” tab above.


      Making a Request

      31.) “Hover” over the newly created “Blueprint” on the “Global Blueprints” page and select “Request machine” to test our configuration. You can also go to the “Self Service” menu and select “Request Machine


      32.)On the “Confirm Machine Request” page click the “Drop Down” next to “Instance Type” and select the type of “Instance” you would like to request.


      33.) Then click the “Drop Down” next to “Provision Into” and select “Non-VPC Location” because we do not have a “VPC” configured.


      34.) Next select the “Drop Down” next to “Location” and select a location to provision to.


      35.) Next click the “Storage” tab above.


      36.) Optionally you can add “EBS Storage” volumes to your “Request“. Click the “Network” tab above.


      37.) “Optionally” if you added more than one “Security Group” to your “Reservation” and “Enabled” “Network Options” in the “Blueprint” you can select a different “Security Group” for your machine. Click “Ok” when finished.


      38.) Next under the “Self-Service” menu select “My Machines” to track the status of your request.


      39.) Your newly “Requested” machine will appear under “My Machines” and the status will show “Requested“. Note: If you machine does not show up click refresh as it can take a few seconds for it to appear.


      40.) If you continue to “Refresh” the page you will see the requests updated “Status“. The next “Status” your “Request” will go to is “CloudProvisioning“.


      41.) After your request goes to “CloudProvisioning” If you login to your “AWS Console” and go to “AWS Management Console“, then “EC2“, and then “Instances” you will see your newly provisioned machine in the “Pending State


      42.) Once finished the machine state in “vCAC” will go to “MachineProvisioned“, Then “Turning On“, and finally “On




      43.) You will now see your machine “Running” in the “AWS Console“.


      44.) In “vCAC” if you “Hover” over your newly created machine you will see the “Machine Options Menu” select “Edit


      45.) On the “Machine Information” tab near the bottom you will see “Admin Password“. Here you can show the “Local Password” for your newly provisioned “Amazon AWS Instance” Click the “Storage” tab above. Note: It can take Amazon 30+ minutes to make the password available even through the AWS Console. Once it is available from Amazon, it will not be available in vCAC until vCAC performs a data collection.


      46.) On the “Storage” tab you can add “EBS” storage “post-proviosioning” if you would like. Click on the “Network” tab above.


      47.) On the “Network” tab you can assign an “Elastic IP Address” if you have made them available through “Amazon AWS“. You can also change the “Security Group” and assign the machine to a “Load Balancer” Click “Ok” when you are done. More on these option soon.


      There are a few important things to note. If you add additional services such as Elastic IP Address, Elastic Block Storage, Elastic Load Balancers, Sucurity Groups, etc through the Amazon AWS Console they will not appear as available in vCAC until after the next Inventory Data Collection. You can perform a manual data collection as well as change the data collection frequency by doing the following:

      1. Go to “Enterprise Administrator” menu and select “Compute Resources
      2. Hover over the “Compute Resource” and select “Data Collection
      3. Under the “Inventory” section you can set the “Frequency” in hours as well as manually “Request” a “Data Collection“.
      4. If you “Request” a “Data Collection” you can select “Refresh” at the bottom of the page to get the status of the collection.

Read the original blog entry...

More Stories By Sidney Smith

Sid Smith, founder of DailyHypervisor is considered to be a cloud expert in the IT field with over 10 years experience in Virtualization, Automation, and Cloud technologies. Sid Smith started in the industry designing and implementing large scale enterprise server and desktop virtualization environments for fortune 100 and 500 companies. He later went on to become a key employee at DynamicOps the well know creators of Cloud Automation Center. In July 2012 DynamicOps was acquired by VMware who has adopted Cloud Automation Center as a center piece for it’s vCloud Suite of products. Sid has helped dozens of fortune 100 and 500 enterprises successfully adopt both private and public cloud strategies as part of their IT offerings. The result of which was large operational and capital savings for his customers. Sid continues to help large enterprise customers reach their hybrid cloud strategies at VMware. On DailyHypervisor you will find exclusive content that will help you learn how to adopt a successful cloud strategy through the use of VMware Cloud Automation Center, Open Stack, and other industry recognized cloud solutions.

@BigDataExpo Stories
Too often with compelling new technologies market participants become overly enamored with that attractiveness of the technology and neglect underlying business drivers. This tendency, what some call the “newest shiny object syndrome,” is understandable given that virtually all of us are heavily engaged in technology. But it is also mistaken. Without concrete business cases driving its deployment, IoT, like many other technologies before it, will fade into obscurity.
There are many considerations when moving applications from on-premise to cloud. It is critical to understand the benefits and also challenges of this migration. A successful migration will result in lower Total Cost of Ownership, yet offer the same or higher level of robustness. Migration to cloud shifts computing resources from your data center, which can yield significant advantages provided that the cloud vendor an offer enterprise-grade quality for your application.
Today air travel is a minefield of delays, hassles and customer disappointment. Airlines struggle to revitalize the experience. GE and M2Mi will demonstrate practical examples of how IoT solutions are helping airlines bring back personalization, reduce trip time and improve reliability. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Dr. Sarah Cooper, M2Mi's VP Business Development and Engineering, will explore the IoT cloud-based platform technologies driv...
The web app is agile. The REST API is agile. The testing and planning are agile. But alas, data infrastructures certainly are not. Once an application matures, changing the shape or indexing scheme of data often forces at best a top down planning exercise and at worst includes schema changes that force downtime. The time has come for a new approach that fundamentally advances the agility of distributed data infrastructures. Come learn about a new solution to the problems faced by software organ...
Achim Weiss is Chief Executive Officer and co-founder of ProfitBricks. In 1995, he broke off his studies to co-found the web hosting company "Schlund+Partner." The company "Schlund+Partner" later became the 1&1 web hosting product line. From 1995 to 2008, he was the technical director for several important projects: the largest web hosting platform in the world, the second largest DSL platform, a video on-demand delivery network, the largest eMail backend in Europe, and a universal billing syste...
Electric power utilities face relentless pressure on their financial performance, and reducing distribution grid losses is one of the last untapped opportunities to meet their business goals. Combining IoT-enabled sensors and cloud-based data analytics, utilities now are able to find, quantify and reduce losses faster – and with a smaller IT footprint. Solutions exist using Internet-enabled sensors deployed temporarily at strategic locations within the distribution grid to measure actual line lo...
The buzz continues for cloud, data analytics and the Internet of Things (IoT) and their collective impact across all industries. But a new conversation is emerging - how do companies use industry disruption and technology enablers to lead in markets undergoing change, uncertainty and ambiguity? Organizations of all sizes need to evolve and transform, often under massive pressure, as industry lines blur and merge and traditional business models are assaulted and turned upside down. In this new da...
Cloud computing delivers on-demand resources that provide businesses with flexibility and cost-savings. The challenge in moving workloads to the cloud has been the cost and complexity of ensuring the initial and ongoing security and regulatory (PCI, HIPAA, FFIEC) compliance across private and public clouds. Manual security compliance is slow, prone to human error, and represents over 50% of the cost of managing cloud applications. Determining how to automate cloud security compliance is critical...
The Internet of Everything is re-shaping technology trends–moving away from “request/response” architecture to an “always-on” Streaming Web where data is in constant motion and secure, reliable communication is an absolute necessity. As more and more THINGS go online, the challenges that developers will need to address will only increase exponentially. In his session at @ThingsExpo, Todd Greene, Founder & CEO of PubNub, will explore the current state of IoT connectivity and review key trends an...
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data...
You have your devices and your data, but what about the rest of your Internet of Things story? Two popular classes of technologies that nicely handle the Big Data analytics for Internet of Things are Apache Hadoop and NoSQL. Hadoop is designed for parallelizing analytical work across many servers and is ideal for the massive data volumes you create with IoT devices. NoSQL databases such as Apache HBase are ideal for storing and retrieving IoT data as “time series data.”
Mobile, social, Big Data, and cloud have fundamentally changed the way we live. “Anytime, anywhere” access to data and information is no longer a luxury; it’s a requirement, in both our personal and professional lives. For IT organizations, this means pressure has never been greater to deliver meaningful services to the business and customers.
In recent years, at least 40% of companies using cloud applications have experienced data loss. One of the best prevention against cloud data loss is backing up your cloud data. In his General Session at 17th Cloud Expo, Bryan Forrester, Senior Vice President of Sales at eFolder, will present how organizations can use eFolder Cloudfinder to automate backups of cloud application data. He will also demonstrate how easy it is to search and restore cloud application data using Cloudfinder.
The IoT market is on track to hit $7.1 trillion in 2020. The reality is that only a handful of companies are ready for this massive demand. There are a lot of barriers, paint points, traps, and hidden roadblocks. How can we deal with these issues and challenges? The paradigm has changed. Old-style ad-hoc trial-and-error ways will certainly lead you to the dead end. What is mandatory is an overarching and adaptive approach to effectively handle the rapid changes and exponential growth.
The enterprise is being consumerized, and the consumer is being enterprised. Moore's Law does not matter anymore, the future belongs to business virtualization powered by invisible service architecture, powered by hyperscale and hyperconvergence, and facilitated by vertical streaming and horizontal scaling and consolidation. Both buyers and sellers want instant results, and from paperwork to paperless to mindless is the ultimate goal for any seamless transaction. The sweetest sweet spot in innov...
The IoT is upon us, but today’s databases, built on 30-year-old math, require multiple platforms to create a single solution. Data demands of the IoT require Big Data systems that can handle ingest, transactions and analytics concurrently adapting to varied situations as they occur, with speed at scale. In his session at @ThingsExpo, Chad Jones, chief strategy officer at Deep Information Sciences, will look differently at IoT data so enterprises can fully leverage their IoT potential. He’ll sha...
There will be 20 billion IoT devices connected to the Internet soon. What if we could control these devices with our voice, mind, or gestures? What if we could teach these devices how to talk to each other? What if these devices could learn how to interact with us (and each other) to make our lives better? What if Jarvis was real? How can I gain these super powers? In his session at 17th Cloud Expo, Chris Matthieu, co-founder and CTO of Octoblu, will show you!
SYS-CON Events announced today that ProfitBricks, the provider of painless cloud infrastructure, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. ProfitBricks is the IaaS provider that offers a painless cloud experience for all IT users, with no learning curve. ProfitBricks boasts flexible cloud servers and networking, an integrated Data Center Designer tool for visual control over the...
Recently announced Azure Data Lake addresses the big data 3V challenges; volume, velocity and variety. It is one more storage feature in addition to blobs and SQL Azure database. Azure Data Lake (should have been Azure Data Ocean IMHO) is really omnipotent. Just look at the key capabilities of Azure Data Lake:
SYS-CON Events announced today that IBM Cloud Data Services has been named “Bronze Sponsor” of SYS-CON's 17th Cloud Expo, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. IBM Cloud Data Services offers a portfolio of integrated, best-of-breed cloud data services for developers focused on mobile computing and analytics use cases.

Tweets by @BigDataExpo

@BigDataExpo Blogs
Today’s modern day industrial revolution is being shaped by ubiquitous connectivity, machine to machine (M2M) communications, the Internet of Things (IoT), open APIs leading to a surge in new applications and services, partnerships and eventual marketplaces. IoT has the potential to transform industry and society much like advances in steam technology, transportation, mass production and communications ushered in the industrial revolution in the 18th and 19th centuries.
Today air travel is a minefield of delays, hassles and customer disappointment. Airlines struggle to revitalize the experience. GE and M2Mi will demonstrate practical examples of how IoT solutions are helping airlines bring back personalization, reduce trip time and improve reliability. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Dr. Sarah Cooper, M2Mi's VP Business Development and Engineering, will explore the IoT cloud-based platform technologies driving this change including privacy controls, data transparency and integration of real time context w...
Today’s connected world is moving from devices towards things, what this means is that by using increasingly low cost sensors embedded in devices we can create many new use cases. These span across use cases in cities, vehicles, home, offices, factories, retail environments, worksites, health, logistics, and health. These use cases rely on ubiquitous connectivity and generate massive amounts of data at scale. These technologies enable new business opportunities, ways to optimize and automate, along with new ways to engage with users.
I was recently watching one of my favorite science fiction TV shows (I’ll confess, ‘Dr. Who’). In classic dystopian fashion, there was a scene in which a young boy is running for his life across some barren ground in a war-ravaged world. One of his compatriots calls out to him to freeze, not to move another inch. The compatriot warns the young boy that he’s in a field of hand mines (no, that is not a typo, he did say hand mines). Slowly, dull gray hands with eyes in the palm start emerging from the ground around the boy and the compatriot. Suddenly, one of the hands grabs the compatriot and pu...
Recently announced Azure Data Lake addresses the big data 3V challenges; volume, velocity and variety. It is one more storage feature in addition to blobs and SQL Azure database. Azure Data Lake (should have been Azure Data Ocean IMHO) is really omnipotent. Just look at the key capabilities of Azure Data Lake:
DevOps Summit at Cloud Expo 2014 Silicon Valley was a terrific event for us. The Qubell booth was crowded on all three days. We ran demos every 30 minutes with folks lining up to get a seat and usually standing around. It was great to meet and talk to over 500 people! My keynote was well received and so was Stan's joint presentation with RingCentral on Devops for BigData. I also participated in two Power Panels – ‘Women in Technology’ and ‘Why DevOps Is Even More Important than You Think,’ both featuring brilliant colleagues and moderators and it was a blast to be a part of.
It’s not hard to find technology trade press commentary on the subject of Big Data. Variously defined (in non-technical terms) as the cluttered old shoebox of all data – and again (in more technical terms) as that amount of data that does not comfortably fit into a standard relational database for storage, processing and analytics within the normal constraints of processing, memory and data transport technologies – we can say that Big Data is an oft mentioned and sometimes misunderstood subject.
“Cloud computing is a model for enabling ubiquitous, convenient, on-demand network access to a shared pool of configurable computing resources (e.g., networks, servers, storage, applications and services) that can be rapidly provisioned and released with minimal management.” While this definition is broadly accepted and has, in fact, been my adopted standard for years, it only describes technical aspects of cloud computing. The amalgamation of technologies used to deliver cloud services is not even half the story. Above all else, the successful employment requires a tight linkage to the econ...
Developing software for the Internet of Things (IoT) comes with its own set of challenges. Security, privacy, and unified standards are a few key issues. In addition, each IoT product is comprised of at least three separate application components: the software embedded in the device, the backend big-data service, and the mobile application for the end user's controls. Each component is developed by a different team, using different technologies and practices, and deployed to a different stack/target - this makes the integration of these separate pipelines and the coordination of software upd...
All we need to do is have our teams self-organize, and behold! Emergent design and/or architecture springs up out of the nothingness! If only it were that easy, right? I follow in the footsteps of so many people who have long wondered at the meanings of such simple words, as though they were dogma from on high. Emerge? Self-organizing? Profound, to be sure. But what do we really make of this sentence?
Too many multinational corporations delete little, if any, data even though at its creation, more than 70 percent of this data is useless for business, regulatory or legal reasons.[1] The problem is hoarding, and what businesses need is their own “Hoarders” reality show about people whose lives are driven by their stuff[2] (corporations are legally people, after all). The goal of such an intervention (and this article)? Turning hoarders into collectors.
Disaster recovery (DR) has traditionally been a major challenge for IT departments. Even with the advent of server virtualization and other technologies that have simplified DR implementation and some aspects of on-going management, it is still a complex and (often extremely) costly undertaking. For those applications that do not require high availability, but are still mission- and business-critical, the decision as to which [applications] to spend money on for true disaster recovery can be a struggle.
SCOPE is an acronym for Structured Computations Optimized for Parallel Execution, a declarative language for working with large-scale data. It is still under development at Microsoft. If you know SQL then working with SCOPE will be quite easy as SCOPE builds on SQL. The execution environment is different from that RDBMS oriented data. Data is still modeled as rows. Every row has typed columns and eveyr rowset has a well-defined schema. There is a SCOPe compiler that comes up with optimized execution plan and a runtime execution plan.
If you’re running Big Data applications, you’re going to want to look at some kind of distributed processing system. Hadoop is one of the best-known clustering systems, but how are you going to process all your data in a reasonable time frame? MapReduce has become a standard, perhaps the standard, for distributed file systems. While it’s a great system already, it’s really geared toward batch use, with jobs needing to queue for later output. This can severely hamper your flexibility. What if you want to explore some of your data? If it’s going to take all night, forget about it.
Organizations already struggle with the simple collection of data resulting from the proliferation of IoT, lacking the right infrastructure to manage it. They can't only rely on the cloud to collect and utilize this data because many applications still require dedicated infrastructure for security, redundancy, performance, etc. In his session at 17th Cloud Expo, Emil Sayegh, CEO of Codero Hosting, will discuss how in order to resolve the inherent issues, companies need to combine dedicated and cloud solutions through hybrid hosting – a sustainable solution for the data required to manage I...

About @BigDataExpo
Big Data focuses on how to use your own enterprise data – processed in the Cloud – most effectively to drive value for your business.