There is a lot of buzz about Hadoop, NoSQL, NewSQL, and columnar MPP databases. But where is the actual value for businesses? Businesses need to have actionable information derived from their data that they collect on a regular basis. We know how to collect data and store it in databases of various kinds. We have seen the evolutions of SQL databases over the last five decades, and databases have gotten sophisticated in terms of processing structured data.
With the recent explosion of social media, and with it the proliferation of unstructured data, new technologies have emerged, such as MapReduce. So now we have the data -- but the real question is where does the business value actually get delivered? The answer is simple.
But both these standards do not help in benchmarking some of these disruptive DBMS players.
Apart from TPC there are other vertical specific standards that exists. One that comes to mind is STAC-M3, commonly used for the financial and securities industry. http://www.stacresearch.com/node/10219
I will be interested to find out if there are other evolving DBMS standards that provide benchmarks on some of these NoSQL technologies.
you are absolutely right, BI layer is not an integral part of the NoSQL DBMS layer. And that is actually the point I have tried to make through my blog. While NoSQL and NewSQL DBMS's address a problem they do not help customers with the actual insights. Customers still need to implement a BI layer on top of this "plumbing" layer to be able to extract menaingful insights!
NewSQL is databases that have been designed from the outset to have a SQL interface (rather than being added on later) but which do not necessarily have a relational storage engine underneath.
NewSQL is a new database access language. It is easier to learn than SQL, elegant, consistent, and well defined. It is not a extension or subset of SQL, and not a Object database language.
Example NewSQL databases :
- VoltDB, This is essentially a standard relational database with all the unnecessary gubbins that has accumulated over 40 years of relational development thrown out so that this is a much leaner and meaner machine than its traditional counterparts. As a result it performs better and has a much smaller footprint than the merchant databases.
- Xeround. This is a cloud-based offering with elastic scaling and NoSQL roots. Like all the other NewSQL databases it is focused on transaction processing. Apart from its inherent capabilities, the other major differentiator for Xeround is that it looks like MySQL, which makes it easy for existing MySQL users to port into the cloud.
- NuoDB uses a distributed object architecture in a peer-to-peer environment that the company likens to BitTorrent. When you update a record you append the changes to the existing data rather than replacing it so you always have a historic data view of your database.
- JustOneDB is fully ACID compliant and is designed to support SQL. In its case, the environment looks just like PostgreSQL and it runs in the cloud on Heroku. However, it uses a completely different storage architecture that the company refers to as tunnel storage.
NewSQL is a set of DBMS players promising to deliver the scalability and flexibility promised by NoSQL while retaining the support for SQL queries and/or ACID (atomicity, consistency, isolation and durability) transactions.
Although fundamentally both NoSQL and NewSQL are trying to solve the same use cases of providing scalability and performance on top of traditional DBMS's like MySQL.
Some examples of NewSQL: VoltDB, Clustrix, GenieDB.
The blogs and comments posted on EnterpriseEfficiency.com do not reflect the views of TechWeb, EnterpriseEfficiency.com, or its sponsors. EnterpriseEfficiency.com, TechWeb, and its sponsors do not assume responsibility for any comments, claims, or opinions made by authors and bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.
4/29/2014 - Join Dell and Intel for an interactive discussion about implementing, refining and improving your virtual environment. Specifically we’ll discuss pain points virtualization can solve and those that it can create and how to prevent them.
Enterprise Efficiency is looking for engaged readers to moderate the message boards on this site. Engage in high-IQ conversations with IT industry leaders; earn kudos and perks. Interested? E-mail: firstname.lastname@example.org
Dell's Efficiency Modeling Tool The major problem facing the CIO is how to measure the effectiveness of the IT department. Learn how Dell’s Efficiency Modeling Tool gives the CIO two clear, powerful numbers: Efficiency Quotient and Impact Quotient. These numbers can be transforma¬tive not only to the department, but to the entire enterprise. Read the full report
Now that TGen has broken new ground in genomic research by using Dell's storage, cloud, and high-performance computing solutions, the company discusses what will come next for it and for personalized medicine.
The Translational Genomics Research Institute wanted to save lives, but its efforts were hobbled by immense computing challenges related to collecting, processing, sharing, and storing enormous amounts of data.
Office and personal productivity tools come in a first-class and coach flavor set, but what makes the difference is primarily little things that most users won't encounter. What's the big issue in using something other than Office, and can you get around it?
We really don't want an "Internet of Everything" but even building an Internet of Everythinguseful means setting some ground rules to insure there's value in the process and that costs and risks are minimized.
Google's Chrome OS has a lot of potential value and a lot of recent press, but it still needs something to make it more than a thin client. It needs cloud integration, it needs extended APIs via web services, and it needs to suck it up and support a hard drive.
On a recent African trip I saw examples of the value of the cloud in developing nations, for educational and community development programs. We could build on this, but not only in developing economies, because these same programs are often under-supported even in first-world countries.
VMware's debate with Cisco on SDN might finally create a fusion between an SDN view that's all about software and another that's all about network equipment. That would be good for every enterprise considering the cloud and SDN.
Wearing a bulky, oversized watch is good training for the next phase in wristwatches: the Internet-enabled, connected watch. Why the smartphone-tethered connected watch makes sense, plus Ivan demos an entirely new concept for the "smart watch."
Cloud storage costs are determined primarily by the rate at which files are changed and the possibility of concurrent access/update. If you can structure your storage use to optimize these factors you can cut costs, perhaps to zero.
The Internet has evolved into a machine for drumming up a chorus of "Happy Birthday" messages, from family, friends, friends of friends who you added on Facebook, random people that you circled on G+, and increasingly, automated bots. Enough already.