We're Not Thinking Big Enough With Storage

Joe Stanganelli, Founder and Principal, Beacon Hill Law | 4/26/2012 | 14 comments

Joe Stanganelli
Chances are that what you would consider big, Martin Leach would consider very, very small.

Delivering his keynote at the Bio-IT World Conference this week here in Boston, Leach challenged attendees to reconsider their concepts of "BIG!" -- the word emblazoned in ultra-large, all caps font, replete with exclamation point, on his first slide -- in this, our technology-driven, buzzword-infused world of "big-data" and "big analytics."

Leach told the audience that the Broad Institute (at which Leach serves as CIO) has a mind-boggling 10 petabytes of data on spinning disks (so many that any given one is expected to fail every 24 hours to 36 hours). This makes the Broad Institute's datacenter the biggest genomics datacenter in the world.

Yet, as recently as 20 years ago, Leach relates, a CIO might have had his mind similarly boggled by a datacenter housing no more than 16 gigabytes of data. Today, you can store 16 gigabytes of data on a reasonably priced thumb drive -- but in the early 1990s, 16GB was "BIG!"

To offer a comparison, Leach raises the example of the 1,000 Genomes Project -- a self-explanatory genome sequencing project that might seem ambitious ("BIG!" even), considering that humanity's first genome map was completed only nine years ago this month.

"If you can do 1,000 genomes, why can't you do a million?" Leach asks (a fair question, especially given the dramatically falling cost of genome sequencing). "If you can do a million genomes, why can't you do a billion?"

As with cost, data storage issues do not present much of an obstacle any more -- or, at least, not any time soon. Rather, storage is an analytics red herring.

"Looking into the future... I don't think it's a big data problem," says Leach. He predicts that in about 23 years, 16,777,216,000 gigabytes (16,000 petabytes, or just shy of 16 exabytes) will be able to fit onto a single $50 hard drive unit on a typical home PC.

"A few years from now, it won't really be all that big -- ten petabytes," says Leach nonchalantly.

"Really, the big question here," he continues, failing to clarify whether the pun is intended, "is 'How do we make sense of it?' "

Leach identifies the truly serious data management problems facing the biomedical and healthcare fields as ones of data movement, data indexing, and data accessibility. "Why don't we have a 'Google search' for data?" Leach laments. "How can you look inside that data? How can you integrate that data? How can you do it in a frictionless way?"

There is some hope on the horizon. Leach's preceding presenter, Jill Mesirov (also of the Broad Institute), introduced GenomeSpace -- an integrated, open-source, cloud-based data management infrastructure for genomics researchers -- to Bio-IT World attendees. Still, with GenomeSpace only in beta, Leach believes the data management problems are far from solved, requiring a great deal more investment -- particularly for already cash-strapped research organizations.

Leach also identifies the need for more data scientists, predicting tremendous job growth in the area. Massachusetts -- site of the conference and home to some of the world's finest hospitals, biotech companies, and research facilities -- will gain an additional 50,000 such jobs by 2018, Leach points out.

Whether that will be enough, however, remains to be seen. 50,000 may seem like a big number of jobs now -- but will it still be big in six years?

View Comments: Newest First | Oldest First | Threaded View
Page 1 / 2   >   >>
megadl   We're Not Thinking Big Enough With Storage   5/7/2012 10:11:10 AM
Data storage growth
Don't you think that at some point in time this growth of data will form a curve relative to the growth of data storage? The current trend though is that if there's more storage, people tend to generate and store more data however... even though the development of new technology grows exponentially.. I'm not very sure that data usage grows at the same rate.
batye   We're Not Thinking Big Enough With Storage   4/29/2012 8:50:05 AM
Re: We're Not Thinking Big Enough With Storage
me too, as I do think we would see some changes
batye   We're Not Thinking Big Enough With Storage   4/29/2012 8:47:58 AM
Re: We're Not Thinking Big Enough With Storage
thank you Rich... :)
tekedge   We're Not Thinking Big Enough With Storage   4/28/2012 9:02:15 AM
Re: We're Not Thinking Big Enough With Storage
@David - Sounds interesting. I will be looking out for that.
batye   We're Not Thinking Big Enough With Storage   4/27/2012 10:24:52 PM
Re: We're Not Thinking Big Enough With Storage
lol :) interesting
Technocrat   We're Not Thinking Big Enough With Storage   4/27/2012 10:09:23 PM
Re: We're Not Thinking Big Enough With Storage
@ Rich That was awesome !  I can't stop laughing - Ode to Batye !  : ) 
batye   We're Not Thinking Big Enough With Storage   4/27/2012 9:19:34 PM
Re: We're Not Thinking Big Enough With Storage
interesting I would like to know more...
David Wagner   We're Not Thinking Big Enough With Storage   4/27/2012 6:11:44 PM
Re: Data Mining
@Gigi- Definitley we need more. Do you think schools can keep up with the demand? I'm hard pressed to imagine that they can. I know I'm going to start telling my children they need to be data scientists when they grow up though.
David Wagner   We're Not Thinking Big Enough With Storage   4/27/2012 6:10:02 PM
Re: We're Not Thinking Big Enough With Storage
@tekedge- Joe is going to have a followup to this story on Monday. Among things that someone brings up is the constant failure of spinning disks. Look for it. I think you'll find it interesting.
tekedge   We're Not Thinking Big Enough With Storage   4/27/2012 9:49:38 AM
We're Not Thinking Big Enough With Storage
@Joe - A very interesting read. A failure of spinning disks every 24 - 36 hrs must be really crazy !!. I am sure the data center operation folks must be on ther toes all the time. How about the power consumption for hosting all these servers and disks?
Page 1 / 2   >   >>


The blogs and comments posted on EnterpriseEfficiency.com do not reflect the views of TechWeb, EnterpriseEfficiency.com, or its sponsors. EnterpriseEfficiency.com, TechWeb, and its sponsors do not assume responsibility for any comments, claims, or opinions made by authors and bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.

More Blogs from Joe Stanganelli
Joe Stanganelli   11/20/2013   58 comments
The Internet may be global, and we may call what we see in our browsers the world wide web, but about 70 percent of the world doesn't have Internet access -- the part that's covered by water.
Joe Stanganelli   10/10/2013   62 comments
"Passwords are dead," a Google information security manager decreed at last month's TechCrunch Disrupt. Other pundits have come to the same conclusion. However, these reports are greatly ...
Joe Stanganelli   9/11/2013   83 comments
Nietzsche said, "That which does not kill me can only make me stronger." Scientists have recently discovered that this may be literally true in the case of plastics, and it could be a real ...
Joe Stanganelli   4/24/2013   28 comments
Big-data is a perennial concern at Boston's annual Bio-IT World Expo because of the sheer volume of information the life sciences industry must contend with. The pain points expressed at ...
Joe Stanganelli   4/30/2012   28 comments
Last week, I wrote an article about how keynote speaker Martin Leach presented a convincing argument to Bio-IT World Conference 2012 attendees here in Boston as to why the biggest obstacle ...
Latest Archived Broadcast
In this episode, you'll learn how to stretch the limits of your private cloud -- and how to recognize the limits that can't be exceeded.
On-demand Video with Chat
IT has to deploy Server 2012 in a way that fits the architecture of its application delivery system.
E2 IT Migration Zones
IT Migration Zone - UK
Why PowerShell Is Important
Reduce the Windows 8 Footprint for VDI
Rethinking Storage Management
IT Migration Zone - FR
SQL Server : 240 To de mémoire flash pour votre data warehouse
Quand Office vient booster les revenus Cloud et Android de Microsoft
Windows Phone : Nokia veut davantage d'applications (et les utilisateurs aussi)
IT Migration Zone - DE
Cloud Computing: Warum Unternehmen trotz NSA auf die „private“ Wolke setzen sollten
Cloud Computing bleibt Wachstumsmarkt – Windows Azure ist Vorreiter
Like Us on Facebook
Twitter Feed
Enterprise Efficiency Twitter Feed
Site Moderators Wanted
Enterprise Efficiency is looking for engaged readers to moderate the message boards on this site. Engage in high-IQ conversations with IT industry leaders; earn kudos and perks. Interested? E-mail:
[email protected]
Informed CIO: Dollars & Sense: Virtual Desktop Infrastructure
Cut through the VDI hype and get the full picture -- including ROI and the impact on your Data Center -- to make an informed decision about your virtual desktop infrastructure deployments.

Read the full report
Virtualization Management: Time To Get Serious
Welcome to the backside of the virtualization wave. Discover the state of virtualization management and where analysts are predicting it is heading

Read the full report
PUBLIC SECTOR RESOURCES
WHITE PAPERS
A Video Case Study – Translational Genomics Research Institute
e2 Storage Video


On the Case
TGen IT: Where We're Going Next

7|11|12   |   08:12   |   10 comments


Now that TGen has broken new ground in genomic research by using Dell's storage, cloud, and high-performance computing solutions, the company discusses what will come next for it and for personalized medicine.
On the Case
Better Care Through Better Communications

6|6|12   |   02:24   |   11 comments


The achievements of the TGen/Dell project could improve how all people receive healthcare, because they are creating ways to improve end-to-end communication of medical data.
On the Case
TGen IT: Where We Are Now

5|15|12   |   06:58   |   6 comments


TGen is breaking new ground in genomic research by using Dell's storage, cloud, and high-performance computing solutions.
On the Case
TGen IT: Where We Were

4|27|12   |   06:45   |   10 comments


The Translational Genomics Research Institute wanted to save lives, but its efforts were hobbled by immense computing challenges related to collecting, processing, sharing, and storing enormous amounts of data.
On the Case
1,200% Faster

4|18|12   |   02:27   |   12 comments


Through their partnership, Dell and TGen have increased the speed of TGen’s medical research by 1,200 percent.
On the Case
IT May Improve Children's Chances of Survival

4|17|12   |   02:12   |   8 comments


IT is helping medical researchers reach breakthroughs in a way and pace never seen before.
On the Case
Medical Advances in the Cloud

4|10|12   |   1:25   |   5 comments


TGen and Dell are pushing the boundaries of computing, and harnessing the power of the cloud to improve healthcare.
On the Case
TGen: Living the Mission

4|9|12   |   2:25   |   3 comments


TGen's CIO puts the organizational mission at the heart of everything the IT staff does.
On the Case
TGen Speeding Up Biomedical Research to Save More Lives

4|5|12   |   1:59   |   6 comments


The Translational Genomics Research Institute is revamping its computing to improve speed, storage, and collaboration – and, most importantly, to save lives.
On the Case
Computing Power Helping to Save Children's Lives

3|28|12   |   2:13   |   3 comments


The Translational Genomics Institute’s partnership with Dell is enabling them to treat kids with neuroblastoma more quickly and save more lives.
Tom Nolle
How Deep Is My Storage Hierarchy?

7|3|12   |   2:13   |   5 comments


At the GigaOM Structure conference, a startup announced a cloud and virtualization storage optimizing approach that shows there's still a lot of thinking to be done on the way storage joins the virtual world.
E2 Interview
What Other Industries Can Learn From Financial Services

6|13|12   |   02:08   |   3 comments


We asked CIO Steve Rubinow what CIOs in other industries can learn from the financial services industry about datacenter efficiency, security, and green computing.
E2 Interview
Removing Big-Data Flow Bottlenecks

6|12|12   |   02:55   |   No comments


We ask CIO Steve Rubinow what pieces of financial services infrastructure need to perform better to get traders info faster.
E2 Interview
Getting Traders the Data They Need

6|11|12   |   02:04   |   1 comment


We ask CIO Steve Rubinow: What do stock market traders need to know, how fast do they need it, and how can CIOs get it to them?
E2 Interview
Can IT Help Fix the Global Economy?

6|8|12   |   02:32   |   2 comments


We ask CIO Steve Rubinow whether today's IT can help repair the global economy (and if IT played any role in the economy's collapse).
E2 Interview
More Competitive Business via Datacenter Strategy

5|4|12   |   2:46   |   1 comment


Businesses need to be competitive, yet efficient, and both goals affect datacenter design.
E2 Interview
The Recipe for Greater Efficiency

5|3|12   |   3:14   |   2 comments


Intel supplies the best ingredients to drive greater datacenter efficiency and support new compute, storage, and networking needs.
E2 Interview
Datacenters Enabling Business Transformation

5|1|12   |   06:37   |   1 comment


Dell’s Gaurav Chand says that for the first time ever datacenter technology is truly enabling all kinds of organizations to transform their business and achieve new objectives.
Tom Nolle
Cloud Data: Big AND Persistent!

3|28|12   |   2:11   |   10 comments


We always hear about "Big" data, but a real issue in cloud storage is not just bigness but also persistence. A large data model is less complicated than a big application repository that somehow needs to be accessed. The Hadoop send-program-to-data model may be the answer.
Tom Nolle
Project Lightning Streamlines Storage

2|16|12   |   2:09   |   2 comments


EMC's Project Lightning has matured into a product set, and it's important, less because it has new features or capabilities in storage technology and management, than because it may package the state of the art in a way more businesses can deploy.
Tom Nolle
Big Data Appliance Is Big News

1|12|12   |   2:18   |   No comments


Oracle's release of a Hadoop appliance for Big Data may be a signal that we're shifting to database appliances.
Tom Nolle
Myopia Can Hurt Storage Policy

12|22|11   |   2:08   |   No comments


We're at the beginning of a cloud-driven revolution in storage, but Oracle's quarter shows that enterprises are hunkering down on old concepts because they're afraid of the costs in the near term.
Sara Peters
An Untrained User & a Mobile Medical Device

12|19|11   |   2:43   |   11 comments


Untrained end users, clueless central IT staff, and expensive mobile devices are a worrisome combination for healthcare CIOs.
Tom Nolle
Too Many Labels on 'Big Data'?

12|9|11   |   2:12   |   3 comments


However you label it, structured and unstructured information are different and will likely always require different tools.
Sara Peters
E2 Debuts New Storage Section

12|8|11   |   1:51   |   1 comment


Need strategic guidance on everything from SSDs to 100 percent virtualized datacenters? Look no further.