Piranha Brings Affordable Big-Data to Government

Curtis Franklin Jr., Executive Editor | 11/30/2012 | 10 comments

Curtis Franklin Jr.
Are you ready to harness the power of a piranha? New software with big-data teeth could change the way your agency looks at text.

Oak Ridge National Labs' Computation Data Analytics Group (CDA) has spent the last nine years working on a system that looks not only at text data but at the context around the text. It tries, in a sense, to "read between the lines" of the data to help with detailed analysis of the contents. Doing this with massive amounts of data is hard, and the CDA has pulled out all the tech stops in order to make the analysis happen in a reasonable amount of time. In a statement about Piranha on the CDA website, they write:

We have pioneered a software agent approach to text analysis that uses a large number of agents distributed over very large computer clusters. This method works much faster than traditional approaches and provides the capability to cluster massive amounts of textual information in relatively short amounts of time, due to the scalability of the agent architecture.

Those are fine words, but what do they really mean for IT executives in government agencies and departments? On one level, they mean that the ability to make some sense out of giant mounds of text files, whether they're in the form of email messages, tweets, or scanned and OCR'd historical documents, has just gotten a bit closer. That's going to be good news for anyone trying to figure out how to make the "wish list" a reality for departments dealing with law enforcement, public health, or natural resource management (to name three quickly obvious areas).

Piranha could also be good news for someone who's trying to figure out what to do with a room full of slightly out-of-date workstations or servers. Make no mistake, Piranha is designed to run best on a high-performance cluster of the sort that a place like, say, Oak Ridge National Labs probably has sitting around waiting for a task. If you're fresh out of high-performance clusters, though, you can run the Piranha server on a cluster made up of just about anything that will support the Java Runtime Environment (JRE), version 1.4.2 or higher. That simple requirement covers an awful lot of computing power in relatively modest chunks.

The modest hardware requirements and very rich set of functions make Piranha perfect for departments and agencies with more brainpower than hardware capex in the budget. You should understand going in that Piranha is not a simple system: It's powerful but has an initial learning curve that approaches vertical. Once scaled, though, there are few big-data problems involving text that Piranha can't help you solve in a very cost-effective manner.

Big-data presents a problem set that many government agencies must deal with. The payoff, though, can be high. In the strict-budget environment that most government entities must work in today, a tool like Piranha can be the perfect way to take a bite out of the large problem of big-data.

View Comments: Newest First | Oldest First | Threaded View
batye   Piranha Brings Affordable Big-Data to Government   12/2/2012 3:00:56 AM
Re: Piranha Brings Affordable Big-Data
I think we would see soon gov. not being able to cope with the ammount of data...
MDMConsult   Piranha Brings Affordable Big-Data to Government   12/1/2012 12:19:56 AM
Re: Piranha Brings Affordable Big-Data
CIOs should be aware of cause, confusion and work to meet the challenges the public sector. There is much more pressure for government today to developing big data strategy, resolving issues for government and being further concerned with IT able to support big data's growth. 
zerox203   Piranha Brings Affordable Big-Data to Government   11/30/2012 10:53:20 PM
Re: why those?
@Sara,

I think in some way this highlights an important issue with how we perceive government effeciency. Those are three great examples Curt provided, because (as he pointed out) they deal with huge volumes of text. There's another reason, though - they're three public-facing departments. We know what they're called and what they do - so they're the ones we list. Someone in the know could probably rattle off a list of dozens... That is, someone like Oak Ridge. I bet a lot of businesses gunning for gov't contracts would be thrilled to have that viel peeled back just a little more.. to know what operations look like at every level, and make just the right kind of software.
 
zerox203   Piranha Brings Affordable Big-Data to Government   11/30/2012 9:46:28 PM
Re: Piranha Brings Affordable Big-Data
I feel as if there's two forms the government takes on in the public's mind - the super-elite, top-secret government that runs area 51 (complete with aliens, of course), and keeps tons of shady, assasin-laden business secret from us, and the other government. The one that can't run a data center right, the one that loses all your paperwork five minutes after you file it. One is run by people who can't get careers in the private sector (detractors claim), and the other  is the highest career one can aspire to.

The existance and popularity of companies who cater their consulting and other service to the government lead me to believe there's more of the second type of government than the first. Needing a tool to parse through large volumes of text seems more on the bureacratically benign side of things to me as well. Not to say that it's a sign of incompetence... just a sign that they're struggling with the same big-data problems as the rest of us, nothing more and nothing less.
 
CurtisFranklin   Piranha Brings Affordable Big-Data to Government   11/30/2012 8:21:58 PM
Re: why those?
@Dave, while I haven't run tests, the sense I get is that even lower-powered clusters can provide performance that's far beyond that of "brute force" techniques using other software (or no software at all). While it might not be lightning fast, it's still dramatically faster than the alternatives.
CurtisFranklin   Piranha Brings Affordable Big-Data to Government   11/30/2012 8:20:13 PM
Re: why those?
Absolutely right, @Dave. One of the first things it does it to help provide structure to that unstructured data, and that alone can be a huge help.
CurtisFranklin   Piranha Brings Affordable Big-Data to Government   11/30/2012 8:18:45 PM
Re: why those?
@Sara, the reason I singled those out was that they typically deal with large quantities of data in text form. I suspect lots of other departments will climb on board, but these are three obvious starting points.
David Wagner   Piranha Brings Affordable Big-Data to Government   11/30/2012 7:06:01 PM
Re: why those?
Also, Curt, you mention that the system works best on high performance stacks, but it will work on less impressive stacks. When on the lower end stacks, does it do the job fast enough to make it worthwhile?
David Wagner   Piranha Brings Affordable Big-Data to Government   11/30/2012 7:05:01 PM
Re: why those?
Good question, Sara. But i'll definitely agree. It looks like a greta way to make sense out of a lot of unstructured data.
Sara Peters   Piranha Brings Affordable Big-Data to Government   11/30/2012 6:51:48 PM
why those?
Curt you say "That's going to be good news for anyone trying to figure out how to make the "wish list" a reality for departments dealing with law enforcement, public health, or natural resource management (to name three quickly obvious areas)."  What makes you single out those three in particular?


The blogs and comments posted on EnterpriseEfficiency.com do not reflect the views of TechWeb, EnterpriseEfficiency.com, or its sponsors. EnterpriseEfficiency.com, TechWeb, and its sponsors do not assume responsibility for any comments, claims, or opinions made by authors and bloggers. They are no substitute for your own research and should not be relied upon for trading or any other purpose.

More Blogs from Curtis Franklin Jr.
Curtis Franklin Jr.   5/22/2013   5 comments
It's the dawn of a new age in helpful machines. It's the first step toward Skynet. Smarter, more intelligent computers are here. How will you use them?
Curtis Franklin Jr.   5/21/2013   10 comments
If you're old enough, you will remember when "Service with a smile" was a common company slogan. In the enterprise datacenter, it just might be the new mission statement.
Curtis Franklin Jr.   5/20/2013   1 comment
Two weeks on the road, two major conferences, lots of info -- and three key lessons for CIOs.
Curtis Franklin Jr.   5/15/2013   17 comments
Remember the old highway safety slogan, "Speed Kills"? In today's business environment, it's lack of speed that's fatal.
Curtis Franklin Jr.   5/14/2013   1 comment
Between webcasts, con calls, and Internet chats, you need a pretty good reason to actually get on a plane and go to a meeting. Last week's Interop gave me a lot of great reasons to be in ...
Latest Archived Broadcast
Data visualization can make complex data easier to grasp. Our expert guest will talk about the hows, whys, and whats of bringing the big picture to your enterprise.
May 28th 2pm EDT Tuesday
On-demand Video with Chat
NBA CIO Michael Gliedman will tell us why the NBA decided to create NBA.com/stats
6/18/2013 -   Please join us for the "IT Convergence Strategies: Why, When and How " to learn more about: • 5 truths about infrastructure convergence today that go beyond the hype • How to exploit the 4 phases of convergence maximum efficiency and agility • Key milestones to plan for on the convergence journey • Why integrated management is a critical component of convergence plans • The importance of an open, modular approach, such as Dell’s active infrastructure, to building a converged data center
E2 IT Migration Zones
IT Migration Zone - UK
Get Modern Apps on the Windows 8 Desktop
Application Audits Simplify Migration
Hardware Refresh Cycles Are Outdated
IT Migration Zone - FR
Windows Blue attendu en juin
Comment profiter d’une nouvelle expérience User Virtualization
S’équiper ou non d’un logiciel anti-virus ?
IT Migration Zone - DE
Leap Motion zeigt Gestensteuerung für Windows 8
Microsofts Surface Pro kommt nach Deutschland
Like Us on Facebook
Twitter Feed
Enterprise Efficiency Twitter Feed
Dell IT Insights
Dell Market Response Twitter Feed
E2 Linked-in Group Ad
Site Moderators Wanted
Enterprise Efficiency is looking for engaged readers to moderate the message boards on this site. Engage in high-IQ conversations with IT industry leaders; earn kudos and perks. Interested? E-mail:
moderators@enterpriseefficiency.com
Dell's Efficiency Modeling Tool
The major problem facing the CIO is how to measure the effectiveness of the IT department. Learn how Dell’s Efficiency Modeling Tool gives the CIO two clear, powerful numbers: Efficiency Quotient and Impact Quotient. These numbers can be transforma¬tive not only to the department, but to the entire enterprise.

Read the full report
The State of Enterprise Efficiency in the Virtual Era: Virtualization – Smart Approaches to Maximize Gains
Virtualization is a presence in nearly all enterprise data centers. But not all companies are using it to its best effect. Learn the common characteristics of success, what barriers companies face, and how to get the most from your efforts.

Read the full report
Informed CIO: Dollars & Sense: Virtual Desktop Infrastructure
Cut through the VDI hype and get the full picture -- including ROI and the impact on your Data Center -- to make an informed decision about your virtual desktop infrastructure deployments.

Read the full report
SPONSORED BY DELL
BRIEFINGS
CASE STUDIES
EBOOKS
PUBLIC SECTOR RESOURCES
VIDEOS
WHITE PAPERS
A Video Case Study – Translational Genomics Research Institute
e2 Video
On the Case
TGen IT: Where We're Going Next

7|11|12   |   08:12   |   10 comments


Now that TGen has broken new ground in genomic research by using Dell's storage, cloud, and high-performance computing solutions, the company discusses what will come next for it and for personalized medicine.
On the Case
Better Care Through Better Communications

6|6|12   |   02:24   |   12 comments


The achievements of the TGen/Dell project could improve how all people receive healthcare, because they are creating ways to improve end-to-end communication of medical data.
On the Case
TGen IT: Where We Are Now

5|15|12   |   06:58   |   5 comments


TGen is breaking new ground in genomic research by using Dell's storage, cloud, and high-performance computing solutions.
On the Case
TGen IT: Where We Were

4|27|12   |   06:45   |   10 comments


The Translational Genomics Research Institute wanted to save lives, but its efforts were hobbled by immense computing challenges related to collecting, processing, sharing, and storing enormous amounts of data.
On the Case
1,200% Faster

4|18|12   |   02:27   |   12 comments


Through their partnership, Dell and TGen have increased the speed of TGen’s medical research by 1,200 percent.
On the Case
IT May Improve Children's Chances of Survival

4|17|12   |   02:12   |   8 comments


IT is helping medical researchers reach breakthroughs in a way and pace never seen before.
On the Case
Medical Advances in the Cloud

4|10|12   |   1:25   |   5 comments


TGen and Dell are pushing the boundaries of computing, and harnessing the power of the cloud to improve healthcare.
On the Case
TGen: Living the Mission

4|9|12   |   2:25   |   3 comments


TGen's CIO puts the organizational mission at the heart of everything the IT staff does.
On the Case
TGen Speeding Up Biomedical Research to Save More Lives

4|5|12   |   1:59   |   8 comments


The Translational Genomics Research Institute is revamping its computing to improve speed, storage, and collaboration – and, most importantly, to save lives.
On the Case
Computing Power Helping to Save Children's Lives

3|28|12   |   2:13   |   3 comments


The Translational Genomics Institute’s partnership with Dell is enabling them to treat kids with neuroblastoma more quickly and save more lives.
Tom Nolle
VMWare & the Bicameral Model of MDM

5|22|13   |   2:14   |   No comments


VMware has a new solution to the MDM problem, two virtual phones inside a real phone, at least for Android phones. Currently limited to two models, the idea could expand and provide a way of letting companies harmonize their need to manage corporate use of phones while preserving BYOD.
Ivan Schneider
Clash of the Tableau 8: Release the Kraken!

5|17|13   |   2:42   |   No comments


Tableau 8 has some great data visualization and presentation capabilities, but it's best paired with a strong data analysis framework.
Tom Nolle
Using Virtualization – for Real!

5|13|13   |   2:10   |   2 comments


There's a lot of hype about virtualization of networks, NaaS, and SDN, but there's a couple of proven applications that enterprises could adopt right now and potentially save money and improve operations.
Tom Nolle
Is UC Becoming Oxymoronic or Just Moronic?

5|9|13   |   2:12   |   No comments


Skype/Outlook UC integration means we're going to have competition and fragmentation of UC client architectures, but is that bad? Modern devices can support IM, email, voice, and video clients, so maybe it's the back end of UC we need to be worried about.
E2 Editors
Windows vs. Integrated Circuit CPUs

4|17|13   |   4:45   |   5 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
E2 Editors
Radio vs. Public Internet Access

4|17|13   |   4:34   |   14 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
E2 Editors
Mainframes vs. Servers

4|17|13   |   4:34   |   16 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
E2 Editors
TCP/IP vs. Printing Press

4|17|13   |   3:07   |   5 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
E2 Editors
BYOD vs. E-Commerce

4|12|13   |   3:12   |   11 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
E2 Editors
Telecommuting vs. Outsourcing

4|12|13   |   4:19   |   7 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
E2 Editors
Personal Computer vs. Mobile Devices

4|12|13   |   4:28   |   20 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
E2 Editors
Smartphones vs. Productivity Software

4|12|13   |   3:09   |   13 comments


The editors make their predictions about what will win the next match-up in the E2 Tournament of IT Revolutionaries.
Tom Nolle
There's More to Mobility Than the Mobile Worker

4|9|13   |   2:03   |   5 comments


Workers are now used to portable device support throughout their everyday lives. We should be looking at the policy of providing fixed-desk devices to support stationary workers. Could portable support be smarter?
Ivan Schneider
From Kim Jong-Un's Trackball to Nuance Voice Ads

4|5|13   |   3:21   |   9 comments


Input devices run the gamut, from the humble Missile Command-style trackball to advanced speech recognition. Unfortunately, these input devices can be used for evil as well as good. Case in point: mobile ads that want you to talk to them.
Tom Nolle
Data/Storage Wish List for Enterprises

4|3|13   |   2:19   |   1 comment


Enterprises want three things in storage systems: First is some speech-recognition way of capturing videoconference data for indexing; second is semantic/AI analysis of emails and IM for content indexing; third is a better system for managing hierarchical layers of storage.