Emory Creel

Large Dataset / Large Database Analysis

Information

Large Dataset / Large Database Analysis

This group is for those interested in working with large datasets. Any hardware, software, or analytical issues related to this topic are welcome.

Members: 34
Latest Activity: Sep 22

Discussion Forum

Phillip Middleton

Dimension Reduction and Parameterization: Grouping, Curve Fitting, and What is 'Good Enough'? 1 Reply

Started by Phillip Middleton. Last reply by Jan Theodore Galkowski Sep 22.

Phillip Middleton

Personal Supercomputing and data analysis.

Started by Phillip Middleton Sep 22.

Emory Creel

cluster servers

Started by Emory Creel Apr. 17, 2008.

Comment Wall (2 comments)

Add a Comment

You need to be a member of Large Dataset / Large Database Analysis to add comments!

2 Comments

Jan Theodore Galkowski Comment by Jan Theodore Galkowski on June 28, 2009 at 2:28pm
When these kinds of datasets are attempted, what do you do? Do you put data into huge relational databases, accepting the startup cost of designing a good schema? Do you work with flat files and UNIX sort? Do you sample from your dataset of 20 billion rows, taking a random gather of two million? Do you ever use tests of statistical significance with giant dataset? Chi-square? How do you control the spurious result problems?
Aldo Taranto Comment by Aldo Taranto on July 3, 2008 at 7:35pm
Hello everyone,

I have done some research on Stochastic Optimization and Stochastic Queuing Theory in their application to Optimal Memory Allocation and Optimal Storage Allocation in Grid Computing and Massively Parallel Super Computers. This has vast implications for our exponentially growing usage of Data Centres and Databases.

Anyone wanting to exploit this commercially and create employment for other Analysts, feel free to contact me at Aldo.Taranto@mathemetrica.com.
Best regards,

______________
Aldo Taranto
Director
www.mathemetrica.com
 

Members (34)

Phillip Middleton Jan Theodore Galkowski Emory Creel Vincent Granville Dr. Diego Kuonen, CStat CSci Vijay Mark Richards Mallikarjun Mukunda Paco Nathan Rishi Yadav Aldo Taranto Ralf Klinkenberg UMESHKUMAR SHAH AJAY OHRI Joe Jurczyk Amber Yukinori Sugiura Amish yuxiawang Peter Ruddock Lin Wang Rick Dolata Ben Hinchliffe Gary D. Miner, Ph.D. Navin singh Clinton McLeod Yi-Chun Tsai Neal J. Verzwyvelt Gurumurthi V. Ramanan Subhransu
 
 

Advertisement

Featured

 

© 2009   Created by Vincent Granville

Badges  |  Report an Issue  |  Privacy  |  Terms of Service