![]() |
|
University of ChicagoDepartment of Computer ScienceDistributed Systems Laboratory (DSL)1100 E. 58th Street, Ryerson HallChicago, IL 60637Cellular: 1-847-722-0876Email: iraicu@cs.uchicago.edu |
![]() |
I have just successfully defended my dissertation in the Distributed Systems Laboratory in the Computer Science Department at University of Chicago in Chicago, Illinois. My research advisor is Dr. Ian Foster and my research revolves around resource management in large-scale distributed systems, such as grids and supercomputers.
I have defined a new paradigm Many-Tasks Computing (MTC) which aims to bridge the gap between high throughput computing (HTC) and high performance computing (HPC). MTC is reminiscent to HTC, but it differs in the emphasis of using many computing resources over short periods of time to accomplish many computational tasks (i.e. including both dependent and independent tasks), where the primary metrics are measured in seconds, as opposed to operations per month. MTC denotes high-performance computations comprising multiple distinct activities, coupled via file system operations. There are many challenges to enable support for MTC across clusters, Grids, and supercomputers, including scalable resource management and storage solutions, as well as having well defined standards on how applications are to interact with the new or improved middleware.
My dissertation work has focused on defining, and exploring both the theory and practical aspects of realizing MTC across a wide range of systems. Many MTC applications are often data intensive and require many distributed resources; for these applications, data locality is crucial to high throughput and performance. Throughout my dissertation, harnessing data locality materialized with the concept of data diffusion, in which I acquired compute and storage resources dynamically through dynamic resource provisioning, replicated data in response to demand, and scheduled computations close to data through streamlined dispatching. These theoretical concepts have been implemented through my Falkon framework and have shown performance and scalability improvements of several orders of magnitude across many diverse workloads and applications (i.e. astronomy, medicine, biology, chemistry, molecular dynamics, economics, and data analytics). Furthermore, Falkons largest application runs have been composed of millions of tasks and executed on 160K processors on the IBM Blue Gene/P. Micro-benchmarks have also been run with billions of tasks with throughputs exceeding 15K tasks/sec and aggregate I/O throughputs of 170Gb/s.
My work has been supported in part by the NASA Ames Research Center GSRP Grant Number NNA06CB89H and by the Mathematical, Information, and Computational Sciences Division subprogram of the Office of Advanced Scientific Computing Research, Office of Science, U.S. Dept. of Energy, under Contract DE-AC02-06CH11357.
News
Wikipedia: "Many-task computing", January 15th, 2009
Historical Log Plot: "Plot of Falkon historical logs from December 2007 to October 2008".
Site Table of Contents
09/2003 - 06/2005: University of Chicago, Department of Computer Science
08/2002 - 05/2003: Purdue University, Department of Computer Science
06/2002 - 08/2002: University of Michigan, Department of Computer and Information Science
08/2000 - 12/2001: Wayne State University, Department of Computer Science
For a detailed description of each project, and my contributions to each, please see the main project page.
12/2006 - Present: Falkon: a Fast and Light-weight tasK executiON framework
12/2006 - Present: Swift: Fast, Reliable, Loosely Coupled Parallel Computation
06/2005 - 03/2009: AstroPortal: A Science Gateway for Large-scale Astronomy Data Analysis
01/2006 - 06/2008: ServMark: an Architecture for Testing Grid Services
06/2006 - 12/2007: HOC-SA: Higher-Order Components-Service Architecture
09/2003 - 12/2006: DI-GRUBER: A Distributed Grid Resource Broker
09/2003 - 05/2005: DiPerF: an automated DIstributed PERformance testing Framework
This is a list of news articles that have been published that refernced my work.
Below is a short list of selected and recent publications and proposals that are at the core of my dissertation. For a complete list of publications, theses, proposals, and technical reports, please see the main publication page.
Ioan Raicu, Ian Foster, Yong Zhao. Many-Task Computing for Grids and Supercomputers, Invited Paper, IEEE Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS08), 2008, co-located with IEEE/ACM Supercomputing 2008.
Ian Foster, Yong Zhao, Ioan Raicu, Shiyong Lu. Cloud Computing and Grid Computing 360-Degree Compared, to appear at IEEE Grid Computing Environments (GCE08) 2008, co-located with IEEE/ACM Supercomputing 2008.
Zhao Zhang, Allan Espinosa, Kamil Iskra, Ioan Raicu, Ian Foster, Michael Wilde. Design and Evaluation of a Collective I/O Model for Loosely-coupled Petascale Programming, to appear at IEEE Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS08) 2008, co-located with IEEE/ACM Supercomputing 2008.
Ioan Raicu, Zhao Zhang, Mike Wilde, Ian Foster, Pete Beckman, Kamil Iskra, Ben Clifford. Toward Loosely Coupled Programming on Petascale Systems, to appear at IEEE/ACM Supercomputing 2008.
Ioan Raicu, Zhao Zhang, Mike Wilde, Ian Foster. Enabling Loosely-Coupled Serial Job Execution on the IBM BlueGene/P Supercomputer and the SiCortex SC5832, Technical Report, Department of Computer Science, University of Chicago, April 2008.
Ioan Raicu, Ian Foster. Harnessing Grid Resources to Enable the Dynamic Analysis of Large Astronomy Datasets: Year 2 Status and Year 3 Proposal, GSRP, Ames Research Center, NASA, March 2008 -- Award funded 10/1/08 - 9/30/09.
Quan T. Pham, Atilla S. Balkir, Jing Tie, Ian Foster, Mike Wilde, Ioan Raicu. Data Intensive Scalable Computing on TeraGrid: A Comparison of MapReduce and Swift, Poster Presentation, TeraGrid Conference 2008.
Ioan Raicu, Yong Zhao, Ian Foster, Mike Wilde, Zhao Zhang, Ben Clifford, Mihael Hategan, Sarah Kenny. Managing and Executing Loosely Coupled Large Scale Applications on Clusters, Grids, and Supercomputers, Extended Abstract, GlobusWorld08, part of Open Source Grid and Cluster Conference 2008.
Yong Zhao, Ioan Raicu, Ian Foster. Scientific Workflow Systems for 21st Century e-Science, New Bottle or New Wine?, Invited Paper, IEEE Workshop on Scientific Workflows 2008, co-located with IEEE International Conference on Services Computing (SCC) 2008.
Ioan Raicu, Yong Zhao, Ian Foster, Alex Szalay. "Accelerating Large-scale Data Exploration through Data Diffusion", International Workshop on Data-Aware Distributed Computing 2008, co-locate with ACM/IEEE International Symposium High Performance Distributed Computing (HPDC) 2008.
Ioan Raicu, Ian Foster. Harnessing Grid Resources to Enable the Dynamic Analysis of Large Astronomy Datasets: Year 2 Status and Year 3 Proposal, GSRP, Ames Research Center, NASA, February 2008.
Ioan Raicu, Ian Foster. Harnessing Grid Resources to Enable the Dynamic Analysis of Large Astronomy Datasets: Year 1 Final Report, GSRP, Ames Research Center, NASA, February 2008.
Yong Zhao, Ioan Raicu, Ian Foster, Mihael Hategan, Veronika Nefedova, Mike Wilde. Realizing Fast, Scalable and Reliable Scientific Computations in Grid Environments, to appear as a book chapter in Grid Computing Research Progress, ISBN: 978-1-60456-404-4, Nova Publisher 2008.
Ioan Raicu. Harnessing Grid Resources with Data-Centric Task Farms, University of Chicago, Computer Science Department, PhD Proposal, December 2007, Chicago, Illinois.
Ioan Raicu, Yong Zhao, Catalin Dumitrescu, Ian Foster and Mike Wilde. Falkon: A Proposal for Project Globus Incubation, Globus Incubation Management Project, 2007 Proposal accepted 11/10/07.
Ioan Raicu, Ian Foster. Harnessing Grid Resources to Enable the Dynamic Analysis of Large Astronomy Datasets: Year 1 Status and Year 2 Proposal, GSRP, Ames Research Center, NASA, February 2007 -- Award funded 10/1/07 - 9/30/08.
Ioan Raicu, Yong Zhao, Ian Foster, Alex Szalay. A Data Diffusion Approach to Large Scale Scientific Exploration, Microsoft Research eScience Workshop 2007.
Ioan Raicu, Yong Zhao, Catalin Dumitrescu, Ian Foster, Mike Wilde. Falkon: a Fast and Light-weight tasK executiON framework, IEEE/ACM SuperComputing 2007.
Ioan Raicu, Catalin Dumitrescu, Ian Foster. Dynamic Resource Provisioning in Grid Environments, TeraGrid Conference 2007.
Yong Zhao, Mihael Hategan, Ben Clifford, Ian Foster, Gregor von Laszewski, Ioan Raicu, Tiberiu Stef-Praun, Mike Wilde. Swift: Fast, Reliable, Loosely Coupled Parallel Computation, IEEE Workshop on Scientific Workflows 2007.
Ioan Raicu, Ian Foster. Harnessing Grid Resources to Enable the Dynamic Analysis of Large Astronomy Datasets, GSRP, Ames Research Center, NASA, February 2006 -- Award funded 10/1/06 - 9/30/07.
Ioan Raicu, Ian Foster, Alex Szalay. Harnessing Grid Resources to Enable the Dynamic Analysis of Large Astronomy Datasets, poster presentation, IEEE/ACM SuperComputing 2006.
Ioan Raicu, Ian Foster, Alex Szalay, Gabriela Turcu. AstroPortal: A Science Gateway for Large-scale Astronomy Data Analysis, TeraGrid Conference 2006, June 2006.
Alex Szalay, Julian Bunn, Jim Gray, Ian Foster, Ioan Raicu. The Importance of Data Locality in Distributed Computing Applications, NSF Workflow Workshop 2006.
Below is a short list of selected and recent presentations that are at the core of my dissertation. For a complete list of presentations, please see the main presentation page.
Below is a list of the conferences, workshops, journals, and books for which I have been either a reviewer for or in the program committee.
International Conference on Complex, Intelligent and Software Intensive Systems (CISIS), 2010
IEEE International Conference on Networking and Services (ICNS), 2010
IEEE International Workshop on High Performance Distributed Data Management (HPDDM),
IEEE Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS), 2009
Distributed Systems Laboratory Workshop (DSLW), 2009
IEEE International Workshop on Cloud Computing (CC), 2009
IEEE International Conference on Networks (ICN), 2010
IEEE International Conference on Computational Science and Engineering (CSE-09), 2009
IEEE International Workshop on Real-Time Service-Oriented Architecture and Applications (RTSOAA), 2009
Journal of Concurrency and Computation: Practice and Experience, 2009
ISCA International Journal of Computers and their Applications (IJCA), 2009
IEEE/ACM International Symposium on Collaborative Technologies and Systems (CTS), 2009
IEEE Workshop on Data-Aware Distributed Computing (DADC), 2009
IEEE International Workshop on Scientific Workflows (SWF), 2009
Megajobs: How to Run One Million Jobs, BOF at IEEE/ACM Supercomputing 2008
IEEE/ACM Workshop on Grid Computing Portals and Science Gateways (GCE08) [committee member & reviewer]
IEEE International Conference on Internet and Web Applications and Services (ICIW 2009) [committee member & reviewer]
IEEE/ACM Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS), co-located with IEEE/ACM Supercomputing 2008 Conference [co-chair, organizing committee, committee member & reviewer]
TeraGrid Conference (TG09) [organizing committee, committee member & reviewer]
IEEE International Conference on Networks (ICN 2009) [committee member & reviewer]
IEEE International Conference on Networking and Services (ICNS 2009) [committee member & reviewer]
Distributed Systems Laboratory Workshop (DSLW08) [chair, organizing committee]
IEEE International Conference on Internet and Web Applications and Services (ICIW08) [committee member & reviewer]
Sixth Annual Conference on Communication Networks and Services Research (CNSR08) [reviewer]
The Handbook of Technology Management (book to appear in 2008) [reviewer]
TeraGrid Conference (TG08) [reviewer]
Distributed Systems Laboratory Workshop (DSLW07) [chair, organizing committee]
ACM/IET/ICST International Workshop on Performance and Analysis of Wireless Networks (PAWN08) [committee member & reviewer]
IEEE International Conference on Advanced Engineering Computing and Applications in Sciences (ADVCOMP08) [committee member & reviewer]
IEEE International Conference on Systems and Networks Communications (ICNSC08) [committee member & reviewer]
IEEE International Conference on Networking and Services (ICNS08) [committee member & reviewer]
IEEE International Conference on Networking (ICN08) [committee member & reviewer]
IEEE Internet Computing, Special Issue on Virtual Organizations, 2007 [reviewer]
IEEE/ACM Workshop on Grid Computing Portals and Science Gateways (GCE07) [committee member & reviewer]
IEEE/ACM Grid Conference (SC07) [reviewer]
IEEE Internet Computing (IC07) [reviewer]
The Handbook of Computer Networks (2007) [reviewer]
IEEE/ACM SuperComputing (SC06) [reviewer]
Distributed Systems Laboratory Workshop (DSLW06) [chair, organizing committee]
IEEE Transactions on Computers (TC06) [reviewer]
Journal of Concurrency and Computation: Practice and Experience 2006 [reviewer]
IEEE Communication Letters (CL05) [reviewer]
High Performance Computing Symposium (HPCC05) [reviewer]
IEEE Intelligent Sensing and Information Processing (ICISIP05) [committee member & reviewer]
ARC Research Network on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP05) [reviewer]
IEEE International Conference on Computer Communications and Networks (IC3N02) [reviewer]
IEEE International Workshop on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS02) [reviewer]
![]()