Falkon: a Fast and Light-weight tasK executiON framework

Falkon [2] is a software component that is a Globus Incubator project as of November 2007.  Falkon is being actively developed at University of Chicago, Computer Science Department in the Distributed Systems Laboratory under Ian Foster's guidance with funding from DOE and NASA. 

The initial idea and motivation behind Falkon emerged in 2005 with an astronomy application (an image stacking/co-adding service, aka AstroPortal) at its core.  The application involved an image dataset Sloan Digital Sky Survey (SDSS) DR4/DR5 of 10TB and involved 1000s to 10000s of object retrievals from this dataset, calibrating these objects, and finally stacking them to produce the final output.  This application was faced with several challenges when being run in tradition Grid Computing environments:

After an initial prototype of the astronomy specific application AstroPortal was complete, work began on generalizing the system so other applications could benefit from a system that could address the 3 challenges mentioned.  In January 2007, the first prototype of Falkon v0 was complete, and testing began with other applications.  In order to leverage a large pool of applications to use Falkon transparently, Falkon coordinated its efforts with the Swift project (a parallel programming system) and created a Falkon Provider to be included with Swift.  Applications from many domains (astronomy, medicine, chemistry, and economics) were tested and showed significant performance improvements.  Furthermore, the project is extending well beyond traditional grids by being ported to the IBM BlueGene/P that will be online in March 2008 at Argonne National Laboratory. 

There have been many people that contributed (ideas, writing, code, etc) to Falkon; my contributions to Falkon have been the leading of the project in general, as well as the designed and implementation of the core Falkon functionalities, to enable the rapid and efficient execution of many independent jobs on large compute clusters. Falkon combines three techniques to achieve this goal: (1) multi-level scheduling techniques to enable separate treatments of resource provisioning and the dispatch of user tasks to those resources; (2) a streamlined task dispatcher able to achieve order-of-magnitude higher task dispatch rates than conventional schedulers; and (3) performs data caching and uses a data-aware scheduler to leverage the co-located computational and storage resources to minimize the use of shared storage infrastructure. 

The latest stable release of Falkon is currently at v0.9.r11.  More detailed information about Falkon and to download the code for free, please visit Falkon Globus Incubator page at http://dev.globus.org/wiki/Incubator/Falkon.

Collaborators:

Presentations:

Papers:

Relevant Links to related projects or sub-projects of Falkon:

 

Last modified: October 13, 2009