Challenges in Data Intensive Computing
Dr. Rob Ross, Argonne National Laboratory, USA
As computing systems
mature, three distinct storage interaction models are emerging.
systems we are now seeing search capabilities
change the way users interact with the file system, lessening the role
of the traditional directory hierarchy. In the wide area, storage resource
management systems that allow users to locate and retrieve data are becoming
an important aspect of the scientific process. On HEC systems the focus
remains on extremely high performance, with applications interacting
primarily through POSIX and structured I/O interfaces and with interactive
access taking a back seat.
Some HEC sites have
long pushed for a globally accessible, secure, high-performance parallel
file system that stores data from applications, allows direct
access to this data from multiple on-site systems, and provides access
across the wide area. This vision has persisted for quite some time,
despite the inability of the community to make it a reality.
panel we will revisit the vision of data access from the user's perspective.
We will discuss the viability of this vision from a technical perspective
as well as debating how appropriate this vision is given the reality
of today's computational science and its workflows.
National Laboratories | Privacy
would you characterize the models of interaction with HEC storage
- On the HEC
- At a desktop
or other system on-site?
these models of interaction, how compelling is the vision
of a globally accessible file system?
- What parts
of the vision match with your models of interaction?
- What is missing
from the vision?
- What is unnecessary?
are the biggest technical challenges facing the implementation
of your vision?
- Are there
technical challenges that seem insurmountable?
- What technologies
can we adopt from other areas?
- What should
we be focusing on now?