Publications
2016
Storage
-
Server-side Log Data Analytics for I/O Workload
Characterization and Coordination on Large Shared Storage
Systems,
Yang Liu, Raghul Gunasekaran, Xiaosong Ma, and Sudharshan S.
Vazhkudai,
Proceedings of Supercomputing 2016 (SC16): 29th Int'l
Conference on High Performance Computing, Networking,
Storage and Analysis, Salt Lake City, UT, November
2016.
pdf
Data management
-
Constellation: A Science Graph Network for Scalable
Data and Knowledge Discovery in Extreme-Scale Scientific
Collaborations,
Sudharshan S. Vazhkudai, John Harney, Raghul Gunasekaran,
Dale Stansberry, Seung-Hwan Lim, Tom Barron, Andrew Nash and
Arvind Ramanathan,
Proceedings of the IEEE Workshop on Big Data Metadata
and Management, Washington D.C., December 2016.
pdf
-
An I/O Load Balancing Framework for Large-scale
Applications (BPIO 2.0) (Poster),
Sarah Neuwirth, Feiyi Wang, Sarp Oral, Sudharshan S.
Vazhkudai, Ulrich Bruening,
Proceedings of Supercomputing 2016 (SC16): 29th Int'l
Conference on High Performance Computing, Networking,
Storage and Analysis, Salt Lake City, UT, November
2016.
pdf
-
Using Balanced Data Placement to Address I/O
Contention in Production Environments,
Sarah Neuwirth, Feiyi Wang, Sarp Oral, Sudharshan S.
Vazhkudai, James Rogers, Ulrich Bruening,
Proceedings of the Int'l Symposium on Computer Architecture
and High Performance Computing, Los Angeles, CA, October
2016.
pdf
-
TagIt: An Integrated Search and Discovery Service
for Extreme-Scale File Systems (Poster),
Hyogi Sim, Youngjae Kim, Sudharshan S. Vazhkudai, Geoffroy
R. Vallee, Seung-Hwan Lim, and Ali R. Butt,
Proceedings of the USENIX Annual Technical Conference
(ATC), Denver, CO, June 2016.
System Architecture and Resilience
-
A Multi-faceted Approach to Job Placement for Improved
Performance on Extreme-Scale Systems,
Christopher Zimmer, Saurabh Gupta, Scott Atchley, Sudharshan
S. Vazhkudai, and Carl Albing,
Proceedings of Supercomputing 2016 (SC16): 29th Int'l
Conference on High Performance Computing, Networking,
Storage and Analysis, Salt Lake City, UT, November 2016.
pdf
2015
Storage
-
A Practical Approach to Reconciling Availability,
Performance, and Capacity in Provisioning Extreme-scale
Storage Systems,
Lipeng Wan, Feiyi Wang, Sarp Oral, Devesh Tiwari, Sudharshan
S. Vazhkudai, Qing Cao,
Proceedings of Supercomputing 2015 (SC15): 28th Int'l
Conference on High Performance Computing, Networking,
Storage and Analysis, Austin, TX, November 2015
pdf
Non-Volatile Memory
-
AnalyzeThis: An Analysis Workflow-Aware Storage
System,
Hyogi Sim, Youngjae Kim, Sudharshan S. Vazhkudai, Devesh
Tiwari, Ali Anwar, Ali R. Butt, Lavanya Ramakrishnan,
Proceedings of Supercomputing 2015 (SC15): 28th Int'l
Conference on High Performance Computing, Networking,
Storage and Analysis, Austin, TX, November 2015.
pdf
System Architecture and Resilience
-
Spatial Locality-Aware Cache Partitioning for
Effective Cache Sharing,
Saurabh Gupta and Huiyang Zhou,
To appear in The 44th Internation Conference on Parallel
Processing (ICPP 2015), September, 2015. pdf
-
Understanding and Exploiting Spatial Properties of
System Failures on Extreme-Scale HPC Systems,
Saurabh Gupta, Devesh Tiwari, Christopher J. Jantzi, James
H. Rogers, Don Maxwell,
In The 45th IEEE Conference on Dependable
Systems and Networks (DSN 2015), June, 2015.
pdf
-
Experience with GPUs on the Titan Supercomputer from
a Reliability, Performance and Power Perspective,
Devesh Tiwari, Saurabh Gupta, Jim Rogers, Don Maxwell,
In The 37th Cray user Group (CUG 2015), April, 2015.
-
Understanding GPU Errors on Large-scale HPC Systems
and the Implications for System Design and
Operation,
Devesh Tiwari, Saurabh Gupta, Jim Rogers, Don Maxwell, Paolo
Rech, Sudharshan Vazhkudai, Daniel Oliveira, Dave Londo,
Nathan Debardeleben, Philippe Navaux, Luigi Carro, and
Arthur Buddy Bland,
In Proceedings of 21st IEEE Symposium on High
Performance Computer Architecture (HPCA 2015), February,
2015.
pdf
2014
File and Storage Systems
-
Improving Large-scale Storage System Performance via
Topology-aware and Balanced Data Placement,
Feiyi Wang, Sarp Oral, Saurabh Gupta, Devesh Tiwari, and
Sudharshan Vazhkudai
In Proceedings of 20th IEEE International Conference on
Parallel and Distributed Systems (ICPADS 2014),
December, 2014.
pdf
-
Best Practices and Lessons Learned from Deploying
and Operating Large-Scale Data-Centric Parallel File
Systems,
Sarp Oral, James Simmons, Jason Hill,
Dustin Leverman, Feiyi Wang, Matt Ezell, Ross Miller,
Douglas Fuller, Raghul Gunasekaran, Youngjae Kim, Saurabh
Gupta, Devesh Tiwari, Sudharshan S. Vazhkudai, James H.
Rogers, David Dillow, Arthur S. Bland, Galen M. Shipman,
Proceedings of Supercomputing 2014 (SC14): 27th IEEE/ACM
Int'l Conference on High Performance Computing, Networking,
Storage and Analysis, New Orleans, Louisiana, November
2014. (Best Paper Finalist)
- Automatic Identification of Applications I/O
Signatures from Noisy Server-Side Traces,
Yang Liu, Raghul Gunasekaran, Xiaosong Ma, Sudharshan S.
Vazhkudai.
Proceedings of the USENIX Conference on File
and Storage Technologies (FAST 2014), Santa Clara,
California, February 2014.
-
HybridPlan: A Capacity Planning Technique for
Projecting Storage Requirements in Hybrid Storage
Systems,
Youngjae Kim, Aayush Gupta, Bhuvan Urgaonkar, Piotr Berman,
Anand Sivasubramaniam,
Springer Journal of Supercomputing
(JSC), Vol. 67, No. 1, pp. 277-303, January 2014.
- Synchronous I/O Scheduling of Independent Write
Caches for an Array of SSDs,
Junghee Lee, Youngjae Kim, Jongman Kim, Galen M. Shipman.
(to appear) IEEE Computer Architecture Letters
(CAL), 2014.
-
Realizing Accelerated Cost-Effective Distributed
RAID,
Aleksandr Khasymski, M. Mustafa Rafique, Ali R. Butt,
Sudharshan S. Vazhkudai, and Dimitrios S. Nikolopoulos,
Chapter in Handbook on Data Centers, edited by
Albert Y. Zomaya and Samee U. Khan, Springer, 2014. pdf
Non-Volatile Memory
- Coordinated Garbage Collection for RAID Array of
Solid State Disks,
Inventors (ordered by inventor's last name) David Dillow,
Youngjae Kim, Sarp Oral, Galen Shipman, Feiyi Wang,
U.S. Patent No. 8,713,268, Issued: April 29, 2014.
-
Coordinating Garbage Collection for Arrays of
Solid-state Drives,
Youngjae Kim, Junghee Lee, Sarp Oral, David Dillow, Feiyi
Wang, Galen M. Shipman,
IEEE Transactions on Computers (IEEE TC), Vol. 63,
No. 4, pp. 888-901, April 2014.
Data Management
Networking
System Architecture and Resilience
-
Feedback Computing in Leadership Compute
Systems,
Raghul Gunasakaren and Youngjae Kim,
Proceedings of the 9th International Workshop of
Feedback Computing in conjunction with ICAC'14 (Feedback
Computing'14), Philadelphia, June 2014.
-
Lazy Checkpointing: Exploiting Temporal Locality in
Failures to Mitigate Checkpointing Overheads on
Extreme-Scale Systems,
Devesh Tiwari, Saurabh Gupta, Sudharshan S. Vazhkudai,
Proceedings of the 44th Annual IEEE/IFIP International
Conference on Dependable Systems and Networks (DSN
2014), Atlanta, Georgia, June 2014. (Best Paper Finalist)
pdf
-
MapReuse: Reusing Computation in an In-Memory
MapReduce System,
Devesh Tiwari and Yan Solihin,
Proceedings of the 29th IEEE International Parallel &
Distributed Processing Symposium (IPDPS), May, 2014.
-
I/O Router Placement and Fine-Grained Routing on
Titan to Support Spider II,
Matt Ezell, Dave Dillow, Sarp Oral, Feiyi Wang, Devesh
Tiwari, Don Maxwell, Dustin Leverman, Jason Hill,
Proceedings of the Cray User Group Conference
(CUG), May, 2014.
-
SSD Provisioning for Exascale Storage System: When,
Where and How much?,
Devesh Tiwari, Sarp Oral, Feiyi Wang, Saurabh Gupta and
Josh Judd,
Proceedings of the Lustre User Group Meetings (LUG),
April, 2014.
-
Transparent Fault Tolerance for Job Input Data in
HPC Environments,
Chao Wang, Sudharshan S. Vazhkudai, Xiaosong Ma, Frank Mueller,
Chapter in Handbook on Data Centers, edited by
Albert Y. Zomaya and Samee U. Khan, Springer, 2014.
pdf
Top
2013
File and Storage Systems
- Asynchronous Object Storage with QoS for Scientific
and Commercial Big Data, Michael J. Brim, David A.
Dillow, Sarp Oral, Bradley W. Settlemyer, Feiyi Wang,
Proceedings of the 8th Parallel Data Storage Workshop (PDSW)
held in conjunction with SC'13, Denver, CO, November 2013.
- Performance and Scalability Evaluation of the Ceph
Parallel File System, Feiyi Wang, Mark Nelson, Sarp
Oral, Scott Atchely, Sage Weil, Brad Settlemyer, Blake Caldwell,
Jason Hill, Proceedings of the 8th Parallel Data Storage
Workshop (PDSW) held in conjunction with SC'13, Denver, CO,
November 2013.
- OLCF's 1 TB/s, Next-Generation Lustre File
System, Sarp Oral, David A. Dillow, Douglas Fuller,
Jason Hill, Dustin Leverman, Sudharshan S. Vazhkudai, Feiyi
Wang, Youngjae Kim, James Rogers, James Simmons, Ross Miller,
Proceedings of the Cray User Group Conference (CUG),
Napa Valley, California, May 2013.
- Taking Advantage of Multicore for the Lustre Gemini
LND Driver, James A. Simmons and John Lewis,
Proceedings of the Cray User Group Conference (CUG),
Napa Valley, California, May 2013.
Non-Volatile Memory
- A Temporal Locality-aware Page-Mapped Flash
Translation Layer, Youngjae Kim, Aayush Gupta, Bhuvan
Urgaonkar, Journal of Computer Science and Technology
(JCST), Vol. 28, No. 6, pp. 1025-1044, November 2013.
- Preemptible I/O Scheduling of Garbage Collection
for Solid-state Drives, Junghee Lee, Youngjae Kim,
Galen M. Shipman, Sarp Oral, Jongman Kim,
IEEE Transactions on Computer-Aided Design of Integrated Circuits
and Systems (IEEE TCAD), Vol. 32, No. 2, pp. 247-260, February 2013.
- Active Flash: Towards Energy-Efficient, In-Situ
Data Analytics on Extreme-Scale Machines, Devesh
Tiwari, Simona Bobila, Sudharshan Vazhkudai, Youngjae Kim,
Xiaosong Ma, Peter Desnoyers, Yan Solin.
Proceedings of the USENIX Conference on File and Storage
Technologies (FAST 2013), 2013.
Data Management
- Design and Implementation of a Scalable Climate
Data System in Support of HPC Environment,
Feiyi Wang1,
John Harney1,
Tom Barron1,
Galen Shipman1,
Dean Williams2,
Luca Cinquini3,
1Oak Ridge National Laboratory,
Oak Ridge, Tennessee 37831,
2Lawrence Livermore National Laboratory,
Livermore, California 94550,
3Jet Propulsion Laboratory,
Pasadena, California 91109,
In submission.
Networking
- End-to-End Data Movement Using MPI-IO Over Routed
Terabits Infrastructures, Geoffroy R. Vallee,
Scott Atchley, Youngjae Kim, Galen M. Shipman.
Proceedings of the IEEE/ACM International Workshop on
Network-aware Data Management in conjunction with
SC'13, Denver, CO, November 2013.
- Layout-aware I/O Scheduling for Terabit Data
Movement, Youngjae Kim, Scott Atchley, Geoffroy R.
Vallee, Galen M. Shipman. Proceedings of the Workshop on
Distributed Storage Systems and Coding for Big Data held in
conjunction with IEEE Big Data'13, San Jose, CA, October
2013.
-
On Timely Staging of HPC Job Input Data,
H. Monti, A. R. Butt, S.S. Vazhkudai,
IEEE Transactions on Parallel and Distributed Systems
(TPDS), Vol. 24, No. 9, pp. 1841-1851, September 2013.
pdf
Supplement
(pdf)
- SeaStar Unchained: Multiplying the Performance of
the Cray SeaStar Network, David A. Dillow, Scott
Atchley, Proceedings of the Cray User Group Conference
(CUG), Napa Valley, California, May 2013.
Top
2012
File and Storage Systems
- Characterizing Output Bottlenecks in a
Supercomputer , Bing Xie, Jeff Chase, David Dillow,
Oleg Drokin, Scott Klasky, Sarp Oral, Norbert
Podhorszki, Proceedings of Supercomputing (SC): the 25th
IEEE/ACM Int'l Conference on High Performance Computing,
Networking, Storage and Analysis (SC 2012), Salt Lake
City, UT, November
2012. pdf
- A Next-Generation Parallel File System Environment
for the OLCF, David A. Dillow, Douglas Fuller, Raghul
Gunasekaran, Jason J. Hill, Youngjae Kim, Sarp Oral, Doug M.
Reitz, Galen M. Shipman, James A. Simmons, Feiyi Wang,
Feiyi, Proceedings of the Cray User Goup conference (CUG
2012), Stuttgart, Germany, April 29-May 3, 2012.
- Technical Overview of the OLCF's Next Generation
Parallel File System, Galen M. Shipman, David A.
Dillow, Jason J. Hill, Youngjae Kim, Sarp Oral, Doug M. Reitz,
James A. Simmons, Proceedings of Lustre User Group
Meetings (LUG 2012), Austin, TX, April, 2012. talk
(pdf)
- Distributed Storage Systems for Data Intensive
Computing, Sudharshan S. Vazhkudai, Ali R. Butt,
Xiaosong Ma, Book Chapter in Data Intensive Distributed
Computing: Challenges and Solutions for Large-scale
Information Management, Editor: Tevfik Kosar, IGI Global
Books, January 2012. (ISBN: 9781615209712, DOI:
10.4018/978-1-61520-971-2)
Non-Volatile Memory
- Reducing Data Movement Costs using Energy-Efficient
Active Computation on SSD, Devesh Tiwari, Sudharshan
S. Vazhkudai, Youngjae Kim, Xiaosong Ma, Simona Boboila, Peter
J. Desnoyers, Proceedings of the USENIX Workshop on
Power-Aware Computing and Systems (HotPower'12, co-located
with OSDI'12), Hollywood, CA, October 2012.
pdf
- NVMalloc: Exposing an Aggregate SSD Store as a
Memory Partition in Extreme-Scale Machines, Chao
Wang, Sudharshan S. Vazhkudai, Xiaosong Ma, Fei Meng, Youngjae
Kim, Christian Engelmann, Proceedings of the 26th IEEE
Int'l Parallel & Distributed Processing Symposium (IPDPS
2012), Shanghai, China, May 2012. pdf
- Active Flash: Out-of-core Data Analytics on Flash
Storage, Simona Boboila, Youngjae Kim, Sudharshan S.
Vazhkudai, Peter J. Desnoyers, Galen M. Shipman,
Proceedings of the 28th IEEE Conference on Mass Storage
Systems and Technologies (MSST 2012), Monterey, CA, April
2012. pdf
- Comparing Coordinated Garbage Collection Algorithms
for Arrays of Solid-state Drives, Junghee Lee,
Youngjae Kim, Sarp Oral, Galen M. Shipman, David A. Dillow,
Feiyi Wang, Proceedings of the 3rd Non-Volatile Memories
Workshop (NVMW 2012), San Diego, CA, March 2012.
- Active Flash: Performance-Energy Tradeoffs for
Out-of-Core Processing on Non-Volatile Memory
Devices, Simona Boboila, Youngjae Kim, Sudharshan S.
Vazhkudai, Peter J. Desnoyers, Galen M. Shipman, Poster in
the Proceedings of the 3rd Non-Volatile Memories Workshop
(NVMW 2012), San Diego, CA, March 2012. paper
(pdf), poster
(pdf)
- Multi-level Hybrid Cache: Impact and
Feasibility, Zhe Zhang, Youngjae Kim, Xiaosong Ma,
Galen M. Shipman, Yuanyuan Zhou, Technical Report
(ORNL/TM-2010/297), National Center for Computational
Sciences, Oak Ridge National Laboratory, February 2012.
pdf
Data Management
- Workload Characterization and Performance Implications of
Large-Scale Blog Servers, Myeongjae Jeon, Youngjae Kim, Jaeho
Hwang, Joonwon Lee, Euiseong Seo, ACM Transactions on the Web
(ACM TWEB), Volume 6, Issue 4, November, 2012.
- Big Data Platforms as a Service: Challenges and
Approach, James Horey, Edmon Begoli, Raghul
Gunasekaran, Seung-Hwan Lim, James Nutaro Proceedings of
the 3rd USENIX Workshop on Hot Topics in Cloud Computing
(HotCloud 2012), Boston, MA, June 2012.
pdf
- Practical Application of Parallel Coordinates for
Climate Model Analysis, Chad A. Steed, Galen M.
Shipman, Peter E. Thornton, Daniel M. Ricciuto, David J.
Erickson III, Marcia L. Branstetter, Proceedings of the
International Conference on Computer Science--Workshop on Data
Mining in Earth Science, pp. 877–886, June 2012. link will open in new window/tab
- On Timely Staging of HPC Job Input Data, Henry M.
Monti, Ali R. Butt, Sudharshan S. Vazhkudai, IEEE
Transactions on Parallel and Distributed Systems (TPDS),
2012.
System Architecture and Resilience
- On the Use of GPUs in Realizing Cost-Effective
Distributed RAID, Aleksandr Khasymski, M. Mustafa
Rafique, Ali R. Butt, Sudharshan S. Vazhkudai, Dimitrios S.
Nikolopoulos, Proceedings of the IEEE International
Symposium on Modeling, Analysis and Simulation of Computer and
Telecommunication Systems (MASCOTS 2012), Washington,
D.C., August 2012. pdf
- D-Factor: A Quantitative Model of Application
Slow-Down in Shared Service Systems with Multiple
Resources, Seung-Hwan. Lim, Jae-Seok Huh, Youngjae
Kim, Galen M. Shipman, Chita Das, Proceedings of the ACM
Int'l Conference on Measurement and Modeling of Computer
Systems (SIGMETRICS 2012), London, United Kingdom, June
11-15, 2012. pdf
Top
2011
File and Storage Systems
- Dynamic Thermal Management for High-Performance Storage
Systems, Youngjae Kim, Sudhanva Gurumurthi, Anand
Sivasubramaniam. Book Chapter in Handbook of Energy-aware
and Green Computing, Editors: Ishfaq Ahmad and Sanjay
Rank, Publisher: Chapman and Hall/CRC Press Taylor and Francis
Group LLC, December 26. 2011. ISBN-13: 978-1439850804.
- Enhancing I/O Throughput via Efficient Routing and
Placement for Large-scale Parallel File Systems,
David Dillow, Sarp Oral, Galen M. Shipman, Zhe Zhang, Youngjae
Kim, Proceedings of the 30th IEEE Int'l Performance
Computing and Communications Conference (IPCCC 2011),
Orlando, FL, November 17-19, 2011. pdf
- Oak Ridge Leadership Computing Facility Position
Paper, Sarp Oral, Jason Hill, Kevin G. Thach, Norbert
Podhorszki, Scott A. Klasky, James H. Rogers, Galen M.
Shipman, Fifth U.S. Department of Energy Best Practices
Workshop on File Systems & Archives., San Francisco,
CA, September, 2011.
- Provisioning a Multi-Tiered Data Staging Area for
Extreme-Scale Machines, Ramya Prabhakar, Sudharshan
Vazhkudai, Youngjae Kim, Ali Butt, Min Li, Mahmut Kandemir,
Proceedings of the 31th Int'l Conference on Distributed
Computing Systems (ICDCS 2011), Minneapolis, Minnesota,
June 20-24, 2011. pdf
- I/O Congestion Avoidance via Routing and Object
Placement, David A. Dillow, Galen M. Shipman, Sarp
Oral, Zhe Zhang, Proceedings of the Cray User Goup
conference (CUG 2011), Alaska, May, 2011.
- Testing methodology for large-scale file
systems, Sarp Oral, Proceedings of Lustre User
Group Meetings (LUG 2011), Orlando, FL, April, 2011.
Non-Volatile Memory
- HybridStore: A Cost-Efficient, High-Performance
Storage System Combining SSDs and HDDs, Youngjae Kim,
Aayush Gupta, Bhuvan Urgaonkar, Peter Berman, Anand
Sivasubramaniam, Proceedings of the 19th IEEE Int'l
Symposium on Modeling, Analysis and Simulation of Computer and
Telecommunication Systems (MASCOTS 2011), Singapore, July
25-27, 2011. pdf
- Harmonia: A Globally Coordinated Garbage Collector
for Arrays of Solid-state Drives, Youngjae Kim, Sarp
Oral, Galen M. Shipman, Junghee Lee, David Dillow, Feiyi Wang.
Proceedings of the 27th IEEE Symposium on Massive Storage
Systems and Technologies (MSST 2011), Denver, Colorado,
May 23-27, 2011. pdf
- A Semi-Preemptive Garbage Collector for Solid State
Drives, Junghee Lee, Youngjae Kim, Galen M. Shipman,
Sarp Oral, Feiyi Wang, Jongman Kim, Proceedings of the
IEEE Int'l Symposium on Performance Analysis of Systems and
Software (ISPASS 2011), Austin, TX, April 10-12, 2011.
(Best Paper Finalist)
pdf
- A Comprehensive Study on Energy Efficiency and
Performance of Flash-based SSD, Seon-Yeong. Park,
Youngjae Kim, Bhuvan Urgaonkar, Joonwon Lee, Euiseong
Seo, Elsevier Journal of System Architecture (Elsevier
JSA), Volume 57, Issue 4, Pages 354-365, April 2011.
- Pathological Behavior of SSDs and Application in
HPC Storage, Youngjae Kim, Junghee Lee, Galen M.
Shipman, Proceedings of the 2nd Non-Volatile Memories
Workshop (NVMW 2011), San Diego, CA, March 2011.
Data Management
- Building a Large-Scale Climate Data Systems for HPC
Environment, Feiyi Wang, John Harney, Galen Shipman,
Dean William, Luca Ciquini, Proceedings of IEEE 7th
International Conference on Next Generation Web Service
Practices, Salamanca, Spain, October 19-21, 2011. pdf
- Metadata - Beyond Hierarchy and POSIX Attributes,
Galen M. Shipman, HEC FSIO 2011 Workshop, August, 2011.
- Characterizing Application Runtime Behavior from
System Logs and Metrics, Raghul Gunasekaran, David A.
Dillow, Galen M. Shipman, Richard Vuduc, Edmond Chow,
Proceedings of Workshop on Characterizing Applications for
Heterogeneous Exascale Systems (co-located with ICS'11),
Tucson, AZ, June, 2011. pdf
- Real-Time System Log Monitoring/Analytics
Framework, Gunasekaran Raghul, Oral Sarp, Dillow A.
Dave, Byung-Hoon Park, Galen M. Shipman, Al Geist,
Proceedings of the Cray User Goup conference (CUG
2011), Alaska, May, 2011. pdf
Networking
- The Common Communication Interface (CCI),
Scott Atchley, David A. Dillow, Galen M. Shipman, Patrick
Geoffray, Jeffrey M. Squyres, George Bosilca, Ronald Minnich,
Proceedings of the 2011 IEEE 19th Annual Symposium on High
Performance Interconnects (HOTI 2011), August, 2011.
pdf
- Migration, Assignment, and Scheduling of Jobs in
Virtualized Environment, Seung-Hwan Lim, Jae-Seok
Huh, Youngjae Kim, Chita Das, Proceedings of the 3rd
USENIX Workshop on Hot Topics in Cloud Computing (HotCloud
2011), Portland, OR, June 2011.
Top
2010
File and Storage Systems
- Workload Characterization of a Leadership Class
Storage, Youngjae Kim, Raghul Gunasekaran, Galen M.
Shipman, David Dillow, Zhe Zhang, Brad Settlemyer,
Proceedings of the 5th Petascale Data Storage Workshop
(PDSW'10, co-located with SC'10), New Orleans, LA,
November 2010. pdf
- Parallelism in System Tools, Kenneth D.
Matney, Sr, Galen M. Shipman, Proceedings of the Cray User
Goup conference (CUG 2011), Edinburgh, United Kingdom,
May, 2010.
- Lessons Learned in Deploying the World's Largest
Scale Lustre File System, Galen M. Shipman, David A.
Dillow, Sarp Oral, Feiyi Wang, Douglas Fuller, Jason Hill, Zhe
Zhang, Proceedings of the Cray User Goup conference (CUG
2011), Edinburgh, United Kingdom, May, 2010. pdf,
citation
- Efficient Journaling for the Spider Storage
System, Sarp Oral, Feiyi Wang, Galen M. Shipman,
David A. Dillow, Ross Miller, Proceedings of the 8th USENIX
conference on File and storage technologies (FAST 2010), San
Jose, CA, February, 2010. pdf
Non-Volatile Memory
- An Empirical Study of Redundant Array of Independent
Solid-State Drives (RAIS), Youngjae Kim, Sarp Oral,
David Dillow, Feiyi Wang, Douglas Fuller, Steve Poole, Galen
M. Shipman. Technical Report (ORNL/TM-2010/61). National
Center for Computational Sciences, Oak Ridge National
Laboratory, March 2010. pdf
System Architecture and Resilience
- Functional Partitioning to Optimize End-to-End
Performance on Many-Core Architectures, Min Li,
Sudharshan Vazhkudai, Ali Butt, Fei Meng, Xiaosong Ma,
Youngjae Kim, Christian Engelmann, Galen M. Shipman.
Proceedings of Supercomputing (SC): the 23th IEEE/ACM
Int'l Conference on High Performance Computing, Networking,
Storage and Analysis (SC 2010), New Orleans, LA, November
2010. pdf
- Reducing Application Runtime Variability on Jaguar
XT5, Sarp Oral, Feiyi Wang, David A. Dillow, Ross
Miller, Galen M. Shipman, Don Maxwell, Dave Henseler, Jeff
Becklehimer, Jeff Larkin, Proceedings of the Cray User
Goup conference (CUG 2011), Edinburgh, United Kingdom,
May, 2010. pdf
Top
2009 and earlier
File and Storage Systems
- The Spider Center Wide File System; From Concept to
Reality, Galen M. Shipman, David A, Dillow, Sarp
Oral, Feiyi Wang, Proceedings of the Cray User Goup
conference (CUG 2009), Atlanta, GA, May, 2009. pdf
- Understanding Lustre Internals, Feiyi Wang,
Sarp Oral, Galen M. Shipman, Oleg Drokin, Tom Wang, Isaac
Huang, Technical Report (ORNL/TM-2009/117). National
Center for Computational Sciences, Oak Ridge National
Laboratory, April, 2009. pdf
- A first look at scalable I/O in Linux
commands, Kenneth D. Matney, Sr, Sarp Oral, Richard
Shane Canon, Proceedings of the 9th LCI International
Conference on High-Performance Clustered Computing,
Urbana-Champaign, IL, April,
2008. citation
- Performance Characterization and Optimization of
Parallel I/O on the Cray XT, Weikuan Yu, Jeffrey
Vetter, and Sarp Oral, Proc. of IEEE Int. Symp. on
Parallel and Distributed Processing (IPDPS) 2008, Miami,
Florida, April, 2008.
- Empirical Analysis of a Large-Scale Hierarchical
Storage System, Weikuan Yu, Sarp Oral, R. Shane
Canon, Jeffrey S. Vetter, and Ramanan Sankaran, Lecture
Notes in Computer Science, pp. 130-140, Volume 5168/2008,
ISBN 978-3-540-85450-0, Springer Berlin/Heidelberg, 2008.
- XT7? Integrating and Operating a Conjoined XT3+XT4
System, R. Shane Canon, Don E. Maxwell, Josh Lothian,
Sr. Kenneth D. Matney, Makia Minich, Sarp Oral, Jeff L.
Becklehimer, and Cathy H. Williams, Cray Users Group (CUG)
Meeting, Seattle, Washington, May, 2007.
- Efficiency Evaluation of Cray XT Parallel IO
Stack, Weikuan Yu, Sarp Oral, Jeffrey Vetter, and
Richard Barrett, Cray User Group (CUG) Meeting, Seattle,
Washington, May, 2007.
- A Center-Wide file System Using Lustre, R.
Shane Canon and Sarp Oral, Proc. of Cray Users Group (CUG)
Meeting, Lugano, Switzerland, May, 2006.
- Lustre: A How-to Guide for Installing and
Configuring Lustre 1.4.1, Richard Alexander, Chad
Kerner, Jeffrey Kuehn, Jeff Layton, Patrice Lucas, Hong Ong,
Sarp Oral, Lex Stein, Joshua Schroeder, Steve Woods, and Scott
Studham, ORNL Technical Report, May 2005.
System Architecture and Resilience
- MPI Support for Multi-Core Architectures: Optimized
Shared Memory Collectives, Galen M. Shipman, Richard
L. Graham, Proceedings of the 15th European PVM/MPI Users'
Group Meeting on Recent Advances in Parallel Virtual Machine
and Message Passing Interface, Dublin, Ireland,
September, 2008.
Top