Using Locker

Locker is a cost optimized, high-capacity, large file storage service for research data.  Locker provides high performance for large files, and allows investigators across U-M to connect their data to computing resources necessary for their research, including U-M’s HPC clusters


  • Locker is available to researchers from any academic unit.
  • Turbo can be accessed from Mac OSX (Mavericks and Yosemite), Windows 7+, and Linux computers using NFSv3.
  • Locker space can be purchased in 1TB increments
  • Locker uses the Globus File Transfer Service
  • Locker does not yet provide the option of secure storage for regulated and/or sensitive data, but it is scheduled on the roadmap.
  • Turbo allows optional daily snapshots and backups of stored research data

Getting Started

To Request or Modify a Locker Storage Volume

To request a Locker CIFS or NFS volume, click on one of the following links:

Globus Server Endpoint

Locker can be made available on existing ARC-TS Globus servers to provide high performance transfers, data sharing and access to Locker from off campus.  To access Locker via Globus, request your Locker volume be added to Globus.

ARC-TS Compute System Support

Locker can be accessed from any ARC-TS compute service that supports the same data classifications as your export.  To have your Locker export added to an ARC-TS resource contact us with the export name and system name. Locker will be available on all login and data transfer nodes at a minimum.

Mounts will be located at

Research groups may also request system group creation to control group access to Locker volumes.

Optional Features

Replication – (Recommended) Optional second copy of all data in a different geographic location.

Snapshots – (Highly Recommended) Tracking of how data in a volume changes over time allowing users to recover deleted, modified, or otherwise damaged data.

Access snapshots at:

Using Locker

Mounting on Windows CIFS
Instructions provided when provisioned

Mounting on Linux NFS
Instructions provided when provisioned

Mounting on Apple OSX
Instructions provided when provisioned

Storage Resource Software

If you are unsure which of our storage services should be used to host your data, we have written some software that you can download and execute to analyze your files to understand how much of your data is stored in large files, how much of your data has been accessed recently, and the distribution of file sizes and access times. The software is accessible here

This software doesn’t examine the contents of any data files, it merely scans file attributes, it also does not store any file names after searching through the filesystem. 

If you have any questions on this software, please send us an email at with your inquiry.  If you are unsure about any of the recommendations the tool sends you, you should also contact us at with your inquiry.

Group Access Controls

Linux Set GID

Using Set GID (SGID) on a directory will force all files created in that directory to inherit the same group permissions as the parent directory even if the user creating them primary or effective group is different.  The benefit of this combined with the creation of a group on shared systems is that all files will be created owned and accessible (by default) to members of that group

#list available group
chgrp <groupname> folder
chmod g+s folder

Windows AD Groups



Small File Limitation

Locker’s target audience are those research projects with massive data volumes in large files. Because of this design each 1 TByte of Locker capacity provides only 1 Million files.  Eg. 10 TByte provides 10 Million files. This works out to 1 Mbyte per file average size.

Sensitive Data — ePHI/HIPAA/ITAR/EAR/CUI

Locker is not currently supported for PHI or other data types.  It is scheduled to be reviewed for support at a later date.

System Abuse

Abuse of Locker intentionally or not may result in performance or access being limited to preserve performance and access for other users.  In the event this happens staff will be in contact with the users to engineer solutions.

Frequently Asked Questions

Q: How do I Check Locker Space and File Usage?
A: Linux or OSX Terminal use:

    Space: df -h <mount-path>
    Files: df -h -i <mount-path>

Q: Can Locker be Mounted on All ARC-TS Cluster Compute Nodes?
A: Currently we do not allow Locker to be mounted by very large numbers of clients.  This could change in the future so let us know if this would help you. Currently we recommend using Globus to stage data between cluster scratch and Locker between runs.  Globus provides a CLI so you can script.

Q: Can I Simultaneously Access Locker from Linux and Windows?
A: Currently Locker supports NFS (Linux) OR CIFS (Windows), Apple OSX supports both. This is known as Multi-Protocol or simultaneous NFS and CIFS access.  Because Linux and Windows have different permissions schemes this is complex to manage. We don’t currently support his on Locker but do support it on Turbo.  At this time, we recommend using Globus to ease data movement between Locker and systems that cannot mount it natively.

Q: Why can’t we use Locker as general purpose storage?
A: To maintain performance, encryption, professional support, and cost,  Locker’s design does not make it well suited for general purpose primary storage. For this see the Turbo and MiStorage services.

Q: I deleted data but Locker still reports full?
A: Likely your export has snapshots enabled.  Snapshots store changes to Locker exports over time thus deleted data is just ‘moved’ to a snapshot.  Eventually snapshots will age out and free space on their own. Snapshot consumption does count against volume space used.  To delete or disable snapshots to free space early contact support.

Q: I have free space but Locker reports full?
A:  Likely you are at your file quota and are running average file size smaller than 1 MByte. This use case is outside the support of Locker’s design and the small files should move to another storage service.

Q: I don’t see my .snapshots folder?
A: Your volume might not have snapshots enabled.  If it does it is a hidden file on Linux and OSX terminals use ls -a to view all files including hidden files.  To show hidden files in OSX and Windows user interfaces varies by version and can be found in their documentation and online.

Q: My volume shows 2x the size I requested!
A: The system Locker is built on tracks all copies of data in its file system.  if a volume requests replication (2 copies of all data) total space will represent the primary and replica copy in total.  Thus 1TB of new data will consume 2TB of Locker space.

Advanced Topics

System Configuration

 Locker consists of two DDN GS14KX-E Gridscaler clusters running IBM Spectrum Scale.  Each cluster is located in different data centers with dedicated fiber for data replication between the two sites.  Each GS14KX-E cluster can hold 1680 hard drives for capacity of 10PB usable using 8TByte drives. Each hard drive is 7200RPM self encrypting and can be added to the system online. If at capacity additional GS14KX-E can be clustered to add additional performance and capacity.

By not including dedicated metadata or flash/NVMe storage we are able to keep the cost of Locker lower than other solutions such at Turbo. Thus Locker will not perform well with small IO operations and is built for capacity.  Thus why we offer both services. The GS14KX-E does have support for adding NVMe/Flash for meta-data and tiering at a later date should the price of such devices become more reasonable.

Locker is directly connected to the Data Den Research Archive via dedicated data movers and to the ARC-TS research network by two IBM Cluster Export Services (CES) nodes or Protocol Nodes.  Each CES node is connected with 100Gbps network connections and work in a active-active high availability configuration. Outside the ARC-TS network performance is limited to 40Gbps from the campus backbone.

Citing and Grants

Order Service

Locker is now available on a pilot basis. Potential pilot users should contact

The rate for Locker will be $40.09 per terabyte per year.

Contact with any questions.


To order Locker, the following information is required:

  • Amount of storage needed (1TB increments 10TB Minimum)
  • MCommunity Group name (group members will receive service-related notification, and can request service changes)
  • Shortcode for billing
  • NFS
    • Hostnames or IP addresses for each permitted user on the wired U-M network. (If forward and reverse records exist in DNS, please use the fully qualified hostname. If the records do not exist, provide the IP address.)
    • Numeric user ID of person who will administer the top level Locker directory and grant access to other users
  • CIFS
    • UMROOT AD Group Name
  • Specify if regulated or sensitive data will be use
  • Specify if your Locker account should be accessible on the Flux HPC cluster

Fill out this form to order Locker CIFS.

Fill out this form to order Locker NFS.

Related Event

September 22 @ 10:00 am - 12:00 pm

Introduction to Python’s NumPy library

This workshop will introduce you to the NumPy library in Python, which is useful in scientific computing. We will cover NumPy’s n-dimensional array object and associated functions in depth, along…

September 22 @ 1:00 pm - 4:00 pm

Introduction to the Linux Command Line

This course will familiarize the student with the basics of accessing and interacting with Linux computers using the GNU/Linux operating system’s Bash shell, also generically referred to as “the command…

September 23 @ 10:00 am - 12:00 pm

SPSS: Variables

Each section will go over one chapter from the materials at Section 1: Basics of SPSS (9/16, 10am – 12pm) Section 2: Variables (9/23, 10am – 12pm) Section 3:…

September 23 @ 1:00 pm - 4:00 pm

Introduction to the Linux Command Line

This course will familiarize the student with the basics of accessing and interacting with Linux computers using the GNU/Linux operating system’s Bash shell, also generically referred to as “the command…