Singularity, which is new “application container” software, has been installed on the Flux and Armis HPC clusters. An application container is a program — a single file — that can be used to combine an application with the system software it needs to run. This enables applications to run on the clusters even if the system software is different. For example, an older application that is needed to finish a project can continue to be used even if it is incompatible with the updated cluster. An application that needs a different Linux distribution can be containerized to run on the cluster.
Singularity containers cannot be created on Flux or Armis, but they can be created and brought to the clusters to run. Singularity provides tools to convert Docker containers for use on Flux and Armis. Please contact email@example.com if you are interested in using Singularity and would like more information about how to create and run Singularity containers or would like a referral to unit support who can help.
The most complex crystal designed and built from nanoparticles has been reported by researchers at Northwestern University and the University of Michigan. The work demonstrates that some of nature’s most complicated structures can be deliberately assembled if researchers can control the shapes of the particles and the way they connect using DNA.
The U-M researcher is Sharon C. Glotzer, the John W. Cahn Distinguished University Professor of Engineering and the Stuart W. Churchill Collegiate Professor of Chemical Engineering. The work is published in the March 3 issue of Science. ARC’s computational resources supported the work.
ANN ARBOR, MI and LONDON — The Michigan Institute of Data Science (MIDAS) at the University of Michigan and the Centre for Data Science and Big Data Institute at UCL (University College London) have signed a five-year agreement of scientific and academic cooperation.
The agreement sets the stage for collaborative research projects between faculty of both institutions; student exchange opportunities; and visiting scholar arrangements, among other potential partnerships.
“There is a lot of common ground in what we do,” said Patrick Wolfe, Executive Director of UCL’s Centre for Data Science and Big Data Institute. “Both MIDAS and UCL cover the full spectrum of data science domains, from smart cities to healthcare to transportation to financial services, and both promote cross-cutting collaboration between scientific disciplines.”
Alfred Hero, co-director of MIDAS and professor of Electrical Engineering and Computer Science at U-M, said that one of the original goals of the institute when it was founded in 2015 under U-M’s $100 million Data Science Initiative was to reach out to U.S. and international partners.
“It seemed very natural that this would be the next step,” Hero said, adding that it would complement MIDAS’s recent partnership with the Shenzhen Research Institute of Big Data in China. “UCL epitomizes the collaboration, multi-disciplinarity and multi-institutional involvement that we’re trying to establish in our international partnerships.”
Wolfe visited Ann Arbor in early January to sign a memorandum of understanding along with Hero and Brian Athey, professor of bioinformatics and the other MIDAS co-director.
The agreement lists several potential areas of cooperation, including:
- joint research projects
- exchange of academic publications and reports
- sharing of teaching methods and course design
- joint symposia, workshops and conferences
- faculty development and exchange
- student exchange
- exchange of visiting research scholars.
Follow UCL’s data science activities @uclbdi
Follow MIDAS at @ARC_UM
A series of training workshops in high performance computing will be hed Jan. 31 through Feb. 22, 2017, presented by CSCAR in conjunction with Advanced Research Computing – Technology Services (ARC-TS). All sessions are held at East Hall, Room B254, 530 Church St.
Introduction to the Linux command Line
This course will familiarize the student with the basics of accessing and interacting with Linux computers using the GNU/Linux operating system’s Bash shell, also known as the “command line.”
Dates: (Please sign up for only one)
• Tuesday, Jan. 31, 12:30 – 3:30 p.m. (full description | registration)
• Tuesday, Feb. 2, 9 a.m. – noon (full description | registration)
• Tuesday, Feb. 7, 9 a.m. – noon (full description | registration)
Introduction to the Flux cluster and batch computing
This workshop will provide a brief overview of the components of the Flux cluster, including the resource manager and scheduler, and will offer students hands-on experience.
Dates: (Please sign up for only one)
• Thursday, Feb. 9, 1 – 4:30 p.m. (full description | registration)
• Monday, Feb. 13, 1 – 4:30 p.m. (full description | registration)
Advanced batch computing on the Flux cluster
This course will cover advanced areas of cluster computing on the Flux cluster, including common parallel programming models, dependent and array scheduling, and a brief introduction to scientific computing with Python, among other topics.
Dates: (Please sign up for only one)
• Wednesday, Feb. 22, 9 a.m. – noon (full description | registration)
• Friday, Feb. 24, 9 a.m. – noon (full description | registration)
Five research projects — three in health and two in social science — have been awarded funding in the second round of the Michigan Institute for Data Science Challenge Initiative program.
The projects will receive funding from MIDAS as part of the Data Science Initiative announced in fall 2015.
The goal of the multiyear MIDAS Challenge Initiatives program is to foster data science projects that have the potential to prompt new partnerships between U-M, federal research agencies and industry. The challenges are focused on four areas: transportation, learning analytics, social science and health science. For more information, visit midas.umich.edu/challenges.
The projects, determined by a competitive submission process, are:
- Title: Michigan Center for Single-Cell Genomic Data Analysis
Description: The center will establish methodologies to analyze sparse data collected from single-cell genome sequencing technologies. The center will bring together experts in mathematics, statistics and computer science with biomedical researchers.
Lead researchers: Jun Li, Department of Human Genetics; Anna Gilbert, Mathematics
Research team: Laura Balzano, Electrical Engineering and Computer Science; Justin Colacino, Environmental Health Sciences; Johann Gagnon-Bartsch, Statistics; Yuanfang Guan, Computational Medicine and Bioinformatics; Sue Hammoud, Human Genetics; Gil Omenn, Computational Medicine and Bioinformatics; Clay Scott, Electrical Engineering and Computer Science; Roman Vershynin, Mathematics; Max Wicha, Oncology.
- Title: From Big Data to Vital Insights: Michigan Center for Health Analytics and Medical Prediction (M-CHAMP)
Description: The center will house a multidisciplinary team that will confront a core methodological problem that currently limits health research — exploiting temporal patterns in longitudinal data for novel discovery and prediction.
Lead researchers: Brahmajee Nallamothu, Internal Medicine; Ji Zhu, Statistics; Jenna Wiens, Electrical Engineering and Computer Science; Marcelline Harris, Nursing.
Research team: T. Jack Iwashyna, Internal Medicine; Jeffrey McCullough, Health Management and Policy (SPH); Kayvan Najarian, Computational Medicine and Bioinformatics; Hallie Prescott, Internal Medicine; Andrew Ryan, Health Management and Policy (SPH); Michael Sjoding, Internal Medicine; Karandeep Singh, Learning Health Sciences (Medical School); Kerby Shedden, Statistics; Jeremy Sussman, Internal Medicine; Vinod Vydiswaran, Learning Health Sciences (Medical School); Akbar Waljee, Internal Medicine.
- Title: Identifying Real-Time Data Predictors of Stress and Depression Using Mobile Technology
Description: Using an app platform that integrates signals from both mobile phones and wearable sensors, the project will collect data from over 1,000 medical interns to identify the dynamic relationships between mood, sleep and circadian rhythms. These relationships will be utilized to inform the type and timing of personalized data feedback for a mobile micro-randomized intervention trial for depression under stress.
- Lead researchers: Srijan Sen, Psychiatry; Margit Burmeister, Molecular and Behavioral Neuroscience.
Research team: Lawrence An, Internal Medicine; Amy Cochran, Mathematics; Elena Frank, Molecular and Behavioral Neuroscience; Daniel Forger, Mathematics; Thomas Insel (Verily Life Sciences); Susan Murphy, Statistics; Maureen Walton, Psychiatry; Zhou Zhao, Molecular and Behavioral Neuroscience.
- Title: Computational Approaches for the Construction of Novel Macroeconomic Data
Description: This project will develop an economic dataset construction system that takes as input economic expertise as well as social media data; will deploy a data construction service that hosts this construction tool; and will use this tool and service to build an “economic datapedia,” a compendium of user-curated economic datasets that are collectively published online.
Lead researcher: Matthew Shapiro, Department of Economics
Research team: Michael Cafarella, Computer Science and Engineering; Jia Deng, Electrical Engineering and Computer Science; Margaret Levenstein, Inter-university Consortium for Political and Social Research.
- Title: A Social Science Collaboration for Research on Communication and Learning based upon Big Data
Description: This project is a multidisciplinary collaboration meant to introduce social scientists, computer scientists and statisticians to the methods and theories of engaging observational data and the results of structured data collections in two pilot projects in the area of political communication and one investigating parenting issues. The projects involve the integration of geospatial, social media and longitudinal data.
Lead researchers: Michael Traugott, Center for Political Studies, ISR; Trivellore Raghunathan, Biostatistics
Research team: Leticia Bode, Communications, Georgetown University; Ceren Budak, U-M School of Information; Pamela Davis-Keane, U-M Psychology, ISR; Jonathan Ladd, Public Policy, Georgetown; Zeina Mneimneh, U-M Survey Research Center; Josh Pasek, U-M Communications; Rebecca Ryan, Public Policy, Georgetown; Lisa Singh, Public Policy, Georgetown; Stuart Soroka, U-M Communications.
The Michigan Institute for Data Science (MIDAS) will hold a faculty meeting at noon on Thursday, January 19 (Suite 7625, School of Public Health I, 1415 Washington Heights) for the NSF 17-534 “Critical Techniques, Technologies and Methodologies for Advancing Foundations and Applications of Big Data Sciences and Engineering (BIGDATA)” solicitation.
The meeting will include an overview of the NSF solicitation, U-M Data Science Resources (MIDAS, CSCAR, ARC-TS) available to faculty responding to the NSF call, and an opportunity to network with other faculty.
MIDAS has also arranged for Sylvia Spengler, NSF CISE Program Director, to be available at 1:30 pm to answer questions regarding the BIGDATA solicitation.
We invite you to participate in the faculty meeting to share your ideas and interest in responding to this BIGDATA solicitation as well as interact with other faculty looking to respond to this funding mechanism.
For those unable to participate in person, you can join virtually using GoToMeeting:
- You can also dial in using your phone.
United States: +1 (872) 240-3212
Access Code: 795-426-685
A box lunch will be provided at the faculty meeting. Your RSVP (https://goo.gl/forms/OYAuB8mWCOlx3fw73) is appreciated.
Sharon Broude Geva, the Director of Advanced Research Computing at the University of Michigan, has been elected vice-chair of the Coalition for Academic Scientific Computation (CASC).
Founded in 1989, CASC advocates for the use of advanced computing technology to accelerate scientific discovery for national competitiveness, global security, and economic success. The organization’s members represent 83 institutions of higher education and national labs.
The vice-chair position is one of four elected CASC executive officers. The officers work closely as a team with the director of CASC. The vice-chair also leads CASC meeting program committees, is responsible for recruitment of new members, substitutes for the chair in his or her absences, and assists with moderating CASC meetings.
Geva served as CASC secretary in 2015 and 2016. Her term as vice-chair is effective for the 2017 calendar year.
The other executive officers for 2017 are are Rajendra Bose, Chair, Columbia University; Neil Bright, Secretary, Georgia Institute of Technology; and Andrew Sherman, Treasurer, Yale University. Curt Hillegas of Princeton University is immediate past chair.
Several University of Michigan researchers and research IT staff made presentations at the SC16 conference in Salt Lake City Nov. 13-17. Material from many of the talks is now available for viewing online:
- Shawn McKee (Physics) and Ben Meekhof (ARC-TS) presented a demonstration of the Open Storage Research Infrastructure (OSiRIS) project at the U-M booth. The demonstration extended the OSiRIS network from its participating institutions in Michigan to the conference center in Utah. Meekhof also presented at a”Birds of a Feather” session on Ceph in HPC environments. More information, including slides, is available on the OSiRIS website.
- Todd Raeker (ARC-TS) made a presentation on ConFlux, U-M’s new computational physics cluster, at the NVIDIA booth. Slides and video are available.
- Nilmini Abeyratne, a Ph.D student in computer science, presented her project “Low Design-Risk Checkpointing Storage Solution for Exascale Supercomputers” at the Doctoral Showcase. A summary, slides, and poster can be viewed on the SC16 website.
- Jeremy Hallum (ARC-TS) presented information on the Yottabyte Research Cloud at the U-M booth. His slides are available here.
Other U-M activity at the conference included Sharon Broude Geva, Director of Advanced Research Computing, participating in a panel titled “HPC Workforce Development: How Do We Find Them, Recruit Them, and Teach Them to Be Today’s Practitioners and Tomorrow’s Leaders?”; Quentin Stout (EECS) and Christiane Jablonowski (CLASP) teaching the “Parallel Computing 101” tutorial.
To accommodate upgrades to software and operating systems, Flux, Armis, and their storage systems (/home and /scratch) will be unavailable starting at 9am Saturday, January 7th, returning to service on Monday, January 9th. Additionally, external Turbo mounts will be unavailable 11pm Saturday, January 7th, until 7am Sunday, January 8th.
During this time, the following updates are planned:
- Operating system and software updates (minor updates) on Flux and Armis. This should not require any changes to user software or processes.
- Resource manager and job scheduling software updates.
- Operating system updates on Turbo.
For HPC jobs, you can use the command “maxwalltime” to discover the amount of time before the beginning of the maintenance. Jobs that cannot complete prior to the beginning of the maintenance will be able to start when the clusters are returned to service.
We will post status updates on our Twitter feed ( https://twitter.com/arcts_um ) and send an email to all HPC users when the outage has been completed.