NDAR has established a foundation of initial standards and conventions for data-sharing within the ASD research community. As a research portal, NDAR provides a platform for investigators to define, refine, and standardize data definition for autism research.
NDAR has established the following special interest groups for community involvement:
Please contact us at ndar@mail.nih.gov to join a special interest group. Meeting announcements will be sent to your e-mail inbox.
One of the most important standards NDAR supports is the NDAR Global Unique Identifier (GUID). The GUID is a universal subject ID that allows researchers to share data specific to a study participant without exposing personally identifiable information (PII). The GUID has been approved by the NIH Office of General Counsel.
The GUID system was conceived by the Simons Foundation Autism Research Initiative (SFARI). The GUID software was designed, developed and tested in close collaboration between SFARI and NDAR project teams.
The system is implemented as an NDAR Web service; an investigator inputs identifying information about a participant into a client application and sends encrypted information to a server application, which then returns a GUID.
Generic unique identifiers have the potential to link collections of research data, augment the amount and types of data available for individuals, support detection of overlap between collections and facilitate replication of research findings.
Four pieces of identifying information must be collected from each study participant to generate a valid GUID:
The GUID is generated using a free software application installed at the research site. The four items from the birth certificate are encrypted into a hash code, which is then transmitted to NDAR. NDAR then encrypts the hash code to generate a GUID and sends it back to the research site for use. The personally identifiable information (PII) about each participant remains at the research site.
NDAR supports an unlimited number of clinical, demographic, and phenotypic data associated with ASD human subjects research. Although much smaller in size than rich data types like imaging and genomics, the definition of clinical data is especially important for data to be aggregated across projects and data repositories. Provided are some of NDAR's primary tools to be used for this purpose:
NDAR GUID — A common research subject identifier to be used across all research projects is essential for data aggregation. The NDAR GUID provides this capability.
NDAR Data Dictionary — Working with the community, NDAR has defined over 200 clinical, imaging, and genomic research data structures that are now being used in autism research including the common measures used by the NIH funded Autism Centers of Excellence (ACE). These measures include:
Researchers are encouraged to extend these definitions using the NDAR data dictionary tool, which is available within the NDAR Portal. Instructions on using the NDAR data dictionary to support investigator defined data is explained in the Data Dictionary Video Tutorial.
Data Validation Tool — Before data will be accepted by NDAR, all data must pass the formatting, value range, and intra-field validation checks defined in the NDAR Data Dictionary. The Validation Tool (Production Environment or Demonstration Environment) is publically available. It allows a researcher to check their data and fix any discrepancies prior to that data being submitted to NDAR. Refer to the video on Preparing Clinical Data for Submission to learn more about using the Validation Tool. NDAR supports clinical data submissions formats of tab delimited, comma separated value (CSV), or XML. Sample data using these formats compatible with NDAR's Demonstration Portal is available.
|
Preparing Clinical Data for Submission * [ Launch Video ] |
For general information on preparing data for submission to NDAR, please follow the NDAR Data-Sharing Checklist.
NDAR supports the receipt of unprocessed brain images in DICOM format. NDAR also supports processed images in a variety of formats including DICOM, MINC 1.0 and 2.0, Analyze, NIfTI-1, AFNI and SPM. If you are using a different file format, please contact ndarhelp@mail.nih.gov to allow us add it to our list of supported standards.
To submit imaging data to NDAR, researchers are required to run a component of the MIPAV (Medical Image Processing, Analysis, and Visualization) application. Using the MIPAV Component for NDAR, you can quickly prepare your image data for submission by following the steps below.
|
[ Launch Video ] |
NDAR supports multiple imaging data structures.
For questions or more information on working with the NDAR image submission tool, please contact us at ndarhelp@mail.nih.gov.
NDAR piloted genomic data submission in July 2010 using the Minimal Information About a Microarray Experiment (MIAME) format. The results of the pilot indicated that a more precise definition of genomics is needed to define and query genomics before it will be possible to aggregate genomics data across research projects and repositories.
Beginning in January 2011, NDAR has created a genomics definition tool for investigators to use for raw genomics data submission. The use of the tool is designed to make genomics submissions very simple. The steps for genomics submission are:
|
Defining a Genomics Experiment in NDAR * [ Launch Video ] |
For any questions about this process, contact NDAR at ndarhelp@mail.nih.gov.
Please note that excel files must be saved as tab delimited, csv, or XML to create an NDAR submission package.
NDAR has outlined a schedule that includes separate timelines for "descriptive data" and "experimental data," as defined in the [Data Sharing Policy]. This policy is included in the terms and conditions of most ASD-related grant awards. Please contact ndarhelp@mail.nih.gov if more specific guidance is needed.
This page was last updated: Jul 12, 2011