Reset Password

NDAR provides a single access to de-identified autism research data. For permission to download data, you will need an NDAR account with approved access to NDAR or a connected repository (AGRE, IAN, or the ATP). For NDAR access, you need to be a research investigator sponsored by an NIH recognized institution with federal wide assurance. See Request Access for more information.

Warning Notice

This is a U.S. Government computer system, which may be accessed and used only for authorized Government business by authorized personnel. Unauthorized access or use of this computer system may subject violators to criminal, civil, and/or administrative action.

All information on this computer system may be intercepted, recorded, read, copied, and disclosed by and to authorized personnel for official purposes, including criminal investigations. Such information includes sensitive data encrypted to comply with confidentiality and privacy requirements. Access or use of this computer system by any person, whether authorized or unauthorized, constitutes consent to these terms. There is no right of privacy in this system.

You have logged in with a temporary password. Please update your password. Passwords must contain 8 or more characters and must contain at least 3 of the following types of characters:

Subscribe to our mailing list

Mailing List(s)
Email Format

You are now leaving the National Database for Autism Research (NDAR) web site to go to:

Click on the address above if the page does not change within 10 seconds.


NDAR is not responsible for the content of this external site and does not monitor other web sites for accuracy.

Selected Filters
No filters selected

The filters you have selected from various query interfaces will be stored here, in the 'Filter Cart'. The database will be queried using filters added to your 'Filter Cart', when multiple filters are defined, each will be executed using 'AND' logic, so with each filter that is applied the result set gets smaller.

From the 'Filter Cart' you can inspect each of the filters that have been defined, and you also have the option to remove filters. The 'Filter Cart' itself will display the number of filters applied along with the number of subjects that are identified by the combination of those filters. For example a GUID filter with two subjects, followed by a GUID filter for just one of those subjects would return only data for the subject that is in both GUID filters.

If you have a question about the filter cart, or underlying filters please contact the help desk at The NDA Help Desk

Value Range
Data Structures with shared data
No filters have been selected

1 Numbers reported are subjects by age
New Trial
New Project

Format should be in the following format: Activity Code, Institute Abbreviation, and Serial Number. Grant Type, Support Year, and Suffix should be excluded. For example, grant 1R01MH123456-01A1 should be entered R01MH123456

Please select an experiment type below

New Documentation

Please enter the name of the data structure to search or if your definition does not exist, please upload that definition so that it can be appropriately defined for submission. Multiple data structures may be associated with a single Data Expected entry. Please add only one data structure per assessment.

Please provide a reason for the requested submission exemption and the
time-frame during which the exemption will be active.

Collection Owners and those with Collection Administrator permission, may edit a collection. The following is currently available for Edit on this page:


Title, investigators, and Collection Description may be edited along with the Collection Phase. For Collection Phase, the options Pre-enrollment, Enrollment, and Completed can be chosen allowing the Collection Owner to indicate the stage of data collection.

Funding Source

The ability to associate the funding source for the project is provided. For NIH funded grants, linkage to Project Reporter information (e.g. R01MH123456) is supported. Projects funded by others, including the URL of the project, are listed. Non NIH funded projects will become available here to link that data with the appropriate funding agency.

Supporting Documentation

Any documents related to the project may be uploaded clarifying the data or acquisition methods used may be uploaded and made available here. The default is to share these documents to the general public. An option to share only to qualified Researchers is also an option.

Clinical Trials

For clinical trials, the option to link to the clinical trial in clinicaltrials.gov is optionally provided.

Collection Summary Collection Charts
Collection Title Collection Investigators Collection Description
Deep sequencing of autism candidate genes in 2000 families from the Simons Simplex Collection (SSC)
Michael Wigler 
343 families from the Simons Simplex Collection. Each family includes father, mother, a proband and an unaffected sibling.

No Data Shared



No Data Shared


Chart Expander
NIH - Extramural None

sequencing_files_readme_col_1936.pdf Background Readme for sequencing files Qualified Researchers

RC2MH090028-01 Deep sequencing of autism candidate genes in 2000 families from the Simons Simple 09/30/2009 08/31/2011 Not Reported Not Reported COLD SPRING HARBOR LABORATORY $2,779,842.00

Collection Owners and those with Collection Administrator permission, may edit a collection. The following is currently available for Edit on this page:


To create a new Omics, eye tracking, fMRI, or EEG experiment, press the "+ New Experiment" button. Once an experiment is created, then raw files for these types of experiments should be provided, associating the experiment – through Experiment_ID – with the metadata defined in the experiments interface.

IDNameCreated DateStatusType
66SSC samples exome sequencing08/03/2012ApprovedOmics
170Whole genome sequencing of 8 SSC samples10/22/2014ApprovedOmics

Collection Owners and those with Collection Administrator permission, may edit a collection. The following is currently available for Edit on this page:

Shared Data

Data structures with the number of subjects submitted and shared are provided.

Genomics Sample Genomics 4651
Genomics Subject Genomics 4645

Collection Owners and those with Collection Administrator permission, may edit a collection. The following is currently available for Edit on this page:


Publications relevant to NDAR data are listed below. Most displayed publications have been associated with the grant within Pubmed. Use the "+ New Publication" button to add new publications. Publications relevant/not relevant to data expected are categorized. Relevant publications are then linked to the underlying data by selecting the Create Study link. Study provides the ability to define cohorts, assign subjects, define outcome measures and lists the study type, data analysis and results. Analyzed data and results are expected in this way.

PubMed IDStudyTitleJournalAuthorsDateStatus
22542183Study (318)De novo gene disruptions in children on the autistic spectrum.NeuronIossifov I, Ronemus M, Levy D, Wang Z, Hakker I, Rosenbaum J, Yamrom B, Lee YH, Narzisi G, Leotta A, Kendall J, Grabowska E, Ma B, Marks S, Rodgers L, Stepansky A, Troge J, Andrews P, Bekritsky M, Pradhan K, Ghiban E, Kramer M, Parla J, Demeter R, Fulton LL, et al.April 26, 2012Relevant

This tab provides a general status on the data expected to be shared. There are two types of data expected.

  1. By Relevant publications — Those publications that reported for the collection's grant and have a status of "relevant" for sharing are listed first. The grantee is expected to share the data specific to those publications using the NDA Study feature. If a publication is erroneously marked relevant, the PI should simply change the status. When sharing a study, only the outcome measures for the subjects/time-points are shared. Other data that have not met the share date, defined below, will remain embargoed. To initiate study creation, simply login, mark your publication as relevant and click on the link listed to begin.

  2. By Data Structure — The number of subjects expected, received and shared is provided. Investigators are expected to update the data that they are collecting, the initial submission date and initial share dates. The NIMH Data Archive shares data when those dates are met.

  3. Submission Exemption — Those with Administrative or Submission Access to the Collection may request an exemption for submission for a defined period by stating the reason and timeframe. Note that the program officer on the grant may review this request.

Relevant Publications
PubMed IDStudyTitleJournalAuthorsDate
No records found.

For those with privileges to edit the collection, it is possible to upload your data definitions using this interface. NDA support staff will then follow up with a harmonized data definition for you to use in providing additional data.

Data Expected
Data ExpectedTargeted EnrollmentInitial SubmissionSubjects SharedStatus
genomics/omics info iconApproved
Research Subject and Pedigree info iconApproved
Structure not yet defined

Collection Owners and those with Collection Administrator permission, may edit a collection. The following is currently available for Edit on this page:

Associated Studies

Studies that have been defined using data from a Collection are important criteria to determine the value of data shared. The number of subjects column displays the counts from this Collection that are included in a Study, out of the total number of subjects in that study. The Data Use column represents whether or not the study is a primary analysis of the data or a secondary analysis. State indicates whether the study is private or shared with the research community.

Study Name Description Number of Subjects
Collection / Total
Data Use State
Transmission disequilibrium of small CNVs in simplex autism. Cohorts: 411 ASD Quads from Simons Simplex Collection 177 Quads from Sanders et al. (PubMed ID: 22495306) 166 Quads from I. Iossifov et al. (PubMed ID: 22542183) 71 Quads from O'Roak et al. (PubMed ID: 22495309) Publication Abstract: We searched for disruptive, genic rare copy-number variants (CNVs) among 411 families affected by sporadic autism spectrum disorder (ASD) from the Simons Simplex Collection by using available exome sequence data and CoNIFER (Copy Number Inference from Exome Reads). Compared to high-density SNP microarrays, our approach yielded ¿2× more smaller genic rare CNVs. We found that affected probands inherited more CNVs than did their siblings (453 versus 394, p = 0.004; odds ratio [OR] = 1.19) and that the probands' CNVs affected more genes (921 versus 726, p = 0.02; OR = 1.30). These smaller CNVs (median size 18 kb) were transmitted preferentially from the mother (136 maternal versus 100 paternal, p = 0.02), although this bias occurred irrespective of affected status. The excess burden of inherited CNVs among probands was driven primarily by sibling pairs with discordant social-behavior phenotypes (p < 0.0002, measured by Social Responsiveness Scale [SRS] score), which contrasts with families where the phenotypes were more closely matched or less extreme (p > 0.5). Finally, we found enrichment of brain-expressed genes unique to probands, especially in the SRS-discordant group (p = 0.0035). In a combined model, our inherited CNVs, de novo CNVs, and de novo single-nucleotide variants all independently contributed to the risk of autism (p < 0.05). Taken together, these results suggest that small transmitted rare CNVs play a role in the etiology of simplex autism. Importantly, the small size of these variants aids in the identification of specific genes as additional risk factors associated with ASD. 462 / 1644 Secondary Analysis Shared
The contribution of de novo coding mutations to autism spectrum disorder Whole exome sequencing has proven to be a powerful tool for understanding the genetic architecture of human disease. Here we apply it to more than 2,500 simplex families, each having a child with an autistic spectrum disorder. By comparing affected to unaffected siblings, we show that 13% of de novo missense mutations and 43% of de novo likely gene-disrupting (LGD) mutations contribute to 12% and 9% of diagnoses, respectively. Including copy number variants, coding de novo mutations contribute to about 30% of all simplex and 45% of female diagnoses. Almost all LGD mutations occur opposite wild-type alleles. LGD targets in affected females significantly overlap the targets in males of lower intelligence quotient (IQ), but neither overlaps significantly with targets in males of higher IQ. We estimate that LGD mutation in about 400 genes can contribute to the joint class of affected females and males of lower IQ, with an overlapping and similar number of genes vulnerable to contributory missense mutation. LGD targets in the joint class overlap with published targets for intellectual disability and schizophrenia, and are enriched for chromatin modifiers, FMRP-associated genes and embryonically expressed genes. Most of the significance for the latter comes from affected females. PLEASE NOTE: Additional data on these subjects, unrelated to this publication exist in other NDAR Studies. These data include realigned BAM files, unfiltered SNV/InDel variant calls (made by GATK and FreeBayes), and CNVs. Please see this news item for more details: https://ndar.nih.gov/ndarpublicweb/aboutNDAR.html#news_item_201 4558 / 9456 Secondary Analysis Shared
De Novo Gene Disruptions in Children on the Autistic Spectrum Public Abstract: Exome sequencing of 343 families, each with a single child on the autism spectrum and at least one unaffected sibling, reveal de novo small indels and point substitutions, which come mostly from the paternal line in an age-dependent manner. We do not see significantly greater numbers of de novo missense mutations in affected versus unaffected children, but gene-disrupting mutations (nonsense, splice site, and frame shifts) are twice as frequent, 59 to 28. Based on this differential and the number of recurrent and total targets of gene disruption found in our and similar studies, we estimate between 350 and 400 autism susceptibility genes. Many of the disrupted genes in these studies are associated with the fragile X protein, FMRP, reinforcing links between autism and synaptic plasticity. We find FMRP-associated genes are under greater purifying selection than the remainder of genes and suggest they are especially dosage-sensitive targets of cognitive disorders. 1372 / 1372 Primary Analysis Shared
Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci Analysis of de novo CNVs (dnCNVs) from the full Simons Simplex Collection (SSC) (N = 2,591 families) replicates prior findings of strong association with autism spectrum disorders (ASDs) and confirms six risk loci (1q21.1, 3q29, 7q11.23, 16p11.2, 15q11.2-13, and 22q11.2). The addition of published CNV data from the Autism Genome Project (AGP) and exome sequencing data from the SSC and the Autism Sequencing Consortium (ASC) shows that genes within small de novo deletions, but not within large dnCNVs, significantly overlap the high-effect risk genes identified by sequencing. Alternatively, large dnCNVs are found likely to contain multiple modest-effect risk genes. Overall, we find strong evidence that de novo mutations are associated with ASD apart from the risk for intellectual disability. Extending the transmission and de novo association test (TADA) to include small de novo deletions reveals 71 ASD risk loci, including 6 CNV regions (noted above) and 65 risk genes (FDR ≤ 0.1). 4111 / 9975 Secondary Analysis Shared
Recurrent de novo mutations implicate novel genes underlying simplex autism risk. Autism spectrum disorder (ASD) has a strong but complex genetic component. Here we report on the resequencing of 64 candidate neurodevelopmental disorder risk genes in 5,979 individuals: 3,486 probands and 2,493 unaffected siblings. We find a strong burden of de novo point mutations for these genes and specifically implicate nine genes. These include CHD2 and SYNGAP1, genes previously reported in related disorders, and novel genes TRIP12 and PAX5. We also show that mutation carriers generally have lower IQs and enrichment for seizures. These data begin to distinguish genetically distinct subtypes of autism important for aetiological classification and future therapeutics. 1888 / 6400 Primary Analysis Shared
* Data not on individual level