Open Access Series of Imaging Studies OASIS Cross
Open Access Series of Imaging Studies OASIS Cross
INTRODUCTION
data in an open-access fashion, the OASIS project can
The Open Access Series of Imaging Studies (OASIS) is provide such exemplar images to the neuroimaging com-
a project aimed at making magnetic resonance imaging munity. Finally, OASIS data can be used for educational
(MRI) data sets of the brain freely available to the purposes as examples or as working data sets for courses
scientific community. By compiling and freely distribut- and workshops.
ing MRI data sets, we hope to facilitate future discoveries A number of data-sharing procedures have been
in basic and clinical neuroscience. Specifically, the OASIS adopted by the OASIS project to ensure the integrity
project is intended to play a number of roles. First, and usability of the provided data.
OASIS images and related measures serve as data sets for
continued scientific exploration. Beginning with the ini- 1. Quality control. All images are carefully screened
tial set of images obtained from more than 400 individ- for acquisition and processing errors. Images with severe
uals with and without dementia across the adult life artifacts are excluded from the distribution. Images with
span, OASIS data sets are selected to encourage inves- visible artifacts typical of MR acquisitions are included.
tigation of high interest topics and provide data that This variability in quality is intended to provide tool
would be difficult for individual laboratories to acquire. builders with realistic target images.
Second, OASIS data are targets for researchers creating 2. Documentation. Detailed acquisition protocols and
and furthering analytic techniques. Because the images parameters, demographics, and supporting measures
are acquired from subjects over a range of ages and (e.g., Mini-Mental State Examination [MMSE] scores) are
health conditions, OASIS data can be used to test the provided for all included images.
robustness and validity of techniques across the full 3. Preparation. In addition to the raw acquisition data,
range of the human brain’s varied landscapes. Third, images representing common postprocessing procedures
OASIS data can be used as benchmark targets for com- (e.g., atlas registration, bias field correction) are distrib-
paring similar analytic techniques. Standard images pro- uted as a convenience and for comparison with similar
vide a common point of reference for demonstrating and postprocessing methods.
contrasting methods. By providing carefully screened 4. Anonymization. In compliance with privacy regula-
tions, distributed data are assigned random identification
numbers (IDs), and all identifying information are re-
1
Washington University, 2Harvard University, 3Massachusetts Gen- moved from the image files. All links between original and
eral Hospital, 4Howard Hughes Medical Institute random IDs are permanently destroyed before release.
D 2007 Massachusetts Institute of Technology Journal of Cognitive Neuroscience 19:9, pp. 1498–1507
5. Access. The data are available on DVD and acces- METHODS
sible for viewing and downloading via a dedicated Web Subjects
site. Third parties are encouraged to redistribute the
data in ways that add to its general availability. Subjects aged 18 to 96 years were selected from a larger
6. Ongoing support. The OASIS Web site will serve as database of individuals who had participated in MRI
a forum for announcements and discussions regarding studies at Washington University, based on the availabil-
the distributions and as a clearinghouse for third party ity of at least three acquired T1-weighted images, right-
derived data sets. hand dominance, and a recent clinical evaluation for
7. Unrestricted usage. OASIS data may be freely older adults. Young and middle-aged subjects were re-
Figure 1. Typical MRI data set. (A) Individual scan before defacing. (B) Same scan after defacing. Note that the defacing process leaves the
cranial vault intact while identifying facial features are removed. (C) Averaged motion-corrected image. Note improved signal-to-noise ratio.
(D) Atlas-registered gain-field-corrected image. (E) Tissue classification image.
score of 0, indicating no dementia, and 100 had a CDR The slope of the best fit line was 1.00 (r = 1.00) for eTIV
score greater than zero (70 CDR = 0.5, 28 CDR = 1, 2 and was 1.00 (r = 1.00) for nWBV. The mean absolute
CDR = 2), indicating a diagnosis of very mild to mod- percent differences before and after defacing were
erate AD. Additional demographics and clinical charac- 0.024% for eTIV and 0.059% for nWBV. The defacing
teristics of the older subjects are shown in Table 3. process, therefore, has little impact on morphometric
measures of interest.
Having established that defacing has minimal impact,
Anatomical Characteristics
eTIV and nWBV are plotted across the adult life span in
As a means of determining whether defacing techniques Figure 4 for the data included as part of the OASIS dis-
impact morphometric analysis, eTIV and nWBV were tribution. Analyses are selective and used to illustrate prop-
calculated before and after defacing for each subject. erties of the data. More detailed normative estimates of
Age Group Total n n Mean Male Female n Mean Male Female CDR 0.5/1/2
The sample consisted of 198 individuals (98 without dementia and 100 with AD). Values given are mean ± SD. Values in parentheses represent the
range. Full clinical data were not available for 23 subjects. Compared with the adults without dementia, the older adults with dementia had lower
scores on the MMSE [t(194) = 10.66, p < .001] and slightly fewer years of education [t(194) = 2.60, p < .05].
MMSE scores range from 30 (best) to 0 (worst).
HBP = high blood pressure.
Reliability Scans
To provide a benchmark for determining the reliability
of analytic procedures, 20 subjects were scanned on two
separate occasions with a mean delay of 20.55 days
(range = 1–89 days). These subjects did not have de-
mentia and ranged in age from 19 to 29 years. Because
of the short latency between scan sessions, differences
in the images and measures derived from them are un-
likely to be attributable to anatomical change and in-
stead can be attributed to sources of noise (e.g., head
positioning, day-to-day scanner variability) and instability
in analytic procedures. Figure 5 shows eTIV (top) and
nWBV (bottom) from the first session versus the second
session for each subject in the reliability data set. The
mean absolute percent difference was 0.54% for eTIV
and 0.44% for nWBV. The coefficients of variation (CVs,
the standard deviation of the difference between test and
retest volumes divided by the overall mean, expressed
in percent) were 0.47% for eTIV and 0.44% for nWBV.
Obtaining and Using the Data Figure 5. Plots of automated anatomical measures for reliability
scans. Each point represents the values of the anatomical measure
OASIS data can be obtained at www.oasis-brains.org. Re- calculated from images obtained from the same subject without
quests for DVD distributions of the data can be sub- dementia on two separate scan sessions separated by a short
interval (<90 days). The reliability of eTIV (top, r = 0.99) and
mitted at the Web site. For convenience, we also provide nWBV (bottom, r = 0.96) are high.
the data using the open-source Extensible Neuroimaging
Archive Toolkit (Marcus, Olsen, Ramaratnam, & Buckner,
2007). It provides tools to search, visualize, and down- sion, the following image files are included in the
load the data. Before downloading and requesting distribution: three to four individual scan images; an
data, users are asked to abide by the OASIS Data Usage image in which the individual scans have been aligned
Agreement. and coregistered; an image that has been gain-field-
OASIS data are distributed in GNU zip archive files, corrected and registered to the atlas of Talairach and
which can be uncompressed using freely available soft- Tournoux (1988) (henceforth T88); a masked T88 image
ware. All images are distributed in Analyze 7.5 format, in which the intensity of all nonbrain voxels has been set
which can be visualized and processed using a number to zero; and a segmented T88 image in which each voxel
of commercial and open source applications, including has been labeled as gray matter, white matter, or
Neurolens, ImageJ, and MRIcro. For each imaging ses- cerebral spinal fluid. Demographic, clinical, and derived