This is a descriptive, tabular dataset of publications related to microbial or genomic research conducted within PIE. Assession numbers for genetic sequences generated from PIE samples are provided where available, followed by a very brief description of analysis type and study objectives. Sampling locations within PIE, sampling dates, and habitat type (sea water, fresh water, sediment, marsh) are also given. Environmental data are included in some publications and are listed here (if brief) or availability is described. Links to sequence archives are given in Methods.
Microbial genomic sequence data is available by conducting searches using accession numbers associated with the sequence of interest.
Examples of of searches are:
Entrez search, easily search by single accession number; returns sequence and publication information. Link: http://www.ncbi.nlm.nih.gov/sites/gquery
Batch Entrez, Batch search a series of accession numbers after creating a Fasta-formatted text file; returns links to GenBank, Fasta, Graphics, Population Set and Related Sequences for each accession number.
Batch Entrez Fasta-formatted text file should be a single column list of accession numbers with a ".1" added like a file extension. File should have no header line. Go to http://ncbi.nlm.nih.gov/sites/batchentrez, follow the web site directions for using Batch Entrez
Fasta-formatted text file example:
To access MG-RAST annotation and raw sequences, the user will need to navigate to this page:
and enter the project id listed in the PIE-Microbe-Genomics tab (e.g. mgp84173) into the text box at the upper left of the page adjacent to the "search" button.
The search will return a list of the samples associated with the project and a link to download the annotation, fasta files, and associated metadata.
Data can also be accessed through the API according to the instructions here
Also available, courtesy of Joe Vineis, is a small git repository that includes some simple instructions for handling the API for beginners:
Version 01: March 6, 2012, updated data and metadata. Used MarcrosExportEML_HTML (working)pie_excel2007.xlsm 2/28/2012 8:18 PM for QA/QC to EML 2.1.0
Version 02: April 20, 2018, data and metadata created to comply with importation to Drupal and LTER PASTA. Used MarcrosExportEML_HTML (working)pie_excel2007_Jan2018.xlsm 1/30/2018 2:24 PM for QA/QC to EML 2.1.0
Version 03: August 13, 2019, edited URL made shorter, data and metadata updates to comply with importation to DEIMS7 and LTER Data Portal. Used MarcrosExportEML_HTML (working)pie_excel2007_Jun2019.xlsm 6/7/19 12:58 PM for QA/QC to EML 2.1.0.
Version 04: August 26, 2019, data and metadata updates to comply with importation to DEIMS7 and LTER Data Portal. Used MarcrosExportEML_HTML (working)pie_excel2007_Jun2019.xlsm 6/7/19 12:58 PM for QA/QC to EML 2.1.0.
Version 05:July 29, 2021, Data and metadata updates. Used MarcrosExportEML_HTML (working)pie_excel2007_Jul2021.xlsm 7/26/2021 9:04 AM for QA/QC to EML 2.1.0.