The Data Integrator finds items in different tracks that overlap by position, and unlike the Table Browser's intersection function, the Data Integrator can output all fields from all selected tracks. Up to 5 different tracks may be queried at a time.
This section contains a brief overview of Data Integrator controls. For more information on using the tools, see the Data Integrator User's Guide.
Select Genome Assembly and Region
The controls in this section are for selecting a genome assembly and region to search.
- group: A species group: Mammal, Vertebrate, Insect etc.
- genome: A single species such as Human or Mouse (not available for certain tracks with restrictions on data sharing).
- assembly: A version of the reference genome assembly such as GRCh37/hg19.
Configure Data Sources
Currently selected data sources (tracks, custom tracks, hub tracks etc) are listed with icons for reordering the data sources. The first data source is special in that data from the remaining data sources appear only when they overlap with the first data source. Under "Add Data Source", several menus display available data sources:
- track group: A category of data track, for example "Genes and Gene Prediction" or "Regulation".
- track: One or more data tables containing results of an experiment or a group of closely related experiments. Some tracks are not available when the region is set to genome due to the data provider's restrictions on sharing.
- table: This appears only when the selected track has more than one data table.
These sections can be reordered by dragging on the section title or arrow icon on the left. To remove a section, click the icon to the right of the title. Click on the Add button to add a new data source.Output Options
- Send output to file: check this box to have output sent to a local file instead of to the web browser window.
When this is checked, additional options appear:- name: the file name to which output will be saved
- compress with gzip: check this box to have the output file compressed by gzip (.gz). This saves disk space and may reduce network transfer time.
- Choose fields: click this button to pop up a dialog box with a checkbox for each field of each data source. If a checkbox is checked, that field will appear in the output. Some tracks also have related mysql tables that can be selected and added to output. However, some of those tables may be unavailable when region is set to genome due to the data provider's restrictions on sharing.