User Manual

IPOP is a multi-omics, multi-plant database, in which dataset contains Histone Modification, Chromatin Accessibility, Chromatin Interaction, Population Genomics, DNA 5mC Modification, Transcriptome, RNA m6A Modification data, and the results of Evolutionary Genomics studies; involves 13 species. IPOP provides multiple omics results data search and download functions, and six tools for users to analyze multiple omics data.

1 Browser

The Browser bar on homepage Provides the ability to view genomic annotations, as well as different omics data for each species.

Figure 1 Browser module

Select the species on the browser page to enter the Jbrowse to view all the omics sequence information of the species.

2 Search

The Search bar on the home page provides the ability to search by gene IDs, key words and sample IDs.

Figure 2 Search module

The figure above shows the result page of the search function, and the number of terms related to the search content is displayed at items.

3 Evolutionary Genomics

3.1 Index page

The Evolutionary Genomics bar provides evolutionary information of all genes in IPOP.

Figure 3 Evolutionary genomics module

3.2 Phylostratigraphic Stage

Figure 4 Phylostratigraphic Stage page

Click nodes in the phylostratigraphic tree on the index page jump to Phylostratigraphic Stage page. Phylostratigraphic Stage page provides detailed information and statistics of the phylostratigraphic stage.

3.3 Orthologous Groups

Figure 5 Orthologous Groups page

Click bars of the Orthologous Groups Evolutionary Origin part and Species Orthologous Groups / Genes Information part in Evolutionary Genomics page to jump to Orthologous Group page. Orthologous Group page provides statistics of genes related to the orthogroup.

3.4 Duplication Event

Figure 6 Duplication Event page

Click bars of Duplication Events part in Evolutionary Genomics page to jump to Duplication Event page. Duplication Event page provides detailed duplication events and mechanisms of which.

3.5 Search Result page

Figure 7 Gene page

Search by Orthologous groups and Evolution Origin jump to Gene page. Gene page provides detailed evolutionary information of related genes. Search by Duplication Events jump to Duplication Events page, which was introduced in 3.4.

4 Omics-Hub Portal

4.1 Histone Modification

4.1.1 Sample Statistics Page

Figure 8 Sample statistics of histone modification

The Sample statistics page shows the sample statistics of different strains and tissues, and lists the basic information of each sample.

Figure 9 Search module of histone modification

Search module of histone modification provides the ability to search for histone modification peaks using gene IDs and genome location.

4.1.2 Sample Infomation Page

Figure 10 Sample infomation page of histone modification

Sample infomation displays the basic information of the sample and the statistics of the distribution of peaks identified in the sample. The details of each peak are listed in the Histone Modification Peaks table, which is as follows:

Columns Details
Modificaion Modification type of the peak
Chromosome Chromosome of the peak
Start Start position of the peak
End End position of the peak
Abundance Abundance of the peak
Related Gene Related gene of the peak
Related Gene Region Related gene region of the peak

4.2 Chromatin Accessibility

4.2.1 Sample Statistics Page

Figure 11 Sample statistics of chromatin accessibility

The Sample statistics page shows the sample statistics of different strains and tissues, and lists the basic information of each sample.

Figure 12 Search module of chromatin accessibility

Search module of chromatin accessibility provides the ability to search for chromatin accessibility regions using gene IDs and genome location.

4.2.2 Sample Infomation Page

Figure 13 Sample infomation page of chromatin accessibility

Sample infomation displays the basic information of the sample and the statistics of the distribution of peaks identified in the sample. The details of each peak are listed in the Accessible Chromatin Regions table, which is as follows:

Columns Details
Chromosome Chromosome of the accessible region
Start Start position of the accessible region
End End position of the accessible region
Abundance Abundance of the accessible region
Related Gene Related gene of the accessible region
Related Gene Region Related gene region of the accessible region
Distance to Transcription Start Site Distance to Transcription Start Site of the accessible region

Figure 14 Data quality

Intersize, correlation and reads distribution of each replicate are visualized in the Data Quality part.

4.3 Chromatin Interaction

4.3.1 Sample Statistics Page

Figure 15 Sample statistics of chromatin interaction

The Sample statistics page shows the sample statistics of different strains and tissues, and lists the basic information of each sample.

Figure 16 Search module of chromatin interaction

Search module of chromatin interaction provides the ability to search for loop regions and tad regions using gene IDs and genome location.

4.3.2 Sample Infomation Page

Figure 17 Sample infomation page of chromatin accessibility

Sample infomation displays the basic information of the sample. The details of each loop region and tad region are listed in the tables, which are as follows:

Columns Details
Chromosome (Interaction Region 1) Chromosome of the loop region 1
Start (Interaction Region 1) Start position of the loop region 1
End (Interaction Region 1) End position of the loop region 1
Chromosome (Interaction Region 2) Chromosome of the loop region 2
Start (Interaction Region 2) Start position of the loop region 2
End (Interaction Region 2) End position of the loop region 2
Columns Details
Chromosome Chromosome of the tad region
Start Start position of the tad region
End End position of the tad region

4.4 Population Genomics

Figure 18 Genome wide variant-trait associations’ information

The population genomics page shows the sample statistics of variant-trait associatoins and variant related traits, and lists detailed information of each significant association in Associations table, which are as follows:

Columns Details
Variation Position of the variation
Alleles Alleles of the variation
Minor Alleles Minor alleles of the variation
Minor Allele Frequence Minor allele frequence of the variation
Trait Trait that the variation was significantly associated to
P-value P-value of the variant-trait association
Genome Version Genome Version
Mapped Gene Mapped gene of the variation
Pubmed ID Pubmed ID of the study

Figure 19 Search module of population genomics

Search module of population genomics provides the ability to search for variant-trait associations using gene IDs and genome location.

4.5 DNA 5mC Modification

4.5.1 Sample Statistics Page

Figure 20 Sample statistics of DNA 5mC Modification

The Sample statistics page shows the sample statistics of different strains and tissues, and lists the basic information of each sample.

Figure 21 Search module of DNA 5mC modification

Search module of DNA 5mC modification provides the ability to search for DNA 5mC modified genes using gene IDs and genome location.

4.5.2 Sample Infomation Page

Figure 22 Sample infomation page of DNA 5mC modification

Sample infomation displays the basic information of the sample. Statisitcs of each type methylation level is displayed in barplot. The details of each DNA 5mC modified gene are listed in the tables, which are as follows:

Columns Details
Gene ID DNA 5mC methylated gene ID
Context DNA 5mC methylation type
Gene Body Methylation type of gene body
Promoter Methylation level of gene promoter
Intron Methylation level of gene intron
Exon Methylation level of gene exon

Figure 23 Data quality of DNA 5mC modification

Coverage distribution of each methylation type and genome-wide methylation level are visualized in the data quality part.

4.6 Transcriptome

4.6.1 Sample Statistics Page

Figure 24 Sample statistics of transcriptome

The Sample statistics page shows the sample statistics of different strains and tissues, and lists the basic information of each sample.

Figure 25 Search module of transcriptome

Search module of transcriptome provides the ability to search for expression level of genes using gene IDs and genome location.

4.6.2 Sample Infomation Page

Figure 26 Sample infomation page of transcriptome

Sample infomation displays the basic information of the sample. The details of each gene are listed in the tables, which are as follows:

Columns Details
Gene ID Gene ID
Chromosome Chromosome of the expression gene
Strand Strand of the expression gene
Start Start position of the expression gene
End End position of the expression gene
TPM Expression level of the expression gene

4.7 RNA m6A Modification

4.7.1 Sample Statistics Page

Figure 27 Sample statistics of RNA m6A modification

The Sample statistics page shows the sample statistics of different strains and tissues, and lists the basic information of each sample.

Figure 28 Search module of transcriptome

Search module of transcriptome provides the ability to search for m6A peaks using gene IDs and genome location.

4.7.2 Sample Infomation Page

Figure 29 Sample infomation page of RNA m6A modification

Sample infomation displays the basic information of the sample and the statistics of the distribution of peaks identified in the sample. The details of each peak are listed in the tables, which are as follows:

Columns Details
Chromosome Chromosome of the m6A peak
Start Start position of the m6A peak
End End position of the m6A peak
Abundance Abundance of the m6A peak
Related Gene Related gene of the m6A peak
Related Gene Region Related gene region of the m6A peak

Figure 30 Enriched motif

Enriched motifs of m6A peaks in the sample are listed in Enriched Motif part.

Figure 31 m6A regulator candidates

Transcriptional profile of m6A regulator candidate genes is displayed in bar plot in the m6A Regulator Candidates part.

Figure 32 Data quality

Reads distribution and correlation between replicates of the sample are visualized in Data Quality part.

5 Tools

5.1 Omics Comparison Analysis Utility

5.1.1 Submit Tasks

Figure 33 Submission of Omics Comparison Analysis Tool

The above figure shows the way of task submission and result viewing of the comparison tool. On the Submission page, click the jobID to jump to the result page. If the task is not completed, it will wait on the running page, and it will automatically jump after it is completed.

5.1.2 Result Page

5.1.2.1 Chromatin Accessibility

Figure 34 Sample information

Detail informaion of two compared samples is listed here.

Figure 35 Overlap of accessible chromatin regions

Overlap of Accessible Chromatin Regions plot. The green bar indicates overlapping genes of two samples, the blue bar and the yellow bar indicates unique genes of two sample respectively.

Figure 36 Distribution of accessible chromatin regions

The results page displays the accessible chromatin regions with different distances from the TSS in the form of a bar plot and the accessible chromatin regions feature distribution.

5.1.2.2 Histone Modification

Figure 37 Sample information

Detail informaion of two compared samples is listed here.

Figure 38 Overlap of histone modification peaks

Overlap of Accessible Chromatin Regions plot. The green bar indicates overlapping genes of two samples, the blue bar and the yellow bar indicates unique genes of two sample respectively.

Figure 39 Distribution of histone modification peaks

The results page displays the histone modification peaks with different distances from the TSS and the histone modification peaks feature distribution in the form of bar plots.

5.1.2.3 RNA m6A Modification

Figure 40 Transcription profiles of compared m6A-seq samples

As is shown in figure 11, the transcription part of RNA m6A modification results page draws a volcano plot showing genes with significantly different expression levels. The expression of m6a-regulated candidate genes in the two samples is displayed in the form of a bar plot.

Figure 41 RNA m6A modification profiles of compared m6A-seq samples

In the modification part, the gene and peak differences of the two samples were analyzed separately, and the difference and overlapping modified genes and peaks were displayed in the form of bar plots; and the volcano plot was drawn to show the genes with significant m6a modification differences and the peaks with significant differences.

5.1.2.4 DNA Methylation

Figure 42 Differentially methylated region

The DNA methylation results page displays the differentially methylated regions in tabular form. The table details are as follows:

Columns Details
Chromosome Chromosome of differentially methylated region
Start Site Leftmost position of differential methylation region
End Site Rightmost position of differential methylation region
Statistics of T Test Statistics of T test
P-Value P-value of T test
Methtylation Level of Sample 1 Methylation level of differential methylation region in sample 1
Methtylation Level of Sample 2 Methylation level of differential methylation region in sample 2

5.1.2.5 Transcriptome

Figure 43 Transcription profiles of compared RNA-seq samples

The Transcriptome Results page displays different and overlapping genes between the two samples in a bar plot, and a volcano plot showing genes with significantly different expression.

5.2 Multi-omics Integration Analysis Utility

5.2.1 Submit Tasks

Figure 44 Submission of Integrated Omics Analysis Tool

The figure above shows how to submit tasks for integration tools, which can integrate up to three omics.

5.2.2 Result Page

Figure 45 Result of Multi-omics Integration Analysis tool

The integration tool results page lists the intersection of two omics significant regions in tabular form and visualized with a circos plot. The table details are as follows:

Columns Details
Chromosome (1) Chromatin of intersect peak in sample 1
Start (1) Leftmost position of intersect peak in sample 1
End (1) Rightmost position of intersect peak in sample 1
Chromosome (2) Chromatin of intersect peak in sample 2
Start (2) Leftmost position of intersect peak in sample 2
End (2) Rightmost position of intersect peak in sample 2

5.3 Evolutionary Information Annotation Utility

5.3.1 Submit Tasks

Figure 46 Submission of Evolutionary Information Analysis Tool

The figure above shows how the evolutionary analysis tool is used. You can enter text or upload a file to submit genes to search.

5.3.2 Result Page

5.3.2.1 Genes Evolutionary Annotation

The Evolutionary Information Annotation Tool result page presents evolutionary information for the searched genes in various formats.

Figure 47 Genes evolutionary annotation

The orthologous groups and phylostratigraphic stages associated with each gene are displayed in tabular form. The table details are as follows:

Columns Details
Gene ID Queried gene ID
Orthologous Group Orthologous group of queried gene
Phylostratigraphic stage Phylostratigraphic stage of the orthologous group to which the queried gene belongs
Species Species of queried gene

5.3.2.2 Phylostratigraphic Tree

Figure 48 Genes Phylostratigraphic tree

The phylostratigraphic stages involved in the genes entered by the user are marked in red in the phylostratigraphic tree.

5.3.2.3 Genes Evolutionary Origin

Figure 49 Genes evolutionary origin

The number of genes at different phylostratigraphic stages is displayed in the form of a bar plot.

5.3.2.4 Gene Related Orthologous Groups

Figure 50 Gene related orthologous groups

The number of genes associated with different orthologous groups is displayed as a bar plot.

5.3.2.5 Duplication Events

Figure 51 Duplication events

The bar plot shows the number of genes with different duplication event mechanisms, and the table shows the repeat mechanism of each duplicate gene pair.

5.4 Transcriptome-wide Association Analysis Utility

5.4.1 Submit Tasks

Figure 52 Submission of Transcriptome-wide Association Analysis Tool

The above figure shows the usage of the twas analysis tool. Users can view the existing TWAS results in the database or analyze their own data. The content and usage of the Submission page is the same as the omics comparison analysis tool.

5.4.2 Result Page

Figure 53 Result of Transcriptome-wide Association Analysis Tool

The TWAS results are visualized in the form of a Manhattan map, and details of the TWAS results are given in the table, and the contents of the table are as follows:

Columns Details
ID Feature/gene identifier
CHR Chromosome
BEST_GWAS_ID rsID of the most significant GWAS SNP in locus
P0 Gene start
P1 Gene end
HSQ Heritability of the gene
MODEL Best performing model
NSNP Number of SNPs in the locus
TWAS_P TWAS P-value
TWAS_Z TWAS Z-score
MODELCV_PV Gene cross-validation P-value of the best performing model
MODELCV_R2 cross-validation R2 of the best performing model
eQTL_GWAS_Z GWAS Z-score for this eQTL
BEST_GWAS_Z Z-score of the most significant GWAS SNP in locus
eQTL_ID rsID of the best eQTL in the locus
eQTL_R2 cross-validation R2 of the best eQTL in the locus
eQTL_Z Z-score of the best eQTL in the locus

5.5 Phylogenetic Tree Construction Utility

5.5.1 Submit Tasks

Figure 54 Submission of Phylogenetic Tree Construction tool

The Phylogenetic Tree Construction tool can be used to build an evolutionary tree using either a Fasta file or gene IDs that matches the genome version in the database.

5.5.2 Result Page

Figure 55 Result of Phylogenetic Tree Construction tool

5.6 Basic Local Alignment Search Utility

5.6.1 Submit Tasks

Figure 56 Submission of Basic Local Alignment Search tool

5.6.2 Result Page

Figure 57 Result of Basic Local Alignment Search tool

BLAST results are displayed in the form of a table, and the contents of the table are as follows:

Columns Details
Qseqid Query sequence ID
Sseqid Subject sequence ID
Pident Percentage of identical matches
Length Alignment length
Qstart Start of alignment in query
Qend End of alignment in query
Sstart Start of alignment in subject
Send End of alignment in subjuct
Evalue Expect value
Bitscore Bit score

6 Download

The Download bar on the home page provides downloads of all the result data in the database.

Figure 58 Download module

7 Related software used in study

Software Reference Resource
BLAST https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-10-421 https://blast.ncbi.nlm.nih.gov/Blast.cgi
FastTree https://academic.oup.com/mbe/article/26/7/1641/1128976 https://github.com/PavelTorgashov/FastTree
DESeq2 https://genomebiology.biomedcentral.com/articles/10.1186/s13059-014-0550-8 https://github.com/thelovelab/DESeq2
CGmapTools https://doi.org/10.1093/bioinformatics/btx595 https://github.com/guoweilong/cgmaptools
BEDTools https://doi.org/10.1093/bioinformatics/btq033 https://github.com/arq5x/bedtools2
FUSION https://www.nature.com/articles/ng.3506 http://gusevlab.org/projects/fusion/
Fastp https://doi.org/10.1093/bioinformatics/bty560 https://github.com/OpenGene/fastp
MultiQC https://doi.org/10.1093/bioinformatics/btw354 https://multiqc.info/
BWA-MEM https://arxiv.org/abs/1303.3997 https://github.com/lh3/bwa
SAMtools https://doi.org/10.1093/gigascience/giab008 https://github.com/samtools/samtools
Picard / https://broadinstitute.github.io/picard/
MACS2 https://genomebiology.biomedcentral.com/articles/10.1186/gb-2008-9-9-r137 https://github.com/macs3-project/MACS
DeepTools https://doi.org/10.1093/nar/gkw257 https://github.com/deeptools/deepTools
ChIPseeker https://doi.org/10.1093/bioinformatics/btv145 https://bioconductor.org/packages/release/bioc/html/ChIPseeker.html
DiffBind https://www.nature.com/articles/nature10730 https://github.com/aeron15/DiffBind
HIC-Pro https://genomebiology.biomedcentral.com/articles/10.1186/s13059-015-0831-x http://github.com/nservant/HiC-Pro
Juicer https://doi.org/10.1016/j.cels.2016.07.002 https://github.com/aidenlab/Juicebox/wiki/Download
Bismark https://doi.org/10.1093/bioinformatics/btr167 https://github.com/FelixKrueger/Bismark
CGmapTools https://doi.org/10.1093/bioinformatics/btx595 https://cgmaptools.github.io/
MethGo https://bmcgenomics.biomedcentral.com/articles/10.1186/1471-2164-16-S12-S11 http://paoyangchen-laboratory.github.io/methgo/
HISAT2 https://www.nature.com/articles/nmeth.3317 https://daehwankimlab.github.io/hisat2/download/
StringTie https://www.nature.com/articles/nprot.2016.095 https://ccb.jhu.edu/software/stringtie/
featureCounts https://doi.org/10.1093/bioinformatics/btt656 https://sourceforge.net/projects/subread/files/subread-2.0.6/
PEA https://doi.org/10.1093/bioinformatics/bty421 https://github.com/cma2015/PEA
IRanges https://doi.org/10.1371/journal.pcbi.1003118 https://bioconductor.org/packages/release/bioc/html/IRanges.html
VCFtools https://doi.org/10.1093/bioinformatics/btr330 https://vcftools.sourceforge.net/downloads.html
Trimmomatic https://doi.org/10.1093/bioinformatics/btu170 https://github.com/usadellab/Trimmomatic
STAR https://www.nature.com/articles/ng.3506 https://doi.org/10.1093/bioinformatics/bts635
OrthoFinder https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1832-y https://github.com/davidemms/OrthoFinder
DIAMOND https://www.nature.com/articles/nmeth.3176 http://ab.inf.uni-tuebingen.de/software/diamond
TimeTree https://doi.org/10.1093/molbev/msx116 http://www.timetree.org
DupGen_finder https://genomebiology.biomedcentral.com/articles/10.1186/s13059-019-1650-2 https://github.com/qiao-xin/DupGen_finder