https://github.com/tanghaibao/jcvi/wiki/MCscan-%28Python-version%29
Tang, H., Bowers, J. E., Wang, X., Ming, R., Alam, M., & Paterson, A. H. (2008). Synteny and collinearity in plant genomes. Science, 320(5875), 486-488.
Kiełbasa, S. M., Wan, R., Sato, K., Horton, P., & Frith, M. C. (2011). Adaptive seeds tame genomic sequence comparison. Genome research, 21(3), 487-493.
ShinySyn
could use mcscan
output files directly to
visualize synteny blocks
( Main View)
or invoke mcscan
pipeline
( MCscan Pipeline)
with genome sequence (fasta
file) and gene coordinates (gff3
file containing CDS annotation)
as input.
This document will compare Arabidopsis thaliana TAIR10 and
Arabidopsis lyrata v2.1 as an example to demonstrate usage of
ShinySyn
.
Firstly we will need to downlaod the genmoe fasta and the gene gff3 file from phytozome:
With the input files ready, user could navigate to MCscan Pipeline page and
start running MCscan with several clicks,
please note all input files could be compressed by gzip
(with .gz
suffix):
Athaliana167
)Athaliana_167_TAIR10.fa.gz
)Athaliana_167_gene_exons.gff3.gz
)Alyrata384
)Alyrata_384_v1.fa.gz
)Alyrata_384_v2.1.gene.gff3.gz
)cscore
cut-offRun Pipeline
buttonPlease note the default cscore
cut-off is 0.7, users could use 0.99 to retrieve
the reciprocal best hits (RBHs) of the orthologous genes as suggested in MCscan’s document.
More explanation of csocre
could refer to this discussion
mentioned in the MCscan’s wiki page.
After pipeline finished, user could download all the result file by
clicking Download Result
. The result will be
a tarball named as mcscan_result.<time>.tgz
. User could extract all the files
with tar
in linux or 7-zip
in windows.
The result files include:
Athaliana167.bed
)Alyrata384.bed
)Athaliana167.Alyrata384.anchors
)Athaliana167.Alyrata384.lifted.anchors
)User could either use the output files generated externally by running Mcscan pipeline
on command line, or the output files returned in the previous section as
the inputs of ShinySyn
’s visualization.
After inputing query/subject species name, as well as uploading the four output file mentioned above, user could generate macro-synteny plot with just one click. We provide two layouts of macro-synteny: parallel and circular, which could be switched in the setting block.
Parallel Layout:
Circular Layout:
If user put mouse over the ribbons (blocks), they will be highlighted and detial information containing start/end query/subject genes will be displayed. When moving over chromosomes/contigs, all the ribbons associated to the selected one will be highlighted. User could make the highlight persist by hovering for more than 8 seconds.
If user click the ribbon, the micro-synteny associated with the selected region will be rendered in the lower panel. There will a heatmap indicating gene density of the selected region (from query species), and user could brush a small region from the heatmap, the genes pairs within will be shown as a typical micro-synteny view as mcscan, but with a zoom in/out capability. The micro-synteny will be automatically updated when user choose a different “focused” region from heatmap.
By default, for each query anchor gene, only the best hit from subject
genes will be retained. User could unpick Extract one best Subject
in
the setting block to disable this and keep all the possible orthologous
pairs. This will retain the “multiple-to-multiple” relationship.
Additionally, a table containing all the gene pairs in the
selected macro-synteny will be shown. The gene pairs are extracted from
.lifted.anchors
file, and contains “low quality anchor genes close to
high quality anchors”. User could search any gene of interest, and clicking
any row from table will automatically update micro-synteny view with
the anchor highlighted.
A tuned color scheme was used in ShinySyn
, however, user is able to simply
customize this in the setting block.
If Generate Dot Viw
was picked in the setting block, a dot plot will be
generated at the same time. User could navigate to
Dot View page to check the result.
There will be a table containing all high quality anchors (from .anchors
file)
aside of the plot. User could select a small rectangle region to zoom in on the dot plot,
and the table will be updated as well. Double clicking will reset the zoom level.