0%

#HiC-Pro软件 ##1. HiCPro自带的digest_genome.py程序获得消化片段的BED文件及chromosomes' size表格文件,需要限制酶酶切位点及参考基因组信息。 cd module load HiC-Pro/2.11.1 /public/home/software/opt/bio/software/HiC-Pro/2.11.1/HiC-Pro_2.11.1/bin/utils/digest_genome.py -r A^AGCTT -o /public/home/jyli/Gossypium_PAN/ADK_Manuscript/ReAnalysis_KHiC/HindIII_resfrag_K.bed ~/Gossypium_PAN/HAU_10Genomes/RawGenomes/K2/K2.Chr.fa ##2. 设置配置文件:https://www.jianshu.com/p/f5050eacdc1e 注:原始数据文件夹下面建立rawdata,且将数据放入 配置文件主要修改内容 BOWTIE2_IDX_PATH = # bowtie2建立的索引所在的路径,记住绝对路径 REFERENCE_GENOME = # bowtie2建立的索引 GENOME_SIZE = # 一个文件记录着参考基因组中每条序列的大小 GENOME_FRAGMENT = 消化片段的BED文件所在的路径 LIGATION_SITE = #连接位点 若单个测序数据则 PAIR1_EXT = SRR824846_Q20L10_1 PAIR2_EXT = SRR824846_Q20L10_2

miRNA prediction software
The clean data of small RNA sequencing (miRNA and siRNA reads) were mapped to G. barbadense using Bowtie, which allowed 200 multiple mapping positions and zero mismatch for each read. We adopted structure-based annotation and probability-based annotation to predict miRNA loci as suggested by Paterson et al. [22]. For the structure-based annotation, RNAfold was employed to predict secondary structures and miRcheck was used to evaluate secondary structures [65]. We then utilized miRDP to filter the putative precursors of the structure-based annotation [66]. The cutoff value of the largest miRNA family size was set at 30 owing to the genome doubling of teraploid relative to diploid. All the annotated mature miRNAs were searched against the miRBase (Release 20) to categorize into cotton conserved and non-conserved miRNA gene families [67]. We also employed the CleaveLand pipeline to predict putative miRNA targets based on the degradation data [68]. The bona fide miRNA targets were detected based on the criterion suggested by Addo-Quaye et al. [69]. After eliminating miRNA reads from small sequencing datasets, the remaining were then regarded as siRNAs. Wang et al.,2015 NP
Target predict
Annotated mRNA database of upland cotton from NCBI and assembled contigs from our cotton EST database was employed to identify miRNA targets by Target-align with default
parameters(Xie and Zhang, 2010). To validate predicted miRNA targets, Cleveland (Version 4.3, http://axtell-lab-psu.weebly.com/cleaveland.html) (Addo-Quaye et al., 2009) was used
to do degradome analysis with some modifications, where maximal mismatches between miRNAs and target sites are less than four and no mismatches is allowed on the 10th and
11th nucleotides on miRNAs (Meyers et al., 2008). Xie et al.,2015 PBJ