该渲染由 marked 生成,可能存在排版问题,最佳体验请前往:https://xxu.do/posts/academic/De-novo-assemble-RNA-seq-sequence
A research paper published on biorxiv determined a new coronavirus subgenus, I would like to figure out is there any changes on protease. However, the sequence data has not been publish.
Fortunately, the similar sequence is do available on NCBI, unfortunately, only RNA-seq data is available.
So I need to assemble the RNA-seq reads first, and BLAST the sequence I need with the assembled data.
Setup the environment with conda:
[object Object]
Fetch the data:
[object Object]
Data quality check
[object Object]
Quality control using fastp
[object Object]
Data quality check (post-cleaning data)
[object Object]
Assemble with Trinity
[object Object]
Check the Trinity result:
[object Object]
BLAST sequence of interest
Put your sequence in query.fasta.
[object Object]
Make BLAST database and run:
[object Object]
Check the BLAST result:
[object Object]
Extract the sequence from trinity.Trinity.fasta
[object Object]
You can also blast with the Predicted sequence:
[object Object]
Make BLAST database and run:
[object Object]