Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

questions about how to get genes from the output #53

Open
3 tasks done
alexyfyf opened this issue Aug 2, 2023 · 4 comments
Open
3 tasks done

questions about how to get genes from the output #53

alexyfyf opened this issue Aug 2, 2023 · 4 comments
Assignees
Labels
enhancement New feature or request question Further information is requested

Comments

@alexyfyf
Copy link

alexyfyf commented Aug 2, 2023

Please report

  • version of RNA-Bloom with java -jar RNA-Bloom.jar -version
    RNA-Bloom v2.0.1
  • version of java with java -version
    openjdk version "18.0.1" 2022-04-19
  • exact command used to run RNA-Bloom
    rnabloom -long ${FILE} -t 48 -outdir ${NAME}

Hi Ka Ming,

I'm using RNA-bloom2 to assemble long-read cDNA RNA-seq data. I have a question about the output. I can see the transcripts.fa files have the sequences for each transcripts, but how can I know which transcripts are from the same gene?
I don't see that information contained in the header. Some example headers are shown here:

>rb_90719 l=1982 c=0.25546062 path=[94775+,95098+]
>rb_90720 l=407 c=0.21744472 s=103012

Also, I'm not sure why some header show s while others show path, any difference?

Thank you so much if you could help to explain it.

Cheers,
Alex

@kmnip kmnip added the question Further information is requested label Aug 3, 2023
@kmnip
Copy link
Collaborator

kmnip commented Aug 3, 2023

There is no inference about genes.

path indicates that it was assembled from the list of sequences from the previous step of the assembly.
s indicates that it originate from a single sequence.

@kmnip kmnip self-assigned this Aug 3, 2023
@alexyfyf
Copy link
Author

alexyfyf commented Aug 6, 2023

Thank you so much for your reply. Are there any suggestions on how to infer genes from RNA-bloom2 output from your experience?

Cheers,
Alex

@kmnip
Copy link
Collaborator

kmnip commented Aug 9, 2023

You can possibly try this:
http://arthropods.eugenes.org/EvidentialGene/other/sra2genes_testdrive/sra2genes4v_testdrive/

If you are interested in a crude gene groupings of assembled transcripts, I can make it a feature request (but very low priority).

@alexyfyf
Copy link
Author

alexyfyf commented Aug 9, 2023

Thank you so much. Would definitely like to have this feature in the future.

@kmnip kmnip added the enhancement New feature or request label Aug 15, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants