Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HMMs database #20

Open
sugar-sugar1 opened this issue Mar 7, 2023 · 1 comment
Open

HMMs database #20

sugar-sugar1 opened this issue Mar 7, 2023 · 1 comment

Comments

@sugar-sugar1
Copy link

Hello,

Will the databases of HMMs ( cellular organisms and GVOG) be updated?
If I want to build my own database, how to configure it to be compatible with viralrecall?
How can I skip the prodigal and use the .faa file as the input file directly? (the genome of some cellular organisms are too large)

Thanks for your time and consideration !

@faylward
Copy link
Owner

There are currently no plans to update the GVOG database - it should still work well for Nucleocytoviricota. There is no straightforward way to change the database because the scores for each GVOG have been calibrated according to their prevalence in Nucleocytoviricota vs other viruses - If you wanted to do that you'd have to change the path to the GVOG database in viralrecall.py and also alter the files in acc/ so that your new HMMs were present. Lastly, in the bin/ folder I have left an executable for prodigal where the source code has been altered to allow for longer contigs - if you put that prodigal in your PATH you should be good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants