0. redundant_CDS/
- all_redundant_CDS.faa.gz : All redundant CDS sequences (amino acid, faa)
- all_redundant_CDS.fna.gz : All redundant CDS sequences (nucleotide, fna)
1. HRGMv2_Unique_Proteins/
- HRGMv2_Unique_Proteins_rep_seq.faa.gz : Representative sequences for unique CDS sequences (amino acid, faa)
- HRGMv2_Unique_Proteins.cluster_info.updated.tsv.gz : Cluster info for Unique proteins
- HRGMv2_Unique_Proteins.taxonomic_map.tsv.gz : Taxonomy info for Unique proteins
2~6. HRGMv2_{identity - 100, 95, 90, 70, 50}_Proteins/
- HRGMv2_{identity}_Proteins_rep_seq.faa.gz : Representative sequences {identity}% protein families (amino acid, faa)
- HRGMv2_{identity}_Proteins_cluster.tsv.gz : Cluster info for {identity}% protein families (representative sequences - member)
- HRGMv2_{identity}_Proteins.cluster_info.updated.tsv.gz : Cluster info for {identity}% protein families
- HRGMv2_{identity}_Proteins.taxonomic_map.tsv.gz : Taxonomy info for {identity}% protein families
- emapper_results : eggNOG-mapper results for {identity}% proteins families
++ deepgoplus_results : deepgoplus results for 90% protein families (after filtering)