0. redundant_CDS/
- all_redundant_CDS.faa.gz : All redundant CDS sequences (amino acid, faa)
- all_redundant_CDS.fna.gz : All redundant CDS sequences (nucleotide, fna)

1. HRGMv2_Unique_Proteins/
- HRGMv2_Unique_Proteins_rep_seq.faa.gz : Representative sequences for unique CDS sequences (amino acid, faa)
- HRGMv2_Unique_Proteins.cluster_info.updated.tsv.gz : Cluster info for Unique proteins
- HRGMv2_Unique_Proteins.taxonomic_map.tsv.gz : Taxonomy info for Unique proteins

2~6. HRGMv2_{identity - 100, 95, 90, 70, 50}_Proteins/
- HRGMv2_{identity}_Proteins_rep_seq.faa.gz : Representative sequences {identity}% protein families (amino acid, faa)
- HRGMv2_{identity}_Proteins_cluster.tsv.gz : Cluster info for {identity}% protein families (representative sequences - member)
- HRGMv2_{identity}_Proteins.cluster_info.updated.tsv.gz : Cluster info for {identity}% protein families
- HRGMv2_{identity}_Proteins.taxonomic_map.tsv.gz : Taxonomy info for {identity}% protein families
- emapper_results : eggNOG-mapper results for {identity}% proteins families
++ deepgoplus_results : deepgoplus results for 90% protein families (after filtering)


Present directory - data/protein_catalog/0.HRGMv2_Proteins

Name Last modified Size
Parent Directory--
0.redundant_CDS2025-02-17 03:51:48-
1.HRGMv2_Unique_Proteins2025-02-17 03:52:56-
2.HRGMv2_100_Proteins2025-02-17 03:55:01-
3.HRGMv2_95_Proteins2025-02-17 03:58:06-
4.HRGMv2_90_Proteins2025-02-17 03:52:22-
5.HRGMv2_70_Proteins2025-02-17 03:58:54-
6.HRGMv2_50_Proteins2025-02-17 03:59:16-
README.txt2025-04-23 20:17:041 KB