0. redundant_CDS/ - all_redundant_CDS.faa.gz : All redundant CDS sequences (amino acid, faa) - all_redundant_CDS.fna.gz : All redundant CDS sequences (nucleotide, fna) 1. HRGMv2_Unique_Proteins/ - HRGMv2_Unique_Proteins_rep_seq.faa.gz : Representative sequences for unique CDS sequences (amino acid, faa) - HRGMv2_Unique_Proteins.cluster_info.updated.tsv.gz : Cluster info for Unique proteins - HRGMv2_Unique_Proteins.taxonomic_map.tsv.gz : Taxonomy info for Unique proteins 2~6. HRGMv2_{identity - 100, 95, 90, 70, 50}_Proteins/ - HRGMv2_{identity}_Proteins_rep_seq.faa.gz : Representative sequences {identity}% protein families (amino acid, faa) - HRGMv2_{identity}_Proteins_cluster.tsv.gz : Cluster info for {identity}% protein families (representative sequences - member) - HRGMv2_{identity}_Proteins.cluster_info.updated.tsv.gz : Cluster info for {identity}% protein families - HRGMv2_{identity}_Proteins.taxonomic_map.tsv.gz : Taxonomy info for {identity}% protein families - emapper_results : eggNOG-mapper results for {identity}% proteins families ++ deepgoplus_results : deepgoplus results for 90% protein families (after filtering)