Query lcl|NC_021557.1_cdsid_YP_008129846.1 [gene=RHYG_00032] [protein=hypothetical protein] [protein_id=YP_008129846.1] [location=22147..22617] Match_columns 156 No_of_seqs 103 out of 211 Neff 6.3 Searched_HMMs 1612 Date Thu Nov 7 17:50:19 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_32 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_32_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99848 Length: 172 100.0 1.3E-50 8E-54 294.1 15.5 145 1-156 1-168 (172) 2 protein:vir:79074 Length: 150 100.0 3.5E-49 2.2E-52 286.2 15.6 144 2-156 1-146 (150) 3 protein:vir:107864 Length: 150 100.0 9.3E-49 5.8E-52 283.9 15.8 146 2-154 1-150 (150) 4 protein:vir:1993 Length: 141 # 100.0 1.2E-47 7.7E-51 277.7 15.6 137 2-156 1-137 (141) 5 protein:vir:99222 Length: 138 100.0 3.1E-46 1.9E-49 270.1 15.6 138 2-156 1-138 (138) 6 protein:vir:79253 Length: 138 100.0 3.1E-46 1.9E-49 270.1 15.6 138 2-156 1-138 (138) 7 protein:vir:103846 Length: 138 100.0 6.4E-46 4E-49 268.4 15.6 138 2-156 1-138 (138) 8 protein:vir:80967 Length: 131 96.6 7.9E-05 4.9E-08 43.1 10.5 107 2-127 1-131 (131) 9 protein:vir:43 Length: 131 # N 96.4 0.00015 9.1E-08 41.6 10.5 106 2-127 1-131 (131) 10 protein:vir:98481 Length: 136 95.8 0.0003 1.9E-07 39.9 9.9 119 1-156 1-136 (136) 11 protein:vir:2432 Length: 124 # 94.8 0.00052 3.2E-07 38.6 7.9 123 2-150 1-124 (124) 12 protein:vir:94761 Length: 132 94.5 0.00091 5.6E-07 37.3 8.4 114 1-155 1-132 (132) 13 protein:vir:4228 Length: 125 # 93.6 0.0012 7.2E-07 36.7 7.3 123 2-150 1-125 (125) 14 protein:vir:9576 Length: 131 # 93.1 0.00035 2.2E-07 39.5 3.7 126 1-155 1-131 (131) 15 protein:vir:98900 Length: 132 93.1 0.0045 2.8E-06 33.4 9.8 123 2-156 1-131 (132) 16 protein:vir:2505 Length: 128 # 92.8 0.00021 1.3E-07 40.8 2.1 115 1-156 4-119 (128) 17 protein:vir:9761 Length: 140 # 92.4 0.00083 5.1E-07 37.5 4.8 127 1-150 1-140 (140) 18 protein:vir:78478 Length: 149 92.3 0.0036 2.2E-06 34.0 8.2 125 2-154 1-149 (149) 19 protein:vir:78254 Length: 149 92.3 0.0036 2.2E-06 34.0 8.2 125 2-154 1-149 (149) 20 protein:vir:7773 Length: 123 # 92.1 0.0015 9.1E-07 36.1 5.7 122 2-150 1-123 (123) 21 protein:vir:1640 Length: 132 # 91.5 0.0012 7.3E-07 36.6 4.6 127 1-155 1-132 (132) 22 protein:vir:81159 Length: 95 # 91.2 0.0099 6.1E-06 31.6 9.4 95 2-121 1-95 (95) 23 protein:vir:104088 Length: 125 91.2 0.0025 1.6E-06 34.8 6.1 123 2-150 1-125 (125) 24 protein:vir:2345 Length: 125 # 88.3 0.0097 6E-06 31.6 6.9 122 1-150 1-125 (125) 25 protein:vir:93592 Length: 108 87.4 0.026 1.6E-05 29.3 8.7 99 1-122 1-108 (108) 26 protein:vir:1329 Length: 122 # 84.0 0.035 2.2E-05 28.6 7.7 116 2-147 1-122 (122) 27 protein:vir:6243 Length: 122 # 81.0 0.04 2.5E-05 28.3 6.8 116 2-147 1-122 (122) 28 protein:vir:106583 Length: 105 80.2 0.067 4.2E-05 27.0 7.8 94 1-120 1-105 (105) 29 protein:vir:99002 Length: 158 70.3 0.042 2.6E-05 28.1 3.9 132 1-156 1-149 (158) 30 protein:vir:108221 Length: 150 66.4 0.17 0.0001 24.8 6.4 133 1-156 4-146 (150) 31 protein:vir:80389 Length: 172 60.4 0.18 0.00011 24.7 5.4 125 1-156 14-160 (172) 32 protein:vir:95176 Length: 172 56.0 0.16 0.0001 24.9 4.4 125 1-156 16-164 (172) 33 protein:vir:4788 Length: 130 # 55.2 0.48 0.0003 22.3 7.4 117 2-156 1-130 (130) 34 protein:vir:106596 Length: 128 52.3 0.51 0.00032 22.2 6.4 108 1-141 9-128 (128) 35 protein:vir:94955 Length: 170 50.1 0.17 0.0001 24.8 3.4 124 1-156 13-153 (170) 36 protein:vir:95004 Length: 169 49.0 0.39 0.00024 22.9 5.2 125 1-156 14-156 (169) 37 protein:vir:78383 Length: 169 46.3 0.44 0.00027 22.5 5.1 125 1-156 14-158 (169) 38 protein:vir:103957 Length: 110 40.1 0.99 0.00061 20.6 7.8 98 4-141 1-110 (110) 39 protein:vir:97145 Length: 110 40.1 0.99 0.00061 20.6 7.8 98 4-141 1-110 (110) 40 protein:vir:96221 Length: 110 40.1 0.99 0.00061 20.6 7.8 98 4-141 1-110 (110) 41 protein:vir:78849 Length: 110 40.1 0.99 0.00061 20.6 7.8 98 4-141 1-110 (110) 42 protein:vir:99796 Length: 110 40.1 0.99 0.00061 20.6 7.8 98 4-141 1-110 (110) 43 protein:vir:9311 Length: 110 # 40.1 0.99 0.00061 20.6 7.8 98 4-141 1-110 (110) 44 protein:vir:96390 Length: 110 40.1 0.99 0.00061 20.6 7.8 98 4-141 1-110 (110) 45 protein:vir:79701 Length: 144 36.9 0.3 0.00018 23.5 2.6 122 1-156 1-143 (144) 46 protein:vir:3615 Length: 110 # 36.6 1.1 0.0007 20.3 5.8 98 4-141 1-110 (110) 47 protein:vir:2738 Length: 112 # 35.4 1.2 0.00076 20.1 7.8 99 1-141 1-112 (112) 48 protein:vir:7410 Length: 107 # 34.3 1.3 0.00081 20.0 8.1 107 3-137 1-107 (107) 49 protein:vir:4831 Length: 105 # 32.5 1.4 0.00088 19.7 9.7 96 3-118 1-105 (105) 50 protein:vir:97267 Length: 172 31.2 0.88 0.00054 20.9 4.2 130 1-156 15-166 (172) 51 protein:vir:4904 Length: 113 # 31.1 1.5 0.00095 19.6 6.7 100 1-141 1-113 (113) 52 protein:vir:96488 Length: 113 30.2 1.6 0.00099 19.5 8.4 100 2-141 1-113 (113) 53 protein:vir:3846 Length: 126 # 29.4 1.7 0.001 19.4 9.0 125 1-138 1-126 (126) 54 protein:vir:100245 Length: 113 27.0 1.9 0.0012 19.1 9.5 101 2-124 1-113 (113) 55 protein:vir:9821 Length: 138 # 26.8 1.9 0.0012 19.0 8.1 119 1-156 5-133 (138) 56 protein:vir:192 Length: 108 # 26.0 2 0.0012 18.9 10.4 104 1-135 5-108 (108) 57 protein:vir:1887 Length: 108 # 26.0 2 0.0012 18.9 10.4 104 1-135 5-108 (108) No 1 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=100.00 E-value=1.3e-50 Score=294.10 Aligned_cols=145 Identities=23% Similarity=0.340 Sum_probs=126.0 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhccccc----------------------CccccccccHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNL----------------------NDMAGRTLDVAKIETAITFAEDILVGYSRA 58 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~----------------------~~~~~~~~D~~~v~~Al~dA~~~id~YL~~ 58 (156) |.||||.+||+++||++||+|||++.+. ++..++++|.++|++||+||+++|||||++ T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~~ 80 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQR 80 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHhc Confidence 9999999999999999999999988753 234578899999999999999999999999 Q ss_pred c-ccccccCCCccchHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccce Q lcl|NC_021557. 59 R-YAVIETLTPENTPQLVKGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTR 137 (156) Q Consensus 59 R-Y~~~~~lpl~~vp~~L~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~ 137 (156) | |. +||+++|.+|+++||||||||||.+++++.+.+|++++|||+||+|||+|++||++||++.+.++ ++++. T Consensus 81 R~Y~----lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~--~~~~~ 154 (172) T protein:vir:99 81 RGYS----LPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALKFLQLIAEGKFSLGPDDPLTP--PGGGV 154 (172) T ss_pred cccc----CCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHHHHHHHhcCccccCCCCCCCC--CCCCc Confidence 9 85 79999999999999999999999999888888999999999999999999999999998765443 33455 Q ss_pred eeeecCcchhhhhhccccC Q lcl|NC_021557. 138 TEAVIPPSRIAGILYGWNS 156 (156) Q Consensus 138 v~~~~~~~r~~~~l~g~~~ 156 (156) ++++.++ |+|||+| T Consensus 155 ~~v~~~~-----r~F~rd~ 168 (172) T protein:vir:99 155 PQVLAPA-----RTFSHDT 168 (172) T ss_pred eeeecCC-----CccChhh Confidence 6665554 4555555 No 2 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=100.00 E-value=3.5e-49 Score=286.20 Aligned_cols=144 Identities=18% Similarity=0.248 Sum_probs=125.6 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccC--ccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLN--DMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLI 79 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~--~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~ 79 (156) |+|||.+||+.+||+++|++||++.+.+ ++..+++|+++|++||+||+++|||||++||. +||.++|.+|+++| T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgyL~~RY~----lPl~~vP~~L~~~a 76 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAHLRGRYN----LPLSPVPTVIKDVT 76 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHHHhhhcc----CCcccccHHHHHHH Confidence 9999999999999999999999876543 35668899999999999999999999999996 68899999999999 Q ss_pred HHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhhhccccC Q lcl|NC_021557. 80 GDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGILYGWNS 156 (156) Q Consensus 80 ~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~l~g~~~ 156 (156) |||||||||.+++...+.+|++++|||+||+||++|++||++||+++++.+++++. +++..++ |+|||+| T Consensus 77 ~dIA~Y~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~--~~v~~~~-----r~f~r~~ 146 (150) T protein:vir:79 77 VNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGE--MKVRARR-----RQFDADL 146 (150) T ss_pred HHHHHHHHHhcccCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCccCCCCCCc--eeeecCC-----CccChhh Confidence 99999999999987778899999999999999999999999999988776666554 4444343 4555555 No 3 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=100.00 E-value=9.3e-49 Score=283.91 Aligned_cols=146 Identities=18% Similarity=0.256 Sum_probs=125.1 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccC--ccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLN--DMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLI 79 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~--~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~ 79 (156) |+|||.+||+.+||++||+|||++.+.+ ++..+++|+++|++||+||+++|||||++||. +||.++|.+|+++| T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~~RY~----lPl~~vP~~L~~~a 76 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLRGRYN----LPLSPVPTVIKDVT 76 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHhhhcc----CCcccccHHHHHHH Confidence 9999999999999999999999876543 34567899999999999999999999999996 68899999999999 Q ss_pred HHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhh--hccc Q lcl|NC_021557. 80 GDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGI--LYGW 154 (156) Q Consensus 80 ~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~--l~g~ 154 (156) ||||||+||.++++..+.+|++++|||+||+||++|++||++||+++++.+++++. +.+..++ |.|+| |.|| T Consensus 77 ~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~--~~v~~~~-r~f~r~~l~gf 150 (150) T protein:vir:10 77 VNLARHWLYARRPEGAALPDTVSQTFKASMHMLEKIRDNKLTIGDPSGPATPEPGE--MKVRARR-RQFDADLLERF 150 (150) T ss_pred HHHHHHHHHhcccccCCCCHHHHHHHHHHHHHHHHHhcCcccCCCCCCCCCCCCce--eeeecCC-CccChhhccCC Confidence 99999999999987778899999999999999999999999999987766665544 4444443 23322 4555 No 4 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=100.00 E-value=1.2e-47 Score=277.74 Aligned_cols=137 Identities=18% Similarity=0.238 Sum_probs=121.8 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) |+|||.+||+++||+++|.+||++. ..++++|+++|++||++|+++|||||++||. +|+.++|.+|+++||| T Consensus 1 M~Y~T~~Dl~~~~ge~~l~~Lt~d~----~~~g~~d~~~i~~Al~dA~~eIdgyL~~RY~----lPl~~~P~~L~~~a~d 72 (141) T protein:vir:19 1 MNYATVNDLCARYTRTRLDILTRPK----TADGQPDDAVAEQALADASAFIDGYLAARFV----LPLTVVPSLLKRQCCV 72 (141) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcCC----CCccccCHHHHHHHHHHHHHHHHHHHhhccc----CCccccchHHHHHHHH Confidence 9999999999999999999999643 3467899999999999999999999999996 6899999999999999 Q ss_pred HHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhhhccccC Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGILYGWNS 156 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~l~g~~~ 156 (156) ||||+||.++ .+|++++|||+||+|||+|++||++||++.++..++++.+.+++..++ ++|||+| T Consensus 73 IA~Y~L~~~~-----~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~~-----r~f~r~~ 137 (141) T protein:vir:19 73 VAWFYLNESQ-----PTEQITATYRDTVRWLEQVRDGKTDPGVESRTAASPEGEDLVQVQSDP-----PVFSRKQ 137 (141) T ss_pred HHHHHHhcCC-----CChHHHHHHHHHHHHHHHHhcCccccCCCCCCCCCCCCCceeEeecCC-----cccCccc Confidence 9999999764 579999999999999999999999999988776666667777776665 5777777 No 5 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=100.00 E-value=3.1e-46 Score=270.08 Aligned_cols=138 Identities=15% Similarity=0.174 Sum_probs=120.2 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) |+|||.+||+++||+++|+|||++.+ ..++++|+++|++||+||+++|||||++||. +||.++|.+|+++||| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~---~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~----lPl~~vP~~L~~~a~d 73 (138) T protein:vir:99 1 MSYCTLADLIEQYSEQKIREVSDRVN---KPATTIDTVIVDRAIADADSEIDLHLHGRYQ----LPLASVPTALKRIACG 73 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCc---cccCcccHHHHHHHHHHHHHHHHHHHhhccc----CCccccchHHHHHHHH Confidence 99999999999999999999997643 2468899999999999999999999999996 6899999999999999 Q ss_pred HHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhhhccccC Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGILYGWNS 156 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~l~g~~~ 156 (156) ||+|+||.++. .+|.+++|||+||+||++|++||++||++.+...+ ++++.+++++++ |+||||= T Consensus 74 IA~Y~L~~~~~----~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~-~~~~~~~~~~~~-----r~F~Rd~ 138 (138) T protein:vir:99 74 LAYANLHIVLK----EENPVYKTAEHLRKLLSGIANGKLSLALDADGKPA-PVANTVQISEGR-----NDWGADW 138 (138) T ss_pred HHHHHHhcCCC----CcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCC-CCCCceeeecCC-----CCCCCCC Confidence 99999997652 35789999999999999999999999998765443 455667777775 4777777 No 6 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=100.00 E-value=3.1e-46 Score=270.08 Aligned_cols=138 Identities=15% Similarity=0.174 Sum_probs=120.2 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) |+|||.+||+++||+++|+|||++.+ ..++++|+++|++||+||+++|||||++||. +||.++|.+|+++||| T Consensus 1 M~YaT~~dl~~r~ge~~l~~Ltd~~~---~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~----lPl~~vP~~L~~~a~d 73 (138) T protein:vir:79 1 MSYCTLADLIEQYSEQKIREVSDRVN---KPATTIDTVIVDRAIADADSEIDLHLHGRYQ----LPLASVPTALKRIACG 73 (138) T ss_pred CCCCCHHHHHHhcCHHHHHHHhccCc---cccCcccHHHHHHHHHHHHHHHHHHHhhccc----CCccccchHHHHHHHH Confidence 99999999999999999999997643 2468899999999999999999999999996 6899999999999999 Q ss_pred HHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhhhccccC Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGILYGWNS 156 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~l~g~~~ 156 (156) ||+|+||.++. .+|.+++|||+||+||++|++||++||++.+...+ ++++.+++++++ |+||||= T Consensus 74 IA~Y~L~~~~~----~~e~i~~rY~~Ai~~L~~Ia~Gk~~Lg~~~~~~~~-~~~~~~~~~~~~-----r~F~Rd~ 138 (138) T protein:vir:79 74 LAYANLHIVLK----EENPVYKTAEHLRKLLSGIANGKLSLALDADGKPA-PVANTVQISEGR-----NDWGADW 138 (138) T ss_pred HHHHHHhcCCC----CcHHHHHHHHHHHHHHHHHhcCcccCCCCCCCcCC-CCCCceeeecCC-----CCCCCCC Confidence 99999997652 35789999999999999999999999998765443 455667777775 4777777 No 7 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=100.00 E-value=6.4e-46 Score=268.36 Aligned_cols=138 Identities=17% Similarity=0.168 Sum_probs=119.2 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) |+|||.+||+++||+++|.|||++.+ ..++++|+++|++||++|+++|||||++||. +||.++|.+|+++||| T Consensus 1 M~Y~T~~dl~~r~ge~~l~~Ltd~~~---~~~~~~d~~~i~~Al~dA~~eIdgyL~~RY~----lPl~~vP~~L~~~a~d 73 (138) T protein:vir:10 1 MSYCTQADLVEQYGEASIRQLSDRVN---KPATTIDPAVVAQAIADADAEIDLHLHARYQ----LPLAQVPVVLKRVACV 73 (138) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccC---CCcCccCHHHHHHHHHHHHHHHHHHHhhccc----CCccccchHHHHHHHH Confidence 99999999999999999999997643 3457899999999999999999999999996 6899999999999999 Q ss_pred HHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhhhccccC Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGILYGWNS 156 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~l~g~~~ 156 (156) |||||||.++. .+|++++|||+||+|||+|++||++||++..+..++ +++.++++.++ |+||++= T Consensus 74 IA~Y~L~~~~~----~~e~~~~rY~~Ai~~L~~Ia~G~~~Lg~~~~~~~~~-~~~~~~~~s~~-----r~Fg~d~ 138 (138) T protein:vir:10 74 LAFANLHTQVK----DDHPAILDAERKRKLLGGISSGKLSLALTSSGTPAP-IANTVQISSQR-----NDFGGTW 138 (138) T ss_pred HHHHHHhcCCC----CChHHHHHHHHHHHHHHHHhcCcccCCCCCCcccCC-CCCceeeecCC-----ccCCCCC Confidence 99999997652 468999999999999999999999999987655543 34556676665 4666655 No 8 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=96.62 E-value=7.9e-05 Score=43.09 Aligned_cols=107 Identities=10% Similarity=0.002 Sum_probs=62.0 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCC-ccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTP-ENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl-~~vp~~L~~~~~ 80 (156) |+|+|.+++.+.|+. ..+.++-....+..|+..||.+...|++-.-.-.+ +.+|..++..|| T Consensus 1 M~Y~d~~~Y~~~y~G-----------------~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c 63 (131) T protein:vir:80 1 MPYTTLEFYTNEYAG-----------------EHLEQDEFAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATC 63 (131) T ss_pred CCCCCHHHHHHhhCC-----------------CCCchhHHHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHH Confidence 999999999998862 23456668999999999999999999863110011 467889999999 Q ss_pred HHHHHHHHHhc----CCC-------------------CCCCHHHHHHHHHHHHHHHHHhcCccccCCCCC Q lcl|NC_021557. 81 DVARYRLRDKS----GGQ-------------------GQVETTVRERHDAAMSNIKAVATGKFELPIAGE 127 (156) Q Consensus 81 dIArY~L~~~~----~~~-------------------~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~ 127 (156) ..+-|.--... ... ...++.-...+++|+.||+. .|=+-=|+..- T Consensus 64 ~q~e~~~~~g~~~~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~--TGLlyrGV~~~ 131 (131) T protein:vir:80 64 NQIEYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAH--TGLLYNGVGVR 131 (131) T ss_pred HHHHHHHHhhhhhhhcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhc--cCCeecCCCCC Confidence 99986543100 000 00000111134444444442 12111111111 No 9 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=96.36 E-value=0.00015 Score=41.59 Aligned_cols=106 Identities=9% Similarity=0.021 Sum_probs=62.2 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccc--cCCCccchHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIE--TLTPENTPQLVKGLI 79 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~--~lpl~~vp~~L~~~~ 79 (156) |+|+|.+.+.+.||. ..|.++-....+..|+..||.+...||...- .+| +.+|..++..| T Consensus 1 M~Y~d~~~Y~~~y~g-----------------~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~~~~~-~~~~~~vk~A~ 62 (131) T protein:vir:43 1 MPYTTLEFYNDEYAG-----------------EHLEQDEFDKLLKHAERKIDSVTFYRIRKGGIESFS-EFIQHQIQLAT 62 (131) T ss_pred CCCCCHHHHHHhhCC-----------------CCCCHhHHHHHHHHHHHHHHHHhcccccccCccccc-hhhHHHHHHHH Confidence 999999999988862 2355677899999999999999999996311 112 45788999999 Q ss_pred HHHHHHHHHHh----cCCCC-------------------CCCHHHHHHHHHHHHHHHHHhcCccccCCCCC Q lcl|NC_021557. 80 GDVARYRLRDK----SGGQG-------------------QVETTVRERHDAAMSNIKAVATGKFELPIAGE 127 (156) Q Consensus 80 ~dIArY~L~~~----~~~~~-------------------~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~ 127 (156) |..+-|.-... ....+ ...+.-..-+++|..||+. .|=+-=|+..- T Consensus 63 c~q~e~~~~~g~~s~~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~--TGLlyrGV~~~ 131 (131) T protein:vir:43 63 CNQIEYFKEAGGTSELAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSYLAH--TGLLYNGVGVR 131 (131) T ss_pred HHHHHHHHHhHHHhhhhccccCeeecCceEEeecccccchhhhchhhhHHHHHHHHhc--cCCeecCCCCC Confidence 99998664310 00000 0000001124444444441 12111111111 No 10 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=95.85 E-value=0.0003 Score=39.87 Aligned_cols=119 Identities=16% Similarity=0.290 Sum_probs=64.5 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) |-+|||++|++.|++.. ++ + .+-+.+.++.-|+|||.+|..+.-. ..++-|..++.++| T Consensus 1 M~~fAtv~Dl~~rw~~~----~~-----d----ee~~ra~~~~lL~dAS~~ir~~~p~--------~~~~~~~~~~~V~~ 59 (136) T protein:vir:98 1 MAAYATVEDYQARAAVT----LP-----D----GSPRRAQVEAYLDDASALMARHIPT--------GHTPDPGTLRAICV 59 (136) T ss_pred CCccCCHHHHHHHhccC----CC-----C----chhHHHHHHHHHHHHHHHHHHhCCC--------CCCCChhHHHHHHH Confidence 99999999999999731 11 0 1223456777899999999887521 22345899999999 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCcc--------ccCCCCCcC---------CCCCccceeeeecC Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKF--------ELPIAGEPV---------NGEAGSTRTEAVIP 143 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~--------~Lg~~~~~~---------~~e~~~~~v~~~~~ 143 (156) ++++ ++. +.+. +..++. ---|-+.+.+ .|.+ .||.....- ...+|.- -...-| T Consensus 60 ~~V~-R~~-~np~-G~~s~T-aG~ys~s~t~-----~G~Lylt~~E~~~Lg~~rqr~~~~d~a~si~~~~~~~-~~~~dp 129 (136) T protein:vir:98 60 AVVR-RVM-ANPG-GYRQRT-IGQYAETLGE-----DGGLYLTEDEKGQLQPPDQTAPDADAAYSLDLDPGTR-AWVDDP 129 (136) T ss_pred HHHH-HHh-hCCC-Cccccc-chhHHHhhhc-----CCCcccChHHHHHhCCCCCcccccccceecccCCCcC-CcCCCC Confidence 9997 333 2332 333333 4468777766 4553 233321100 0001000 011123 Q ss_pred cchhhhhhccccC Q lcl|NC_021557. 144 PSRIAGILYGWNS 156 (156) Q Consensus 144 ~~r~~~~l~g~~~ 156 (156) ..| ||.- T Consensus 130 ~~~------~~~~ 136 (136) T protein:vir:98 130 AGC------GWPR 136 (136) T ss_pred CCC------CCCC Confidence 333 2333 No 11 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=94.80 E-value=0.00052 Score=38.59 Aligned_cols=123 Identities=13% Similarity=0.118 Sum_probs=68.3 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc-ccccCCCccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA-VIETLTPENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~-~~~~lpl~~vp~~L~~~~~ 80 (156) |+|||.+|+.++++. .||+ -..+.++.=|+|||..|-. |+. .+--..-+..|..++.++| T Consensus 1 ~~~At~~Dv~~rw~r----~Lt~-----------~E~~~ve~lL~dAs~~ir~----r~P~l~~~~~~~~~~~~v~~V~a 61 (124) T protein:vir:24 1 MAYATADDVVTLWAK----EPEP-----------EVMALIERRLEQVERMIRR----RIPDLDARVSSDIFRADLIDIEA 61 (124) T ss_pred CCCCCHHHHHHHhCC----CCCH-----------HHHHHHHHHHHHHHHHHHh----cCCCcchhcCCCCChhhHHHHHH Confidence 999999999999863 2221 1356788899999999874 553 1000111245788999999 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhh Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGI 150 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~ 150 (156) ++.. ++. +.+. + ...+-.-.|...+.+ ....|++-|--. +-..-.++...-.++..|+...+. T Consensus 62 ~~V~-R~~-rnP~-G-~~s~T~G~Ys~sl~~--~~~~g~Lylt~~-E~~~Lg~~r~~~~~~i~p~~~~~~ 124 (124) T protein:vir:24 62 DAVL-RLV-RNPE-G-YLSETDGAYTYQLQA--DLSQGKLVILDE-EWTTLGVNRLSRMSTLVPNIVMPT 124 (124) T ss_pred HHHH-HHh-hCCC-C-ceecccchhHHhhhh--cccCCceeeCHH-HHHhhCcccccceeEeecceeeCC Confidence 9987 444 3332 2 233334788888887 455677655221 111101111111122223333332 No 12 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=94.46 E-value=0.00091 Score=37.26 Aligned_cols=114 Identities=18% Similarity=0.227 Sum_probs=56.1 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccC-CC--ccchHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETL-TP--ENTPQLVKG 77 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~l-pl--~~vp~~L~~ 77 (156) |-+|||++|+..+|. +|+++ -.++++.-|++||.+|..=.-.++..+... +. ...+.++++ T Consensus 1 m~~fAtv~Dl~~r~r-----~L~~d-----------E~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~~~~~k~ 64 (132) T protein:vir:94 1 MNPFATVDDLTMLWR-----PLKGD-----------EKERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYFSSVVKS 64 (132) T ss_pred CCCcCCHHHHHHHhc-----cCChh-----------HHHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccchhHHHH Confidence 999999999999985 34421 147899999999999975554444321111 11 124678999 Q ss_pred HHHHHHHHHHHHhcCCCCCCCHHH------HHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCc------- Q lcl|NC_021557. 78 LIGDVARYRLRDKSGGQGQVETTV------RERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPP------- 144 (156) Q Consensus 78 ~~~dIArY~L~~~~~~~~~~~e~v------~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~------- 144 (156) +||++++=-|-. +. + .+.+ .--|-+...|+ ...|. +.....+ T Consensus 65 V~~~~V~Ral~~--~~-~--~~g~tq~S~TaG~ys~S~T~~--np~G~------------------lylt~~e~~~LGl~ 119 (132) T protein:vir:94 65 VTVDIVARTLMT--ST-D--QEPMTQTTESALGYSVSGSYL--VPGGG------------------LFIKNSELSRLGLK 119 (132) T ss_pred HHHHHHHHHhcC--CC-C--CCCceeeeeecccceeeeeee--cCCCC------------------ceeChHHHHhhCCC Confidence 999999754421 10 0 0111 11111111110 11111 1111111 Q ss_pred -chhhhh-hcccc Q lcl|NC_021557. 145 -SRIAGI-LYGWN 155 (156) Q Consensus 145 -~r~~~~-l~g~~ 155 (156) .|.+.. ++|-| T Consensus 120 ~~r~~~i~~~~~~ 132 (132) T protein:vir:94 120 KQRFGVIDFYGND 132 (132) T ss_pred CCceEEEeecCCC Confidence 011111 22333 No 13 >protein:vir:4228 Length: 125 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2817 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039683;swissprot:sw:q05225;genbank:gi:9625449;uniprot:Q05225;genbank:GeneID:2942926 Probab=93.60 E-value=0.0012 Score=36.68 Aligned_cols=123 Identities=15% Similarity=0.130 Sum_probs=67.0 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCC-CccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLT-PENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lp-l~~vp~~L~~~~~ 80 (156) |+|||.+|+.++++. .||+ -..+-|+.=|++|+..|-..+=. ++--+. .+..++.++.+++ T Consensus 1 m~~A~~eDV~a~w~r----~lt~-----------~e~~~v~~~L~~Ae~~Ir~riPd---L~~r~~~~~~~~~~v~~Vea 62 (125) T protein:vir:42 1 MAYATAEDVVTLWAK----EPEP-----------EVMALIERRLQQIERMIKRRIPD---LDVKAAASATFRADLIDIEA 62 (125) T ss_pred CCcccHhHHHHHhCC----CCCh-----------HHHHHHHHHHHHHHHHHHHhCCC---chhhhcccCcchhhHHHHHH Confidence 999999999999873 2221 14677888999999988554321 000011 2345677788877 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCC-CcCCCCCccceeeeecCcchhhhh Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAG-EPVNGEAGSTRTEAVIPPSRIAGI 150 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~-~~~~~e~~~~~v~~~~~~~r~~~~ 150 (156) +..+ +|+.. + ++ ....-.-.|...+.+ +.+.|++.+--.. ..-.+.. .+. .+...|....+. T Consensus 63 ~aV~-Rv~RN-p-eG-y~s~T~G~Ys~~l~~--~~~~g~L~it~eEw~~L~p~~-~~g-~~~i~P~~~~~~ 125 (125) T protein:vir:42 63 DAVL-RLVRN-P-EG-YLSETDGAYTYQLQA--DLSQGKLTILDEEWEILGVNS-QKR-MAVIVPNVVMPT 125 (125) T ss_pred HHHH-HHHhC-C-Cc-cccccchhHHHhhhc--ccccCceeeCHHHHHhhCccc-ccc-ceeecccceeCC Confidence 7665 56532 2 22 222333778877776 6778887663211 1111111 111 122233333332 No 14 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=93.12 E-value=0.00035 Score=39.51 Aligned_cols=126 Identities=16% Similarity=0.122 Sum_probs=59.0 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc--ccccCCCccchHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA--VIETLTPENTPQLVKGL 78 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~--~~~~lpl~~vp~~L~~~ 78 (156) |-+|||++|+..++. +|+.. ..++++.=|++||..|..=+-.... -.+..+.+..+..++.+ T Consensus 1 m~~fAtv~D~~~rwr-----~Lt~~-----------E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V 64 (131) T protein:vir:95 1 MENFATVEDLKKLWR-----ALKFD-----------EEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSV 64 (131) T ss_pred CCccCCHHHHHHHhc-----CCCHH-----------HHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHH Confidence 999999999999984 23321 2568999999999998765432111 01112223467899999 Q ss_pred HHHHHHHHHHHhcCCCCCCC--HHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhh-hcccc Q lcl|NC_021557. 79 IGDVARYRLRDKSGGQGQVE--TTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGI-LYGWN 155 (156) Q Consensus 79 ~~dIArY~L~~~~~~~~~~~--e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~-l~g~~ 155 (156) ||++++.-|-...... ..+ -+-.-.|-+...|+ ...|.+-|.-. |-.. . .....|.+.. ++|-| T Consensus 65 ~~~~V~Ral~~~~~~~-G~tq~S~TaG~ys~S~t~~--~p~g~lylt~~------e~~~--L--Gl~~~r~~~i~~~~~~ 131 (131) T protein:vir:95 65 TVDVVARTLMTSTDQE-PMTQVAESALGYSFSGSYL--VPGGGLFIKDS------ELKR--L--GLKKQRYGVIDIYGTD 131 (131) T ss_pred HHHHHHHHhcCCCCCC-Cceeeeeecccceeeeeee--cCCCCceeChH------HHHH--h--CCCCCceeEEeeccCC Confidence 9999997763211000 000 01112222222221 11222111000 0000 0 0000111111 34444 No 15 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=93.08 E-value=0.0045 Score=33.43 Aligned_cols=123 Identities=17% Similarity=0.190 Sum_probs=68.9 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCC-ccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTP-ENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl-~~vp~~L~~~~~ 80 (156) |+|+|.+.+.+..| ..++++..++-+..|+..||.+...||...-.-.+ +.++..++..+| T Consensus 1 M~Y~t~~~Y~~~~G------------------~~i~e~~F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c 62 (132) T protein:vir:98 1 MPYLTYEEFMDLNG------------------RDIDDKKFEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALC 62 (132) T ss_pred CCCCCHHHHHhhcC------------------CCCCHHHHHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHH Confidence 99999999976433 23566779999999999999999999963211112 235667888888 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhhh-------cc Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGIL-------YG 153 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~l-------~g 153 (156) ..+-|. +. .|. . .++.+-.-+.-+.-|+.++..................+ ..+..-| .| T Consensus 63 ~qiey~-~~-~G~--~-------sae~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~---~~a~~~L~~tGLLyrG 128 (132) T protein:vir:98 63 AQIEYF-DA-LGA--T-------TFEEINNSPQTFQAGRTSVSNASRYNPSGANESKPLVA---EDVYIYLQGTGLLFQG 128 (132) T ss_pred HHHHHH-Hh-ccc--h-------hhhhccCccceeeeCcEEEEeeccCCcccccccccchH---HHHHHHHhhcCCcccc Confidence 888754 22 111 0 13333344666788888876543221111111111111 1111112 22 Q ss_pred ccC Q lcl|NC_021557. 154 WNS 156 (156) Q Consensus 154 ~~~ 156 (156) =.| T Consensus 129 V~~ 131 (132) T protein:vir:98 129 VKT 131 (132) T ss_pred CCC Confidence 223 No 16 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=92.84 E-value=0.00021 Score=40.76 Aligned_cols=115 Identities=15% Similarity=0.238 Sum_probs=61.7 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) --+++|.+|+..+++. .||++ +++.+..-|++|+.+|++||. +|.+ | .+.|..++++|| T Consensus 4 ~~alAtvdDv~~~lrr----~Lt~d-----------E~~~a~~Ll~eAsdlI~g~l~-~~~v----p-~~~p~~v~rVvA 62 (128) T protein:vir:25 4 CKALATSQDVKRALRR----DLTEA-----------EQTDLSELLAEATDLVVGYLH-PYPV----P-TPTPGPIKRVVA 62 (128) T ss_pred chhccCHHHHHHHhcC----CCCHH-----------HHHHHHHHHhcchheeeeecC-CCCC----C-CCCCchHHHHHH Confidence 3469999999999874 34321 356677789999999999997 4543 3 467889999999 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecC-cchhhhhhccccC Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIP-PSRIAGILYGWNS 156 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~-~~r~~~~l~g~~~ 156 (156) .|+.=-| .+|++ ..+ -.+-+..|-++-++.. ..+++ .+--.+. ..++.+---|-+| T Consensus 63 ~ivarAl--tr~~~-~~p------------e~~S~TAgpfs~~ft~----~~~~~-g~yLTaa~k~~Lrp~R~~~~s 119 (128) T protein:vir:25 63 SMVAAVL--TRPTQ-ILP------------ETQSLTADGFGVTFTP----GGNSP-GPYLSAALKQRLRPYRTGMVA 119 (128) T ss_pred HHHHHHh--hCCCc-cCC------------CceeeecccccccccC----CCCCC-CceEcHHHHhhcccccceeeE Confidence 9987554 34432 222 1222233333222211 11111 1212211 2222221222222 No 17 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=92.42 E-value=0.00083 Score=37.48 Aligned_cols=127 Identities=10% Similarity=0.098 Sum_probs=62.3 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhc-ccccccCCCc-cchHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRAR-YAVIETLTPE-NTPQLVKGL 78 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~R-Y~~~~~lpl~-~vp~~L~~~ 78 (156) |-+|||++|+..++. +|+++ -.++++.-|++||..|....-.. +.++...+.. ..+.+++.+ T Consensus 1 m~~fATv~Dv~~rwr-----~Lt~d-----------E~~ra~~LL~dAS~~iR~~~p~~g~~~~~~~~~~~~~~~~~k~V 64 (140) T protein:vir:97 1 MGNFATTDDVILLWR-----PLSVD-----------ELKRANALLKVVSDTLRMEADKVGKDLDKTMVDKPYFVNVIKSV 64 (140) T ss_pred CCcCCCHHHHHHHhc-----CCCHh-----------HHHHHHHHHHHHHHHHHHhhhhccCCcchhcccCccchhHHHHH Confidence 999999999999985 23321 14688999999999998766421 2333223332 346788999 Q ss_pred HHHHHHHHHHHhcCCCCCCC--HHHHHHHHHHHHHHHHHhcCcccc--------CCCCCcCCCCCccceeee-ecCcchh Q lcl|NC_021557. 79 IGDVARYRLRDKSGGQGQVE--TTVRERHDAAMSNIKAVATGKFEL--------PIAGEPVNGEAGSTRTEA-VIPPSRI 147 (156) Q Consensus 79 ~~dIArY~L~~~~~~~~~~~--e~v~~rY~~Ai~~L~~Va~Gk~~L--------g~~~~~~~~e~~~~~v~~-~~~~~r~ 147 (156) ||+|.+=-|-... +....+ -+---.|.+...|+ ...|.+-| |+... .-+...... -...++- T Consensus 65 ~~~mV~Ral~~~~-d~~G~tq~S~TaG~ys~S~T~~--np~G~lylt~~e~~~LGl~~~----r~~~i~~~g~~~~~~~~ 137 (140) T protein:vir:97 65 TVDIVARTLMTST-QGEPMSQESQSALGYTWSGTYL--VPGGGLFIKDNELKRLGLKKQ----RYGGIELYGEIKRDNDY 137 (140) T ss_pred HHHHHHHHhcCCC-CCCcceeeeeeccchhheeeee--cCCCCceeChHHHHHhCCCCC----ceeeecccCccccCccc Confidence 9999874443110 111110 12334555555553 22333322 22100 000000000 0011222 Q ss_pred hhh Q lcl|NC_021557. 148 AGI 150 (156) Q Consensus 148 ~~~ 150 (156) |+| T Consensus 138 ~~~ 140 (140) T protein:vir:97 138 FDR 140 (140) T ss_pred ccC Confidence 222 No 18 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=92.32 E-value=0.0036 Score=33.98 Aligned_cols=125 Identities=14% Similarity=0.143 Sum_probs=64.7 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc-ccccCCCccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA-VIETLTPENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~-~~~~lpl~~vp~~L~~~~~ 80 (156) |+|||++|+.++++. .||. -..++++.-|++|+..|-. |+. ++--.|-+..-+.++.++| T Consensus 1 ~afAtv~Dve~rw~r----~LT~-----------eE~~~ae~lL~dAs~~IR~----~iP~La~~~~dp~~~a~v~~V~~ 61 (149) T protein:vir:78 1 MAYAEPSDVVARLGR----PLTD-----------DEETQVETFLEDAEIEIRS----RIPDLDDKAEDEDYLKRVIKVEA 61 (149) T ss_pred CCcCCHHHHHHHhCC----CCCH-----------HHHHHHHHHHHHHHHHHHH----hccccccccCCcchhhHHHHHHH Confidence 999999999999863 2331 1356789999999999876 432 1111121122257889999 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccc--------cCCCCC---------------cCCCCCccce Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFE--------LPIAGE---------------PVNGEAGSTR 137 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~--------Lg~~~~---------------~~~~e~~~~~ 137 (156) ++.+= +. +.+. + ....-.-.|-+.+.+. ...|++- ||.... +.-|.=|+.. T Consensus 62 ~mV~R-~~-rnpe-G-~~S~T~G~YS~slt~~--np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~ 135 (149) T protein:vir:78 62 SAVTR-LI-RNPD-G-YIGETDGNYSYQLNWR--LNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVE 135 (149) T ss_pred HHHHH-Hh-cCCC-C-eeeeecchhhhhhhcc--CCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCccccee Confidence 99874 33 3332 2 2222336888877773 3334432 222111 1111111111 Q ss_pred eeeecCcchhhhhhccc Q lcl|NC_021557. 138 TEAVIPPSRIAGILYGW 154 (156) Q Consensus 138 v~~~~~~~r~~~~l~g~ 154 (156) -++. .+.-.-.+|| T Consensus 136 ~~~~---~~~~~~~~~~ 149 (149) T protein:vir:78 136 WQVF---QQSSPLYWGY 149 (149) T ss_pred eeee---eccCcccccC Confidence 1111 1122235666 No 19 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=92.32 E-value=0.0036 Score=33.98 Aligned_cols=125 Identities=14% Similarity=0.143 Sum_probs=64.7 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc-ccccCCCccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA-VIETLTPENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~-~~~~lpl~~vp~~L~~~~~ 80 (156) |+|||++|+.++++. .||. -..++++.-|++|+..|-. |+. ++--.|-+..-+.++.++| T Consensus 1 ~afAtv~Dve~rw~r----~LT~-----------eE~~~ae~lL~dAs~~IR~----~iP~La~~~~dp~~~a~v~~V~~ 61 (149) T protein:vir:78 1 MAYAEPSDVVARLGR----PLTD-----------DEETQVETFLEDAEIEIRS----RIPDLDDKAEDEDYLKRVIKVEA 61 (149) T ss_pred CCcCCHHHHHHHhCC----CCCH-----------HHHHHHHHHHHHHHHHHHH----hccccccccCCcchhhHHHHHHH Confidence 999999999999863 2331 1356789999999999876 432 1111121122257889999 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccc--------cCCCCC---------------cCCCCCccce Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFE--------LPIAGE---------------PVNGEAGSTR 137 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~--------Lg~~~~---------------~~~~e~~~~~ 137 (156) ++.+= +. +.+. + ....-.-.|-+.+.+. ...|++- ||.... +.-|.=|+.. T Consensus 62 ~mV~R-~~-rnpe-G-~~S~T~G~YS~slt~~--np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~ 135 (149) T protein:vir:78 62 SAVTR-LI-RNPD-G-YIGETDGNYSYQLNWR--LNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVE 135 (149) T ss_pred HHHHH-Hh-cCCC-C-eeeeecchhhhhhhcc--CCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCccccee Confidence 99874 33 3332 2 2222336888877773 3334432 222111 1111111111 Q ss_pred eeeecCcchhhhhhccc Q lcl|NC_021557. 138 TEAVIPPSRIAGILYGW 154 (156) Q Consensus 138 v~~~~~~~r~~~~l~g~ 154 (156) -++. .+.-.-.+|| T Consensus 136 ~~~~---~~~~~~~~~~ 149 (149) T protein:vir:78 136 WQVF---QQSSPLYWGY 149 (149) T ss_pred eeee---eccCcccccC Confidence 1111 1122235666 No 20 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=92.06 E-value=0.0015 Score=36.12 Aligned_cols=122 Identities=17% Similarity=0.162 Sum_probs=63.6 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc-ccccCCCccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA-VIETLTPENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~-~~~~lpl~~vp~~L~~~~~ 80 (156) |+|||.+|+.++++. .||. -..+.++.=|+|||..|-. |+. ++--.+-+..-+.++.++| T Consensus 1 ~~~At~~Dv~ar~~r----~LT~-----------~E~~~ve~lL~dAs~~ir~----r~P~l~~~a~d~~~~~~~~~V~~ 61 (123) T protein:vir:77 1 MPYATASDVTSRWAR----QPTD-----------EETALINVRLADVERMIKR----RIPDLATKVTDPDYLEDLKQVEA 61 (123) T ss_pred CCcCCHHHHHHHhCC----CCCH-----------HHHHHHHHHHHHHHHHHHH----hccCcccccCCcchhHHHHHHHH Confidence 999999999999863 2331 1356788899999999876 443 1101121112267889999 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhh Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGI 150 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~ 150 (156) ++.. ++. +.+. + ....-.-.|.+.+.+ ....|++-|--.. -..-.++...+ ++..|....+. T Consensus 62 ~~V~-R~~-rnpe-G-~~s~T~G~ys~sl~~--a~~~g~Lylt~~E-~~~Lg~~~~~~-~~i~p~~~~~~ 123 (123) T protein:vir:77 62 DAVL-RLV-RNPE-G-YLSETDGNYTYMLRS--DLASGKLEIFPEE-WEILGYRRSRM-TVIVPNPVMPT 123 (123) T ss_pred HHHH-HHh-hCCC-C-ceecccchhhhhhcc--cCCCCcceeCHHH-HHhhcCCCCce-eEEeeceecCC Confidence 9886 444 3333 2 233333688888775 5566766442111 00000111111 11111111111 No 21 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=91.49 E-value=0.0012 Score=36.63 Aligned_cols=127 Identities=19% Similarity=0.213 Sum_probs=59.7 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccC--C-CccchHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETL--T-PENTPQLVKG 77 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~l--p-l~~vp~~L~~ 77 (156) |-+|||++|+..+|+ +|+.+ ..++++.-|++|+.+|..=+-.+..-+-.. | ....+..+++ T Consensus 1 m~~fAtv~Dv~~r~r-----~L~~~-----------E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~ 64 (132) T protein:vir:16 1 MNPFATVDDLTMLWR-----PLKGD-----------EKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKS 64 (132) T ss_pred CCccCCHHHHHHHhc-----CCCHh-----------HHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHH Confidence 999999999999985 34321 246899999999999865443322110001 1 1234678999 Q ss_pred HHHHHHHHHHHHhcCCCC-CCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCcchhhhh-hcccc Q lcl|NC_021557. 78 LIGDVARYRLRDKSGGQG-QVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPPSRIAGI-LYGWN 155 (156) Q Consensus 78 ~~~dIArY~L~~~~~~~~-~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~~r~~~~-l~g~~ 155 (156) +||++++=-|-.-.+..+ .-..+---.|-+...|+ ...|.+-|.-..-..-+ . ..+|.+.. ++|-| T Consensus 65 V~~~~V~Ral~~~~~~~G~tq~S~TaG~ys~S~t~~--~p~G~lylt~~e~~~LG-~---------~~~r~~~i~~~~~~ 132 (132) T protein:vir:16 65 VTVDIVARTLMTSTDQEPMTQTTESALGYSVSGSYL--VPGGGLFIKNSELSRLG-L---------KKQRFGVIDFYGND 132 (132) T ss_pred HHHHHHHHHhcCCCCCCCceeeeeeccchheeeeee--cCCCcceeChHHHHhhC-C---------CCCceEEEeecCCC Confidence 999999855532110000 00012223344444443 22333222100000000 0 00111111 33334 No 22 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=91.24 E-value=0.0099 Score=31.58 Aligned_cols=95 Identities=18% Similarity=0.161 Sum_probs=66.6 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) ||++|.+++....- + ++ .-|.+.|+.-|+.|...|.+|++.++. ..|+.++..++- T Consensus 1 Mm~vtLee~K~~LR------I---D~-------d~dD~lI~~li~aA~~~i~~~~g~~~~--------~~~~~~~~Avl~ 56 (95) T protein:vir:81 1 MMIVTLEEVKNWLR------V---DF-------SDDDALITTLINAAEEYLKNATGTTFD--------ATNHLAKIFCMT 56 (95) T ss_pred CCcCCHHHHHHHcC------C---CC-------CcchHHHHHHHHHHHHHHHHhhccccc--------cCchHHHHHHHH Confidence 99999999875422 1 11 126788999999999999999987653 345677777777 Q ss_pred HHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccc Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFE 121 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~ 121 (156) ++-+| |..|......+.++-.-.+.-|..|+....|.-. T Consensus 57 lv~~~-YeNRe~~~~~~~~~p~~v~sll~~lr~~~~~~~~ 95 (95) T protein:vir:81 57 LIADW-YENRELVGRASDQVRPILQSILAQLTYAYGGETA 95 (95) T ss_pred HHHHH-HhhccccccccccccHHHHHHHHHhhhccccccC Confidence 77655 7767543333446777778777778776666654 No 23 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=91.23 E-value=0.0025 Score=34.84 Aligned_cols=123 Identities=11% Similarity=0.111 Sum_probs=65.7 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCC-CccchHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLT-PENTPQLVKGLIG 80 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lp-l~~vp~~L~~~~~ 80 (156) |+|||.+|+.++++. .||+ -..+.|+.=|++|+..|--.+=. +..-+. .+..++-++.+++ T Consensus 1 ma~A~~~Dv~~~w~r----~lT~-----------~E~~~v~~~L~~Ae~~Ir~riP~---L~~r~~a~~~~~~~v~~Vea 62 (125) T protein:vir:10 1 MAYANAQDVVTLWAK----EPEP-----------EVMELIERRLAQVERMIKRRIPN---LDLKVAADATFQADLIDIEA 62 (125) T ss_pred CCcCCHHHHHHHhCC----CCCH-----------HHHHHHHHHHHHHHHHHHHhCCC---hhhhhhcCCCccccHHHHHH Confidence 999999999999863 2221 14677888999999998654320 000011 3456667777766 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCC-CcCCCCCccceeeeecCcchhhhh Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAG-EPVNGEAGSTRTEAVIPPSRIAGI 150 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~-~~~~~e~~~~~v~~~~~~~r~~~~ 150 (156) +..+ +|+. ++ ++ ....-.-.|.+.+.+ +.+.|++-|--.. ..-.+..+ +.. +...|+...+. T Consensus 63 ~aV~-Rv~r-NP-eG-y~s~T~G~Ys~~l~~--~~~~g~L~it~~Ew~~Lg~~r~-s~~-~~i~p~~~~~~ 125 (125) T protein:vir:10 63 DAVL-RLVR-NP-EG-YISETDGAYTYQLQT--DLSQGRLTILDDEWTTLGVNRL-SRM-SVIAPNIVMPT 125 (125) T ss_pred HHHH-HHhc-CC-Cc-ccccccchhHHhhhc--ccccCceeeCHHHHHhhccccc-cce-eeeecccccCC Confidence 6554 5653 22 22 233333677777776 6677877653211 11111111 111 22223333332 No 24 >protein:vir:2345 Length: 125 # NCBI annotation: gp15 # Family: family:all:2817 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075282;genbank:gi:12657869;genbank:GeneID:920134 Probab=88.26 E-value=0.0097 Score=31.62 Aligned_cols=122 Identities=11% Similarity=0.069 Sum_probs=68.0 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc-ccccCC-CccchHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA-VIETLT-PENTPQLVKGL 78 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~-~~~~lp-l~~vp~~L~~~ 78 (156) |-.|+|.+|.+++++. .||+ -..+.|+.=|++|+..|- .|+. +.-.+. .+.-+..++.+ T Consensus 1 ma~~A~~eDV~a~w~R----~lt~-----------eE~~~V~~~L~~ae~~ir----rriPdL~~r~~~~~~~~~~v~~V 61 (125) T protein:vir:23 1 MATLATHEDVTAFWAR----TPTA-----------EEIVLINRRLAQAERMLL----RAIPELLIKASSDPVFRAEVIDI 61 (125) T ss_pred CCcccCHHHHHHHhCC----CCCH-----------HHHHHHHHHHHHHHHHHH----HhcCChhhhhcCCCcchhhHHHH Confidence 9999999999999863 2221 146778889999999987 4442 100011 34567888888 Q ss_pred HHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCC-CCcCCCCCccceeeeecCcchhhhh Q lcl|NC_021557. 79 IGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIA-GEPVNGEAGSTRTEAVIPPSRIAGI 150 (156) Q Consensus 79 ~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~-~~~~~~e~~~~~v~~~~~~~r~~~~ 150 (156) +++..+ +++. .+ ++ ....-.-.|-..+.| +++.|++-+--. ...-.+..++..+....+ ..+. T Consensus 62 ~a~~V~-Rv~r-nP-eG-y~seT~g~Yt~~l~~--~~~~g~L~it~~E~a~Lg~~~s~~~vi~p~~---~~p~ 125 (125) T protein:vir:23 62 EAEAVL-RLVR-NH-EG-YLSETDGNYTYMLQA--QDPNRKLEILPEEWEVLGIVRSGLGILVPTV---VLPS 125 (125) T ss_pred HHHHHH-HHhc-CC-CC-ccccccchhhhhhhc--cCCCCceeecHHHHHhhccccccceEEeece---ecCC Confidence 888766 5552 32 22 222333778877777 667788755221 111112222222222222 1111 No 25 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=87.38 E-value=0.026 Score=29.27 Aligned_cols=99 Identities=13% Similarity=0.176 Sum_probs=54.3 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc----ccccCCCccchHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA----VIETLTPENTPQLVK 76 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~----~~~~lpl~~vp~~L~ 76 (156) ||+++|.++++...-- ++ .-|.+-|+.-|..|++.|-+||..... -.......++|..++ T Consensus 1 mm~~vtLeevK~hLRI---------d~-------d~dD~li~~~i~aA~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~i~ 64 (108) T protein:vir:93 1 MTALLTLEEIKAHLRV---------DH-------DADDDMLMDKVRQATAVLLAYIQGSRDKVIREDGELIPGEALTRMK 64 (108) T ss_pred CCcCCCHHHHHHHcCC---------CC-------CcChHHHHHHHHHHHHHHHHHhccccccccccccccccccCChHHH Confidence 9999999999876331 11 126788999999999999999964321 111111224577788 Q ss_pred HHHHHHHHHHHHHhcCCCCCCCHHHHH-----HHHHHHHHHHHHhcCcccc Q lcl|NC_021557. 77 GLIGDVARYRLRDKSGGQGQVETTVRE-----RHDAAMSNIKAVATGKFEL 122 (156) Q Consensus 77 ~~~~dIArY~L~~~~~~~~~~~e~v~~-----rY~~Ai~~L~~Va~Gk~~L 122 (156) ..++-++=|| |..|...+. ..+.. -.+.-|.-+++ =.+ | T Consensus 65 ~AvLlLv~~~-YenRe~~~~--~~~~~~elP~~v~~Ll~~~R~---p~~-~ 108 (108) T protein:vir:93 65 GAAMRLTGML-YRNPDLAER--EELLQGELPFSVSVLIYDLRC---PTV-L 108 (108) T ss_pred HHHHHHHHHH-Hhccccccc--cccccccCCHHHHHHHHHccc---ccc-C Confidence 8888777765 766632111 01111 11211111111 111 1 No 26 >protein:vir:1329 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047928;swissprot:trembl:q9zxb0;genbank:gi:9631146;uniprot:Q9ZXB0;genbank:GeneID:2715909 Probab=83.96 E-value=0.035 Score=28.58 Aligned_cols=116 Identities=21% Similarity=0.322 Sum_probs=66.9 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) |+|+|.++|.+.-|.++-. ....+.+..||+...+.+..|.+..+++.. .|.|..++-+... T Consensus 1 mayatieelraldglddsa--------------lfsdellsdaidfsvetveaycgrkwdtae----dptpetirwcvrt 62 (122) T protein:vir:13 1 MAYATIEELRALDGLDDSA--------------LFSDELLSDAIDFSVETVEAYCGRKWDTAE----DPTPETIRWCVRT 62 (122) T ss_pred CcchhhhhhhhhcCccchh--------------hhhhhhhhhhhhhhhhhhhhhhCcccCCcC----CCChhHHHHHHHH Confidence 9999999998776643222 234567888999999999999999997532 4578888877888 Q ss_pred HHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCC---Cc-CCCCCcc--ceeeeecCcchh Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAG---EP-VNGEAGS--TRTEAVIPPSRI 147 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~---~~-~~~e~~~--~~v~~~~~~~r~ 147 (156) +||-+.-+. +..--+.|+..-. -=|.+.|.... -+ .-||... +.-++..|.-+. T Consensus 63 larqyvldh----------vsripdralqlqs--efgsiqlaqaggnwrptslpevnaklnlyrvrlpfifm 122 (122) T protein:vir:13 63 LARQYVLDH----------VSRIPDRALQLQS--EFGSIQLAQAGGNWRPTSLPEVNAKLNLYRVRLPFIFM 122 (122) T ss_pred HHHHHHHHH----------hhhcchhhhhhhh--cccceeeeccCCCcccCcccccccceeeeeeecceeeC Confidence 998776543 2222223333211 22555553322 11 1233221 222333332211 No 27 >protein:vir:6243 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813697;swissprot:trembl:q859c0;genbank:gi:29366757;uniprot:Q859C0;genbank:GeneID:1258898 Probab=81.04 E-value=0.04 Score=28.28 Aligned_cols=116 Identities=20% Similarity=0.293 Sum_probs=65.7 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) |+|+|.++|.+.-|- |+. ...-.+.+..||+...+.+.-|.++.+++.. .|.|.+++-+... T Consensus 1 mayatieelralegi------------dda--slfpdellsdaidfsvetvevycgqkwdtae----nptpevirwcvrt 62 (122) T protein:vir:62 1 MAYATIEELRALEGI------------DDA--SLFPDELLSDAIDFSVETVEVYCGQKWDTAE----NPTPEVIRWCVRT 62 (122) T ss_pred CccchhhhhHhhccc------------ccc--ccchhhhhhhhhhhhhhhhhhhcCcccCCcC----CCchHHHHHHHHH Confidence 999999998766442 221 2334567889999999999999999998532 4678888877788 Q ss_pred HHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCC---CCcC-CCCCcc--ceeeeecCcchh Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIA---GEPV-NGEAGS--TRTEAVIPPSRI 147 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~---~~~~-~~e~~~--~~v~~~~~~~r~ 147 (156) +||-+.-+. +..--+.|+..-. -=|.+.|... +-++ -||... +.-++..|.-+. T Consensus 63 larqyvldh----------vsripdralqlqs--efgsiqlaqaggtwrptslpevnaklnlyrvrlpfifm 122 (122) T protein:vir:62 63 LARQYVLDH----------VSRIPDRALQLQS--EFGSIQLAQAGGTWRPTSLPEVNAKLNLYRVRLPFIFM 122 (122) T ss_pred HHHHHHHHH----------hhhcchhhhhhhh--cccceeeeccCCccccCcCcccccceeeeEeecceeeC Confidence 998776543 2222223333221 2255555322 2111 233221 222333332211 No 28 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=80.24 E-value=0.067 Score=27.01 Aligned_cols=94 Identities=11% Similarity=0.182 Sum_probs=61.5 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) || ..+.+...+- .|.. .+++.+...++.-|++|.+.+=+|+.. ..+|..|-.+++ T Consensus 1 ~~---~~~~~~e~ik-----~L~~-------~~d~~~DelL~~lieda~~~vl~y~nr----------~~ip~~l~~~v~ 55 (105) T protein:vir:10 1 ML---NVDQLTEIVS-----ALST-------RLENVNNALLTELVKESIAQVLDYTGQ----------KKLVGSMDIYVK 55 (105) T ss_pred CC---chHHHHHHHH-----HHhc-------cCCCchhHHHHHHHHHHHHHHHHHcCC----------cccchhHHHHHH Confidence 44 2333332221 1211 123457789999999999999999863 256889999999 Q ss_pred HHHHHHHHHhcCCCCCCCH-----------HHHHHHHHHHHHHHHHhcCcc Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVET-----------TVRERHDAAMSNIKAVATGKF 120 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e-----------~v~~rY~~Ai~~L~~Va~Gk~ 120 (156) ++|+.. |.|+|.++..++ .+-+-|.+.|+--++-.-|++ T Consensus 56 evav~~-fNR~G~EG~tS~SegGvS~sy~~~~~~~~~~~l~~yR~~~v~~~ 105 (105) T protein:vir:10 56 KLAVIN-YNRLGIEGETQRSEGGITNYLETGIPKDIRQGLNSYRIAKVKKL 105 (105) T ss_pred HHHHHH-hcccCCcccceeecCCeeeeeeccCcHHHHHHHHHHhhhcccCC Confidence 999977 678887555432 344667767766666666666 No 29 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=70.27 E-value=0.042 Score=28.12 Aligned_cols=132 Identities=17% Similarity=0.206 Sum_probs=60.9 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) |-+|+|++||..|+-. .++ ..++--.+.++.-|++||++..-+=+.+.. +| +.+|..++.+|. T Consensus 1 ~~alasvee~~trl~~----------~lp--~~~~r~~a~a~~vLd~~S~~ar~~~gr~W~----~~-~daP~~vr~ivL 63 (158) T protein:vir:99 1 MAALVSVEEFTTFLRV----------PLP--EEGSEKYTQMEFLLTLASDWARELSCKPWL----LP-ADAPVTARGIIL 63 (158) T ss_pred CcceeeHhhhhhhhcc----------cCC--hhhhHHHHHHHHHHHHHHHHHHHhcCccCC----CC-CcchhHHHHHHH Confidence 9999999999998721 111 011111233334499999998776655431 22 468999999999 Q ss_pred HHHHHHHHHhcCCCCCCCH----HHHHH-----------HHHHHHHHHHHhcCccccCCCCCc-CCCCCccceeeee-cC Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVET----TVRER-----------HDAAMSNIKAVATGKFELPIAGEP-VNGEAGSTRTEAV-IP 143 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e----~v~~r-----------Y~~Ai~~L~~Va~Gk~~Lg~~~~~-~~~e~~~~~v~~~-~~ 143 (156) .-|+=.+- +++ ..+. +--.+ +++=++.|++...-+ =|+-+-. ..++.....-.+- .+ T Consensus 64 ~aa~R~~~--NP~--g~~~~~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s~--GG~~~~~ttR~d~~~~~~yv~v~~ 137 (158) T protein:vir:99 64 AASRREWN--NPK--RVSYVVKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRST--GNWGVIETYRDDEEQLNGYLEVYP 137 (158) T ss_pred HHHHHHHh--cCC--ceEEeeecchhhhcccccCCCcccCHHHHHHHHHhhccc--CceeEEEeecCccccCCceecccC Confidence 88874432 221 1111 01111 233345555553222 1221111 1122111111111 11 Q ss_pred cchhhhhhccccC Q lcl|NC_021557. 144 PSRIAGILYGWNS 156 (156) Q Consensus 144 ~~r~~~~l~g~~~ 156 (156) ..-.|+ |||-+- T Consensus 138 ~GdpfP-~~~~~d 149 (158) T protein:vir:99 138 HGGLMP-VYHPDD 149 (158) T ss_pred CCCccc-ccCccc Confidence 111111 444444 No 30 >protein:vir:108221 Length: 150 # NCBI annotation: gp11 # Family: family:all:28004 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552340;genbank:gi:160700660;genbank:GeneID:5758941 Probab=66.42 E-value=0.17 Score=24.84 Aligned_cols=133 Identities=17% Similarity=0.119 Sum_probs=71.3 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) |.+|+++++|.++|. -|++. -..+.+.=|.+|+..|.. |-.....-|++.-+.+.+.++| T Consensus 4 ~~pFadv~~lea~Wr-----pLt~~-----------E~~~Ae~LL~~As~~IR~----~~Pa~a~a~l~~dd~~A~~Vs~ 63 (150) T protein:vir:10 4 VTPFIDVSQFEAMFR-----PLGDG-----------ERLLAEVLLKAAAIRIRD----RVAAAGRAPLEPDDAMAILVSF 63 (150) T ss_pred CccccchhhhHhhhc-----ccChh-----------HHHHHHHHHHHHHHHHhh----cccccCCCCCCCCcchhHHHHH Confidence 889999999999987 34421 134455667888887765 3332223467778889999999 Q ss_pred HHHHHHHH--HhcCCCCCCCHHHH-----HHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecCc-ch--hhhh Q lcl|NC_021557. 81 DVARYRLR--DKSGGQGQVETTVR-----ERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIPP-SR--IAGI 150 (156) Q Consensus 81 dIArY~L~--~~~~~~~~~~e~v~-----~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~~-~r--~~~~ 150 (156) +|.+=-|- .+..+..+.+...- --|-+.-.-|.--..-|-.||++.... |+-|.. ...|.. .| -..+ T Consensus 64 ~vVk~Am~~~~e~~G~ss~S~T~G~rses~T~snPag~L~ft~~~k~lLGis~ta~-P~~~~~--~~df~~~~~~~~~~~ 140 (150) T protein:vir:10 64 EVTRDAMPPIPEMAGRTQYSITTDDRTEQATMATAAGLLDFNERHWSLLGISATAG-PEYGGM--GGDFGQLGRANPYPI 140 (150) T ss_pred HHHHHhccccccccccchhhhccccccccccccchhhhhhhhHHHHHHhCCCccCC-ccccCC--CcchhhhcCCCCcce Confidence 99986652 11111112111111 124445555666666677788854322 222211 111110 00 0123 Q ss_pred hccccC Q lcl|NC_021557. 151 LYGWNS 156 (156) Q Consensus 151 l~g~~~ 156 (156) +.|-+. T Consensus 141 ~~~~~~ 146 (150) T protein:vir:10 141 VIGSDA 146 (150) T ss_pred EecCCc Confidence 444333 No 31 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=60.40 E-value=0.18 Score=24.68 Aligned_cols=125 Identities=18% Similarity=0.171 Sum_probs=57.9 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHH----HHhhcccc-------------- Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVG----YSRARYAV-------------- 62 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~----YL~~RY~~-------------- 62 (156) -=+|+|.+|+.+.+... + -+++..-.+.||..|+..||+ |.+.|=.. T Consensus 14 anSYvt~~~a~aY~~~r--------g-------~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~ 78 (172) T protein:vir:80 14 ANTYAGADFVIAYAQAR--------G-------VTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVD 78 (172) T ss_pred ccccccHHHHHHHHHHc--------C-------CCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccC Confidence 45799999997776431 2 123344579999999999999 33332100 Q ss_pred cccCCCccchHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeec Q lcl|NC_021557. 63 IETLTPENTPQLVKGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVI 142 (156) Q Consensus 63 ~~~lpl~~vp~~L~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~ 142 (156) ...+|...+|.-|+..||-+|.+.+ . +.+.. .... ..++ --|+| |.++.--.... +...+..+. T Consensus 79 g~~~~~~~IP~~v~~A~~elA~~~~-~--g~~~~--~~~~---~~~v-~~ekV--G~i~~eY~~~~-----~~~~~~~~~ 142 (172) T protein:vir:80 79 GFVIPSDVIPKELQSAVAAAVIEQV-N--GFELQ--QSQD---QWAV-RIEKV--DVIEVQYAAGG-----GGQSASANA 142 (172) T ss_pred cccccccchhHHHHHHHHHHHHHHh-c--CCccC--cCCC---Ccee-eEEec--cceEEeeeccc-----Ccccccccc Confidence 1124556799999999999996444 2 21111 1111 1111 11233 33332211110 011111111 Q ss_pred Ccc----hhhhhhccccC Q lcl|NC_021557. 143 PPS----RIAGILYGWNS 156 (156) Q Consensus 143 ~~~----r~~~~l~g~~~ 156 (156) |-. .+..-|.+|=. T Consensus 143 ~~~~~~~~v~~LL~p~l~ 160 (172) T protein:vir:80 143 PMKPTFPKIDALLNPLLV 160 (172) T ss_pred CCccchHHHHHHHhhhhc Confidence 111 11222222211 No 32 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=56.02 E-value=0.16 Score=24.90 Aligned_cols=125 Identities=13% Similarity=0.090 Sum_probs=60.6 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHH----Hhhcccc-------------- Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGY----SRARYAV-------------- 62 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~Y----L~~RY~~-------------- 62 (156) -=+|+|.+|+.+.+... .. .-..|++..+.||..|+..||+| .+.|=.. T Consensus 16 anSYvtv~ea~aY~~~r--------g~-----~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~ 82 (172) T protein:vir:95 16 ANSYVSVADARIYASNR--------GV-----ELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLN 82 (172) T ss_pred ccccccHHHHHHHHHhc--------CC-----cCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccC Confidence 45799999998875532 10 11236777899999999999985 2221100 Q ss_pred cccCCCccchHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeec Q lcl|NC_021557. 63 IETLTPENTPQLVKGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVI 142 (156) Q Consensus 63 ~~~lpl~~vp~~L~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~ 142 (156) ...+|...+|.-|+..||-+|.+-+ . +..-+.+..-.+ .+ --++| |.++.--......+ ..-.+ T Consensus 83 ~~~v~~~~IP~~V~~A~~elA~~~~-~--~~~~~~~~~~~~----~v-k~~kV--G~I~veY~~~~~~~------~~~~~ 146 (172) T protein:vir:95 83 EDEVPSNVIPKSLIAAQVQLTMAIN-A--GFDLQPNVSPQD----YV-TREKV--GPIETEYADPLSVG------IMPTF 146 (172) T ss_pred cccccccchhHHHHHHHHHHHHHHH-c--CccccccCCccc----ce-eEEec--cceEEeeccCCCCC------CcccH Confidence 1113556789999999999996433 2 111111111111 11 01222 44443221111010 01111 Q ss_pred Ccchhhhhhccc------cC Q lcl|NC_021557. 143 PPSRIAGILYGW------NS 156 (156) Q Consensus 143 ~~~r~~~~l~g~------~~ 156 (156) .-+..-|.|+ |+ T Consensus 147 --~~v~~LL~p~l~~~~~~~ 164 (172) T protein:vir:95 147 --TAANALLAPLFGECASNK 164 (172) T ss_pred --HHHHHHHhhhhcccCCcc Confidence 1233334444 33 No 33 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=55.24 E-value=0.48 Score=22.31 Aligned_cols=117 Identities=15% Similarity=0.016 Sum_probs=51.0 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccc--cccCCCccchHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAV--IETLTPENTPQLVKGLI 79 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~--~~~lpl~~vp~~L~~~~ 79 (156) |+|+|.+++.+.-|+ . .+-.+.=+..|+..||.+...+|.. .+.-+...+-..+++.. T Consensus 1 M~YlT~eey~el~~~------------------~--~~~F~kl~k~A~~~ID~~t~~~y~~~~~~~~~~~~r~~~vK~A~ 60 (130) T protein:vir:47 1 MTYLTQEEFDELDFD------------------E--VTDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAM 60 (130) T ss_pred CCCCchhhHhhcCCC------------------C--hhhHHHHHHHHHHHHHHHhcccccccCCccCcchHHHHHHHHHH Confidence 999999999754332 1 1127888999999999999999952 22233333444444433 Q ss_pred HHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCC-CCccceeeeecCcchhhhhhc----c- Q lcl|NC_021557. 80 GDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNG-EAGSTRTEAVIPPSRIAGILY----G- 153 (156) Q Consensus 80 ~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~-e~~~~~v~~~~~~~r~~~~l~----g- 153 (156) |-=.-|. .. .| .... +. -.-..-|.-|+.++--...+.+. +.+. ..+ . .+-.-|. | T Consensus 61 a~QieY~-~~-~G-~~s~-~~--------~~~~~S~svGrtSis~~~~~~~~~~~~~---~vs-~--da~~~L~~tGL~L 122 (130) T protein:vir:47 61 AFQIAYL-DA-SG-IMSA-DD--------KQLANSVSIGRTSISYSTSQSTLAGQRF---NLS-M--DAENALRQAGFSL 122 (130) T ss_pred HHHHHHH-HH-hc-cccc-hh--------ccCcceeeecceeeecCcCccccccCCc---ccc-H--HHHHHHHhccccc Confidence 3322222 11 11 1111 11 12233444555444322211111 1111 010 0 0000010 0 Q ss_pred cc-----C Q lcl|NC_021557. 154 WN-----S 156 (156) Q Consensus 154 ~~-----~ 156 (156) |. - T Consensus 123 y~GV~yd~ 130 (130) T protein:vir:47 123 VVGVAYDR 130 (130) T ss_pred ccCCCccC Confidence 00 0 No 34 >protein:vir:106596 Length: 128 # NCBI annotation: ORF042 # Family: family:all:372 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239495;genbank:gi:66395254;genbank:GeneID:4555750 Probab=52.28 E-value=0.51 Score=22.19 Aligned_cols=108 Identities=13% Similarity=0.131 Sum_probs=55.6 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) -.-+...++|.+.. +.+.-+.+. ++ ......++..|++|...|-.||.... ..+|..|.-++. T Consensus 9 ~~~~~~~~~~m~~L--e~vK~~LgI---~d----~~~D~lL~~lI~~a~~~i~~~l~~~~--------~~iP~~L~~Iv~ 71 (128) T protein:vir:10 9 SLLQLNSGEVMNYL--DDVKSRIGL---ND----NEQDKQLNSIINNVAAELLSRLPVDT--------ISIPDKLQFIVV 71 (128) T ss_pred chheecHHHHHHHH--HHHHHHhCC---CC----cchhhHHHHHHHHHHHHHHHHcCCCh--------hhhhhhHHHHHH Confidence 01122222222211 111111211 11 12245799999999999999998432 247899999999 Q ss_pred HHHHHHHHHhcCCCCCCCHH-----------HHHHHHHHH-HHHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETT-----------VRERHDAAM-SNIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~-----------v~~rY~~Ai-~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) ++|+-+ |.|+++++..++. .-.-|++-| +|++. .|+ .+.+.|+|- T Consensus 72 evaVkr-yNR~g~EG~~S~SeeG~S~tf~dnd~~~Y~~~L~~y~~~--~~~-------------~~kG~v~F~ 128 (128) T protein:vir:10 72 EVSTKR-YNRIGAEGMSTDSQDGRSNTFERNDFEEYQSIIDALYPK--LDS-------------SERGSVNFY 128 (128) T ss_pred HHHHHH-hcccCccCcceeeeCceeeeeccCCcchhHHHHHHHHhh--ccC-------------CCCCceeeC Confidence 999866 7778877655432 233454333 23332 111 111223332 No 35 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=50.11 E-value=0.17 Score=24.85 Aligned_cols=124 Identities=18% Similarity=0.158 Sum_probs=65.1 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHH---HHhhcccc--------------c Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVG---YSRARYAV--------------I 63 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~---YL~~RY~~--------------~ 63 (156) -=+|+|.+|...-+...-. . ......|++..+.||..|+..||+ |++.|=.. . T Consensus 13 AnSYvtv~ea~aY~~~r~~---~-------~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg 82 (170) T protein:vir:94 13 ANSYVTVAEANSYFDGSYG---R-------PLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGG 82 (170) T ss_pred ccceecHHHHHHHHHhhcc---c-------cccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCc Confidence 4689999999765543211 1 112356888999999999999996 33333110 1 Q ss_pred ccCCCccchHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeecC Q lcl|NC_021557. 64 ETLTPENTPQLVKGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVIP 143 (156) Q Consensus 64 ~~lpl~~vp~~L~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~~ 143 (156) ..+|-..+|..|+..||-+|.+.+- +.... ... +.+++ -++| |.++..-.... +..+..+ T Consensus 83 ~~~~~~~IP~~V~~Aq~elA~~~~~---~~~~~--~~~----~~~v~-~~kV--G~i~veY~~~~-~~~~~~~------- 142 (170) T protein:vir:94 83 MTLSQVSIPVKVKIAVFELAYFMLE---SGAAL--SFA----DQTID-SVKV--GTIRVEFTKNS-TDAGLPT------- 142 (170) T ss_pred cccccchhhHHHHHHHHHHHHHHHh---CcccC--ccc----cccee-eEec--ceeEEEecCCC-CCCccHH------- Confidence 1235567999999999999986662 11111 111 11222 2344 66655442111 1111111 Q ss_pred cchhhhhhccccC Q lcl|NC_021557. 144 PSRIAGILYGWNS 156 (156) Q Consensus 144 ~~r~~~~l~g~~~ 156 (156) .++.-|.|+=+ T Consensus 143 --~v~~LL~p~l~ 153 (170) T protein:vir:94 143 --FVEAMLSGFGS 153 (170) T ss_pred --HHHHHhhhhhc Confidence 22222322221 No 36 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=49.03 E-value=0.39 Score=22.85 Aligned_cols=125 Identities=11% Similarity=0.008 Sum_probs=60.7 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHH----Hhhccc--ccc---------- Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGY----SRARYA--VIE---------- 64 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~Y----L~~RY~--~~~---------- 64 (156) -=+|+|.+|+.+.+... +. +-..|+...+.+|..|+..||+| .+.|=. -++ T Consensus 14 anSYvt~~ea~aY~~~r--------g~-----~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~ 80 (169) T protein:vir:95 14 ADSYVSLEDGRALAAKY--------GL-----ELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLH 80 (169) T ss_pred ccccccHHHHHHHHHHc--------CC-----cCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceec Confidence 33699999998876542 11 11236778999999999999983 333211 011 Q ss_pred --cCCCccchHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeec Q lcl|NC_021557. 65 --TLTPENTPQLVKGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVI 142 (156) Q Consensus 65 --~lpl~~vp~~L~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~ 142 (156) .+|...+|.-++..||-+|.+.+- +.....+.. ..+ + .+..-.|.++..-... +.......+ T Consensus 81 g~~~~~~~IP~~V~~A~~elA~~~~~---g~~~~~~~~-~~~----v--~~e~v~G~i~veY~~~------~~~~~~~~~ 144 (169) T protein:vir:95 81 GFPQPSNVIPSLVIQAQVMAAVEYGA---GTDVRGSTD-GRE----V--QTERVEGAVTVSYFKN------GYSGGTVSI 144 (169) T ss_pred ccccccccchHHHHHHHHHHHHHHHc---CccccCCCC-ccc----e--eeeeeccceeEeecCC------CCcCccccH Confidence 135677999999999999987773 111111110 010 0 0001124443322111 111111111 Q ss_pred CcchhhhhhccccC Q lcl|NC_021557. 143 PPSRIAGILYGWNS 156 (156) Q Consensus 143 ~~~r~~~~l~g~~~ 156 (156) | .++.-|.++=. T Consensus 145 ~--a~~~LL~p~l~ 156 (169) T protein:vir:95 145 T--AADDALRPLLC 156 (169) T ss_pred H--HHHHhhhhhcc Confidence 1 12222222211 No 37 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=46.26 E-value=0.44 Score=22.54 Aligned_cols=125 Identities=11% Similarity=-0.004 Sum_probs=59.1 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHH----Hhhcccc--c----------- Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGY----SRARYAV--I----------- 63 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~Y----L~~RY~~--~----------- 63 (156) -=+|+|.+|+.+.+...-. .-..|+...+.||..|+..||+| ++.|=.. + T Consensus 14 anSYvtv~~a~aY~~~rg~-------------~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~ 80 (169) T protein:vir:78 14 ADSYVSLEDGRALAAKYGL-------------ELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLH 80 (169) T ss_pred ccccccHHHHHHHHHHcCC-------------cCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCceec Confidence 3369999999888653211 11236788999999999999973 3333210 1 Q ss_pred -ccCCCccchHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeec Q lcl|NC_021557. 64 -ETLTPENTPQLVKGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVI 142 (156) Q Consensus 64 -~~lpl~~vp~~L~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~ 142 (156) ..+|...+|.-++..||-+|.+.|- +.....++ -.++ ++ .|+ -.|.++.--.... +.++.. .+ T Consensus 81 g~~~~~~~IP~~v~~A~~elA~~~~~---g~~~~~~~-~~~~----v~-~e~-v~G~i~veY~~~~----~~~~~~--~~ 144 (169) T protein:vir:78 81 GFPQPSNVIPPLVIQAQVMAAVEYGA---GTDVRGST-DGRE----VQ-TER-VEGAVTVSYFKNG----YSGGTV--SI 144 (169) T ss_pred ccccccccchHHHHHHHHHHHHHHhc---CcccCCCC-Ccce----eE-EEE-ecCceeEeecCCC----CCCCcc--cH Confidence 1234567999999999999986662 21111111 0011 00 000 0133322111111 001111 11 Q ss_pred Ccchhhhhhcccc--C Q lcl|NC_021557. 143 PPSRIAGILYGWN--S 156 (156) Q Consensus 143 ~~~r~~~~l~g~~--~ 156 (156) | .++.-|.+|= | T Consensus 145 ~--~~~~LL~p~l~~~ 158 (169) T protein:vir:78 145 T--TADDALRPLLCGS 158 (169) T ss_pred H--HHHHHhhhhcccC Confidence 1 1111222221 0 No 38 >protein:vir:103957 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873994;genbank:gi:118430769;genbank:GeneID:4525451 Probab=40.10 E-value=0.99 Score=20.62 Aligned_cols=98 Identities=17% Similarity=0.229 Sum_probs=57.1 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.| +++ ......++..|++|...|-.||+-.. ..+|..|.-+++++| T Consensus 1 M~~L~~vK~~lg------I~d----------~~~D~lL~~ii~~a~~~i~~~l~~~~--------~~iP~~l~~iv~ev~ 56 (110) T protein:vir:10 1 MTTLADVKKRIG------LKD----------EKQDEQLEEIIKSCESQLLSMLPIEV--------EQIPERFSYMIKEVA 56 (110) T ss_pred CchHHHHHHHhC------CCC----------CchhHHHHHHHHHHHHHHHHHhccch--------hhhhhHHHHHHHHHH Confidence 234555655544 111 12345799999999999999996222 358999999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHHH-HHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAMS-NIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai~-~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|++-|. |++. .|+ .+.+.|+|- T Consensus 57 vkr-yNR~g~EG~~S~S~eG~S~sf~d~d~~~y~~~l~~y~~~--~~~-------------~~kG~v~Fl 110 (110) T protein:vir:10 57 VKR-YNRIGAEGMTSEAVDGRSNAYELNDFKEYEAIIDNYFNA--RTR-------------TKKGRAVFF 110 (110) T ss_pred HHH-hcccCccccceeecCceeeeecccccchHHHHHHHHHhh--cCC-------------CCCceeeeC Confidence 876 6778876655432 2344554443 3221 111 111223332 No 39 >protein:vir:97145 Length: 110 # NCBI annotation: ORF049 # Family: family:all:372 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239728;genbank:gi:66394913;genbank:GeneID:5130878 Probab=40.10 E-value=0.99 Score=20.62 Aligned_cols=98 Identities=17% Similarity=0.229 Sum_probs=57.1 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.| +++ ......++..|++|...|-.||+-.. ..+|..|.-+++++| T Consensus 1 M~~L~~vK~~lg------I~d----------~~~D~lL~~ii~~a~~~i~~~l~~~~--------~~iP~~l~~iv~ev~ 56 (110) T protein:vir:97 1 MTTLADVKKRIG------LKD----------EKQDEQLEEIIKSCESQLLSMLPIEV--------EQIPERFSYMIKEVA 56 (110) T ss_pred CchHHHHHHHhC------CCC----------CchhHHHHHHHHHHHHHHHHHhccch--------hhhhhHHHHHHHHHH Confidence 234555655544 111 12345799999999999999996222 358999999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHHH-HHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAMS-NIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai~-~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|++-|. |++. .|+ .+.+.|+|- T Consensus 57 vkr-yNR~g~EG~~S~S~eG~S~sf~d~d~~~y~~~l~~y~~~--~~~-------------~~kG~v~Fl 110 (110) T protein:vir:97 57 VKR-YNRIGAEGMTSEAVDGRSNAYELNDFKEYEAIIDNYFNA--RTR-------------TKKGRAVFF 110 (110) T ss_pred HHH-hcccCccccceeecCceeeeecccccchHHHHHHHHHhh--cCC-------------CCCceeeeC Confidence 876 6778876655432 2344554443 3221 111 111223332 No 40 >protein:vir:96221 Length: 110 # NCBI annotation: ORF044 # Family: family:all:372 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239573;genbank:gi:66395333;genbank:GeneID:5132767 Probab=40.10 E-value=0.99 Score=20.62 Aligned_cols=98 Identities=17% Similarity=0.229 Sum_probs=57.1 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.| +++ ......++..|++|...|-.||+-.. ..+|..|.-+++++| T Consensus 1 M~~L~~vK~~lg------I~d----------~~~D~lL~~ii~~a~~~i~~~l~~~~--------~~iP~~l~~iv~ev~ 56 (110) T protein:vir:96 1 MTTLADVKKRIG------LKD----------EKQDEQLEEIIKSCESQLLSMLPIEV--------EQIPERFSYMIKEVA 56 (110) T ss_pred CchHHHHHHHhC------CCC----------CchhHHHHHHHHHHHHHHHHHhccch--------hhhhhHHHHHHHHHH Confidence 234555655544 111 12345799999999999999996222 358999999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHHH-HHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAMS-NIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai~-~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|++-|. |++. .|+ .+.+.|+|- T Consensus 57 vkr-yNR~g~EG~~S~S~eG~S~sf~d~d~~~y~~~l~~y~~~--~~~-------------~~kG~v~Fl 110 (110) T protein:vir:96 57 VKR-YNRIGAEGMTSEAVDGRSNAYELNDFKEYEAIIDNYFNA--RTR-------------TKKGRAVFF 110 (110) T ss_pred HHH-hcccCccccceeecCceeeeecccccchHHHHHHHHHhh--cCC-------------CCCceeeeC Confidence 876 6778876655432 2344554443 3221 111 111223332 No 41 >protein:vir:78849 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285363;genbank:gi:148717891;genbank:GeneID:5246980 Probab=40.10 E-value=0.99 Score=20.62 Aligned_cols=98 Identities=17% Similarity=0.229 Sum_probs=57.1 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.| +++ ......++..|++|...|-.||+-.. ..+|..|.-+++++| T Consensus 1 M~~L~~vK~~lg------I~d----------~~~D~lL~~ii~~a~~~i~~~l~~~~--------~~iP~~l~~iv~ev~ 56 (110) T protein:vir:78 1 MTTLADVKKRIG------LKD----------EKQDEQLEEIIKSCESQLLSMLPIEV--------EQIPERFSYMIKEVA 56 (110) T ss_pred CchHHHHHHHhC------CCC----------CchhHHHHHHHHHHHHHHHHHhccch--------hhhhhHHHHHHHHHH Confidence 234555655544 111 12345799999999999999996222 358999999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHHH-HHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAMS-NIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai~-~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|++-|. |++. .|+ .+.+.|+|- T Consensus 57 vkr-yNR~g~EG~~S~S~eG~S~sf~d~d~~~y~~~l~~y~~~--~~~-------------~~kG~v~Fl 110 (110) T protein:vir:78 57 VKR-YNRIGAEGMTSEAVDGRSNAYELNDFKEYEAIIDNYFNA--RTR-------------TKKGRAVFF 110 (110) T ss_pred HHH-hcccCccccceeecCceeeeecccccchHHHHHHHHHhh--cCC-------------CCCceeeeC Confidence 876 6778876655432 2344554443 3221 111 111223332 No 42 >protein:vir:99796 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004309;genbank:gi:122891763;genbank:GeneID:4712351 Probab=40.10 E-value=0.99 Score=20.62 Aligned_cols=98 Identities=17% Similarity=0.229 Sum_probs=57.1 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.| +++ ......++..|++|...|-.||+-.. ..+|..|.-+++++| T Consensus 1 M~~L~~vK~~lg------I~d----------~~~D~lL~~ii~~a~~~i~~~l~~~~--------~~iP~~l~~iv~ev~ 56 (110) T protein:vir:99 1 MTTLADVKKRIG------LKD----------EKQDEQLEEIIKSCESQLLSMLPIEV--------EQIPERFSYMIKEVA 56 (110) T ss_pred CchHHHHHHHhC------CCC----------CchhHHHHHHHHHHHHHHHHHhccch--------hhhhhHHHHHHHHHH Confidence 234555655544 111 12345799999999999999996222 358999999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHHH-HHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAMS-NIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai~-~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|++-|. |++. .|+ .+.+.|+|- T Consensus 57 vkr-yNR~g~EG~~S~S~eG~S~sf~d~d~~~y~~~l~~y~~~--~~~-------------~~kG~v~Fl 110 (110) T protein:vir:99 57 VKR-YNRIGAEGMTSEAVDGRSNAYELNDFKEYEAIIDNYFNA--RTR-------------TKKGRAVFF 110 (110) T ss_pred HHH-hcccCccccceeecCceeeeecccccchHHHHHHHHHhh--cCC-------------CCCceeeeC Confidence 876 6778876655432 2344554443 3221 111 111223332 No 43 >protein:vir:9311 Length: 110 # NCBI annotation: phi Mu50B-like protein # Family: family:all:372 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803289;genbank:gi:29028599;genbank:GeneID:1258047 Probab=40.10 E-value=0.99 Score=20.62 Aligned_cols=98 Identities=17% Similarity=0.229 Sum_probs=57.1 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.| +++ ......++..|++|...|-.||+-.. ..+|..|.-+++++| T Consensus 1 M~~L~~vK~~lg------I~d----------~~~D~lL~~ii~~a~~~i~~~l~~~~--------~~iP~~l~~iv~ev~ 56 (110) T protein:vir:93 1 MTTLADVKKRIG------LKD----------EKQDEQLEEIIKSCESQLLSMLPIEV--------EQIPERFSYMIKEVA 56 (110) T ss_pred CchHHHHHHHhC------CCC----------CchhHHHHHHHHHHHHHHHHHhccch--------hhhhhHHHHHHHHHH Confidence 234555655544 111 12345799999999999999996222 358999999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHHH-HHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAMS-NIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai~-~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|++-|. |++. .|+ .+.+.|+|- T Consensus 57 vkr-yNR~g~EG~~S~S~eG~S~sf~d~d~~~y~~~l~~y~~~--~~~-------------~~kG~v~Fl 110 (110) T protein:vir:93 57 VKR-YNRIGAEGMTSEAVDGRSNAYELNDFKEYEAIIDNYFNA--RTR-------------TKKGRAVFF 110 (110) T ss_pred HHH-hcccCccccceeecCceeeeecccccchHHHHHHHHHhh--cCC-------------CCCceeeeC Confidence 876 6778876655432 2344554443 3221 111 111223332 No 44 >protein:vir:96390 Length: 110 # NCBI annotation: ORF048 # Family: family:all:372 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239650;genbank:gi:66395410;genbank:GeneID:5132866 Probab=40.10 E-value=0.99 Score=20.62 Aligned_cols=98 Identities=17% Similarity=0.229 Sum_probs=57.1 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.| +++ ......++..|++|...|-.||+-.. ..+|..|.-+++++| T Consensus 1 M~~L~~vK~~lg------I~d----------~~~D~lL~~ii~~a~~~i~~~l~~~~--------~~iP~~l~~iv~ev~ 56 (110) T protein:vir:96 1 MTTLADVKKRIG------LKD----------EKQDEQLEEIIKSCESQLLSMLPIEV--------EQIPERFSYMIKEVA 56 (110) T ss_pred CchHHHHHHHhC------CCC----------CchhHHHHHHHHHHHHHHHHHhccch--------hhhhhHHHHHHHHHH Confidence 234555655544 111 12345799999999999999996222 358999999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHHH-HHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAMS-NIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai~-~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|++-|. |++. .|+ .+.+.|+|- T Consensus 57 vkr-yNR~g~EG~~S~S~eG~S~sf~d~d~~~y~~~l~~y~~~--~~~-------------~~kG~v~Fl 110 (110) T protein:vir:96 57 VKR-YNRIGAEGMTSEAVDGRSNAYELNDFKEYEAIIDNYFNA--RTR-------------TKKGRAVFF 110 (110) T ss_pred HHH-hcccCccccceeecCceeeeecccccchHHHHHHHHHhh--cCC-------------CCCceeeeC Confidence 876 6778876655432 2344554443 3221 111 111223332 No 45 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=36.87 E-value=0.3 Score=23.49 Aligned_cols=122 Identities=16% Similarity=0.159 Sum_probs=60.4 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhc---cccccc-----C-CCccc Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRAR---YAVIET-----L-TPENT 71 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~R---Y~~~~~-----l-pl~~v 71 (156) |.+|+|.+++...-| ..++.+..+.=+..|+..||.+..-+ |+.-.. . +.... T Consensus 1 ~~pYLTy~ef~~lg~------------------~~~~~d~F~kllk~A~~~ID~~T~y~~~~y~~~~i~~d~~~d~~~~~ 62 (144) T protein:vir:79 1 MKPYLTTSDFEKLGY------------------ELKKPDNFGKLLKSATVLINQICSYYDPAFAYHDLEADSQADPDSYL 62 (144) T ss_pred CCcccchhhhhhhCC------------------CCcchhhhhhHHHHHHHHhhhhhhhhccccccccccccccccchhhh Confidence 999999999854433 12345668899999999999977654 431000 0 00111 Q ss_pred h---HHHHH-HHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCC-CccceeeeecCcch Q lcl|NC_021557. 72 P---QLVKG-LIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGE-AGSTRTEAVIPPSR 146 (156) Q Consensus 72 p---~~L~~-~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e-~~~~~v~~~~~~~r 146 (156) + ..+++ +|..|. | +.. . -+...|+-+-+|+.-|.-|+.++.....+.+.. .+... ++ .- T Consensus 63 ~~r~~~vKkA~a~QIe-Y-~~~-~--------G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~--v~---~~ 126 (144) T protein:vir:79 63 FRQAMAFKKAVALEML-F-LED-S--------GYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTG--VV---KS 126 (144) T ss_pred hHHHHHHHHHHHHHHH-H-HHH-c--------CCcchhhhhcCccceeEecceEEeecCCCcccccccccc--cc---HH Confidence 1 22333 333444 2 221 1 133445556688888999997776533222211 11111 11 01 Q ss_pred hhhhh-------ccccC Q lcl|NC_021557. 147 IAGIL-------YGWNS 156 (156) Q Consensus 147 ~~~~l-------~g~~~ 156 (156) +-.-| .|=-| T Consensus 127 a~~yL~~tGLLYrGV~s 143 (144) T protein:vir:79 127 AYDLLGRYGLLFSGVAS 143 (144) T ss_pred HHHHHhhcCcccccccc Confidence 11111 12222 No 46 >protein:vir:3615 Length: 110 # NCBI annotation: ORF38 # Family: family:all:372 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112701;genbank:gi:13786569;genbank:GeneID:921067 Probab=36.65 E-value=1.1 Score=20.28 Aligned_cols=98 Identities=15% Similarity=0.130 Sum_probs=57.0 Q ss_pred cCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHHH Q lcl|NC_021557. 4 FLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDVA 83 (156) Q Consensus 4 YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dIA 83 (156) -.+.+++..+.|. .+...++..|++|...|-.||...+ ..+|..|--+++++| T Consensus 1 M~~L~~vK~~lg~-------------------~~D~lL~~li~~a~~~i~~~~~~~~--------~eiP~~l~~iv~eva 53 (110) T protein:vir:36 1 MAITDDLKMLLGG-------------------SLDERLEVIEKRTRDRLLLILGSDI--------KEVPPELEYVVLDVS 53 (110) T ss_pred ChhHHHHHhhcCC-------------------ChhHHHHHHHHHHHHHHHHHhCCCh--------hhhhhHHHHHHHHHH Confidence 2344555555442 1345899999999999999998532 357899999999999 Q ss_pred HHHHHHhcCCCCCCCHH-----------HHHHHHHHH-HHHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 84 RYRLRDKSGGQGQVETT-----------VRERHDAAM-SNIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 84 rY~L~~~~~~~~~~~e~-----------v~~rY~~Ai-~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +.+ |.|+++++..++. .-.-|.+-| +|++.= .+ ....+-+.|++- T Consensus 54 v~r-yNR~g~EG~~S~SeeG~S~sf~~~d~~~y~~~l~~y~~~~-~~-----------~~~~~~g~~~f~ 110 (110) T protein:vir:36 54 LKR-FNRIGQEGMQSYSQEGLSMTFSESDFDEYADEIESWRKSR-ET-----------EGDKKIGRFRLY 110 (110) T ss_pred HHH-hccccccccceeecCCceeeecccCcchHHHHHHHHHhhh-cc-----------ccCCcceeeeeC Confidence 866 6788876654432 123344333 222211 00 112223334332 No 47 >protein:vir:2738 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695111;genbank:gi:23455880;genbank:GeneID:955641 Probab=35.42 E-value=1.2 Score=20.09 Aligned_cols=99 Identities=14% Similarity=0.081 Sum_probs=54.1 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) |-+ -+.+-| +.+..++ +..|...++.-|.+|+..+-+||.. ..+|.-|--+.+ T Consensus 1 ~~l-~~~~~L------~~iK~~l----------g~~dD~lL~~ii~~a~~~i~~~l~~----------~~iP~~l~~Iv~ 53 (112) T protein:vir:27 1 MTL-DKDKVI------KNVSVDL----------NTNDDALLKILLERVVNHFKSEYGV----------EEIDDKLAFIFE 53 (112) T ss_pred Ccc-hhHHHH------HHHHhhc----------CCChhHHHHHHHHHHHHHHHHhcCc----------cccchhHHHHHH Confidence 221 111111 1122222 1235678999999999999999852 468999999999 Q ss_pred HHHHHHHHHhcCCCCCCCHH-------------HHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETT-------------VRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~-------------v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +||+.+ |.|+|+++..++. --.-|.+-|..... ..|+ .+.+.|+|- T Consensus 54 evavkr-yNR~g~EG~~S~SeeG~S~sf~d~~~df~~Y~~~l~~~~~-~~~~-------------~~~G~v~Fl 112 (112) T protein:vir:27 54 DCVIKR-FNRRGAEGAKSESVDGHSMSYYDNENEFKPYDDMLQRLYG-TSGQ-------------AKEGEVLFL 112 (112) T ss_pred HHHHHH-hcccCccccceeecCceeeeecccccchhhhHHHHHHHHh-hcCC-------------CCCceeeeC Confidence 999876 6788876654432 12345655553321 1111 111223332 No 48 >protein:vir:7410 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839927;genbank:gi:30089897;genbank:GeneID:1260684 Probab=34.25 E-value=1.3 Score=19.95 Aligned_cols=107 Identities=10% Similarity=0.037 Sum_probs=60.7 Q ss_pred ccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHH Q lcl|NC_021557. 3 RFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDV 82 (156) Q Consensus 3 ~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dI 82 (156) |=+|.+++..... + |+ . |...|+..|..|.+.|-+.++..+.......-+++++..+..|+-+ T Consensus 1 M~v~LdeiK~~LR------I------Dd----d-DD~ll~~~i~aAe~yI~~Aig~~~~~~~fy~~e~~~~l~~~Avl~L 63 (107) T protein:vir:74 1 MSVTVDDLLDQLS------E------DD----D-RKPQLQIYFDTATAYVKNAVSSDTVDAPFFNVENVSPIYDVAVLSY 63 (107) T ss_pred CeecHHHHHHHcC------C------CC----C-hhHHHHHHHHHHHHHHhhhcCCcccccccccccCcchHHHHHHHHH Confidence 7789999876542 1 11 1 6789999999999999999997654211112245778888889999 Q ss_pred HHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccce Q lcl|NC_021557. 83 ARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTR 137 (156) Q Consensus 83 ArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~ 137 (156) |-.| |..|.. +.++----..-|..|+..=...- .........+. T Consensus 64 a~~w-YeNR~a----t~~vp~~v~siI~QLRg~y~~~~------e~~~~~~~~~~ 107 (107) T protein:vir:74 64 SMDL-WINRST----TMPPTTAVDHMVGQLRGLYSSWK------EEQGGQNLQTE 107 (107) T ss_pred HHHH-HHhccc----cccccHHHHHHHHHHhhcccchh------hhcCCCcccCC Confidence 8766 666633 23333333444444443221111 11111111111 No 49 >protein:vir:4831 Length: 105 # NCBI annotation: ORF27 # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038328;genbank:gi:9634654;genbank:GeneID:1262588 Probab=32.49 E-value=1.4 Score=19.75 Aligned_cols=96 Identities=14% Similarity=0.103 Sum_probs=56.0 Q ss_pred ccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHHH Q lcl|NC_021557. 3 RFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGDV 82 (156) Q Consensus 3 ~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~dI 82 (156) |.+|.+++....- .+. .-|.+.|+.-|..|.+.|.++++..+. + ....++|+.++..++-+ T Consensus 1 M~vtLee~K~~LR---------ID~-------dddD~lI~~~i~aA~~yi~~~ig~~~~--~-~~~~~~~~~~~~Avl~l 61 (105) T protein:vir:48 1 MSVSKTSIMQTLN---------LDE-------TDDTALIPAYIESAKQYIINAVGSDSK--F-YDLENVQPLFDTAVIAL 61 (105) T ss_pred CcccHHHHHHHcC---------CCC-------ccchHHHHHHHHHHHHHHHHhhCCCCc--c-ccccCCchHHHHHHHHH Confidence 8889999876422 111 126778999999999999999985432 1 12234677777777777 Q ss_pred HHHHHHHhcCCC-CCCCHHHHHHHHHHHHHHH--------HHhcC Q lcl|NC_021557. 83 ARYRLRDKSGGQ-GQVETTVRERHDAAMSNIK--------AVATG 118 (156) Q Consensus 83 ArY~L~~~~~~~-~~~~e~v~~rY~~Ai~~L~--------~Va~G 118 (156) +-+| |..|..- .....++-...+.-|..|+ -..+| T Consensus 62 v~~~-YeNR~~~~~~~~~~ip~~v~sli~~lR~~y~~~~e~~~~g 105 (105) T protein:vir:48 62 TSSY-FTYRVALTDTVTYPINLTLNSIIGQLRGLYATYSEVVANG 105 (105) T ss_pred HHHH-HhhhhhccCcccchhhHHHHHHHHHHhhhhhhhhhcccCC Confidence 7655 6666421 1112233334443333333 33444 No 50 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=31.15 E-value=0.88 Score=20.89 Aligned_cols=130 Identities=16% Similarity=0.074 Sum_probs=58.9 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHH---HHhhcc-c--ccc---------- Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVG---YSRARY-A--VIE---------- 64 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~---YL~~RY-~--~~~---------- 64 (156) -=+|+|.+|+...+...=. +- ..-++.-.+++|..|+.-||+ |.+.|= . -++ T Consensus 15 AnSYvtv~~a~aY~~~rg~---~~---------~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~ 82 (172) T protein:vir:97 15 ANAYISVEEFKTYHTDRGN---SF---------AGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDR 82 (172) T ss_pred ccccccHHHHHHHHHhcCc---cc---------CCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCC Confidence 5679999999887754311 10 112344478899999999997 233331 1 111 Q ss_pred -cCCCccchHHHHHHHHHHHHHHHHHhcCCCCC-CCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeeec Q lcl|NC_021557. 65 -TLTPENTPQLVKGLIGDVARYRLRDKSGGQGQ-VETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAVI 142 (156) Q Consensus 65 -~lpl~~vp~~L~~~~~dIArY~L~~~~~~~~~-~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~~ 142 (156) ..|...+|.-|+..||-+|.+-|-........ .+...+-.. |.+.=|.++..-...+.+ ....-.+ T Consensus 83 ~~~~~~~IP~~v~~A~~elA~~al~~~l~~d~~~~~~~~~v~~-------kr~kvg~i~~~y~~~~~~-----~~~~p~~ 150 (172) T protein:vir:97 83 DRYYINDIPPEVKEACAEYALRALAAELNPDPERNASGVAVLS-------KSEAVGPISESVTFVGGA-----VFQMPKY 150 (172) T ss_pred cccccccccHHHHHHHHHHHHHHHhccccccccccccccccee-------eeeeecceeeEeeccCCC-----CCccccH Confidence 12345589999999999998777422111000 011110001 222223333321111111 0000001 Q ss_pred Ccchhhhhh----ccccC Q lcl|NC_021557. 143 PPSRIAGIL----YGWNS 156 (156) Q Consensus 143 ~~~r~~~~l----~g~~~ 156 (156) | -++..| .++.+ T Consensus 151 ~--~v~aLL~p~gl~~~~ 166 (172) T protein:vir:97 151 P--AADQKLVRAGLVRSG 166 (172) T ss_pred H--HHHHHHhhhccccCc Confidence 1 022222 22222 No 51 >protein:vir:4904 Length: 113 # NCBI annotation: gp113 # Family: family:all:372 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056682;genbank:gi:9635017;genbank:GeneID:1262667 Probab=31.10 E-value=1.5 Score=19.58 Aligned_cols=100 Identities=10% Similarity=0.006 Sum_probs=52.8 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) ||.--..+-| +.+..+. +.-|...++.-|.+|++.+-.||.. ..+|.-|--+.+ T Consensus 1 m~~l~~~~~L------~~vK~~l----------gi~dD~lL~~li~~a~~~i~~~l~~----------~~iP~~l~~Iv~ 54 (113) T protein:vir:49 1 MMALDKEKVI------QNVSVDL----------NINDDNLLGILLERIVNHFKAEYGV----------DEVDDNLAFIFE 54 (113) T ss_pred CcchhHHHHH------HHHHHhc----------CCChhHHHHHHHHHHHHHHHHHhCc----------cccchHHHHHHH Confidence 5543222111 1111111 1224567999999999999999862 368999999999 Q ss_pred HHHHHHHHHhcCCCCCCCHHH---HH----------HHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTV---RE----------RHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v---~~----------rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) ++|+.+ |.|+|+++..++.. -- -|.+-|. +-..++- ..+.+.|+|- T Consensus 55 evavkr-yNR~g~EG~~S~SeeG~S~sf~d~~~df~eY~~~l~---~~~~~~~-----------~~~~G~v~Fl 113 (113) T protein:vir:49 55 DCLVKR-FNRRGAEGARSESIDGHSMSYYDNENEFDPYDNMLQ---RLYGTSG-----------QAKEGEVLFL 113 (113) T ss_pred HHHHHH-hcccCccccceeecCceeeeecccccccchhHHHHH---HHHhhcC-----------CCCCcceeeC Confidence 999876 67788766544321 11 2332222 2211110 0111223222 No 52 >protein:vir:96488 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238494;genbank:gi:66391770;genbank:GeneID:5176910 Probab=30.21 E-value=1.6 Score=19.47 Aligned_cols=100 Identities=10% Similarity=0.055 Sum_probs=57.7 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHHH Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIGD 81 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~d 81 (156) |+ |.+++.-..+ |.. ..+..|...++.-|++|++.+-.||+. ..+|.-|--+..+ T Consensus 1 M~--~L~~~K~l~~------ik~-------~~~~~~D~lL~~ii~~a~~~i~~~l~~----------~~iP~~L~~Iv~e 55 (113) T protein:vir:96 1 MM--ALDKDKVIKN------VSV-------DLNTDDDVLLKILLERVVNHFKSEYGV----------EEIDDKLAFIFED 55 (113) T ss_pred Cc--hhHHHHHHhc------CCC-------CCCCchhHHHHHHHHHHHHHHHHHhcc----------cccchhHHHHHHH Confidence 43 4555433222 221 122346788999999999999999962 4678999999999 Q ss_pred HHHHHHHHhcCCCCCCCHH-------------HHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCccceeeee Q lcl|NC_021557. 82 VARYRLRDKSGGQGQVETT-------------VRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRTEAV 141 (156) Q Consensus 82 IArY~L~~~~~~~~~~~e~-------------v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v~~~ 141 (156) +|+-+ |.|+|+++..++. --.-|.+-|..... ..|+. +.+.|+|- T Consensus 56 vavkr-yNR~g~EG~~S~S~eG~S~sf~d~~~df~eY~~~l~~~~~-~~~~~-------------~~G~v~Fl 113 (113) T protein:vir:96 56 CVIKR-FNRRGAEGAKSESVDGHSMSYYDNENEFKPYDDMLQRLYG-TSGQS-------------KEGEVLFL 113 (113) T ss_pred HHHHH-hcCCCccccceeccCceeeeecccccccchhHHHHHHHHh-hcCCC-------------CCceeeeC Confidence 99865 7788876654432 22345555553321 11111 11223332 No 53 >protein:vir:3846 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050152;swissprot:trembl:q9t1f5;genbank:gi:9633044;uniprot:Q9T1F5;genbank:GeneID:1262150 Probab=29.38 E-value=1.7 Score=19.37 Aligned_cols=125 Identities=10% Similarity=0.043 Sum_probs=64.9 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) |.+|.|+.|=.++ .|-+|-..-..+ .-|...+...|..|+..|-+-++..+.- | .--+.++++....|. T Consensus 1 ~~~~~~~~~~l~~----~L~~lk~~lrlD-----ddDd~~l~~~l~AA~~yIk~AVG~d~~~-F-y~~e~v~plf~lAvl 69 (126) T protein:vir:38 1 MTTYLKITDGLKR----SLGYLDEDTSLD-----EGLQKRMSSALIAAESYVQHAIGTDVKD-F-YISEENKPLYTLVCN 69 (126) T ss_pred Cceeeeechhhhh----HHHHhHHhccCC-----CchHHHHHHHHHHHHHHHhhhccCCccc-c-hhccCCchHHHHHHH Confidence 9999999993322 344443221112 3378899999999999999999876531 1 123467778888888 Q ss_pred HHHHHHHHHhcC-CCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCcccee Q lcl|NC_021557. 81 DVARYRLRDKSG-GQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGSTRT 138 (156) Q Consensus 81 dIArY~L~~~~~-~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~~~v 138 (156) -+|-.| |..|. -......+|---...=|..|+..=+-. .-.......+..++...- T Consensus 70 ~La~~y-Y~nRsAtt~~~~~~Vp~~~~siI~QLRg~Y~~~-qe~~~~~~t~~~~~d~~~ 126 (126) T protein:vir:38 70 ALAASY-VQNPVSITSGAVVNVDIVTNAIIGQLRGRYAKE-LEAQDGQNTKSQSSDSEN 126 (126) T ss_pred HHHHhh-hhccchhccccccccchHHHHHHHHHHhhHHHH-hhhcCCCCcccCCccCCC Confidence 888755 44442 111122333344455555555421000 000111111111211111 No 54 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=27.01 E-value=1.9 Score=19.07 Aligned_cols=101 Identities=8% Similarity=0.023 Sum_probs=53.8 Q ss_pred CccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhccc-cccc-----------CCCc Q lcl|NC_021557. 2 PRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYA-VIET-----------LTPE 69 (156) Q Consensus 2 m~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~-~~~~-----------lpl~ 69 (156) ||++|.++++...-- + ..-|.+.|+.=|..|.+.|-.||+.++. .... -... T Consensus 1 M~~vtLee~K~hLRv------------d----~d~dD~lI~~li~AA~~~ve~~l~r~l~~~~~~~~~~~~~~~~~~~~~ 64 (113) T protein:vir:10 1 MALVELKLALGFVRA------------N----AGVEDDVVQMLLDAATQSAVDYLNRQVFETEDAMTTAIEAGTAGQNPM 64 (113) T ss_pred CCCCCHHHHHHHcCC------------C----CCcchHHHHHHHHHHHHHHHHHhCcccccccccccccccccccccccc Confidence 889999999765431 1 1127888999999999999999987641 1000 0112 Q ss_pred cchHHHHHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCC Q lcl|NC_021557. 70 NTPQLVKGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPI 124 (156) Q Consensus 70 ~vp~~L~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~ 124 (156) .+|+.++..+.-++=|| |..|-. .....+.+-=.-+-..|... +.-.|+ T Consensus 65 ~~p~~i~~AvLllv~~~-Y~nRe~--~~~~~~~~lP~~v~~Ll~~y---R~~~g~ 113 (113) T protein:vir:10 65 VVNAAIRAAILKITAEL-YANRED--TAFGPITELPLNARALLRPH---RIIPGV 113 (113) T ss_pred ccChHHHHHHHHHHHHH-Hhhhhh--hchhhhhccCHHHHHHHHHh---hhhcCC Confidence 36788888777777654 765521 11111111000011111111 112232 No 55 >protein:vir:9821 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795583;genbank:gi:28876338;genbank:GeneID:1257921 Probab=26.79 E-value=1.9 Score=19.04 Aligned_cols=119 Identities=15% Similarity=0.167 Sum_probs=52.2 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccc-----hHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENT-----PQLV 75 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~v-----p~~L 75 (156) -|+|+|.+++.+.-++ .++ -.+.=+..|+..||.+..-||... -++.. -.+= T Consensus 5 ~M~YlT~eey~~l~~~------------------~~~--dF~kllk~As~~ID~~t~~~y~~~---d~e~d~~~r~~~vK 61 (138) T protein:vir:98 5 IIAFLTQKEFEDLGFD------------------DVE--DFEKMEKRASHAVNLYCRNRYDYK---DLKKEIALVQKAVK 61 (138) T ss_pred cccccchHHHhccCCC------------------Chh--hHHHHHHHHHHHhhhhhccccccc---cccchhHHHHHHHH Confidence 6999999998653221 111 288899999999999999999731 22221 1222 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCC-CCcCCCCCccceeeeecCcchhhhhhc-- Q lcl|NC_021557. 76 KGLIGDVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIA-GEPVNGEAGSTRTEAVIPPSRIAGILY-- 152 (156) Q Consensus 76 ~~~~~dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~-~~~~~~e~~~~~v~~~~~~~r~~~~l~-- 152 (156) +-+|..|..... .| .... .. -.-+.-|.-|+.++--. ..+..+..+++.-....+. -+..-|+ T Consensus 62 kA~a~QIeY~~~---~G-~ts~--~d-------~~~~~s~svGrTSiS~~~~~~~~s~~~~~~~~~~~s~-~A~~~L~~t 127 (138) T protein:vir:98 62 RAIAYQIAYLND---SG-VMTA--ED-------KQSFAGISLGRTSISYTVGHGQGSQQKTLADRFNLCL-DAENELLVV 127 (138) T ss_pred HHHHHHHHHHHH---cC-Ccch--hh-------ccCcCceEeeeeEeecccccccccccccccccccccH-HHHHHHhhc Confidence 233444442221 11 1111 11 23445566676665310 0011111111111101110 0111111 Q ss_pred --cccC Q lcl|NC_021557. 153 --GWNS 156 (156) Q Consensus 153 --g~~~ 156 (156) .|.- T Consensus 128 GLLY~G 133 (138) T protein:vir:98 128 GLGYTG 133 (138) T ss_pred Cccccc Confidence 1111 No 56 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=25.99 E-value=2 Score=18.94 Aligned_cols=104 Identities=16% Similarity=0.105 Sum_probs=55.4 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) -|+++|.++++...- + + ..-|.+-|+.=|..|.+.|-+|+....- . ....+|..++..+. T Consensus 5 ~M~~vtLee~K~hLR------i---d-------~dddD~lI~~~i~AA~~~v~~~~~~~~~---~-~~~~~p~~ik~AiL 64 (108) T protein:vir:19 5 VLDVISLSLFKQQIE------F---E-------EDDRDELITLYAQAAFDYCMRWCDEPAW---K-VAADIPAAVKGAVL 64 (108) T ss_pred cccccCHHHHHHHcC------C---C-------CCcchHHHHHHHHHHHHHHHHHhCCccc---c-cccccchHHHHHHH Confidence 688999999987633 1 1 1237788999999999999999975421 1 11356777777666 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCcc Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGS 135 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~ 135 (156) -++- ++|..|-..+. .+...- .++++|-.-= +---|.+. .|.|+ T Consensus 65 llv~-~~YenRE~~~~--~~~~~~--~~~~~LL~pY--R~~~g~~~----~~~~~ 108 (108) T protein:vir:19 65 LVFA-DMFEHRTAQSE--VQLYEN--AAAERMMFIH--RNWRGKAE----SEEGS 108 (108) T ss_pred HHHH-HHHhccccccc--chhhhh--HHHHHHHHHH--HhcCCCCC----cccCC Confidence 6665 44776632211 122111 1233332211 11111111 12222 No 57 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=25.99 E-value=2 Score=18.94 Aligned_cols=104 Identities=16% Similarity=0.105 Sum_probs=55.4 Q ss_pred CCccCCHHHHHhhcCHHHHHHHhcccccCccccccccHHHHHHHHHHHHHHHHHHHhhcccccccCCCccchHHHHHHHH Q lcl|NC_021557. 1 MPRFLTVDEFTTMFGLAEVSQIAGIGNLNDMAGRTLDVAKIETAITFAEDILVGYSRARYAVIETLTPENTPQLVKGLIG 80 (156) Q Consensus 1 mm~YaT~~Dl~~~~ge~eL~~Lt~~~~~~~~~~~~~D~~~v~~Al~dA~~~id~YL~~RY~~~~~lpl~~vp~~L~~~~~ 80 (156) -|+++|.++++...- + + ..-|.+-|+.=|..|.+.|-+|+....- . ....+|..++..+. T Consensus 5 ~M~~vtLee~K~hLR------i---d-------~dddD~lI~~~i~AA~~~v~~~~~~~~~---~-~~~~~p~~ik~AiL 64 (108) T protein:vir:18 5 VLDVISLSLFKQQIE------F---E-------EDDRDELITLYAQAAFDYCMRWCDEPAW---K-VAADIPAAVKGAVL 64 (108) T ss_pred cccccCHHHHHHHcC------C---C-------CCcchHHHHHHHHHHHHHHHHHhCCccc---c-cccccchHHHHHHH Confidence 688999999987633 1 1 1237788999999999999999975421 1 11356777777666 Q ss_pred HHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHHHHHhcCccccCCCCCcCCCCCcc Q lcl|NC_021557. 81 DVARYRLRDKSGGQGQVETTVRERHDAAMSNIKAVATGKFELPIAGEPVNGEAGS 135 (156) Q Consensus 81 dIArY~L~~~~~~~~~~~e~v~~rY~~Ai~~L~~Va~Gk~~Lg~~~~~~~~e~~~ 135 (156) -++- ++|..|-..+. .+...- .++++|-.-= +---|.+. .|.|+ T Consensus 65 llv~-~~YenRE~~~~--~~~~~~--~~~~~LL~pY--R~~~g~~~----~~~~~ 108 (108) T protein:vir:18 65 LVFA-DMFEHRTAQSE--VQLYEN--AAAERMMFIH--RNWRGKAE----SEEGS 108 (108) T ss_pred HHHH-HHHhccccccc--chhhhh--HHHHHHHHHH--HhcCCCCC----cccCC Confidence 6665 44776632211 122111 1233332211 11111111 12222 Done!