Query lcl|NC_019769.1_cdsid_YP_007151744.1 [gene=F864_gp09] [protein=hypothetical protein] [protein_id=YP_007151744.1] [location=6277..6624] Match_columns 115 No_of_seqs 96 out of 112 Neff 6.5 Searched_HMMs 1612 Date Thu Nov 7 16:34:34 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_9 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_9_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:195 Length: 115 # 100.0 1.3E-47 8.4E-51 277.5 12.8 115 1-115 1-115 (115) 2 protein:vir:93602 Length: 114 100.0 8.6E-47 5.3E-50 273.1 12.8 114 1-115 1-114 (114) 3 protein:vir:10368 Length: 118 100.0 2.3E-37 1.4E-40 221.5 12.3 110 1-115 1-115 (118) 4 protein:vir:81066 Length: 118 100.0 3.5E-37 2.1E-40 220.5 12.4 110 1-115 1-115 (118) 5 protein:vir:97070 Length: 118 100.0 3.7E-37 2.3E-40 220.3 12.1 110 1-115 1-115 (118) 6 protein:vir:4348 Length: 121 # 100.0 1.6E-34 9.6E-38 205.9 11.9 108 1-115 1-119 (121) 7 protein:vir:1438 Length: 115 # 100.0 1.5E-34 9.3E-38 206.0 9.2 109 1-115 1-115 (115) 8 protein:vir:1892 Length: 121 # 100.0 4.2E-34 2.6E-37 203.6 11.0 108 1-115 1-119 (121) 9 protein:vir:100116 Length: 115 100.0 1.7E-34 1.1E-37 205.7 8.7 109 1-115 1-115 (115) 10 protein:vir:100242 Length: 114 100.0 7.9E-33 4.9E-36 196.6 9.2 109 1-115 1-114 (114) 11 protein:vir:80371 Length: 115 99.9 1E-30 6.3E-34 185.0 8.8 109 1-115 1-115 (115) 12 protein:vir:102888 Length: 119 99.3 2.3E-14 1.4E-17 95.3 10.4 107 1-115 1-117 (119) 13 protein:vir:105008 Length: 119 99.3 2.3E-14 1.4E-17 95.3 10.4 107 1-115 1-117 (119) 14 protein:vir:107581 Length: 119 99.3 2.3E-14 1.4E-17 95.3 10.4 107 1-115 1-117 (119) 15 protein:vir:102086 Length: 119 99.3 2.3E-14 1.4E-17 95.3 10.4 107 1-115 1-117 (119) 16 protein:vir:1274 Length: 162 # 99.3 1.7E-14 1E-17 96.1 9.2 109 1-115 37-157 (162) 17 protein:vir:9364 Length: 131 # 99.2 4.1E-14 2.5E-17 94.0 8.2 111 1-115 1-120 (131) 18 protein:vir:78648 Length: 131 99.2 4.1E-14 2.5E-17 94.0 8.2 111 1-115 1-120 (131) 19 protein:vir:96972 Length: 131 99.2 4.1E-14 2.5E-17 94.0 8.2 111 1-115 1-120 (131) 20 protein:vir:2689 Length: 131 # 99.2 4.1E-14 2.5E-17 94.0 8.2 111 1-115 1-120 (131) 21 protein:vir:93902 Length: 131 99.2 4.7E-14 2.9E-17 93.6 8.0 111 1-115 1-120 (131) 22 protein:vir:94418 Length: 131 99.2 6E-14 3.7E-17 93.1 8.0 111 1-115 1-120 (131) 23 protein:vir:1387 Length: 116 # 99.1 2E-13 1.2E-16 90.2 8.1 108 1-115 4-113 (116) 24 protein:vir:78349 Length: 127 99.1 5.2E-13 3.2E-16 87.9 9.6 111 1-115 1-122 (127) 25 protein:vir:98343 Length: 126 99.0 3.3E-12 2E-15 83.5 8.5 114 1-115 1-118 (126) 26 protein:vir:9415 Length: 126 # 99.0 3.3E-12 2E-15 83.5 8.5 114 1-115 1-118 (126) 27 protein:vir:80001 Length: 126 99.0 4.1E-12 2.5E-15 83.0 8.9 114 1-115 1-118 (126) 28 protein:vir:81093 Length: 126 99.0 4.1E-12 2.5E-15 83.0 8.9 114 1-115 1-118 (126) 29 protein:vir:96894 Length: 140 98.9 2.4E-11 1.5E-14 78.8 10.2 105 1-115 1-131 (140) 30 protein:vir:95111 Length: 145 98.9 3.1E-11 1.9E-14 78.2 9.8 107 1-115 1-131 (145) 31 protein:vir:95961 Length: 145 98.8 3.6E-11 2.2E-14 77.8 9.8 107 1-115 1-131 (145) 32 protein:vir:94794 Length: 145 98.8 3.6E-11 2.2E-14 77.8 9.8 107 1-115 1-131 (145) 33 protein:vir:93736 Length: 145 98.8 4.5E-11 2.8E-14 77.3 9.7 107 1-115 1-131 (145) 34 protein:vir:97421 Length: 145 98.8 4.5E-11 2.8E-14 77.3 9.7 107 1-115 1-131 (145) 35 protein:vir:94488 Length: 145 98.8 4.5E-11 2.8E-14 77.3 9.7 107 1-115 1-131 (145) 36 protein:vir:1244 Length: 145 # 98.8 5.4E-11 3.4E-14 76.9 10.0 107 1-115 1-131 (145) 37 protein:vir:96125 Length: 140 98.8 5.4E-11 3.3E-14 76.9 10.0 105 1-115 3-131 (140) 38 protein:vir:97325 Length: 145 98.8 7.7E-11 4.8E-14 76.0 9.8 105 1-115 1-131 (145) 39 protein:vir:96260 Length: 141 98.7 1.2E-10 7.7E-14 74.9 9.4 107 1-115 1-131 (141) 40 protein:vir:105892 Length: 141 98.7 1.2E-10 7.7E-14 74.9 9.4 107 1-115 1-131 (141) 41 protein:vir:94096 Length: 141 98.7 1.2E-10 7.7E-14 74.9 9.4 107 1-115 1-131 (141) 42 protein:vir:105337 Length: 145 98.7 2.2E-10 1.4E-13 73.5 9.5 105 1-115 1-131 (145) 43 protein:vir:107096 Length: 145 98.7 2.2E-10 1.4E-13 73.5 9.5 105 1-115 1-131 (145) 44 protein:vir:9709 Length: 141 # 98.7 1.7E-10 1.1E-13 74.1 8.7 112 1-112 1-141 (141) 45 protein:vir:9579 Length: 111 # 98.5 1.4E-09 8.6E-13 69.1 9.9 103 1-115 1-110 (111) 46 protein:vir:101303 Length: 135 98.5 9.5E-10 5.9E-13 70.0 8.8 111 1-115 1-130 (135) 47 protein:vir:9514 Length: 135 # 98.5 9.5E-10 5.9E-13 70.0 8.8 111 1-115 1-130 (135) 48 protein:vir:100675 Length: 135 98.5 9.5E-10 5.9E-13 70.0 8.8 111 1-115 1-130 (135) 49 protein:vir:96002 Length: 133 98.5 1.7E-09 1.1E-12 68.6 9.2 111 1-115 1-129 (133) 50 protein:vir:1643 Length: 111 # 98.4 6.7E-09 4.2E-12 65.4 10.6 103 1-115 1-110 (111) 51 protein:vir:94768 Length: 111 98.4 1.1E-08 6.5E-12 64.3 10.5 103 1-115 1-110 (111) 52 protein:vir:9648 Length: 126 # 98.3 2.6E-09 1.6E-12 67.6 7.0 111 1-115 2-120 (126) 53 protein:vir:5979 Length: 134 # 98.3 1.2E-08 7.5E-12 64.0 9.9 106 1-115 1-131 (134) 54 protein:vir:9764 Length: 111 # 98.3 2E-08 1.3E-11 62.8 10.4 103 1-115 1-110 (111) 55 protein:vir:105055 Length: 129 98.2 2.7E-08 1.7E-11 62.1 9.8 106 1-115 1-124 (129) 56 protein:vir:5744 Length: 140 # 98.0 1.1E-07 7E-11 58.7 9.4 106 1-115 15-138 (140) 57 protein:vir:80105 Length: 162 97.9 2.8E-07 1.7E-10 56.5 9.3 107 1-115 13-139 (162) 58 protein:vir:3618 Length: 129 # 97.6 2.8E-06 1.7E-09 51.0 10.8 107 1-115 2-128 (129) 59 protein:vir:2741 Length: 128 # 97.6 2.3E-06 1.4E-09 51.5 10.3 107 1-115 1-127 (128) 60 protein:vir:3972 Length: 129 # 97.6 3.1E-06 1.9E-09 50.8 10.9 107 1-115 2-128 (129) 61 protein:vir:744 Length: 129 # 97.5 3.4E-06 2.1E-09 50.5 11.1 107 1-115 2-128 (129) 62 protein:vir:4907 Length: 128 # 97.3 6.5E-06 4.1E-09 49.0 9.9 106 1-114 1-128 (128) 63 protein:vir:98629 Length: 126 97.2 2.5E-06 1.5E-09 51.3 6.8 111 1-115 2-120 (126) 64 protein:vir:99537 Length: 125 97.2 1.5E-05 9.1E-09 47.1 10.8 108 1-115 1-124 (125) 65 protein:vir:96485 Length: 128 97.1 1.8E-05 1.1E-08 46.7 10.1 106 1-114 1-128 (128) 66 protein:vir:98426 Length: 131 96.8 6.3E-05 3.9E-08 43.6 10.9 104 1-115 6-126 (131) 67 protein:vir:95765 Length: 127 96.0 0.00023 1.4E-07 40.6 9.7 106 1-115 1-122 (127) 68 protein:vir:81158 Length: 109 95.8 0.00021 1.3E-07 40.7 8.8 103 1-115 4-108 (109) 69 protein:vir:80109 Length: 104 95.0 0.00041 2.6E-07 39.1 7.7 102 1-114 1-104 (104) 70 protein:vir:95371 Length: 104 94.8 0.00059 3.6E-07 38.3 8.2 101 1-114 1-104 (104) 71 protein:vir:106593 Length: 131 94.3 0.0027 1.7E-06 34.7 10.7 108 1-115 3-130 (131) 72 protein:vir:103918 Length: 127 94.1 0.0023 1.4E-06 35.0 9.9 109 1-115 1-126 (127) 73 protein:vir:96217 Length: 127 94.1 0.0023 1.4E-06 35.0 9.9 109 1-115 1-126 (127) 74 protein:vir:99769 Length: 127 94.1 0.0023 1.4E-06 35.0 9.9 109 1-115 1-126 (127) 75 protein:vir:97143 Length: 127 94.1 0.0023 1.4E-06 35.0 9.9 109 1-115 1-126 (127) 76 protein:vir:96355 Length: 127 94.1 0.0023 1.4E-06 35.1 9.9 109 1-115 1-126 (127) 77 protein:vir:78854 Length: 127 94.1 0.0023 1.4E-06 35.1 9.9 109 1-115 1-126 (127) 78 protein:vir:9313 Length: 127 # 94.0 0.0025 1.5E-06 34.9 9.9 109 1-115 1-126 (127) 79 protein:vir:6215 Length: 109 # 93.9 0.0017 1E-06 35.8 8.8 102 1-115 1-109 (109) 80 protein:vir:967 Length: 105 # 93.8 0.0013 8.1E-07 36.4 8.1 102 1-113 1-105 (105) 81 protein:vir:7994 Length: 134 # 93.1 0.0018 1.1E-06 35.6 7.6 103 1-115 1-131 (134) 82 protein:vir:102609 Length: 134 92.9 0.002 1.2E-06 35.4 7.6 103 1-115 1-131 (134) 83 protein:vir:105826 Length: 134 92.9 0.002 1.2E-06 35.4 7.6 103 1-115 1-131 (134) 84 protein:vir:106554 Length: 122 91.8 0.011 6.7E-06 31.4 10.2 105 1-115 1-111 (122) 85 protein:vir:78612 Length: 178 84.6 0.059 3.7E-05 27.3 9.7 106 1-115 9-150 (178) 86 protein:vir:106763 Length: 178 84.1 0.063 3.9E-05 27.2 10.0 106 1-115 9-150 (178) 87 protein:vir:101569 Length: 178 82.7 0.075 4.7E-05 26.8 10.1 103 1-115 9-150 (178) 88 protein:vir:9931 Length: 119 # 82.1 0.067 4.2E-05 27.0 8.5 106 1-115 6-118 (119) 89 protein:vir:3637 Length: 178 # 82.0 0.081 5E-05 26.6 10.1 103 1-115 9-150 (178) 90 protein:vir:94061 Length: 175 78.0 0.12 7.4E-05 25.7 9.6 103 1-115 8-149 (175) 91 protein:vir:8107 Length: 138 # 70.0 0.07 4.3E-05 26.9 5.0 102 1-115 1-132 (138) 92 protein:vir:8331 Length: 150 # 63.5 0.19 0.00012 24.5 6.1 102 1-115 27-144 (150) 93 protein:vir:108220 Length: 133 62.5 0.33 0.00021 23.2 10.5 109 1-115 1-126 (133) 94 protein:vir:105772 Length: 128 48.5 0.67 0.00041 21.5 8.4 106 1-115 1-126 (128) 95 protein:vir:9880 Length: 136 # 45.2 0.78 0.00048 21.2 8.7 106 1-115 1-128 (136) 96 protein:vir:3874 Length: 114 # 36.4 1.2 0.00073 20.2 7.8 103 1-106 1-114 (114) 97 protein:vir:101655 Length: 134 35.4 1.2 0.00077 20.1 7.8 101 1-115 10-131 (134) 98 protein:vir:7860 Length: 134 # 35.4 1.2 0.00077 20.1 7.8 101 1-115 10-131 (134) 99 protein:vir:103278 Length: 169 29.8 1.6 0.001 19.4 9.7 113 1-115 38-166 (169) 100 protein:vir:81228 Length: 109 28.6 1.7 0.0011 19.3 8.1 94 17-115 1-106 (109) 101 protein:vir:79637 Length: 130 27.1 1.9 0.0012 19.1 10.0 114 1-115 1-126 (130) 102 protein:vir:99874 Length: 154 23.2 2.3 0.0015 18.6 6.6 112 1-115 13-150 (154) 103 protein:vir:100200 Length: 123 21.6 2.6 0.0016 18.3 10.6 113 1-115 1-121 (123) 104 protein:vir:99925 Length: 147 20.6 1.1 0.00071 20.3 2.8 104 1-115 1-137 (147) 105 protein:vir:94547 Length: 117 20.5 2.8 0.0017 18.2 6.5 109 1-115 1-117 (117) No 1 >protein:vir:195 Length: 115 # NCBI annotation: Gp11 # Family: family:all:896 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037705;genbank:gi:9634170;genbank:GeneID:1262540 Probab=100.00 E-value=1.3e-47 Score=277.55 Aligned_cols=115 Identities=69% Similarity=1.174 Sum_probs=113.7 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEARTIRNMA 80 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A~~l~~av 80 (115) |||++|++||+++++|||||+++|+|+++.|++++||||||++||+|+|+|||++.+++|+||||||+|+++|++||++| T Consensus 1 M~e~~i~~lL~~l~~gRvyp~~aP~~~~~~~~~~~Pyiv~q~vsg~p~~~L~G~~~~~~~vQIDvyA~t~~~A~~l~~~i 80 (115) T protein:vir:19 1 MNEDNIYALLSPLAEGRVYPYVAPLGSDGKPSVSPPWIIFSIVDDVSADVLCGQAESRVSVQVDVYSTSIAESRSLRDLV 80 (115) T ss_pred CchhHHHHHHhhhcCcccceeeccCCCCCCccccCCeEEEEeccCcccccccCCCccceEEEEEEeeCChHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999988999999999999999999999999 Q ss_pred HHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 81 LDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 81 ~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |+||+.++|+..++.++||+||||||++|||+|+- T Consensus 81 ~~Al~~~~p~~~~~~~~ye~dt~lyR~s~d~~V~~ 115 (115) T protein:vir:19 81 LASLEPLTPTEVVKIPGYEPDYRLYRATLDFKVTP 115 (115) T ss_pred HHHhhhcCCEEecCCCCcccchhceeeEEEEEecC Confidence 99999999999999999999999999999999999 No 2 >protein:vir:93602 Length: 114 # NCBI annotation: putative structural component # Family: family:all:896 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449300;genbank:gi:157166048;uniprot:Q6H9U1;genbank:GeneID:5580424 Probab=100.00 E-value=8.6e-47 Score=273.14 Aligned_cols=114 Identities=63% Similarity=0.991 Sum_probs=110.2 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEARTIRNMA 80 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A~~l~~av 80 (115) |||++||++|+++++|||||+++|++ +..+.+++||||||++||+|+|+|||++++++|+||||||+|+++|++||+|| T Consensus 1 M~e~~i~~lL~~~~~gRvyp~~~P~~-~~~~~~~~Pyiv~q~vsg~p~~~l~gp~~~~~~vQIDvyA~t~~~A~~l~~~v 79 (114) T protein:vir:93 1 MTEADLYPHLAHLAGGQVYPYVVPLL-DGRPSVALPWVVFSLISSVSADVMGGQAESSVSVQIDVYAGTVTQARQIRQDA 79 (114) T ss_pred CchHHHHHHHHhhcCcccccccCCcc-cCcCCccCceEEEEeccCcccccccCccccceEEEEEeeeCCHHHHHHHHHHH Confidence 99999999999999999999999996 55666789999999999999999999888999999999999999999999999 Q ss_pred HHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 81 LDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 81 ~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |+||+.++++..++.++||+||||||++|||+||| T Consensus 80 ~~Al~~~~~~~~~~~~~ye~dt~lyR~~~d~~v~~ 114 (114) T protein:vir:93 80 REAIMLLAPGSVSEMQDYIPENRCYRATLEFQVTV 114 (114) T ss_pred HHHHhhcCcEeecCCCcccccccceeeEEEEEEeC Confidence 99999999999999999999999999999999999 No 3 >protein:vir:10368 Length: 118 # NCBI annotation: conserved phage protein # Family: family:all:3244 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858960;genbank:gi:32128425;genbank:GeneID:2648389 Probab=100.00 E-value=2.3e-37 Score=221.46 Aligned_cols=110 Identities=18% Similarity=0.247 Sum_probs=98.5 Q ss_pred Cc-hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCC-CC-cceEEEEEEeeCCHHHHHHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQ-AE-SAVSVQVDVYSSTITEARTIR 77 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~-~~-~~~~vQIDvyA~t~~~A~~l~ 77 (115) |+ |++|+++|+++++|||||+++|+|. ..+||||||++||.|.|+|+|. ++ +++||||||||+|+++|++|+ T Consensus 1 Ms~e~~l~a~L~~~~~~RVyp~~aP~~~-----~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t~~~A~~l~ 75 (118) T protein:vir:10 1 MSYGRVLKDLLDPVFSGRVYADIPPDSP-----PLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRSKQEAYLAT 75 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCC-----CcCCEEEEEecCCcccccccCCCCccceeEEEEEEeeCCHHHHHHHH Confidence 88 9999999999999999999999974 1369999999999999999995 54 679999999999999999999 Q ss_pred HHHHHHHHhhccee--eccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 78 NMALDALQVLKPGS--IVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 78 ~av~~Al~~~~~~~--~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ++||+||+...... .+..++||+||||||+++||+|.- T Consensus 76 ~av~~al~~~~~~~~~~~~~d~ye~dt~l~r~~~Df~vw~ 115 (118) T protein:vir:10 76 VQVLRLVSEANDMQVLSQPIDDYVREIKLYGSRVDISMWY 115 (118) T ss_pred HHHHHHhhhcccceeccCCCccccccCCceEEEEEEEEee Confidence 99999999764333 345688999999999999999977 No 4 >protein:vir:81066 Length: 118 # NCBI annotation: p13 # Family: family:all:3244 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285683;genbank:gi:148727191;genbank:GeneID:5247111 Probab=100.00 E-value=3.5e-37 Score=220.47 Aligned_cols=110 Identities=18% Similarity=0.266 Sum_probs=99.8 Q ss_pred Cc-hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCC-CC-cceEEEEEEeeCCHHHHHHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQ-AE-SAVSVQVDVYSSTITEARTIR 77 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~-~~-~~~~vQIDvyA~t~~~A~~l~ 77 (115) |+ |++|+++|+++++|||||+++|++. ...||||||++||.|.|+|+|. ++ +++||||||||+|+++|++|+ T Consensus 1 Ms~e~~l~a~L~~~~~~Rvyp~~aP~~~-----~~~Pyiv~q~vsg~p~~~l~G~~~~~~~~rvQIdvyA~t~~~A~~l~ 75 (118) T protein:vir:81 1 MSYGRVLKDLLDPVFSGRVYADIPPDSP-----PLDAYAIYQRVGGVPVYWQEGGMPEKVNARVQIQIWSRSKQEAYLAT 75 (118) T ss_pred CchHHHHHHHHHhhcCCccccccCCCCC-----ccCceEEEEecCCcccccccCCCCCccceeEEEEEeeCCHHHHHHHH Confidence 88 9999999999999999999999974 1359999999999999999995 55 579999999999999999999 Q ss_pred HHHHHHHHhhcceeecc--CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 78 NMALDALQVLKPGSIVK--TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 78 ~av~~Al~~~~~~~~~~--~~~ye~dT~lyr~~~df~i~~ 115 (115) ++|++||+.+.+....+ .++||+||||||+++||+|.- T Consensus 76 ~av~~al~~~~~~~~~~~~~d~ye~dt~l~r~~~Df~iw~ 115 (118) T protein:vir:81 76 VQVLRLVSEAPDMQVLSQPIDDYVREIKLYGSRVDVSMWY 115 (118) T ss_pred HHHHHHhhhccceeeccCCccccccccCceeEEEEEEEEe Confidence 99999999887766543 468999999999999999888 No 5 >protein:vir:97070 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:3244 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453569;genbank:gi:84662604;genbank:GeneID:5142485 Probab=100.00 E-value=3.7e-37 Score=220.29 Aligned_cols=110 Identities=19% Similarity=0.248 Sum_probs=98.8 Q ss_pred Cc-hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCC-CC-cceEEEEEEeeCCHHHHHHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQ-AE-SAVSVQVDVYSSTITEARTIR 77 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~-~~-~~~~vQIDvyA~t~~~A~~l~ 77 (115) |+ |++|+++|+++++|||||+++|+|+ ..+||||||++||.|.|+|+|. ++ .++||||||||+|+++|++|+ T Consensus 1 M~~e~~l~a~L~~~~~~Rvyp~~aP~~~-----~~~Pyiv~q~vsg~p~~~ldG~~~~~~~~rvQIdvyA~t~~~A~~l~ 75 (118) T protein:vir:97 1 MSYGRMLKDLLDPVFSGRVYADIPPDSP-----PLDAYAIYQRVGGVPVYWKEGGMPDKVNARVQVQIWSRSKQEAYLAT 75 (118) T ss_pred CchHHHHHHHHhhhcCCccccccCCCCC-----CcCCEEEEEecCCcccccccCCCCCccceeEEEEEeeCCHHHHHHHH Confidence 77 9999999999999999999999985 1369999999999999999995 54 679999999999999999999 Q ss_pred HHHHHHHHhhccee--eccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 78 NMALDALQVLKPGS--IVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 78 ~av~~Al~~~~~~~--~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ++|++||+.+.... .+..++||+||||||+++||+|.- T Consensus 76 ~av~~al~~~~~~~~~~~~~~~ye~dt~lyr~~~Df~iw~ 115 (118) T protein:vir:97 76 VQVLRIVSEANDMQVLSQPIDDYVRELKLYGSRVDISMWY 115 (118) T ss_pred HHHHHHhhcccccccccCCcccccccCCceEEEEEEEEEe Confidence 99999999764433 455678999999999999999988 No 6 >protein:vir:4348 Length: 121 # NCBI annotation: Orf15 # Family: family:all:896 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061511;genbank:gi:9635607;genbank:GeneID:1262874 Probab=100.00 E-value=1.6e-34 Score=205.93 Aligned_cols=108 Identities=21% Similarity=0.368 Sum_probs=96.4 Q ss_pred CchHHHHHHHH------hhcCC---ccce-eeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCC Q lcl|NC_019769. 1 MTEDDLYPLLE------PLAGG---QVYP-YVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSST 69 (115) Q Consensus 1 M~E~~i~~lL~------~l~~~---Rvyp-~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t 69 (115) |.|. |+++|. +|+++ |||| +++|+++ ++||||||++||.|+|+|+|+++ +++|+||||||+| T Consensus 1 m~~~-i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~------~~Pyiv~q~vsg~p~~~l~g~~~~~~~~vQIDvyA~t 73 (121) T protein:vir:43 1 MYPP-IFKVCSSSPAVTAILGASPLRMYQFGLAPQLV------VKPYATWQTISGSPENYLWGRPDADGFTIQVDIFSAT 73 (121) T ss_pred CChH-HHHHHhhChhhhhhhcCCCceeeccCCCCCCC------cCCeEEEEEecCcccceecCCCCcceeEEEEEeeeCC Confidence 8664 455543 46654 9999 6999974 68999999999999999999755 7899999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 70 ITEARTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 70 ~~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +++|++|+++||+||+.+++...+..++||+||||||++||++|++ T Consensus 74 ~~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyR~s~Dv~w~~ 119 (121) T protein:vir:43 74 AAEARDAAKAIRDAIELSAYVVRWGGESVDPDTKTYRVSFDVDWIV 119 (121) T ss_pred HHHHHHHHHHHHHHhhhcCCcccCCCCCCcccccceeeeeEEEEee Confidence 9999999999999999999988889999999999999999999999 No 7 >protein:vir:1438 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:2712 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536367;genbank:gi:17975172;genbank:GeneID:929144 Probab=100.00 E-value=1.5e-34 Score=206.01 Aligned_cols=109 Identities=23% Similarity=0.331 Sum_probs=98.0 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTIRNM 79 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l~~a 79 (115) |--=.|+++|+++++|||||+++|+++ +.||+|||++||.|+|+|+|+++ +++|+||||||+|+++|++|+++ T Consensus 1 ~~~~~i~~aL~~l~~~RVyp~~aP~~~------~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA~t~~~A~~l~~~ 74 (115) T protein:vir:14 1 MSVIVIRDALQGIGGAKGYLGVAPAKA------PAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYAPTFTDADRLADL 74 (115) T ss_pred CeeEeeehhhccccccccccccCCCCC------CCCEEEEEeecCcccccccCCCCCcceEEEEEEeeCCHHHHHHHHHH Confidence 887788999999999999999999975 57999999999999999999765 89999999999999999999999 Q ss_pred HHHHHHhhcceee-----ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 80 ALDALQVLKPGSI-----VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 80 v~~Al~~~~~~~~-----~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |+++++...+... ++.++||+||||||+++||+|=- T Consensus 75 v~~~~~~~~~~~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:14 75 AVDRAMSVQDRFSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred HHHHHhcCccceeeeeecCCCCCCcccccceeeEEEEEEeC Confidence 9999988765433 23467999999999999999777 No 8 >protein:vir:1892 Length: 121 # NCBI annotation: gp11 # Family: family:all:896 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037672;genbank:gi:9634130;genbank:GeneID:1262490 Probab=100.00 E-value=4.2e-34 Score=203.58 Aligned_cols=108 Identities=18% Similarity=0.290 Sum_probs=96.2 Q ss_pred CchHHHHHHHH------hhc---CCccce-eeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCC Q lcl|NC_019769. 1 MTEDDLYPLLE------PLA---GGQVYP-YVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSST 69 (115) Q Consensus 1 M~E~~i~~lL~------~l~---~~Rvyp-~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t 69 (115) |.. .||++|. +|+ ++|||| +++|+++ ++||+|||++||.|+|+|+|+++ +++|+||||||+| T Consensus 1 m~~-~i~~~l~~d~~v~allg~~~~Rvyp~~~aP~~~------~~Pyiv~q~vsg~p~~~l~G~~~~~~~~vQIDvyA~t 73 (121) T protein:vir:18 1 MIA-PIFSVCASSPEVTDLLGSNPVRIYPFGIQDDNV------VYPYVVWQNITGSPENYIAQRPDADFFTLQVDAYADT 73 (121) T ss_pred Cch-HHHHHHhcChhhhhhhcCCCceeeeccCCCCcC------cCCeEEEEEecCcccceecCCCCcceeEEEEEeecCC Confidence 765 4555552 344 359999 6999974 68999999999999999999755 7899999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 70 ITEARTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 70 ~~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +++|++|+++||+||+.+++...++.++||+||||||++||++|++ T Consensus 74 ~~~A~~l~~avr~Ale~~~~~~~~~~~~ye~dT~lyR~s~Dv~~~~ 119 (121) T protein:vir:18 74 VDEVIAVATALRDAIEPHAHITRWGGQERDPETKRYRYSFDVDWIV 119 (121) T ss_pred HHHHHHHHHHHHHHhhhcCcccCCCCCCCcccccceeeeeEEEEee Confidence 9999999999999999999988889999999999999999999999 No 9 >protein:vir:100116 Length: 115 # NCBI annotation: gp10 # Family: family:all:2712 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945040;genbank:gi:38707900;genbank:GeneID:2744163 Probab=100.00 E-value=1.7e-34 Score=205.70 Aligned_cols=109 Identities=23% Similarity=0.326 Sum_probs=97.7 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTIRNM 79 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l~~a 79 (115) |--=-|+++|+++++|||||+++|+++ +.||+|||++||.|+|+|+|+++ +++|+||||||+|+++|++|+++ T Consensus 1 ~~~~~i~~aL~~l~~~RVyp~~aP~~~------~~Pyiv~q~vsg~p~~~L~G~~~~~~~~vQIDvyA~t~~~A~~l~~~ 74 (115) T protein:vir:10 1 MSVIVIRDALQGIGGAKGYLGVAPEKA------PAPYFVVTRVHGALDMALAGLTGGRSGSYQIDCYAPTFTDADRLADL 74 (115) T ss_pred CeeEEeehhhcccCCceeecccCCCCC------CCCEEEEEeecCccccccCCCCCCcceEEEEEEeeCCHHHHHHHHHH Confidence 877778999999999999999999975 57999999999999999999765 89999999999999999999999 Q ss_pred HHHHHHhhcceee-----ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 80 ALDALQVLKPGSI-----VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 80 v~~Al~~~~~~~~-----~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |+++++...+... ++.++||+||||||+++||+|=- T Consensus 75 v~~~~~~~~~~~~~~~~~~~~d~ye~dt~lyR~s~D~~vWf 115 (115) T protein:vir:10 75 AVDRAMSVQDRFSVGGVDELPDDYSEDTGLFRISLELSVEF 115 (115) T ss_pred HHHHHhcCccceeEeeecCCCCCCcccccceeeEEEEEEeC Confidence 9999988665432 33467999999999999999777 No 10 >protein:vir:100242 Length: 114 # NCBI annotation: gp71 # Family: family:all:2712 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355407;genbank:gi:77864697;genbank:GeneID:3725964 Probab=99.95 E-value=7.9e-33 Score=196.58 Aligned_cols=109 Identities=23% Similarity=0.327 Sum_probs=98.3 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTIRNM 79 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l~~a 79 (115) |++-.|++.|+.|.++|+|++++|+++ ++||+|||++||.|+|+|+|+++ ++.|+||||||.|++||++|+++ T Consensus 1 ~~~~~i~~~l~~~~g~~~~~~~aP~~~------~~Py~vy~rvsg~p~~tL~G~~g~~~~r~QiD~yA~T~~eA~~La~~ 74 (114) T protein:vir:10 1 MSALTIRDAIGIVGGAKGYVSVASSAA------QSPYYVVSRVSGTRDMALGGATGGKSGMFQIDVYAKTYTEADSLADQ 74 (114) T ss_pred CceeeeehhhcccccccccCCCCCCCC------CCceEEEEeccCcccccccCCCCcceEEEEEEeeeCCHHHHHHHHHH Confidence 999999999999999999999999985 68999999999999999999755 88999999999999999999999 Q ss_pred HHHHHHhhccee---ecc-CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 80 ALDALQVLKPGS---IVK-TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 80 v~~Al~~~~~~~---~~~-~~~ye~dT~lyr~~~df~i~~ 115 (115) +++++.....+. +.+ .++||+||+|||+++||+|-- T Consensus 75 ~~~~l~~~~~f~~~~l~~~~d~ye~dT~l~Rvsld~si~f 114 (114) T protein:vir:10 75 IIDRVESTGMFSVGGVSDLPDDYSSDTGVFRVSLEISVQF 114 (114) T ss_pred HHhhcccccCeeeeccccCCCCCCcccCceEEEEEEEEeC Confidence 998887644333 333 467999999999999999999 No 11 >protein:vir:80371 Length: 115 # NCBI annotation: gp11 # Family: family:all:2712 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111090;genbank:gi:134288622;genbank:GeneID:4960618 Probab=99.94 E-value=1e-30 Score=185.03 Aligned_cols=109 Identities=23% Similarity=0.354 Sum_probs=98.2 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCC-CcceEEEEEEeeCCHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA-ESAVSVQVDVYSSTITEARTIRNM 79 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~-~~~~~vQIDvyA~t~~~A~~l~~a 79 (115) |.-.-|..+|+++.++|.|+++||+++ +.||+|||++||.+++.|||.. .++.++||||||.|+++|++|+++ T Consensus 1 ~~~~vir~al~~i~~~~~~~~vAp~~~------~~pyivy~rvsga~e~~L~G~ag~~~~~~QID~yA~T~~ea~~La~~ 74 (115) T protein:vir:80 1 MSVIVVRDALQGIGGAKGYLGVAPEKA------PARYFVVTRVHGALDMALAGPTGGRSGSYQIDCYAPTFTDADRLADL 74 (115) T ss_pred CeeeeeechhhhccccccceeeccccC------cCCeEEEeecCCCccccccCCCCCceeEEEEeeecCCHHHHHHHHHH Confidence 999999999999999999999999986 6899999999999999999964 689999999999999999999999 Q ss_pred HHHHHHhhc----ceeeccC-CCccccccceeeEEEEEEeC Q lcl|NC_019769. 80 ALDALQVLK----PGSIVKT-PGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 80 v~~Al~~~~----~~~~~~~-~~ye~dT~lyr~~~df~i~~ 115 (115) +++++..+. .....+. ++||+||+|||+++||+|-. T Consensus 75 v~d~~~~~~~~~~vg~l~e~pd~Ye~DT~l~Rvs~dv~i~f 115 (115) T protein:vir:80 75 AVDRAMSVQDRFSVGGVDELPDDYSADTGLFRVSLELSVEF 115 (115) T ss_pred HHHhhhCCccccceecccCCCcccccccceEEEEEEEEEeC Confidence 999666433 2334455 68999999999999999999 No 12 >protein:vir:102888 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338141;genbank:gi:77020213;genbank:GeneID:3703797 Probab=99.30 E-value=2.3e-14 Score=95.33 Aligned_cols=107 Identities=20% Similarity=0.259 Sum_probs=90.2 Q ss_pred Cc--hHHHHHHHHh------h-cCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCH Q lcl|NC_019769. 1 MT--EDDLYPLLEP------L-AGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~--E~~i~~lL~~------l-~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~ 70 (115) |. -..||.+|.+ + .++|||-+..|++. ..|||+|+.+...|+++-++... ...++|||||+++ T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~------~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~- 73 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAE------EFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS- 73 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCC------CCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC- Confidence 65 7888888853 4 44579989899864 36999999999999999999765 6789999999996 Q ss_pred HHHHHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 71 TEARTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 ~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..+|.++|+.+|..++..-....+.||+||++||-.+-|.=++ T Consensus 74 -~~~~i~~~I~~~m~~~gf~r~~~~d~ye~dt~lyhk~~Rf~~~~ 117 (119) T protein:vir:10 74 -STTAIHQKVNEIMKRIGFSRYAVADLYEEDTQIFHYAMRFAKGV 117 (119) T ss_pred -CHHHHHHHHHHHHHHcCCeeeccCCCcCChhhhheeeeeeeeee Confidence 56789999999999988776666778999999999998888777 No 13 >protein:vir:105008 Length: 119 # NCBI annotation: conserved structural protein # Family: family:all:517 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459973;genbank:gi:85701388;genbank:GeneID:3882149 Probab=99.30 E-value=2.3e-14 Score=95.33 Aligned_cols=107 Identities=20% Similarity=0.259 Sum_probs=90.2 Q ss_pred Cc--hHHHHHHHHh------h-cCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCH Q lcl|NC_019769. 1 MT--EDDLYPLLEP------L-AGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~--E~~i~~lL~~------l-~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~ 70 (115) |. -..||.+|.+ + .++|||-+..|++. ..|||+|+.+...|+++-++... ...++|||||+++ T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~------~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~- 73 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAE------EFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS- 73 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCC------CCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC- Confidence 65 7888888853 4 44579989899864 36999999999999999999765 6789999999996 Q ss_pred HHHHHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 71 TEARTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 ~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..+|.++|+.+|..++..-....+.||+||++||-.+-|.=++ T Consensus 74 -~~~~i~~~I~~~m~~~gf~r~~~~d~ye~dt~lyhk~~Rf~~~~ 117 (119) T protein:vir:10 74 -STTAIHQKVNEIMKRIGFSRYAVADLYEEDTQIFHYAMRFAKGV 117 (119) T ss_pred -CHHHHHHHHHHHHHHcCCeeeccCCCcCChhhhheeeeeeeeee Confidence 56789999999999988776666778999999999998888777 No 14 >protein:vir:107581 Length: 119 # NCBI annotation: conserved phage protein # Family: family:all:517 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338192;genbank:gi:77020160;genbank:GeneID:3703712 Probab=99.30 E-value=2.3e-14 Score=95.33 Aligned_cols=107 Identities=20% Similarity=0.259 Sum_probs=90.2 Q ss_pred Cc--hHHHHHHHHh------h-cCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCH Q lcl|NC_019769. 1 MT--EDDLYPLLEP------L-AGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~--E~~i~~lL~~------l-~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~ 70 (115) |. -..||.+|.+ + .++|||-+..|++. ..|||+|+.+...|+++-++... ...++|||||+++ T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~------~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~- 73 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAE------EFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS- 73 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCC------CCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC- Confidence 65 7888888853 4 44579989899864 36999999999999999999765 6789999999996 Q ss_pred HHHHHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 71 TEARTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 ~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..+|.++|+.+|..++..-....+.||+||++||-.+-|.=++ T Consensus 74 -~~~~i~~~I~~~m~~~gf~r~~~~d~ye~dt~lyhk~~Rf~~~~ 117 (119) T protein:vir:10 74 -STTAIHQKVNEIMKRIGFSRYAVADLYEEDTQIFHYAMRFAKGV 117 (119) T ss_pred -CHHHHHHHHHHHHHHcCCeeeccCCCcCChhhhheeeeeeeeee Confidence 56789999999999988776666778999999999998888777 No 15 >protein:vir:102086 Length: 119 # NCBI annotation: structural protein # Family: family:all:517 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512319;genbank:gi:89152488;genbank:GeneID:3953079 Probab=99.30 E-value=2.3e-14 Score=95.33 Aligned_cols=107 Identities=20% Similarity=0.259 Sum_probs=90.2 Q ss_pred Cc--hHHHHHHHHh------h-cCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCH Q lcl|NC_019769. 1 MT--EDDLYPLLEP------L-AGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~--E~~i~~lL~~------l-~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~ 70 (115) |. -..||.+|.+ + .++|||-+..|++. ..|||+|+.+...|+++-++... ...++|||||+++ T Consensus 1 M~~i~~~I~~~L~~~~~l~~l~~~~~I~~~~~~~~~------~~p~I~~~~~~~~p~~~add~e~~~~~~~QIDVwsk~- 73 (119) T protein:vir:10 1 MINLRPDILQALENDQELVSLLGGKRIYYRKAKKAE------EFPRITYFELDNRPDGFADNQEIESEILFQVDVWAKS- 73 (119) T ss_pred CCchHHHHHHHhhcCchhhhhcCCceEEecccCCCC------CCcEEEEEecCCCCCcccCCceeeeEEEEEEEEeeCC- Confidence 65 7888888853 4 44579989899864 36999999999999999999765 6789999999996 Q ss_pred HHHHHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 71 TEARTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 ~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..+|.++|+.+|..++..-....+.||+||++||-.+-|.=++ T Consensus 74 -~~~~i~~~I~~~m~~~gf~r~~~~d~ye~dt~lyhk~~Rf~~~~ 117 (119) T protein:vir:10 74 -STTAIHQKVNEIMKRIGFSRYAVADLYEEDTQIFHYAMRFAKGV 117 (119) T ss_pred -CHHHHHHHHHHHHHHcCCeeeccCCCcCChhhhheeeeeeeeee Confidence 56789999999999988776666778999999999998888777 No 16 >protein:vir:1274 Length: 162 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690766;genbank:gi:22855006;genbank:GeneID:955217 Probab=99.29 E-value=1.7e-14 Score=96.11 Aligned_cols=109 Identities=22% Similarity=0.231 Sum_probs=91.0 Q ss_pred Cc---hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCH Q lcl|NC_019769. 1 MT---EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~---E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~ 70 (115) |+ =.+|+.+| ..|+++|+|-+.+|.+. ..|||+|+.+...|+.+-++... ...++|||||+++. T Consensus 37 ~~mn~~k~v~q~L~n~~~L~~l~~~~i~~l~~~~~~------~~p~Itf~e~~~~p~~yADD~e~ss~~~iQIDIwsk~s 110 (162) T protein:vir:12 37 MTYSPKIELVSTLNSSAFLKGLTSGGIHNLVANDVS------AFPRVVFSEIQDADADFADNEVYSFEVRYQISIFTQAS 110 (162) T ss_pred hhhhHHHHHHHHhcChhHHHhhCCCceEEEeecCCC------CceEEEEEeecCCCCcccccceeeEEEEEEEEEeecCC Confidence 55 46666666 46899999999887664 46999999999999999999765 66899999998754 Q ss_pred --HHHHHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 71 --TEARTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 --~~A~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ++..+|..+|+..|..++..-....+.||+||++||-.+-|..+- T Consensus 111 t~~d~~~l~~~I~~lMk~~GF~R~s~~d~YE~DTklyHK~~RF~~~y 157 (162) T protein:vir:12 111 TRGKETAIASEIDRLMREIGYSRYDSQDLYETDTKVFHKARRYKKTY 157 (162) T ss_pred cchhHHHHHHHHHHHHHHcCCEeecCCCCCCChhhhhhhhheeccce Confidence 677899999999999988877777788999999999999995443 No 17 >protein:vir:9364 Length: 131 # NCBI annotation: SLT orf 131b-like protein # Family: family:all:508 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803342;genbank:gi:29028653;genbank:GeneID:1258094 Probab=99.22 E-value=4.1e-14 Score=93.98 Aligned_cols=111 Identities=16% Similarity=0.185 Sum_probs=92.8 Q ss_pred Cc-hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHH Q lcl|NC_019769. 1 MT-EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITE 72 (115) Q Consensus 1 M~-E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~ 72 (115) |- =.+||.+| ..++++|+|.+..|+..+. ..|||++..++..|+++-++... ....+|||||+.+... T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~----~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~~~~ 76 (131) T protein:vir:93 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAET----SKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSNNQK 76 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCcccc----ccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecCccc Confidence 33 35677776 4688999999999997654 46999999999999999888765 6789999999999999 Q ss_pred HHHHHHHHHHHHHhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..|..+|...|..++..-.. +.+.||+||++||.+.-|+=+- T Consensus 77 ~~~i~~~I~~~M~~~gf~q~s~~~d~Yd~dtk~y~~arRYrg~~ 120 (131) T protein:vir:93 77 TIDITKRIRYLLYQQNLIQASSQLDAYFEETKRYVMSRRYQGIP 120 (131) T ss_pred hHHHHHHHHHHHHHcCceeccCCCCccchhhHHhhhhhhccccc Confidence 999999999999987776544 4577999999999998886555 No 18 >protein:vir:78648 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429947;genbank:gi:156604001;genbank:GeneID:5525394 Probab=99.22 E-value=4.1e-14 Score=93.98 Aligned_cols=111 Identities=16% Similarity=0.185 Sum_probs=92.8 Q ss_pred Cc-hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHH Q lcl|NC_019769. 1 MT-EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITE 72 (115) Q Consensus 1 M~-E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~ 72 (115) |- =.+||.+| ..++++|+|.+..|+..+. ..|||++..++..|+++-++... ....+|||||+.+... T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~----~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~~~~ 76 (131) T protein:vir:78 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAET----SKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSNNQK 76 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCcccc----ccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecCccc Confidence 33 35677776 4688999999999997654 46999999999999999888765 6789999999999999 Q ss_pred HHHHHHHHHHHHHhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..|..+|...|..++..-.. +.+.||+||++||.+.-|+=+- T Consensus 77 ~~~i~~~I~~~M~~~gf~q~s~~~d~Yd~dtk~y~~arRYrg~~ 120 (131) T protein:vir:78 77 TIDITKRIRYLLYQQNLIQASSQLDAYFEETKRYVMSRRYQGIP 120 (131) T ss_pred hHHHHHHHHHHHHHcCceeccCCCCccchhhHHhhhhhhccccc Confidence 999999999999987776544 4577999999999998886555 No 19 >protein:vir:96972 Length: 131 # NCBI annotation: ORF035 # Family: family:all:508 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239865;genbank:gi:66395543;genbank:GeneID:5133005 Probab=99.22 E-value=4.1e-14 Score=93.98 Aligned_cols=111 Identities=16% Similarity=0.185 Sum_probs=92.8 Q ss_pred Cc-hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHH Q lcl|NC_019769. 1 MT-EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITE 72 (115) Q Consensus 1 M~-E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~ 72 (115) |- =.+||.+| ..++++|+|.+..|+..+. ..|||++..++..|+++-++... ....+|||||+.+... T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~----~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~~~~ 76 (131) T protein:vir:96 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAET----SKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSNNQK 76 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCcccc----ccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecCccc Confidence 33 35677776 4688999999999997654 46999999999999999888765 6789999999999999 Q ss_pred HHHHHHHHHHHHHhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..|..+|...|..++..-.. +.+.||+||++||.+.-|+=+- T Consensus 77 ~~~i~~~I~~~M~~~gf~q~s~~~d~Yd~dtk~y~~arRYrg~~ 120 (131) T protein:vir:96 77 TIDITKRIRYLLYQQNLIQASSQLDAYFEETKRYVMSRRYQGIP 120 (131) T ss_pred hHHHHHHHHHHHHHcCceeccCCCCccchhhHHhhhhhhccccc Confidence 999999999999987776544 4577999999999998886555 No 20 >protein:vir:2689 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075508;genbank:gi:12719437;genbank:GeneID:920159 Probab=99.22 E-value=4.1e-14 Score=93.98 Aligned_cols=111 Identities=16% Similarity=0.185 Sum_probs=92.8 Q ss_pred Cc-hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHH Q lcl|NC_019769. 1 MT-EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITE 72 (115) Q Consensus 1 M~-E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~ 72 (115) |- =.+||.+| ..++++|+|.+..|+..+. ..|||++..++..|+++-++... ....+|||||+.+... T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~----~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDVws~~~~~ 76 (131) T protein:vir:26 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAET----SKPFVVITPVYDLPSDFMSDKYLSEEYLIQIDVESSNNQK 76 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEeecCCcccc----ccceEEEeeCCCCCccccCCceeeeEEEEEEEEEecCccc Confidence 33 35677776 4688999999999997654 46999999999999999888765 6789999999999999 Q ss_pred HHHHHHHHHHHHHhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..|..+|...|..++..-.. +.+.||+||++||.+.-|+=+- T Consensus 77 ~~~i~~~I~~~M~~~gf~q~s~~~d~Yd~dtk~y~~arRYrg~~ 120 (131) T protein:vir:26 77 TIDITKRIRYLLYQQNLIQASSQLDAYFEETKRYVMSRRYQGIP 120 (131) T ss_pred hHHHHHHHHHHHHHcCceeccCCCCccchhhHHhhhhhhccccc Confidence 999999999999987776544 4577999999999998886555 No 21 >protein:vir:93902 Length: 131 # NCBI annotation: ORF029 # Family: family:all:508 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239943;genbank:gi:66395617;genbank:GeneID:5130968 Probab=99.21 E-value=4.7e-14 Score=93.64 Aligned_cols=111 Identities=17% Similarity=0.194 Sum_probs=93.1 Q ss_pred Cc-hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHH Q lcl|NC_019769. 1 MT-EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITE 72 (115) Q Consensus 1 M~-E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~ 72 (115) |- =.+||.+| ..++++|+|.+..|+..+. ..|||++..+.+.|+++-++... ...++|||||+.+... T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~----~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDV~s~~~~~ 76 (131) T protein:vir:93 1 MNILNTIKEILLSDAELQTYINSRIYYYKVTENAET----SKPFVVITPIYDLPSDFMSDKYLSEEYLIQIDVESSNNQK 76 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEEecCCcccc----ccceEEEeeCCCCcccccCCceeeeEEEEEEEEEecCccc Confidence 33 35677776 4688999999999997654 46999999999999999888765 6789999999999999 Q ss_pred HHHHHHHHHHHHHhhcceeecc-CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGSIVK-TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~~~~-~~~ye~dT~lyr~~~df~i~~ 115 (115) +.+|..+|+..|...+..-... .+.||+||++||.+.-|.=+- T Consensus 77 ~~~i~~~I~~~M~~~gf~q~s~~~d~Yd~dtk~y~~arRYrg~~ 120 (131) T protein:vir:93 77 TIDITKRIRYLLYQQNLIQASSQLDAYFEETKRYVMSRRYQGIP 120 (131) T ss_pred hHHHHHHHHHHHHHcCceeccCCCCccchhHHHhhhhhhhccch Confidence 9999999999999887765544 477999999999988886655 No 22 >protein:vir:94418 Length: 131 # NCBI annotation: ORF029 # Family: family:all:508 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240011;genbank:gi:66395684;genbank:GeneID:5133078 Probab=99.20 E-value=6e-14 Score=93.07 Aligned_cols=111 Identities=17% Similarity=0.183 Sum_probs=92.7 Q ss_pred Cc-hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHH Q lcl|NC_019769. 1 MT-EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITE 72 (115) Q Consensus 1 M~-E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~ 72 (115) |- =.+||.+| ..++++|+|.+..|+..+. ..|||++..+.+.|+++-++... ...++|||||+.+... T Consensus 1 ~dil~~iy~~L~~d~~L~~lv~~rI~~y~~Pe~~d~----~~p~I~i~~i~~~p~~yadn~~l~~~~~~QIDV~s~~~~~ 76 (131) T protein:vir:94 1 MNILNTIKGILLSDAELKTHINSRIYYYKVTENAET----SKPFVVITPIYDLPSDFMSDKYLSEEYLIQIDVESSNNQK 76 (131) T ss_pred CchHHHHHHHhhcchHHHhhcCCceEEEecCCcccc----ccceEEEeeCCCCcccccCCceeeeEEEEEEEEEecCccc Confidence 33 35677776 4688999999999997654 46999999999999999888765 6789999999999999 Q ss_pred HHHHHHHHHHHHHhhcceeecc-CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGSIVK-TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~~~~-~~~ye~dT~lyr~~~df~i~~ 115 (115) +.+|..+|...|..++..-... .+.||+||++||.+.-|.=+- T Consensus 77 ~~~i~~~I~~~M~~~gf~q~s~~~d~Yd~dtk~y~~arRYrg~~ 120 (131) T protein:vir:94 77 TIDITKRIRYLLYQQNLIQASSQLDAYFEETKRYVMSRRYQGIP 120 (131) T ss_pred hHHHHHHHHHHHHHcCceeccCCCCccchhHHHhhhhhhhccch Confidence 9999999999999877765443 477999999999988886655 No 23 >protein:vir:1387 Length: 116 # NCBI annotation: Gp10 protein # Family: family:all:517 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612839;genbank:gi:20065973;genbank:GeneID:935788 Probab=99.14 E-value=2e-13 Score=90.20 Aligned_cols=108 Identities=17% Similarity=0.067 Sum_probs=92.3 Q ss_pred Cc-hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTIRN 78 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l~~ 78 (115) |. -+.|+.+|+++ +..|+....... ...|||||+.....|..+-++... ...++|||||+++..++..|.+ T Consensus 4 m~I~~~i~~~Lk~i-~ipV~~~~y~~~------~~~~~Itf~~y~e~~~~yaDd~e~~t~~~iQVDI~sk~~~~~~~l~~ 76 (116) T protein:vir:13 4 FDIIALVYECLECL-NVPVIEGWYDEE------LNKTHITVHEYLEQDESFEDDEAREEEHNIQIDVWSKDSLEAFKLKK 76 (116) T ss_pred cchhHHHHHHHhhc-CCeeeecccCCC------CccceEEEEeeecCCCcccCCeeeeEEEEEEEEEeecCCccHHHHHH Confidence 55 57888888886 667887654432 136999999999999999999876 6789999999999999999999 Q ss_pred HHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 79 MALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 79 av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +|+..|...+..-....+.||+||++||-.+-|.=.. T Consensus 77 ~V~~lMk~~GF~r~~~~d~ye~dt~iyhk~~RF~y~~ 113 (116) T protein:vir:13 77 AIKKLLKKNNFYFDSSEDFYETKTRIYHKGLRFSYIS 113 (116) T ss_pred HHHHHHHHcCCEeeecCCCccchhhhhhhhhhheeee Confidence 9999999999888888888999999999999887777 No 24 >protein:vir:78349 Length: 127 # NCBI annotation: gp10 # Family: family:all:508 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468649;genbank:gi:157325227;genbank:GeneID:5601695 Probab=99.12 E-value=5.2e-13 Score=87.92 Aligned_cols=111 Identities=14% Similarity=0.196 Sum_probs=94.1 Q ss_pred Cc--hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCC-cceEEEEEEeeCCH Q lcl|NC_019769. 1 MT--EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAE-SAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~--E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~-~~~~vQIDvyA~t~ 70 (115) |. -.+||.+| +.+.++|+|-+..|++.+. ..||||.+.+.. .|.++-++... ..+.+|||||+.+. T Consensus 1 M~d~l~~iy~~L~~d~~l~~~~~~~I~~~~~Pe~~d~----~~p~I~I~~i~~p~p~~yadn~~l~~~~~~QIDV~s~~r 76 (127) T protein:vir:78 1 MIDILNVIYTTLSKNDIIHTTCEERIKYYDFPGTGDS----TKTFLLIIPLDVPIPTNFSSNESRMEDFLVQIDVQSNDR 76 (127) T ss_pred CcchHHHHHHHhhcchhhhhhcCCceEEEecCCCccc----cCcEEEEeeCCCCCCCcccCCccceeEEEEEEEEEEcCC Confidence 55 67788777 4577889999999998654 459999999965 69999999776 66899999999999 Q ss_pred HHHHHHHHHHHHHHHhhcceeecc-CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 71 TEARTIRNMALDALQVLKPGSIVK-TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 ~~A~~l~~av~~Al~~~~~~~~~~-~~~ye~dT~lyr~~~df~i~~ 115 (115) ....+|...|...|..++..-... .+.||+|||+||-.--|.=+- T Consensus 77 ~~~~~i~~~I~~~M~~~gf~q~s~~~d~Y~~dtk~y~~arRYrg~~ 122 (127) T protein:vir:78 77 LIVKKIQDEVRKEMKQIGFGQLAGGLDEYFPETGRFVDARKYSGLP 122 (127) T ss_pred CchHHHHHHHHHHHHHcCceeccCCCCccchhhhhhhheeeeeecc Confidence 999999999999999887765553 477999999999999887766 No 25 >protein:vir:98343 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918935;genbank:gi:119443697;genbank:GeneID:4594505 Probab=98.98 E-value=3.3e-12 Score=83.54 Aligned_cols=114 Identities=13% Similarity=0.140 Sum_probs=85.3 Q ss_pred Cch--HHH-HHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHH Q lcl|NC_019769. 1 MTE--DDL-YPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTI 76 (115) Q Consensus 1 M~E--~~i-~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l 76 (115) |+. .-| -+++..++.+.+-|.-.|-..+.--....|||+|......|+.+-++... ....+|||||.+. ++-.+| T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk-~d~~~l 79 (126) T protein:vir:98 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQ-DEPNEQ 79 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecC-CCHHHH Confidence 441 111 22234566666666655544332222246999999999999999999875 6689999996544 446679 Q ss_pred HHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 77 RNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 77 ~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..+|+..|...++.-....+.||+|||+||-.+-|..++ T Consensus 80 ~~~V~~lMk~~GF~r~~~~dlYE~DtklyHk~~RF~~~~ 118 (126) T protein:vir:98 80 AEKIVELLKVINFQCYYREPLYESDVMSFRHIIRAKGSI 118 (126) T ss_pred HHHHHHHHHHcCCeeeecCCCccchhhhheeeeeeeeee Confidence 999999999988887778788999999999999999999 No 26 >protein:vir:9415 Length: 126 # NCBI annotation: phi PVL orf 12-like protein # Family: family:all:517 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803393;genbank:gi:29028705;genbank:GeneID:1258143 Probab=98.98 E-value=3.3e-12 Score=83.54 Aligned_cols=114 Identities=13% Similarity=0.140 Sum_probs=85.3 Q ss_pred Cch--HHH-HHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHH Q lcl|NC_019769. 1 MTE--DDL-YPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTI 76 (115) Q Consensus 1 M~E--~~i-~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l 76 (115) |+. .-| -+++..++.+.+-|.-.|-..+.--....|||+|......|+.+-++... ....+|||||.+. ++-.+| T Consensus 1 ~~~~~k~l~~~~I~~li~~~L~~~nvpv~~~~y~~~~~tyItf~ey~~~~~~yaDD~e~~t~~~iQVDIw~sk-~d~~~l 79 (126) T protein:vir:94 1 MINVTKLIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQ-DEPNEQ 79 (126) T ss_pred CccchhhhhhhHHHHhhhhhhhccCceeeeeeecCCCceEEEEEeecCCCCcccccceeeeEEEEEEEEeecC-CCHHHH Confidence 441 111 22234566666666655544332222246999999999999999999875 6689999996544 446679 Q ss_pred HHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 77 RNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 77 ~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..+|+..|...++.-....+.||+|||+||-.+-|..++ T Consensus 80 ~~~V~~lMk~~GF~r~~~~dlYE~DtklyHk~~RF~~~~ 118 (126) T protein:vir:94 80 AEKIVELLKVINFQCYYREPLYESDVMSFRHIIRAKGSI 118 (126) T ss_pred HHHHHHHHHHcCCeeeecCCCccchhhhheeeeeeeeee Confidence 999999999988887778788999999999999999999 No 27 >protein:vir:80001 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430007;genbank:gi:156604062;genbank:GeneID:5525461 Probab=98.97 E-value=4.1e-12 Score=83.04 Aligned_cols=114 Identities=13% Similarity=0.146 Sum_probs=86.4 Q ss_pred Cch--HHH-HHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHH Q lcl|NC_019769. 1 MTE--DDL-YPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTI 76 (115) Q Consensus 1 M~E--~~i-~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l 76 (115) |+. +-| -+.+..++.+++=|.-.|-..+.......|||+|......|+.+-++... ....+|||||.+. ++-.+| T Consensus 1 ~~~~~~~i~n~~I~~li~~~Lk~~nvPV~~~~y~~~~ktyItf~ey~~~~~~yADd~e~~t~~~iQIDIW~sk-~~~~~l 79 (126) T protein:vir:80 1 MINVTELIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQ-DEPNEQ 79 (126) T ss_pred CcchHHhhhhhHHHhhhhhceeeccceeccccccCCCCcEEEEEeecCCCCccccCeeeeeEEEEEEEEeeCC-CCHHHH Confidence 662 111 23335566666655555544433323356999999999999999999875 6789999999555 457889 Q ss_pred HHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 77 RNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 77 ~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..+|...|...++.-....+.||+|||+||-.+-|.-++ T Consensus 80 ~~~V~~~Mk~~GF~R~~~~d~YE~DtklyHk~~Rf~~~~ 118 (126) T protein:vir:80 80 AEKIVELLKVINFQCYYREPLYESDVMSFRHIIRAKGSI 118 (126) T ss_pred HHHHHHHHHHcCCeeeecCCCccchhhhhheeeeeeeec Confidence 999999999988877777788999999999999999999 No 28 >protein:vir:81093 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:517 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429879;genbank:gi:156603932;genbank:GeneID:5525313 Probab=98.97 E-value=4.1e-12 Score=83.04 Aligned_cols=114 Identities=13% Similarity=0.146 Sum_probs=86.4 Q ss_pred Cch--HHH-HHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHHHHHHH Q lcl|NC_019769. 1 MTE--DDL-YPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTITEARTI 76 (115) Q Consensus 1 M~E--~~i-~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~~A~~l 76 (115) |+. +-| -+.+..++.+++=|.-.|-..+.......|||+|......|+.+-++... ....+|||||.+. ++-.+| T Consensus 1 ~~~~~~~i~n~~I~~li~~~Lk~~nvPV~~~~y~~~~ktyItf~ey~~~~~~yADd~e~~t~~~iQIDIW~sk-~~~~~l 79 (126) T protein:vir:81 1 MINVTELIRNAIIANNITDEVNVFNYTIDDHFHEKTDKPIIRIYPLPFNPDTYADDNEISREYHYQIDVWWSQ-DEPNEQ 79 (126) T ss_pred CcchHHhhhhhHHHhhhhhceeeccceeccccccCCCCcEEEEEeecCCCCccccCeeeeeEEEEEEEEeeCC-CCHHHH Confidence 662 111 23335566666655555544433323356999999999999999999875 6789999999555 457889 Q ss_pred HHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 77 RNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 77 ~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..+|...|...++.-....+.||+|||+||-.+-|.-++ T Consensus 80 ~~~V~~~Mk~~GF~R~~~~d~YE~DtklyHk~~Rf~~~~ 118 (126) T protein:vir:81 80 AEKIVELLKVINFQCYYREPLYESDVMSFRHIIRAKGSI 118 (126) T ss_pred HHHHHHHHHHcCCeeeecCCCccchhhhhheeeeeeeec Confidence 999999999988877777788999999999999999999 No 29 >protein:vir:96894 Length: 140 # NCBI annotation: ORF029 # Family: family:all:296 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240162;genbank:gi:66395835;genbank:GeneID:5133235 Probab=98.89 E-value=2.4e-11 Score=78.81 Aligned_cols=105 Identities=20% Similarity=0.199 Sum_probs=79.3 Q ss_pred Cc---h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT---E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~---E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. + .-|++.| .++++|||| +..|.+. ++|||++-.....+.+.-++. ....++||+||+ T Consensus 1 Msms~~~aLq~Ai~a~L~ada~l~alvg~~Vy-D~~P~~~------~~Pyv~lG~~~~~~~~~~~~~-g~~~~~~i~Vws 72 (140) T protein:vir:96 1 MWVSVEPELTVQIYKRLKASPIINKFVGDRVF-DVVQEDA------VYPYIVVGESNVTNNESSTMM-RETVGIVIHVYS 72 (140) T ss_pred CCccHHHHHHHHHHHHhhcChhHHHhcCCccc-cCCccCC------CCCEEEecCceeeecCCCccc-ceEEEEEEEEEE Confidence 55 3 4455554 468899999 6788753 689999988877777765554 357899999999 Q ss_pred C--CHHHHHHHHHHHHHHHHhhcceeecc----------C-CCccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVLKPGSIVK----------T-PGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~~~~~~~~----------~-~~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||.. +....+ . --+|+|...+|..+.|.++| T Consensus 73 ~~~g~~ea~~ia~av~~AL~~--~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~r~~v 131 (140) T protein:vir:96 73 QFATQYEAKQIISAIGYVLNR--PIDIENYEFQFSRIDSQSVFPDIDRFTKHGTIRLLFKY 131 (140) T ss_pred cCCCHHHHHHHHHHHHHHhCC--CccCCCCeEEEEEEeeeEEEecCCCceEEEEEEEEEEE Confidence 7 789999999999999963 222111 1 12789999999999999999 No 30 >protein:vir:95111 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240829;genbank:gi:66394699;genbank:GeneID:5133905 Probab=98.86 E-value=3.1e-11 Score=78.20 Aligned_cols=107 Identities=15% Similarity=0.139 Sum_probs=81.1 Q ss_pred Cc-------hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT-------EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~-------E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. +.-|++.| ..+++|||| +..|.+. ++|||++-.....+.+.-++ .....++||+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~-D~~P~~a------~~PYV~lG~~~~~~~~~~~~-~g~~~~~ti~Vws 72 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNSIIQKQLDGRVF-DCVQKDA------VYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCcee-cCCcCCC------CCCEEEecCceeeecCCCcc-cceEEEEEEEEEE Confidence 87 35666666 578999999 6788753 68999998887777776655 3457899999997 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||..- ...... ..+ -+|+|...+|..+.|..+| T Consensus 73 ~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~v 131 (145) T protein:vir:95 73 QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDRYTKHGIIRLVFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEE Confidence 6 8899999999999999631 111111 111 2789999999999999999 No 31 >protein:vir:95961 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240391;genbank:gi:66396072;genbank:GeneID:5133472 Probab=98.85 E-value=3.6e-11 Score=77.85 Aligned_cols=107 Identities=15% Similarity=0.142 Sum_probs=80.5 Q ss_pred Cc-------hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT-------EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~-------E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. ..-|++.| ..+++|||| +..|.+. ++|||++-.....+.+.-++ .....++||+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~-D~~P~~~------~~PYv~lG~~~~~d~~~~~~-~g~~~~~ti~Vws 72 (145) T protein:vir:95 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVF-DCVQKDA------VYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccc-cCCcCCC------CCCEEEecCceeeecCCCcc-cceEEEEEEEEEE Confidence 87 35566665 678999999 6788753 68999998877777766655 3457899999997 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|+.||..- ...... ..+ -+|+|...+|..+.|..+| T Consensus 73 ~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~v 131 (145) T protein:vir:95 73 QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGVIRLVFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEE Confidence 6 8899999999999999631 111111 111 2789999999999999999 No 32 >protein:vir:94794 Length: 145 # NCBI annotation: ORF028 # Family: family:all:296 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240542;genbank:gi:66396219;genbank:GeneID:5133574 Probab=98.85 E-value=3.6e-11 Score=77.84 Aligned_cols=107 Identities=15% Similarity=0.140 Sum_probs=80.5 Q ss_pred Cc-------hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT-------EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~-------E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. ..-|++.| ..+++|||| +..|.+. ++|||++-.....+.+.-++ .....++||+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrV~-D~~P~~~------~~PYv~lG~~~~~d~~~~~~-~g~~~~~ti~Vws 72 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNPIIQKQLDGRVF-DCVQKDA------VYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcCHhHHHhhccccc-cCCcCCC------CCCEEEecCceeeecCCCcc-cceEEEEEEEEEE Confidence 87 35566665 678999999 6788753 68999998877777766655 3457899999997 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|+.||..- ...... ..+ -+|+|...+|..+.|...| T Consensus 73 ~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~v 131 (145) T protein:vir:94 73 QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEE Confidence 6 8899999999999999631 111111 111 2789999999999999999 No 33 >protein:vir:93736 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240465;genbank:gi:66396143;genbank:GeneID:5133505 Probab=98.83 E-value=4.5e-11 Score=77.29 Aligned_cols=107 Identities=15% Similarity=0.140 Sum_probs=80.9 Q ss_pred Cc-------hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT-------EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~-------E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. +.-|++.| ..+++|||| +..|.+. ++|||++-.....+.+.-++ .....++||+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~-D~~P~~a------~~PYV~lG~~~~~d~~~~~~-~g~~~~~ti~Vws 72 (145) T protein:vir:93 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVF-DCVQKDA------VYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCcee-cCCcCCC------CCCEEEeCCceeeecCCCcc-cceEEEEEEEEEE Confidence 87 35666666 578899999 6788753 68999998877777766655 3457899999998 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||..- ...... ..+ -+|+|...+|..+.|...| T Consensus 73 ~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~v 131 (145) T protein:vir:93 73 QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEE Confidence 7 7889999999999999631 111111 111 2789999999999999999 No 34 >protein:vir:97421 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240755;genbank:gi:66396436;genbank:GeneID:5133777 Probab=98.83 E-value=4.5e-11 Score=77.29 Aligned_cols=107 Identities=15% Similarity=0.140 Sum_probs=80.9 Q ss_pred Cc-------hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT-------EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~-------E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. +.-|++.| ..+++|||| +..|.+. ++|||++-.....+.+.-++ .....++||+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~-D~~P~~a------~~PYV~lG~~~~~d~~~~~~-~g~~~~~ti~Vws 72 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVF-DCVQKDA------VYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCcee-cCCcCCC------CCCEEEeCCceeeecCCCcc-cceEEEEEEEEEE Confidence 87 35666666 578899999 6788753 68999998877777766655 3457899999998 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||..- ...... ..+ -+|+|...+|..+.|...| T Consensus 73 ~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~v 131 (145) T protein:vir:97 73 QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEE Confidence 7 7889999999999999631 111111 111 2789999999999999999 No 35 >protein:vir:94488 Length: 145 # NCBI annotation: ORF032 # Family: family:all:296 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240682;genbank:gi:66396364;genbank:GeneID:5133752 Probab=98.83 E-value=4.5e-11 Score=77.29 Aligned_cols=107 Identities=15% Similarity=0.140 Sum_probs=80.9 Q ss_pred Cc-------hHHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT-------EDDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~-------E~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. +.-|++.| ..+++|||| +..|.+. ++|||++-.....+.+.-++ .....++||+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ada~l~alvggrI~-D~~P~~a------~~PYV~lG~~~~~d~~~~~~-~g~~~~~ti~Vws 72 (145) T protein:vir:94 1 MWVSVERYLFNKVYNKLKSNLIIQKQLDGRVF-DCVQKDA------VYPYIVVGETNVTNKETTTS-MVEDVGITLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcCcee-cCCcCCC------CCCEEEeCCceeeecCCCcc-cceEEEEEEEEEE Confidence 87 35666666 578899999 6788753 68999998877777766655 3457899999998 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||..- ...... ..+ -+|+|...+|..+.|...| T Consensus 73 ~~~g~~eak~ia~av~~aL~~~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~v 131 (145) T protein:vir:94 73 QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhccccCCCCCeEEEeEEeeeeEeecCCcceEEEEEEEEEEE Confidence 7 7889999999999999631 111111 111 2789999999999999999 No 36 >protein:vir:1244 Length: 145 # NCBI annotation: similar to phage Spp1 gp17 # Family: family:all:296 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510943;genbank:gi:17426277;genbank:GeneID:927402 Probab=98.82 E-value=5.4e-11 Score=76.85 Aligned_cols=107 Identities=17% Similarity=0.204 Sum_probs=78.8 Q ss_pred Cc---h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT---E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~---E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. | .-|++.| ..++++|||- .+|.+ .++|||++-.....+.++-++. ....+++||||+ T Consensus 1 M~~s~~~aLq~ai~~~L~ad~~l~~lvg~~vyD-~~P~~------~~~PyV~lG~~~~~~~~t~~~~-~~~~~lti~Vws 72 (145) T protein:vir:12 1 MWVSVERYLFNKVYNKLKSNPIIQKQLGGRVFD-CVQKD------AVYPYIVVGETNVTNKETTTSM-VEDVGITLHVYS 72 (145) T ss_pred CcccHHHHHHHHHHHHhhcChhHHHhcCccccc-CCccC------CCCCEEEeccceeeecCCCccc-ceEEEEEEEEEE Confidence 77 3 4455554 4788999997 47764 3699999988777777766663 347899999998 Q ss_pred CC--HHHHHHHHHHHHHHHHh---hcceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 ST--ITEARTIRNMALDALQV---LKPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~t--~~~A~~l~~av~~Al~~---~~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) .. +.+|++++.+|+.||.. +...... ..+ -+|+|+.++|..+.|..++ T Consensus 73 ~~~gr~ea~~ia~ai~~aL~~~l~l~~~~lv~l~~~~~~~~rd~d~~~~hgvl~~ra~i 131 (145) T protein:vir:12 73 QARNRDEASQIIQFLGFVLNNEIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKY 131 (145) T ss_pred cCccHHHHHHHHHHHHHHhccccCCCCceEEEEEEeeEEEEecCCCceEEEEEEEEEEE Confidence 74 78999999999999963 1111111 111 2789999999999999999 No 37 >protein:vir:96125 Length: 140 # NCBI annotation: ORF038 # Family: family:all:296 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240084;genbank:gi:66395765;genbank:GeneID:5133106 Probab=98.82 E-value=5.4e-11 Score=76.89 Aligned_cols=105 Identities=19% Similarity=0.241 Sum_probs=77.7 Q ss_pred Cc-h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee-- Q lcl|NC_019769. 1 MT-E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS-- 67 (115) Q Consensus 1 M~-E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA-- 67 (115) |. | .-||+.| .++++|||| +..|.+ .++|||++-.....+.+.-++. ....+++|+||+ T Consensus 3 msa~~aLq~Ai~~~L~ad~~l~alvggrVy-D~~P~~------~~~PYV~lG~~~~~~~~~~~~~-g~~~~~tl~Vws~~ 74 (140) T protein:vir:96 3 VTAEPLLYNKIMNNLIENPITDKLVGGRVF-DCVQKD------VVYPYIVVGESNVTESERSPGM-REIIAITFHVYSQY 74 (140) T ss_pred cchhHHHHHHHHHHhccChhHHhhcCcccc-cCCccC------CCCCEEEeCCceeeecCCCccc-ceEEEEEEEEEEcC Confidence 66 4 3445555 478999999 667865 3689999977776776654433 347899999995 Q ss_pred CCHHHHHHHHHHHHHHHHhhcceeecc----------CC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 STITEARTIRNMALDALQVLKPGSIVK----------TP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~t~~~A~~l~~av~~Al~~~~~~~~~~----------~~-~ye~dT~lyr~~~df~i~~ 115 (115) ....+|++++.+|++||.. +....+ .+ -+|+|+..+|..+.|.++| T Consensus 75 ~g~~ea~~ia~ai~~aL~~--~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v 131 (140) T protein:vir:96 75 ENGAEARELLKYLNYACRL--NINFKDYELEWIKKDNSQVFTDIDQYTKHGVLRLLYKV 131 (140) T ss_pred CCHHHHHHHHHHHHHHhcC--CccCCCceEEEEEEeeeEEeecCCCceEEEEEEEEEEE Confidence 5899999999999999963 222111 11 2789999999999999999 No 38 >protein:vir:97325 Length: 145 # NCBI annotation: ORF030 # Family: family:all:296 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240617;genbank:gi:66396297;genbank:GeneID:5133681 Probab=98.79 E-value=7.7e-11 Score=76.02 Aligned_cols=105 Identities=15% Similarity=0.139 Sum_probs=80.3 Q ss_pred Cc-------hHHHHHHHH------hhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT-------EDDLYPLLE------PLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~-------E~~i~~lL~------~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. +.-|++.|. .+++|||| +..|.+. ++|||++-.....+.+.-++. ....++||+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~~l~alvggrV~-D~~P~~a------~~PYv~lG~~~~~d~~~~~~~-g~~~~~ti~Vws 72 (145) T protein:vir:97 1 MWVSVERYLFNKVYNKLKSNLIIRKQLDGRVF-DCVQKDA------VYPYIVVGETNVTNKETTTSM-VEDVGITLHVYS 72 (145) T ss_pred CcchHhHHHHHHHHHHhhcChhHHHhhcCcee-cCCccCC------CCCEEEeCcceeeecCCCccc-ceEEEEEEEEEE Confidence 88 355666664 68899999 6788763 689999988777777666553 357899999999 Q ss_pred C--CHHHHHHHHHHHHHHHHhhcceeec----------cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVLKPGSIV----------KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~~~~~~~----------~~~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|+.||.. +.... ..+ -+|+|...+|..+.|...| T Consensus 73 ~~~g~~eak~ia~av~~aL~~--~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~fra~v 131 (145) T protein:vir:97 73 QARNRDEASQIIQFLGFVLNN--EIEIDYYSFIKSRIDTQEVITDIDQYTKHGIIRLVFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhcc--ccCCCCCeEEEeEEeeeeEeecCCCceEEEEEEEEEEE Confidence 7 789999999999999963 11111 111 2789999999999999999 No 39 >protein:vir:96260 Length: 141 # NCBI annotation: ORF027 # Family: family:all:296 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240317;genbank:gi:66395991;genbank:GeneID:5133337 Probab=98.73 E-value=1.2e-10 Score=74.89 Aligned_cols=107 Identities=18% Similarity=0.178 Sum_probs=77.2 Q ss_pred Cc---h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT---E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~---E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. + .-|++.| .++++|||| +..|.++ ++|||++-.....+.+.-++. ....++||+||+ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~-D~~P~~~------~~PYv~lG~~~~~~~~~~~~~-g~~~~~ti~Vws 72 (141) T protein:vir:96 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVF-DVVQDDA------VYPYIVVGESNVTNNESSATM-RETVGIVIHVYS 72 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccc-cCCccCC------CCCEEEeCCceeeecCCCccc-ceEEEEEEEEEE Confidence 55 3 4455555 468899999 6788753 689999988887777755543 357899999995 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec--c--C-C-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV--K--T-P-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~--~--~-~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||..- ...... . . . -+|+|...+|..+.|.++| T Consensus 73 ~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v 131 (141) T protein:vir:96 73 QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKY 131 (141) T ss_pred cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Confidence 5 6779999999999999631 111110 1 1 1 2678888899999999999 No 40 >protein:vir:105892 Length: 141 # NCBI annotation: tail protein # Family: family:all:296 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004380;genbank:gi:122891835;genbank:GeneID:4712363 Probab=98.73 E-value=1.2e-10 Score=74.89 Aligned_cols=107 Identities=18% Similarity=0.178 Sum_probs=77.2 Q ss_pred Cc---h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT---E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~---E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. + .-|++.| .++++|||| +..|.++ ++|||++-.....+.+.-++. ....++||+||+ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~-D~~P~~~------~~PYv~lG~~~~~~~~~~~~~-g~~~~~ti~Vws 72 (141) T protein:vir:10 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVF-DVVQDDA------VYPYIVVGESNVTNNESSATM-RETVGIVIHVYS 72 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccc-cCCccCC------CCCEEEeCCceeeecCCCccc-ceEEEEEEEEEE Confidence 55 3 4455555 468899999 6788753 689999988887777755543 357899999995 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec--c--C-C-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV--K--T-P-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~--~--~-~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||..- ...... . . . -+|+|...+|..+.|.++| T Consensus 73 ~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v 131 (141) T protein:vir:10 73 QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKY 131 (141) T ss_pred cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Confidence 5 6779999999999999631 111110 1 1 1 2678888899999999999 No 41 >protein:vir:94096 Length: 141 # NCBI annotation: ORF031 # Family: family:all:296 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240240;genbank:gi:66395916;genbank:GeneID:5133265 Probab=98.73 E-value=1.2e-10 Score=74.89 Aligned_cols=107 Identities=18% Similarity=0.178 Sum_probs=77.2 Q ss_pred Cc---h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT---E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~---E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. + .-|++.| .++++|||| +..|.++ ++|||++-.....+.+.-++. ....++||+||+ T Consensus 1 Msms~~~aLQ~Ai~~~L~adaal~alvg~rI~-D~~P~~~------~~PYv~lG~~~~~~~~~~~~~-g~~~~~ti~Vws 72 (141) T protein:vir:94 1 MWVSVEPELTNQIYKRLISDPNINKLVDDRVF-DVVQDDA------VYPYIVVGESNVTNNESSATM-RETVGIVIHVYS 72 (141) T ss_pred CccchhHHHHHHHHHHhhcChhhHhhcCCccc-cCCccCC------CCCEEEeCCceeeecCCCccc-ceEEEEEEEEEE Confidence 55 3 4455555 468899999 6788753 689999988887777755543 357899999995 Q ss_pred C--CHHHHHHHHHHHHHHHHhh---cceeec--c--C-C-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 S--TITEARTIRNMALDALQVL---KPGSIV--K--T-P-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 ~--t~~~A~~l~~av~~Al~~~---~~~~~~--~--~-~-~ye~dT~lyr~~~df~i~~ 115 (115) . ...+|++++.+|++||..- ...... . . . -+|+|...+|..+.|.++| T Consensus 73 ~~~g~~eak~ia~av~~AL~~~l~l~~~~lv~l~~~~~~~~rd~dg~t~hgvl~~ra~v 131 (141) T protein:vir:94 73 QFATQYEAKLILSAIGYVLNRPIEIDNYEFQFSRIDSQAVFPDIDRFTKHGTIRLLFKY 131 (141) T ss_pred cCCCHHHHHHHHHHHHHHhcccccCCCceEEEEEEeeeeeeecCCCceEEEEEEEEEEE Confidence 5 6779999999999999631 111110 1 1 1 2678888899999999999 No 42 >protein:vir:105337 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950674;genbank:gi:119967844;genbank:GeneID:4643216 Probab=98.69 E-value=2.2e-10 Score=73.53 Aligned_cols=105 Identities=18% Similarity=0.182 Sum_probs=78.5 Q ss_pred Cc---h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT---E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~---E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. | .-||+.| ..+++||||- ..|.++ ++|||++-.....+...-++. ....+++|+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD-~~P~~a------~~PyV~lG~~~~~~~~~~~~~-g~~~~~ti~Vws 72 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIVSKQLGGRVFD-CVQKDA------VYPYIVVGETNVTNKETTTSM-FEDVGVTLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcccccc-CCccCC------CCCEEEeCcceeeecCCCccc-ceEEEEEEEEEE Confidence 88 3 4556555 4788999996 577653 689999987776666655544 457899999996 Q ss_pred --CCHHHHHHHHHHHHHHHHhhcceeec----------cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 --STITEARTIRNMALDALQVLKPGSIV----------KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 --~t~~~A~~l~~av~~Al~~~~~~~~~----------~~~-~ye~dT~lyr~~~df~i~~ 115 (115) ....+|++++.+|++||+. +.... ..+ -+|+|...+|..+.|...| T Consensus 73 ~~~g~~ea~~ia~av~~aL~a--~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~v 131 (145) T protein:vir:10 73 QARNRDEASQIIQYLGFVLNS--EIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhCC--CcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEE Confidence 5899999999999999962 22211 111 2789999999999999999 No 43 >protein:vir:107096 Length: 145 # NCBI annotation: conserved phage protein # Family: family:all:296 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950611;genbank:gi:119953691;genbank:GeneID:4643105 Probab=98.69 E-value=2.2e-10 Score=73.53 Aligned_cols=105 Identities=18% Similarity=0.185 Sum_probs=78.5 Q ss_pred Cc---h----HHHHHHH------HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEee Q lcl|NC_019769. 1 MT---E----DDLYPLL------EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYS 67 (115) Q Consensus 1 M~---E----~~i~~lL------~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA 67 (115) |. | .-||+.| ..+++||||- ..|.++ ++|||++-.....+...-++. ....+++|+||+ T Consensus 1 Ms~s~~~aLq~Ai~~~L~ad~al~alvg~rVyD-~~P~~a------~~PyV~lG~~~~~~~~~~~~~-g~~~~~ti~Vws 72 (145) T protein:vir:10 1 MWVSVERYLFNKIYNKLKSNPIIKKQLGGRVFD-CVQKDA------VYPYIVVGETNVTNKETTTSM-FEDVGVTLHVYS 72 (145) T ss_pred CchhHHHHHHHHHHHHhhcChhHHHhhcccccc-CCccCC------CCCEEEeCcceeeecCCCccc-ceEEEEEEEEEE Confidence 88 3 4556555 4788999996 577653 689999987776666655544 457899999996 Q ss_pred --CCHHHHHHHHHHHHHHHHhhcceeec----------cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 68 --STITEARTIRNMALDALQVLKPGSIV----------KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 68 --~t~~~A~~l~~av~~Al~~~~~~~~~----------~~~-~ye~dT~lyr~~~df~i~~ 115 (115) ....+|++++.+|++||+. +.... ..+ -+|+|...+|..+.|...| T Consensus 73 ~~~g~~ea~~ia~av~~aL~a--~l~l~~~~lv~l~~~~~~~~rd~dg~~~hgvl~~ra~v 131 (145) T protein:vir:10 73 QARNRDEASQIIQYLGFVLNS--EIEINNYSFIKSRIDTQEVITDIDQYTKHGIIRLIFKY 131 (145) T ss_pred cCCCHHHHHHHHHHHHHHhCC--CcCCCCCeEEEEEEeeeeEeecCCCceEEEEEEEEEEE Confidence 5899999999999999962 22211 111 2789999999999999999 No 44 >protein:vir:9709 Length: 141 # NCBI annotation: hypothetical protein # Family: family:all:2110 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795471;genbank:gi:28876220;genbank:GeneID:1257764 Probab=98.68 E-value=1.7e-10 Score=74.08 Aligned_cols=112 Identities=18% Similarity=0.198 Sum_probs=87.4 Q ss_pred Cc-hHHHHHHHH------hhcC------------CccceeeccCCCCCCcc-ccccEEEEEecCCCccceecCCCC-cce Q lcl|NC_019769. 1 MT-EDDLYPLLE------PLAG------------GQVYPYVAPLGSDGKPS-VSPPWVIFSIITDVAADVLCGQAE-SAV 59 (115) Q Consensus 1 M~-E~~i~~lL~------~l~~------------~Rvyp~~aP~~~~~~p~-~~~Pyiv~q~vsg~p~n~l~G~~~-~~~ 59 (115) |. |.+||.+|+ .|++ +.+|-...|+...+-.+ -..|+|+++.+.|.+..+-+.... ... T Consensus 1 mlp~~~vy~~L~~n~~L~~lm~~~r~~~~~~~~~~~If~~~vPE~~~~~qk~~~aP~IrI~~i~~~~~~yADn~~~~~~~ 80 (141) T protein:vir:97 1 MIAETTAYKLLSNDKTLNELLDKLRGGPFKNGFKQGIFTYDIPDNPIDLRKAELAPFMRIKTTLDGPADYADDEILCNEQ 80 (141) T ss_pred CchHHHHHHHhcccHHHHHHHhhhccccccccccccccccccCCChhhhhhhccCCeEEEeccCCCcccccccccceeee Confidence 88 888988884 3543 35777788886322111 147999999999999999998765 678 Q ss_pred EEEEEEeeCCHHHHHHHHHHHHHHHHhhcceeec---cCCCccccccceee-----EEEEE Q lcl|NC_019769. 60 SVQVDVYSSTITEARTIRNMALDALQVLKPGSIV---KTPGYEPDLRYHRA-----TLEFQ 112 (115) Q Consensus 60 ~vQIDvyA~t~~~A~~l~~av~~Al~~~~~~~~~---~~~~ye~dT~lyr~-----~~df~ 112 (115) +||||+|..+..+..++-..|-..|...+..-.. .-+.+|+||++||. ++||. T Consensus 81 ~vQIdiW~~~~~~~e~i~~~Id~~M~~~gf~rY~~~~~~~~~dpD~d~~~~~rRYr~~~~~ 141 (141) T protein:vir:97 81 RITINFWCKTASEADQINKCIDNILKQGGFERYTANEKPRYKDSDIDLLMNVRKYRCFDFY 141 (141) T ss_pred eeEeeeeecChhHHHHHHHHHHHHHHhcCceeccccCCCCCCccchhhhhhhhheeeeccC Confidence 9999999999999999999999999987766433 23458999999884 67777 No 45 >protein:vir:9579 Length: 111 # NCBI annotation: gp45 # Family: family:all:1269 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862884;genbank:gi:32469476;genbank:GeneID:1461321 Probab=98.54 E-value=1.4e-09 Score=69.14 Aligned_cols=103 Identities=13% Similarity=0.063 Sum_probs=79.3 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEARTIRNMA 80 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A~~l~~av 80 (115) |+|+-|..-|..-++--||-. .|. ..+.++++..++||...|++ ++.++=|=|||.|..+|..|+.+| T Consensus 1 miE~~v~~~L~~~l~vpv~~~-vp~------~~P~~FV~vErtGG~~~~~~-----~~p~laVq~wg~S~~~Aa~La~~v 68 (111) T protein:vir:95 1 MIEIIINKYLDGHLDVPSFFE-HEA------EAPDSFVIIQKTGGKERNHS-----GSATFAFQSYAPTMQKAAELNVKV 68 (111) T ss_pred ChHHhHHHHhhhhcCeeEEee-cCC------CCCCceEEEEeeCCcccccc-----ccceEEEEeccccHHHHHHHHHHH Confidence 999999999987666444433 222 23579999999999999887 345777779999999999999999 Q ss_pred HHHHHhhcc---eeec----cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 81 LDALQVLKP---GSIV----KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 81 ~~Al~~~~~---~~~~----~~~~ye~dT~lyr~~~df~i~~ 115 (115) +.||..+.- .... ..+--|++||.||..+-|+|+- T Consensus 69 ~~a~~~l~~~~~i~~v~~~s~ynf~d~~tk~~RYQ~~~~i~~ 110 (111) T protein:vir:95 69 KSAVKGLIELDSICGVHLNSDYNFTDTETKQYRYQAVFDINY 110 (111) T ss_pred HHHHhhhhccccccccccCCccccCCCCCCCceEEEEEEEEe Confidence 999976522 1111 1222478999999999999999 No 46 >protein:vir:101303 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908836;genbank:gi:118725100;genbank:GeneID:4555874 Probab=98.53 E-value=9.5e-10 Score=70.04 Aligned_cols=111 Identities=18% Similarity=0.223 Sum_probs=87.3 Q ss_pred Cc--hHHHHHHHH------hhcC-CccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCC-cceEEEEEEeeCC Q lcl|NC_019769. 1 MT--EDDLYPLLE------PLAG-GQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAE-SAVSVQVDVYSST 69 (115) Q Consensus 1 M~--E~~i~~lL~------~l~~-~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~-~~~~vQIDvyA~t 69 (115) |. =.+||.+|. .+++ .|++-+-.|+..+. ..|+||..-++. .|.++-++... ..+.+|||||.+. T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~----~~p~IvI~pl~~p~p~~~~sn~~ls~~~~~QIDV~~k~ 76 (135) T protein:vir:10 1 MIDILYKVHEVISQDRIIREHVNINNIKFNKYPNVKDT----DVPFIVIDDIDDPIPTTYTDGDECAYSYIVQIDVFVKY 76 (135) T ss_pred CcchHHHHHHHhhcchHHHhhcCccceEEEecCCcccc----ccceEEEecCCCCCCccccCchhceeeeeEEEeeeeec Confidence 55 367888774 5677 49999999997653 459999999987 68999998765 6789999999999 Q ss_pred HH------HHHHHHHHHHHHH-Hhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 70 IT------EARTIRNMALDAL-QVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 70 ~~------~A~~l~~av~~Al-~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) .+ .+..|...|+..| +.++..... +.+.|++||++||.+=-|+=+. T Consensus 77 ~~~~~~R~~~~~i~~~I~~~l~~~~~f~q~s~~ldeY~~et~~y~~aRRYrG~~ 130 (135) T protein:vir:10 77 NDEYNARIIRNKISNRIQKLLWSELKMGNVSNGKPEYIEEFKTYRSSRVYEGIF 130 (135) T ss_pred ccccchhhHHHHHHHHHHHHHHHHcCccccCCCCccchhhhhhhhhhheeeeec Confidence 77 4666788888888 445554443 4567999999999888887777 No 47 >protein:vir:9514 Length: 135 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835561;genbank:gi:30043946;genbank:GeneID:1260543 Probab=98.53 E-value=9.5e-10 Score=70.04 Aligned_cols=111 Identities=18% Similarity=0.223 Sum_probs=87.3 Q ss_pred Cc--hHHHHHHHH------hhcC-CccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCC-cceEEEEEEeeCC Q lcl|NC_019769. 1 MT--EDDLYPLLE------PLAG-GQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAE-SAVSVQVDVYSST 69 (115) Q Consensus 1 M~--E~~i~~lL~------~l~~-~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~-~~~~vQIDvyA~t 69 (115) |. =.+||.+|. .+++ .|++-+-.|+..+. ..|+||..-++. .|.++-++... ..+.+|||||.+. T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~----~~p~IvI~pl~~p~p~~~~sn~~ls~~~~~QIDV~~k~ 76 (135) T protein:vir:95 1 MIDILYKVHEVISQDRIIREHVNINNIKFNKYPNVKDT----DVPFIVIDDIDDPIPTTYTDGDECAYSYIVQIDVFVKY 76 (135) T ss_pred CcchHHHHHHHhhcchHHHhhcCccceEEEecCCcccc----ccceEEEecCCCCCCccccCchhceeeeeEEEeeeeec Confidence 55 367888774 5677 49999999997653 459999999987 68999998765 6789999999999 Q ss_pred HH------HHHHHHHHHHHHH-Hhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 70 IT------EARTIRNMALDAL-QVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 70 ~~------~A~~l~~av~~Al-~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) .+ .+..|...|+..| +.++..... +.+.|++||++||.+=-|+=+. T Consensus 77 ~~~~~~R~~~~~i~~~I~~~l~~~~~f~q~s~~ldeY~~et~~y~~aRRYrG~~ 130 (135) T protein:vir:95 77 NDEYNARIIRNKISNRIQKLLWSELKMGNVSNGKPEYIEEFKTYRSSRVYEGIF 130 (135) T ss_pred ccccchhhHHHHHHHHHHHHHHHHcCccccCCCCccchhhhhhhhhhheeeeec Confidence 77 4666788888888 445554443 4567999999999888887777 No 48 >protein:vir:100675 Length: 135 # NCBI annotation: 77ORF027 # Family: family:all:508 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958611;genbank:gi:41189540;genbank:GeneID:2743821 Probab=98.53 E-value=9.5e-10 Score=70.04 Aligned_cols=111 Identities=18% Similarity=0.223 Sum_probs=87.3 Q ss_pred Cc--hHHHHHHHH------hhcC-CccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCC-cceEEEEEEeeCC Q lcl|NC_019769. 1 MT--EDDLYPLLE------PLAG-GQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAE-SAVSVQVDVYSST 69 (115) Q Consensus 1 M~--E~~i~~lL~------~l~~-~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~-~~~~vQIDvyA~t 69 (115) |. =.+||.+|. .+++ .|++-+-.|+..+. ..|+||..-++. .|.++-++... ..+.+|||||.+. T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~rIk~~~~Pe~~d~----~~p~IvI~pl~~p~p~~~~sn~~ls~~~~~QIDV~~k~ 76 (135) T protein:vir:10 1 MIDILYKVHEVISQDRIIREHVNINNIKFNKYPNVKDT----DVPFIVIDDIDDPIPTTYTDGDECAYSYIVQIDVFVKY 76 (135) T ss_pred CcchHHHHHHHhhcchHHHhhcCccceEEEecCCcccc----ccceEEEecCCCCCCccccCchhceeeeeEEEeeeeec Confidence 55 367888774 5677 49999999997653 459999999987 68999998765 6789999999999 Q ss_pred HH------HHHHHHHHHHHHH-Hhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 70 IT------EARTIRNMALDAL-QVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 70 ~~------~A~~l~~av~~Al-~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) .+ .+..|...|+..| +.++..... +.+.|++||++||.+=-|+=+. T Consensus 77 ~~~~~~R~~~~~i~~~I~~~l~~~~~f~q~s~~ldeY~~et~~y~~aRRYrG~~ 130 (135) T protein:vir:10 77 NDEYNARIIRNKISNRIQKLLWSELKMGNVSNGKPEYIEEFKTYRSSRVYEGIF 130 (135) T ss_pred ccccchhhHHHHHHHHHHHHHHHHcCccccCCCCccchhhhhhhhhhheeeeec Confidence 77 4666788888888 445554443 4567999999999888887777 No 49 >protein:vir:96002 Length: 133 # NCBI annotation: ORF024 # Family: family:all:508 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239806;genbank:gi:66395472;genbank:GeneID:5132919 Probab=98.49 E-value=1.7e-09 Score=68.65 Aligned_cols=111 Identities=18% Similarity=0.196 Sum_probs=85.0 Q ss_pred Cc--hHHHHHHHH------hhcCCc-cceeeccCCCCCCccccccEEEEEecCC-CccceecCCCC-cceEEEEEEeeCC Q lcl|NC_019769. 1 MT--EDDLYPLLE------PLAGGQ-VYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAE-SAVSVQVDVYSST 69 (115) Q Consensus 1 M~--E~~i~~lL~------~l~~~R-vyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~-~~~~vQIDvyA~t 69 (115) |. =.+||.+|. .+++++ ++-+-.|+..+. ..|+||..-++. .|.++-++... ..+.+|||||+.. T Consensus 1 m~diL~eIy~~L~~d~~L~~~v~~~~Ik~~~~Pe~~d~----~~p~IvI~pi~~p~p~~f~sn~~ls~~~~~QIDV~sk~ 76 (133) T protein:vir:96 1 MIDILMEVYNILKSDDDLMRLIDKKNIKFNQYPDVKDK----MAPYIVIDDYDDPIPEWHSDGDRIAYNYAFQIDVMVKA 76 (133) T ss_pred CcchHHHHHHHhhcchHHHHhcCccceEEeecCCcccc----ccceEEEecCCCCCcccccCcceeeeEEEEEEeeeeec Confidence 55 367887774 577765 999999997653 459999999988 77889888765 6789999999975 Q ss_pred HH------HHHHHHHHHHHHHHhhcceeecc-CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 70 IT------EARTIRNMALDALQVLKPGSIVK-TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 70 ~~------~A~~l~~av~~Al~~~~~~~~~~-~~~ye~dT~lyr~~~df~i~~ 115 (115) .+ ..++|...|+..|...+..-..+ .+.|++||++||-+=-++=+. T Consensus 77 ~~~~~~R~~~~~i~~rI~~~m~~~gf~Q~~~~~deYd~et~~y~~aRRYrg~~ 129 (133) T protein:vir:96 77 SDAYNARKRRNEISNRISELLWKNQMKQIRNLGNEYDKNLALYRSTRRYEAIF 129 (133) T ss_pred cccccchhhhHHHHHHHHHHHHHcCceecCCCccccchhhhhhhhhheeeccc Confidence 44 45667777777777666555444 367999999999887777666 No 50 >protein:vir:1643 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695064;genbank:gi:23455755;genbank:GeneID:955492 Probab=98.41 E-value=6.7e-09 Score=65.40 Aligned_cols=103 Identities=12% Similarity=0.097 Sum_probs=81.1 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEARTIRNMA 80 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A~~l~~av 80 (115) |+|..|..-|..-.+--||-.+ |.+ .+.+|++..++||...|++ ++.++=|=|||.|..+|..|+.+| T Consensus 1 miE~~i~~~L~~~l~Vpv~~e~-p~~------~P~~FV~vErtGG~~~~~~-----~~~~lAVq~w~~S~~eAa~La~~v 68 (111) T protein:vir:16 1 MIEIIIKNFLDTHLSVSSFLEK-KGE------MPLSYILFEKTGSSKSNHL-----LSSTFAFQSYAPSMYEAAKLNEQL 68 (111) T ss_pred ChHHhHHHHHhhcCCceeEeec-CCC------CCCceEEEEecCCcccccc-----ccceEEEEecchhHHHHHHHHHHH Confidence 9999999999887665666554 543 3568999999999999866 455777789999999999999999 Q ss_pred HHHHHhhcce---ee----ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 81 LDALQVLKPG---SI----VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 81 ~~Al~~~~~~---~~----~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +++|+.+... .. +..+--|++||-||..+-|+|+- T Consensus 69 ~~~l~~l~~~~~I~av~~~s~ynf~d~~tk~~RYQav~~i~~ 110 (111) T protein:vir:16 69 KEVVERLIELNEISNVSLNSDYNFTDTETKEYRYQAVFDINH 110 (111) T ss_pred HHHHhhccccccceeeecCCCCcCCCCCCCCceEEEEEEEee Confidence 9999865321 11 12222478899999999999999 No 51 >protein:vir:94768 Length: 111 # NCBI annotation: unknown # Family: family:all:1269 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996711;genbank:gi:45597426;genbank:GeneID:2769040 Probab=98.36 E-value=1.1e-08 Score=64.32 Aligned_cols=103 Identities=13% Similarity=0.098 Sum_probs=80.9 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEARTIRNMA 80 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A~~l~~av 80 (115) |+|..|..-|..-.+--||-.+ |.+ .+.+|++..+.||...|++ ++.++=|=|||.|..+|..|+.+| T Consensus 1 miE~~v~~~L~~~l~vpv~~e~-p~~------~p~~FV~vErtGG~~~~~~-----~~~~lAVQ~~~~S~~eAa~La~~v 68 (111) T protein:vir:94 1 MIEIIIKNFLDTHLSVSSFLEK-KGE------MPLSYVLFEKTGSSKSNHL-----LSSTFAFQSYAPSMYEAAKLNEQL 68 (111) T ss_pred ChHHhHHHHHhhcCCcceEeec-CCC------CCCceEEEEecCCcccccc-----ccceEEEEecchhHHHHHHHHHHH Confidence 9999999999886665666554 544 2568999999999999887 345677779999999999999999 Q ss_pred HHHHHhhcce---ee----ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 81 LDALQVLKPG---SI----VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 81 ~~Al~~~~~~---~~----~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +++|+.+... .. +..+--|++||-||...-|+|+- T Consensus 69 ~~~~~~l~~~~~i~~v~~~s~Ynf~d~~tk~~RYQav~~i~~ 110 (111) T protein:vir:94 69 KEVVERLIELNEISNVSLNSDYNFTDTETKEYRYQAVFDINH 110 (111) T ss_pred HHHHhhcccccccceeecCCCcccCCCcCCCceEEEEEEEee Confidence 9999865221 11 11222478899999999999999 No 52 >protein:vir:9648 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795410;genbank:gi:28876183;genbank:GeneID:1257699 Probab=98.35 E-value=2.6e-09 Score=67.62 Aligned_cols=111 Identities=13% Similarity=0.054 Sum_probs=91.3 Q ss_pred Cc--hHHHHHHHHh---hcCCccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCC-cceEEEEEEeeCCHHHH Q lcl|NC_019769. 1 MT--EDDLYPLLEP---LAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAE-SAVSVQVDVYSSTITEA 73 (115) Q Consensus 1 M~--E~~i~~lL~~---l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~-~~~~vQIDvyA~t~~~A 73 (115) |+ =-+||.+|.. |-..|++-+-.|+..+ ...||||..-+.. .|.++-++... ...-+||||+...+... T Consensus 2 m~DiL~~Iy~~L~~d~~l~~~rIk~~~~Pe~~d----~~~p~IvI~pl~~P~p~~~~sd~~ls~~ylyQIDVes~~r~~~ 77 (126) T protein:vir:96 2 VRDMLAEVFDLLKADNVLKLVKIKSFERPESLL----DDQTSIVILPITAPKQSTFGSDTALSKKFLYQIEVESTSRLEC 77 (126) T ss_pred hhHHHHHHHHHHhccceecceeeeeeecCCCCC----CCcceEEEeeCCCCCCccccCchhhhhhceeeEeeeecCccch Confidence 44 2567777753 4455999999999875 3579999999987 88888888765 67899999999999999 Q ss_pred HHHHHHHHHHHHhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) +.|...|+..|..++..-.. +.+.|++|||+|+.+=-|+=+- T Consensus 78 ~~i~~rI~~~l~~igf~q~s~gldeY~~etkry~daRRYrg~~ 120 (126) T protein:vir:96 78 KDLQCRIEKQLEKIGFYQNDAGFERFDRDTGRYLDARTFRGFS 120 (126) T ss_pred HHHHHHHHHHHHHcCccccccCcchhhhhhhhhhhhheecccc Confidence 99999999999988776654 4578999999999888777663 No 53 >protein:vir:5979 Length: 134 # NCBI annotation: hypothetical protein # Family: family:all:296 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690679;genbank:geneid:6329147;genbank:gi:22855073;uniprot:O48448;genbank:GeneID:955319 Probab=98.31 E-value=1.2e-08 Score=63.98 Aligned_cols=106 Identities=18% Similarity=0.188 Sum_probs=74.1 Q ss_pred Cc----h----HHHHHHHH------hhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEe Q lcl|NC_019769. 1 MT----E----DDLYPLLE------PLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVY 66 (115) Q Consensus 1 M~----E----~~i~~lL~------~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvy 66 (115) |+ + .-||+.|. +++ |||| +..|.+. ++|||++-.....+.+.-++. ....+++|+|| T Consensus 1 m~~~s~~~aLq~Ai~~~L~ad~~l~alv-g~I~-D~~P~~~------~~PYV~lG~~~~~d~~~~~~~-g~~~~~ti~Vw 71 (134) T protein:vir:59 1 MTWKLASRALQKATVENLESYQPLMEMV-NQVT-ESPGKDD------PYPYVVIGDQSSTPFETKSSF-GENITMDFHVW 71 (134) T ss_pred CCccchhHHHHHHHHHHhhcChhHHHhh-hhhh-cCCCCCC------CCCEEEeCCceeeecCCCccc-ceEEEEEEEEE Confidence 76 2 33455553 577 4899 6788763 689999977666666544444 35678999999 Q ss_pred eC-CHHHHHHHHHHHHHHHHhh----cceeec-----cCC-CccccccceeeEEEEEEeC Q lcl|NC_019769. 67 SS-TITEARTIRNMALDALQVL----KPGSIV-----KTP-GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 67 A~-t~~~A~~l~~av~~Al~~~----~~~~~~-----~~~-~ye~dT~lyr~~~df~i~~ 115 (115) +. ...+|++++.+|++||... ...... ..+ -+|+|...+|..+.|...| T Consensus 72 s~~g~~ea~~ia~av~~aL~~~~L~l~~~~lv~l~~~~~~~~rd~dg~~~hg~l~fra~v 131 (134) T protein:vir:59 72 GGTTRAEAQDISSRVLEALTYKPLMFEGFTFVAKKLVLAQVITDTDGVTKHGIIKVRFTI 131 (134) T ss_pred ECCChHHHHHHHHHHHHHhcCCCcccCCceEEEeEEeeeeEEecCCCceEEEEEEEEEEE Confidence 87 3467889999999999632 111111 112 2789999999999999999 No 54 >protein:vir:9764 Length: 111 # NCBI annotation: hypothetical protein # Family: family:all:1269 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795526;genbank:gi:28876278;genbank:GeneID:1257819 Probab=98.28 E-value=2e-08 Score=62.77 Aligned_cols=103 Identities=11% Similarity=0.088 Sum_probs=80.9 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEARTIRNMA 80 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A~~l~~av 80 (115) |+|.-|..-|..-++--||-.+-++ .+.+|++..+.||...|++ +..++=|=|||.|..+|..|+.+| T Consensus 1 mIE~~i~~yL~~~l~vpv~~e~p~~-------~P~~FV~vEkTGG~~~~~~-----~~a~lAvQsyg~S~~~AA~La~~V 68 (111) T protein:vir:97 1 MIEVIIKKYLDEHLDVPSFFEHQKD-------EPARFIILEKTSGAKQNHL-----LSSTFAFQSYAESLYEAALLNDKV 68 (111) T ss_pred ChhhhhhHHHhhhcCceEEEeecCC-------CCCceEEEEeeCCcccccc-----ccceEEEEecchhHHHHHHHHHHH Confidence 9999999999887777676543322 2569999999999999887 345677779999999999999999 Q ss_pred HHHHHhhcce-ee------ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 81 LDALQVLKPG-SI------VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 81 ~~Al~~~~~~-~~------~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ++||+.+... .+ +..+--|++||-||...-|.|.- T Consensus 69 ~~a~~~l~~l~~i~~v~lns~Ynf~d~~tk~yRYQa~~di~~ 110 (111) T protein:vir:97 69 KQVIEQLDVLPQVSGVHLNADYNFTDTATKRYRYQAVFDINH 110 (111) T ss_pred HHHhhhhccCccceeeeecccccCCCCCCCCccEEEEEEEee Confidence 9999865321 12 12233478999999999999988 No 55 >protein:vir:105055 Length: 129 # NCBI annotation: Gp10 # Family: family:all:11393 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006590;genbank:gi:46402096;genbank:GeneID:2777921 Probab=98.21 E-value=2.7e-08 Score=62.09 Aligned_cols=106 Identities=19% Similarity=0.214 Sum_probs=82.7 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCC---CcceEEEEEEe-eCCHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA---ESAVSVQVDVY-SSTITEARTI 76 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~---~~~~~vQIDvy-A~t~~~A~~l 76 (115) |+|.+|++.|..|-+-.|||...|... .-=++||+||..--| .|.. .-..|+||..+ -++|..+.+| T Consensus 1 MIE~~ik~~LerlT~l~vYPLlLPdt~-------~eGvtyQRISDpk~~--sGl~~T~Lv~~RfQI~~~~~dDY~~ll~l 71 (129) T protein:vir:10 1 MIELAIKNELERITGMDAYPLLLPDTV-------QEGVTFQRISDPEMY--SGTLRTGIVSARIQVNLYRVDDYTSLLQL 71 (129) T ss_pred CccHHHHHHHHHhhcCcccceecCCch-------hcCeeeeeccCcccc--chhhhheeeeeEEEEEEEEecCchHHHHH Confidence 999999999999999999999999753 344999999866544 4543 34679999999 8999999999 Q ss_pred HHHHHHHHHhhcceeecc----------C-CC---ccccccceeeEEEEEEeC Q lcl|NC_019769. 77 RNMALDALQVLKPGSIVK----------T-PG---YEPDLRYHRATLEFQVTV 115 (115) Q Consensus 77 ~~av~~Al~~~~~~~~~~----------~-~~---ye~dT~lyr~~~df~i~~ 115 (115) -+++..+-+......+++ . ++ --...+.||.+=||-|+- T Consensus 72 d~~i~~~We~i~HG~Ig~yPVQ~V~RG~~~Q~~~tltnn~~~yr~~RDfII~y 124 (129) T protein:vir:10 72 DKKIWSEWKSIVHGQLDGVPVQYVERGGIQQDKTTLTNRSIQYRLIRDFIIHY 124 (129) T ss_pred HHHHHHHhhhhcccccCCeeeeeeeeccccccceeccCCcEEEEEEeeEEEEe Confidence 999999888643332221 1 12 234568899999999998 No 56 >protein:vir:5744 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:11393 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892055;genbank:gi:33770518;uniprot:Q7Y405;genbank:GeneID:2637455 Probab=98.00 E-value=1.1e-07 Score=58.67 Aligned_cols=106 Identities=20% Similarity=0.247 Sum_probs=81.7 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCC---CcceEEEEEE-eeCCHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA---ESAVSVQVDV-YSSTITEARTI 76 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~---~~~~~vQIDv-yA~t~~~A~~l 76 (115) |+|.+|++.|..|-+-.|||...|... .-=++||+||+.--| .|.. .-..|+||.. --++|..+.+| T Consensus 15 MIE~~ik~~LerlT~l~vYPLlLPdt~-------~EGVtyQRISDPk~~--sGl~~T~LV~~RfQI~~~~~dDY~~ll~l 85 (140) T protein:vir:57 15 MIEQSLKSALERITGMNVYPLLLPDTE-------LEGVTFQRISDPEIE--TGLVRTNLIDCRFQITIHLIDDYTRLVVL 85 (140) T ss_pred HhhHHHHHHHHHhhcCcccceecCChh-------hcCeeeeeccCccch--hhhhhhheeeeEEEEEEEEecCchHHHHH Confidence 999999999999999999999999753 334999999866544 4543 3467999998 46789999999 Q ss_pred HHHHHHHHHhhcceeecc----------C-CC---ccccccceeeEEEEEEeC Q lcl|NC_019769. 77 RNMALDALQVLKPGSIVK----------T-PG---YEPDLRYHRATLEFQVTV 115 (115) Q Consensus 77 ~~av~~Al~~~~~~~~~~----------~-~~---ye~dT~lyr~~~df~i~~ 115 (115) -+++..+-+......+++ . ++ --...+.||.+=||-|+- T Consensus 86 d~~i~~~We~i~HG~Ig~yPVQ~V~RG~~~Q~~~tltnn~~~Yrl~RDFII~y 138 (140) T protein:vir:57 86 DAAIWAEWKKVVHGYIDGYPVQYVRRGGVQQGVTTLTNNSKHFWFSRDFILSF 138 (140) T ss_pred HHHHHHHHhhhcccccCCeeeeeeeeccccccceeccCCcEEEEEEeeEEEEe Confidence 999999888643332221 1 12 234568899999999998 No 57 >protein:vir:80105 Length: 162 # NCBI annotation: gp13 # Family: family:all:2729 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468717;genbank:gi:157325297;genbank:GeneID:5601796 Probab=97.86 E-value=2.8e-07 Score=56.51 Aligned_cols=107 Identities=11% Similarity=0.122 Sum_probs=65.8 Q ss_pred CchHHHHHHHHhhc-CCccceeeccCCCCCCccccccEEEEEecCCCccceecCCC--C--cceEEEEEEeeCCHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLA-GGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA--E--SAVSVQVDVYSSTITEART 75 (115) Q Consensus 1 M~E~~i~~lL~~l~-~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~--~--~~~~vQIDvyA~t~~~A~~ 75 (115) |.-..|-.+ ..+. +..| .|+. ...|.-.+||++|+.++ |+--.++.- + -.+++||||+|.+..+|.. T Consensus 13 lv~~ii~~i-~~~~~gl~v----I~~~-~~g~~p~yPF~TY~v~~--pyi~~~~~~~~~e~~~~~isi~~~S~~~~eAl~ 84 (162) T protein:vir:80 13 LVKTLINAV-NELSGGLQL----IESS-SGGEQPEYPFCQYTITS--PYIAISPDIVEGEQFEIVISLTWRALSGHQALN 84 (162) T ss_pred HHHHHHHHH-HhhhcceeE----EEcc-CCCCCCCCCeEEEEEec--CccccCCcccCCcceEEEEEEEEEeCCHHHHHH Confidence 443333333 3333 3344 4443 23344479999999874 332223321 1 2468999999999999999 Q ss_pred HHHHHHHHHHhhc------------ceeeccCCC---ccccccceeeEEEEEEeC Q lcl|NC_019769. 76 IRNMALDALQVLK------------PGSIVKTPG---YEPDLRYHRATLEFQVTV 115 (115) Q Consensus 76 l~~av~~Al~~~~------------~~~~~~~~~---ye~dT~lyr~~~df~i~~ 115 (115) |+.++++.++..+ +....+.++ +--.---||..||+++.| T Consensus 85 la~~l~~~f~~~~~~~~~~~~~gIvvvdv~~~~~R~~~~~~~yerR~GFD~~~Rv 139 (162) T protein:vir:80 85 LANITNKYFRSQKGRFFMQENGGIVVVSVQNSGLRDTFISIEYERSAGIDLRLRV 139 (162) T ss_pred HHHHHHHHhhcCCceeeeeecCcEEEEecCCCccceeEeeeeeeeeecceEEEEE Confidence 9999999996311 111223333 223334699999999999 No 58 >protein:vir:3618 Length: 129 # NCBI annotation: ORF41 # Family: family:all:504 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112704;genbank:gi:13786572;genbank:GeneID:921070 Probab=97.56 E-value=2.8e-06 Score=51.02 Aligned_cols=107 Identities=13% Similarity=0.199 Sum_probs=70.1 Q ss_pred Cc--hHHHH----HHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeC--CHHH Q lcl|NC_019769. 1 MT--EDDLY----PLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSS--TITE 72 (115) Q Consensus 1 M~--E~~i~----~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~--t~~~ 72 (115) |. +++|| +.|.. .+.+||-++-+++ +++|||+.......+..+-+.. ...+.+.||||+. .+.+ T Consensus 2 mksp~qeL~d~~f~~l~~-lG~~vyD~lP~~~------v~YPfV~ig~~~~~~~~tKt~~-~g~v~ltihVW~~~~~R~~ 73 (129) T protein:vir:36 2 IKTRDQSIFDELFKRIQA-LGYTVYDYKPMNE------VGYPFVELENTQTIHEANKTDI-KGTVSLSLSVWGLQKKRKE 73 (129) T ss_pred CcChhHHHHHHHHHHHHh-cCCeeeeccCCCC------CCcCEEEeeeeeecCCcccccc-ccEEEEEEEEEeCCcCchh Confidence 55 56654 44555 5889997655443 5799999887766655543222 2467999999987 4678 Q ss_pred HHHHHHHHHHHHHhhc----cee--------eccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLK----PGS--------IVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~----~~~--------~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..++.++..++.... +.. ..-..|-.+++-|+|..+.+.... T Consensus 74 v~~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~q~~~D~st~~~L~Hgii~l~f~~ 128 (129) T protein:vir:36 74 VSDMASNIFNQALNISATDGYSWALNSQASTIQMLDDTTTNTPLKRALINLEFRL 128 (129) T ss_pred HHHHHHHHHHHhcccccCCCeEEEEEeeeeeEEEeccCCCCceeeEEEEEEEEEe Confidence 8899999988875321 110 011234447888999876666666 No 59 >protein:vir:2741 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695114;genbank:gi:23455883;genbank:GeneID:955650 Probab=97.56 E-value=2.3e-06 Score=51.48 Aligned_cols=107 Identities=13% Similarity=0.160 Sum_probs=69.3 Q ss_pred Cc--hHHH----HHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCC--HHH Q lcl|NC_019769. 1 MT--EDDL----YPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSST--ITE 72 (115) Q Consensus 1 M~--E~~i----~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t--~~~ 72 (115) |. +++| |+.|.. .+-+||-++-+++ +++|||+.......+..+-+..+ ..+.+.||||+.. +.+ T Consensus 1 M~sp~qeL~~~lf~~l~~-~g~~vyD~lP~~~------~~YPfV~ig~~~~~~~~tkt~~~-g~~~l~i~vW~~~~~R~~ 72 (128) T protein:vir:27 1 MKQPDQLLHDEMYRISCE-LGYNTYTYLPPDD------AAYPFVVMGETMVLPQSTKSHLI-GRLSSTVHVWGHVDDRKT 72 (128) T ss_pred CCCHHHHHHHHHHHHHHh-cCCceeccCCCCC------CCcCEEEeccceecCCccccccc-cEEEEEEEEEECCcchhH Confidence 98 5555 445555 3668886544332 57999999888777666544432 4567999999974 677 Q ss_pred HHHHHHHHHHHHHhhccee-------ec-----cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGS-------IV-----KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~-------~~-----~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..++.++..++....... .. ...|-..++-|+|..+++.... T Consensus 73 v~~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~qil~Dtst~~~l~Hgii~l~f~~ 127 (128) T protein:vir:27 73 LSDMAGQLMSSFFAIKKIGGKQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKF 127 (128) T ss_pred HHHHHHHHHHHhccccccCCeeEEEEeecceEEEeeecCCCceeeEEEEEEEEEe Confidence 7888888888875332111 11 1223456788888776666555 No 60 >protein:vir:3972 Length: 129 # NCBI annotation: structural protein # Family: family:all:504 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663680;genbank:gi:21716117;genbank:GeneID:951217 Probab=97.56 E-value=3.1e-06 Score=50.81 Aligned_cols=107 Identities=12% Similarity=0.202 Sum_probs=70.0 Q ss_pred Cc--hHHHH----HHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeC--CHHH Q lcl|NC_019769. 1 MT--EDDLY----PLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSS--TITE 72 (115) Q Consensus 1 M~--E~~i~----~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~--t~~~ 72 (115) |. +++|| +.|.. .+.+||-++-+++ +++|||+.......+..+-+.. ...+.+.||||+. .+.+ T Consensus 2 mksp~qeL~d~~f~~l~~-lG~~vyD~lP~~~------v~YPfV~ig~~~~~~~~tKt~~-~g~v~ltihVW~~~~~R~~ 73 (129) T protein:vir:39 2 IKTRDQSIFDELFKRIQA-LGYTVYDYKQMNE------VGYPFVEMENTQTIHEPNKTDI-KGTVSLSLSVWGLQKKRKE 73 (129) T ss_pred CcChhHHHHHHHHHHHHh-cCCeeeeccCCCC------CCcCEEEeeeeeecCCcccccc-ccEEEEEEEEEeCCcCchh Confidence 55 56554 44555 5889997655443 5799999887766655543222 2467999999997 4678 Q ss_pred HHHHHHHHHHHHHhhccee-------e-----ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGS-------I-----VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~-------~-----~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..++.++..++....... . .-..|-.+++-|+|..+++.... T Consensus 74 v~~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~q~~~Dts~~~~L~Hgvi~l~f~~ 128 (129) T protein:vir:39 74 VSDMASNIFNQALNISATDGYSWALNLQASTIQMMDDTTTGTPLKRAFINLEFRL 128 (129) T ss_pred HHHHHHHHHHHhcccccCCCeeEEEeecceeEEEecccCCCceeeeEEEEEEEEe Confidence 8888888887774321111 1 11223458899999977766666 No 61 >protein:vir:744 Length: 129 # NCBI annotation: major structural protein 2 # Family: family:all:504 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108721;genbank:gi:13487843;genbank:GeneID:920879 Probab=97.55 E-value=3.4e-06 Score=50.54 Aligned_cols=107 Identities=12% Similarity=0.189 Sum_probs=70.1 Q ss_pred Cc--hHHHH----HHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeC--CHHH Q lcl|NC_019769. 1 MT--EDDLY----PLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSS--TITE 72 (115) Q Consensus 1 M~--E~~i~----~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~--t~~~ 72 (115) |. +++|| +.|.. +|.+||-++-++. +++|||+.......+..+-+.. ...+.+.||||+. .+.+ T Consensus 2 mksp~qeL~d~~~~~l~~-lG~~vyD~lP~~~------v~YPfV~ig~~~~~~~~tKt~~-~g~v~ltihVW~~~~~R~~ 73 (129) T protein:vir:74 2 IKTRDQSIFDELFKRIQA-LGYTVYDYKPMNE------VGYPFVELENTQTIHEANKTDI-KGTVSLSLSVWGLQKKRKE 73 (129) T ss_pred CcChhHHHHHHHHHHHHh-cCCeeeeccCCCC------CCcCEEEeeeeeecCCcccccc-ccEEEEEEEEeeCCccchh Confidence 55 56654 44555 5889997655543 5799999887766655543222 2468999999987 4678 Q ss_pred HHHHHHHHHHHHHhhccee--ec----------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGS--IV----------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~--~~----------~~~~ye~dT~lyr~~~df~i~~ 115 (115) +..++.++..++....-.. .+ -..|-.+++-|+|..+++.... T Consensus 74 v~~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~q~~~Dtst~~~L~Hgvi~l~f~~ 128 (129) T protein:vir:74 74 VSDMASNIFNQALNISATDGYSWALNSQASTIQMLDDTTTHTPLKRALINLEFRL 128 (129) T ss_pred HHHHHHHHHHHhccccccCCcEEEEeecceeEEEcccCCCCceeeeEEEEEEEEe Confidence 8889999988875321111 11 1123348889999977666666 No 62 >protein:vir:4907 Length: 128 # NCBI annotation: gp128 # Family: family:all:504 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056685;genbank:gi:9635020;genbank:GeneID:1262660 Probab=97.30 E-value=6.5e-06 Score=49.01 Aligned_cols=106 Identities=15% Similarity=0.184 Sum_probs=67.9 Q ss_pred Cc--hHHHHHHH----HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeC--CHHH Q lcl|NC_019769. 1 MT--EDDLYPLL----EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSS--TITE 72 (115) Q Consensus 1 M~--E~~i~~lL----~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~--t~~~ 72 (115) |. +++||..| ..+ +=.||-++-++. +++|||+.......+..+-++.+ ..+++-||||+. .+.+ T Consensus 1 m~sp~q~L~~~~f~~l~~~-g~~vyD~lP~~~------v~YPfV~ig~~~~~~~~tKt~~~-g~v~ltihVW~~~~~R~e 72 (128) T protein:vir:49 1 MKQPDQLLHDEMYRISCEL-GYNTYTYLPPDD------AAYPFVVMGETMVLPQSTKSHLI-GRLSSTVHVWGRVDDRKT 72 (128) T ss_pred CCchHHHHHHHHHHHHHhc-CCceecccCCCC------CCCCEEEeeeeeecCCccccccc-cEEEEEEEEEeCCCCchh Confidence 98 56665544 443 336775433332 57999999888777666544433 356799999987 4678 Q ss_pred HHHHHHHHHHHHHhhccee-------e-----ccCCCccccccceeeEE--EEEEe Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGS-------I-----VKTPGYEPDLRYHRATL--EFQVT 114 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~-------~-----~~~~~ye~dT~lyr~~~--df~i~ 114 (115) +..++.++..++....... . ....+-+.++-|+|..+ +|.++ T Consensus 73 v~~i~~~i~~~l~~~~~t~~y~f~~~i~~s~~~~~~D~st~~~L~Hgvl~l~f~~~ 128 (128) T protein:vir:49 73 LSDMAGQLMSSFFAIKNIGGKQFSAEINQSSIDSNRDNSTDEVLYHFVIYTYFKFV 128 (128) T ss_pred HHHHHHHHHHHhhcccccCCeEEEEEeccceEEEEeecCCCcceeeEEEEEEEEeC Confidence 8899999998885422111 1 11223456677788776 56666 No 63 >protein:vir:98629 Length: 126 # NCBI annotation: hypothetical protein # Family: family:all:508 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039928;genbank:gi:126011103;genbank:GeneID:4818465 Probab=97.22 E-value=2.5e-06 Score=51.32 Aligned_cols=111 Identities=13% Similarity=0.060 Sum_probs=89.0 Q ss_pred Cc--hHHHHHHHHh---hcCCccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCC-cceEEEEEEeeCCHHHH Q lcl|NC_019769. 1 MT--EDDLYPLLEP---LAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAE-SAVSVQVDVYSSTITEA 73 (115) Q Consensus 1 M~--E~~i~~lL~~---l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~-~~~~vQIDvyA~t~~~A 73 (115) |+ =-+||.+|.. +...|++-+-.|+..+ ...|+||-.-+.. .|.++-++... ....+||||=...+... T Consensus 2 m~DiL~~Iy~~L~~d~~i~~~~Ikfye~Pe~~d----~~~p~IVI~Pl~~P~p~~~~sd~~ls~~y~yQIDVes~~R~~~ 77 (126) T protein:vir:98 2 VRDMLAEVFDLLKADNVLKLVKIKSFERPESLL----DDQTSIVILPITAPKQSTFGSDTALSKKFLYQIEVESTSRLEC 77 (126) T ss_pred hhHHHHHHHHHHhcCceeceeeeeeeecCCccc----cCcceEEEeeCCCCCcccccCChhhheeeeeeeecccccccch Confidence 44 2567777753 4445999999999875 3579999998877 88888888665 67899999999999999 Q ss_pred HHHHHHHHHHHHhhcceeec-cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGSIV-KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~~~-~~~~ye~dT~lyr~~~df~i~~ 115 (115) ++|.+.|+..|..++..-.. +.+.|++|||+|+.+=-|+=+- T Consensus 78 ~~i~~rI~~~l~~~gf~q~~~gldeY~~Et~ryvdaRrY~G~~ 120 (126) T protein:vir:98 78 KDLQRRIEKQLEKIGFYQNDAGFERFDRDTGRYLDARTFRGFS 120 (126) T ss_pred HHHHHHHHHHHHHcCccccccCcchhhhhhhhhhhhhhhccCc Confidence 99999999999988766544 5678999999998776666552 No 64 >protein:vir:99537 Length: 125 # NCBI annotation: putative protein # Family: family:all:504 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958542;genbank:gi:41179324;genbank:GeneID:2717175 Probab=97.19 E-value=1.5e-05 Score=47.07 Aligned_cols=108 Identities=10% Similarity=0.122 Sum_probs=68.2 Q ss_pred Cc-hHHHHHHHHhh---cCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeC--CHHHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPL---AGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSS--TITEAR 74 (115) Q Consensus 1 M~-E~~i~~lL~~l---~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~--t~~~A~ 74 (115) |+ +++||..|=.. .|..||-++-+++ +++|||+.......+..+-.+- ...+.+-||||+. .+.++. T Consensus 1 m~P~q~Lfd~~f~~~~~lG~~vyD~lP~~~------v~YPFVvig~~~~~~~~tKt~~-~g~i~lti~VWg~~~~R~~v~ 73 (125) T protein:vir:99 1 MNPYEELFKTVIEYCKKTGYPTFDYLPDES------QGYPFIMVGDQINNDIYAKDFV-TGTSNLTIHVFAEYNYRAEVA 73 (125) T ss_pred CchhHHHHHHHHHHHHhcCCceeeecCCCC------CCcCEEEEeeeeecCCCCcccc-ceEEEEEEEEeeCcccchhHH Confidence 99 77776665322 4667886544332 5799999887766654432221 2467999999997 456777 Q ss_pred HHHHHHHHHHHhhcce----------eeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 75 TIRNMALDALQVLKPG----------SIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 75 ~l~~av~~Al~~~~~~----------~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) .++.++...+....-+ ...-..|-++.|-|+|..+.+...+ T Consensus 74 ~i~~~i~~~~~~~~~t~~y~~~~~~~~~qii~D~s~~t~L~Hg~l~l~F~i 124 (125) T protein:vir:99 74 TIMEQIQQLIPKFITTNHYLFGLTGSSSNILGETADSIQLQHGRLILDFNL 124 (125) T ss_pred HHHHHHHHHhccceeccCcEEEeeeeeEEEeecCCCCceeeEEEEEEEEee Confidence 7777777655432111 1112345567888888877766666 No 65 >protein:vir:96485 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238497;genbank:gi:66391773;genbank:GeneID:5176907 Probab=97.07 E-value=1.8e-05 Score=46.66 Aligned_cols=106 Identities=14% Similarity=0.177 Sum_probs=67.3 Q ss_pred Cc--hHHHHHHH----HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeC--CHHH Q lcl|NC_019769. 1 MT--EDDLYPLL----EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSS--TITE 72 (115) Q Consensus 1 M~--E~~i~~lL----~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~--t~~~ 72 (115) |. +++||..| .. .+-+||-++-+++ +++|||+.......+..+-+. -...+.+-||||+. .+.+ T Consensus 1 m~sp~qeL~d~~f~~l~~-~g~~vyd~lP~~~------v~YPfV~ig~~~~~~~~tKt~-~~g~v~ltihVW~~~~~R~~ 72 (128) T protein:vir:96 1 MKQPDQLLHDEMYRISSG-LGYDTYTYLPPEG------AAYPFVVMGETMVLPQSTKSH-LIGRLSSTVHVWGRVDDRKT 72 (128) T ss_pred CCCHHHHHHHHHHHHHHh-cCCeeecccCCCC------CCCCEEEEeeeeecCCccccc-cccEEEEEEEEEECCCCchh Confidence 98 56665544 44 3557886443332 579999988776666554332 23568999999987 5788 Q ss_pred HHHHHHHHHHHHHhhccee-------ecc-----CCCccccccceee--EEEEEEe Q lcl|NC_019769. 73 ARTIRNMALDALQVLKPGS-------IVK-----TPGYEPDLRYHRA--TLEFQVT 114 (115) Q Consensus 73 A~~l~~av~~Al~~~~~~~-------~~~-----~~~ye~dT~lyr~--~~df~i~ 114 (115) +..++.++..++....... ..+ ..|-..++-|+|. +++|.++ T Consensus 73 v~~i~~~i~~~l~~~~~t~~y~~~~~~~~~~~qii~D~st~~~l~Hgil~l~f~~~ 128 (128) T protein:vir:96 73 LSDMAGQLMSSFFTIKNIDGMQFSAEVNESSIDSNRDNSTDEVLYHFIIYTYFKFI 128 (128) T ss_pred HHHHHHHHHHHhhhhhccCCeEEEEEEeeeeEEEeeecCCCceeeEEEEEEEEEeC Confidence 8899999998885432111 111 1222246677777 6677777 No 66 >protein:vir:98426 Length: 131 # NCBI annotation: ORF6 # Family: family:all:12105 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958284;genbank:gi:41057258;uniprot:Q38599;genbank:GeneID:2732810 Probab=96.77 E-value=6.3e-05 Score=43.62 Aligned_cols=104 Identities=14% Similarity=0.086 Sum_probs=69.6 Q ss_pred Cc--hHHHHHHHHh-----hcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHH Q lcl|NC_019769. 1 MT--EDDLYPLLEP-----LAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEA 73 (115) Q Consensus 1 M~--E~~i~~lL~~-----l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A 73 (115) |- |+-+-.+|+. +.+=+|+.. .|.. .+..+|+..+.+|...|. ..++.++=|-||+.|.++| T Consensus 6 ~pda~~v~~~~lr~~l~a~~~~V~V~t~-vP~~------RP~rfV~VertgG~~~~~----~~Dr~~L~Vq~W~~t~~~A 74 (131) T protein:vir:98 6 MPDAVAVIAGYLRAVLVARGVTVPVGSR-VPSP------RPARFVRIERIGGPANTV----VTDRPRLDVHCWGSSEEDA 74 (131) T ss_pred CCchhHHHHHHHHHHHHhcCCceEeccc-CCCC------CCceEEEEEecCCCcCCc----cccceEEEEEecCCCHHHH Confidence 22 4444444432 223355543 2321 256899999998875554 3567888889999999999 Q ss_pred HHHHHHHHHHHHh----hcceeec--cC----CCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQV----LKPGSIV--KT----PGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~----~~~~~~~--~~----~~ye~dT~lyr~~~df~i~~ 115 (115) -.|+.++|+.|.. +.....+ +. +--|+||+.+|..+...+++ T Consensus 75 ~~La~~vr~~ll~~~~~~g~~~~~~~e~~gpy~~PD~es~~~Ryq~tv~l~~ 126 (131) T protein:vir:98 75 HDLMQLCRALLGAARGSHGDTVLARPATGGPQFLPDAETGAARWAFTLDITM 126 (131) T ss_pred HHHHHHHHHHHhhcccccchheeccccCCCCCcCCCCCCCCceeEEEEEEEe Confidence 9999999986652 2222322 12 22478999999999999999 No 67 >protein:vir:95765 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950594;genbank:gi:119953789;genbank:GeneID:5076835 Probab=95.98 E-value=0.00023 Score=40.57 Aligned_cols=106 Identities=14% Similarity=0.130 Sum_probs=64.3 Q ss_pred Cc-hHHHHHHH---HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCC--HHHHH Q lcl|NC_019769. 1 MT-EDDLYPLL---EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSST--ITEAR 74 (115) Q Consensus 1 M~-E~~i~~lL---~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t--~~~A~ 74 (115) |+ +++||..+ +.+ +=-+|-+ .|.. .+++|||+.--....+..+-... ..+.+-||||+.. +.+.. T Consensus 1 m~P~qeLfd~~f~~~~~-Gy~vYD~-lP~~-----~v~YPFVvig~~~~~~~~tKt~~--G~i~l~i~VWg~~~~R~~vs 71 (127) T protein:vir:95 1 MTPNHALFRRLFAISNI-RVDTYDF-LPDA-----KSAYPFVYIGENNGSDIPNKDLL--GRLRQTVHLYGLRTDRANLD 71 (127) T ss_pred CchhHHHHHHHHHHHhc-CCccccc-cCcC-----CCCcCEEEEeeeeecccccceee--eEEEEEEEeecCchhhhhHH Confidence 99 77776655 332 3356654 4522 26899999987777777665422 3578889999873 45566 Q ss_pred HHHHHHHHHHHhhccee----------eccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 75 TIRNMALDALQVLKPGS----------IVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 75 ~l~~av~~Al~~~~~~~----------~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) .++.++..++....... ..-+.|-..+|-|+|..+.+..-- T Consensus 72 ~i~~~i~~~~~~~~~~~~y~~~~~~s~~qil~Dtstnt~L~Hgil~l~f~f 122 (127) T protein:vir:95 72 DISAYLESEVKRAHDGYDYHLYHVETSKQIIPDNTDVQPLLHIVLDFTFDY 122 (127) T ss_pred HHHHHHHHHhhhhcccceeEEEEecceeEEecccCCcceeEEEEEEEEEEe Confidence 66777766553221111 111233445788888776665555 No 68 >protein:vir:81158 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:1089 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285817;genbank:gi:148747738;genbank:GeneID:5247201 Probab=95.80 E-value=0.00021 Score=40.73 Aligned_cols=103 Identities=18% Similarity=0.196 Sum_probs=69.7 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCC-CcceEEEEEEeeCCHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA-ESAVSVQVDVYSSTITEARTIRNM 79 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~-~~~~~vQIDvyA~t~~~A~~l~~a 79 (115) ||.++|+..|+. .+-.|--+-...| | .+|||||...+.... .=+|.- .....+||..|-+-.+.+. -+. T Consensus 4 mt~~~l~~~Lk~-~GlPvay~~F~~g----p--~pPyivY~~~~~~~~-~ADn~vy~~~~~~~IELYT~~KD~~~--E~~ 73 (109) T protein:vir:81 4 MTQAELYQALKS-IGFPVAYGSFTNP----V--TPPFITYQFAYSNDM-MADNINYVAIDDFQVELYTKKKDPVA--EQK 73 (109) T ss_pred ecHHHHHHHHHh-cCCCeeeccCCCC----C--CCceEEEEeccCcce-eccceEEEeccceEEEEEeeccChHH--HHH Confidence 999999999998 3445544444443 2 479999999753321 223332 2345899999997665443 246 Q ss_pred HHHHHHhhc-ceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 80 ALDALQVLK-PGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 80 v~~Al~~~~-~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |+++|+.+. +.. ....|-..-|+|-...+|.++= T Consensus 74 iE~~L~~~~i~y~--k~et~IesEklyq~~Y~~~~~g 108 (109) T protein:vir:81 74 VQDKLKELGLPYR--KFETFIDTENLFQILYEIQILG 108 (109) T ss_pred HHHHHHhcCCcee--eeEEEecCCceEEEEEEEEEec Confidence 788888654 333 2235888889999999999988 No 69 >protein:vir:80109 Length: 104 # NCBI annotation: Putative aminopeptidase # Family: family:all:1089 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425609;genbank:gi:155042942;genbank:GeneID:5469534 Probab=94.96 E-value=0.00041 Score=39.14 Aligned_cols=102 Identities=17% Similarity=0.194 Sum_probs=66.8 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCC-CcceEEEEEEeeCCHHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA-ESAVSVQVDVYSSTITEARTIRNM 79 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~-~~~~~vQIDvyA~t~~~A~~l~~a 79 (115) ||-++|+..|+.+ +-.|--.-. | ..| .+|||||...++.. ..=+|.- .....+||..|.+-.+.+. -+. T Consensus 1 Mt~~~l~~~Lk~~-glPvay~~F--~--~~P--~pPyivy~~~~~~~-~~ADn~~y~~~~~~~IELYT~~Kd~~~--E~~ 70 (104) T protein:vir:80 1 MNLDELNTILKQT-GFPVAYSHF--G--KPQ--KPPFITYVVAYSSN-FGADDKVYQDIENVQIELYTDKKDLEA--EER 70 (104) T ss_pred CCHHHHHHHHHhc-CCCeeeecC--C--CcC--CCCEEEEEecCCcc-eeccceEEEeecceEEEEEeeccCHHH--HHH Confidence 9999999999973 222211111 1 112 47999999864322 1223332 2445899999998776543 346 Q ss_pred HHHHHHhhc-ceeeccCCCccccccceeeEEEEEEe Q lcl|NC_019769. 80 ALDALQVLK-PGSIVKTPGYEPDLRYHRATLEFQVT 114 (115) Q Consensus 80 v~~Al~~~~-~~~~~~~~~ye~dT~lyr~~~df~i~ 114 (115) |+++|+.++ +.... ..|-..-|+|-...+|.++ T Consensus 71 iE~~Ld~~~i~y~k~--et~IesEklyq~~Y~~~l~ 104 (104) T protein:vir:80 71 IKAVLDANSLYYETT--ETYIPSERLYQKVYEVRLL 104 (104) T ss_pred HHHHHhhCCCceeeE--EEEecCcceEEEEEEEEeC Confidence 777887654 33333 3588888999999999999 No 70 >protein:vir:95371 Length: 104 # NCBI annotation: aminopeptidase # Family: family:all:1089 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764481;genbank:gi:115334635;genbank:GeneID:5179258 Probab=94.81 E-value=0.00059 Score=38.31 Aligned_cols=101 Identities=17% Similarity=0.226 Sum_probs=67.1 Q ss_pred CchHHHHHHHHhhcCCcc-ceeeccCCCCCCccccccEEEEEecCCCccceecCCC-CcceEEEEEEeeCCHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQV-YPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA-ESAVSVQVDVYSSTITEARTIRN 78 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rv-yp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~-~~~~~vQIDvyA~t~~~A~~l~~ 78 (115) ||-++|+..|+.+ +-.| |-.+++ | | .+|||||...++... .=+|.- .....+||..|.+-.+.+. -+ T Consensus 1 Mt~~~l~~~Lk~~-glPvay~hF~~-~----p--~pPyivy~~~~~~~~-~ADn~~y~~~~~~~IELYT~~Kd~~~--E~ 69 (104) T protein:vir:95 1 MKLTELDDLLKAT-GLPVAYSHFSK-P----Q--KPPFITYMVAYSSNF-TADDQVYQEIENVQIELYTLKKDFEA--EE 69 (104) T ss_pred CCHHHHHHHHHhc-CCCeeeccccC-C----C--CCceEEEEecCCcce-eccceEEEeecceEEEEEeeccCHHH--HH Confidence 9999999999974 2222 222221 1 2 469999998753321 224432 2445899999998776543 34 Q ss_pred HHHHHHHhhc-ceeeccCCCccccccceeeEEEEEEe Q lcl|NC_019769. 79 MALDALQVLK-PGSIVKTPGYEPDLRYHRATLEFQVT 114 (115) Q Consensus 79 av~~Al~~~~-~~~~~~~~~ye~dT~lyr~~~df~i~ 114 (115) .|+++|+.+. +.... ..|-+.-|+|-...+|.++ T Consensus 70 ~iE~~Ld~~~i~y~k~--et~IesEklyq~~Y~~~l~ 104 (104) T protein:vir:95 70 KVKAVLDANNLVYETS--ETYIPSEKLYQKVYEVRLL 104 (104) T ss_pred HHHHHHHhCCCceeeE--EEEecCcceEEEEEEEEeC Confidence 6777887653 33333 3588889999999999999 No 71 >protein:vir:106593 Length: 131 # NCBI annotation: ORF039 # Family: family:all:504 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239498;genbank:gi:66395251;genbank:GeneID:4555747 Probab=94.31 E-value=0.0027 Score=34.65 Aligned_cols=108 Identities=10% Similarity=0.077 Sum_probs=59.3 Q ss_pred Cc--hHHHHHHH----HhhcCCccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HH Q lcl|NC_019769. 1 MT--EDDLYPLL----EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--IT 71 (115) Q Consensus 1 M~--E~~i~~lL----~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~ 71 (115) |+ +++||.-| .. .|-.||.+.-++. .+++|||+...+.. .+..+-... ...+.+.||||+.. +. T Consensus 3 ~ksp~qeLfd~~f~~~~~-lGy~vyd~lP~~~-----ev~YPFVvig~~~~~~~~~tKt~~-~g~v~lti~VWg~~~~R~ 75 (131) T protein:vir:10 3 KTTPQQALFDSIYAQLLG-YGIDVIDFKELNS-----QLTYPFFVLRDVEANKSKYTMESV-GGELTVIIDLWNYAEDRG 75 (131) T ss_pred ccChhHHHHHHHHHHHHh-cCCceeeccCCCC-----CCCCCEEEEeeeeccCCCCccccc-ceEEEEEEEEeecchhhh Confidence 33 66665554 44 4667886655432 25799999866532 222111111 24679999999984 44 Q ss_pred HHHHHHHHHHHHHHhhcce----e-ecc-----C-CCccccccceeeEEEEEEeC Q lcl|NC_019769. 72 EARTIRNMALDALQVLKPG----S-IVK-----T-PGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 72 ~A~~l~~av~~Al~~~~~~----~-~~~-----~-~~ye~dT~lyr~~~df~i~~ 115 (115) +...++.++...+....-. . ... . ++=.+++-|.|..+.+..-. T Consensus 76 ~vs~i~~~i~~~~~~~~~td~y~~~~~~~~~~~i~D~sttn~~L~Hg~i~lef~~ 130 (131) T protein:vir:10 76 QHDSIVGATEWMLTGIESVEGYQLMIDDINIKTLNDVENSDRQLLHTVIIAIYKL 130 (131) T ss_pred hHHHHHHHHHHHhhcceecccceEEecceEEEEEeccCCCCceeeeEEEEEEEEe Confidence 5556666666665322111 1 111 1 12346677888776666655 No 72 >protein:vir:103918 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873997;genbank:gi:118430772;genbank:GeneID:4525410 Probab=94.11 E-value=0.0023 Score=35.05 Aligned_cols=109 Identities=10% Similarity=0.116 Sum_probs=54.9 Q ss_pred Cc-hHHHHHHHHhhcCC---ccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGG---QVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--ITEA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~---Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~~A 73 (115) |+ +++||.-|=..+-+ -+|.+.-++. .+++|+|+...+.. .+..+-... ...+.+.||||+.. +.+. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~-----ev~YPFV~ig~~q~~~~~~tKt~~-~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:10 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQ-----EIPYPFFVIKMPESNRSKYTFDSY-SGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCC-----CCCCCEEEEcceeccCCCCccccc-ceEEEEEEEEeecccccchH Confidence 99 78887766443333 4554443321 26799999765422 122111111 13579999999985 3334 Q ss_pred HHHHHHHHHHHHhhccee--ec--------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS--IV--------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~--~~--------~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++.++...+....-.. .+ ...+-..++-|.|..+.+..-- T Consensus 75 s~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~ 126 (127) T protein:vir:10 75 DGLVKRCIDDLTPSVKTNDYDFEEDDTNIAQLVDDTTNQELLHTSITISYKT 126 (127) T ss_pred HHHHHHHHHHhccceeccceeEEeeeeeeeecccCCCcceeeeEEEEEEEee Confidence 455555554443211110 01 1111122445777665555544 No 73 >protein:vir:96217 Length: 127 # NCBI annotation: ORF036 # Family: family:all:504 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239575;genbank:gi:66395326;genbank:GeneID:5132763 Probab=94.11 E-value=0.0023 Score=35.05 Aligned_cols=109 Identities=10% Similarity=0.116 Sum_probs=54.9 Q ss_pred Cc-hHHHHHHHHhhcCC---ccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGG---QVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--ITEA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~---Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~~A 73 (115) |+ +++||.-|=..+-+ -+|.+.-++. .+++|+|+...+.. .+..+-... ...+.+.||||+.. +.+. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~-----ev~YPFV~ig~~q~~~~~~tKt~~-~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:96 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQ-----EIPYPFFVIKMPESNRSKYTFDSY-SGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCC-----CCCCCEEEEcceeccCCCCccccc-ceEEEEEEEEeecccccchH Confidence 99 78887766443333 4554443321 26799999765422 122111111 13579999999985 3334 Q ss_pred HHHHHHHHHHHHhhccee--ec--------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS--IV--------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~--~~--------~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++.++...+....-.. .+ ...+-..++-|.|..+.+..-- T Consensus 75 s~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~ 126 (127) T protein:vir:96 75 DGLVKRCIDDLTPSVKTNDYDFEEDDTNIAQLVDDTTNQELLHTSITISYKT 126 (127) T ss_pred HHHHHHHHHHhccceeccceeEEeeeeeeeecccCCCcceeeeEEEEEEEee Confidence 455555554443211110 01 1111122445777665555544 No 74 >protein:vir:99769 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004312;genbank:gi:122891766;genbank:GeneID:4712324 Probab=94.11 E-value=0.0023 Score=35.05 Aligned_cols=109 Identities=10% Similarity=0.116 Sum_probs=54.9 Q ss_pred Cc-hHHHHHHHHhhcCC---ccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGG---QVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--ITEA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~---Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~~A 73 (115) |+ +++||.-|=..+-+ -+|.+.-++. .+++|+|+...+.. .+..+-... ...+.+.||||+.. +.+. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~-----ev~YPFV~ig~~q~~~~~~tKt~~-~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:99 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQ-----EIPYPFFVIKMPESNRSKYTFDSY-SGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCC-----CCCCCEEEEcceeccCCCCccccc-ceEEEEEEEEeecccccchH Confidence 99 78887766443333 4554443321 26799999765422 122111111 13579999999985 3334 Q ss_pred HHHHHHHHHHHHhhccee--ec--------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS--IV--------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~--~~--------~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++.++...+....-.. .+ ...+-..++-|.|..+.+..-- T Consensus 75 s~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~ 126 (127) T protein:vir:99 75 DGLVKRCIDDLTPSVKTNDYDFEEDDTNIAQLVDDTTNQELLHTSITISYKT 126 (127) T ss_pred HHHHHHHHHHhccceeccceeEEeeeeeeeecccCCCcceeeeEEEEEEEee Confidence 455555554443211110 01 1111122445777665555544 No 75 >protein:vir:97143 Length: 127 # NCBI annotation: ORF041 # Family: family:all:504 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239730;genbank:gi:66394906;genbank:GeneID:5130876 Probab=94.11 E-value=0.0023 Score=35.05 Aligned_cols=109 Identities=10% Similarity=0.116 Sum_probs=54.9 Q ss_pred Cc-hHHHHHHHHhhcCC---ccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGG---QVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--ITEA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~---Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~~A 73 (115) |+ +++||.-|=..+-+ -+|.+.-++. .+++|+|+...+.. .+..+-... ...+.+.||||+.. +.+. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~-----ev~YPFV~ig~~q~~~~~~tKt~~-~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:97 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQ-----EIPYPFFVIKMPESNRSKYTFDSY-SGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCC-----CCCCCEEEEcceeccCCCCccccc-ceEEEEEEEEeecccccchH Confidence 99 78887766443333 4554443321 26799999765422 122111111 13579999999985 3334 Q ss_pred HHHHHHHHHHHHhhccee--ec--------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS--IV--------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~--~~--------~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++.++...+....-.. .+ ...+-..++-|.|..+.+..-- T Consensus 75 s~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~l~d~tTn~~L~Hgvi~lef~~ 126 (127) T protein:vir:97 75 DGLVKRCIDDLTPSVKTNDYDFEEDDTNIAQLVDDTTNQELLHTSITISYKT 126 (127) T ss_pred HHHHHHHHHHhccceeccceeEEeeeeeeeecccCCCcceeeeEEEEEEEee Confidence 455555554443211110 01 1111122445777665555544 No 76 >protein:vir:96355 Length: 127 # NCBI annotation: ORF038 # Family: family:all:504 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239652;genbank:gi:66395404;genbank:GeneID:5132831 Probab=94.11 E-value=0.0023 Score=35.08 Aligned_cols=109 Identities=10% Similarity=0.116 Sum_probs=54.9 Q ss_pred Cc-hHHHHHHHHhhcCC---ccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGG---QVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--ITEA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~---Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~~A 73 (115) |+ +++||.-|=..+-+ -+|.+.-++. .+++|+|+...+.. .+..+-... ...+.+.||||+.. +.+. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~-----ev~YPFV~ig~~q~~~~~~tKt~~-~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:96 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQ-----EIPYPFFVIKMPESNRSKYTFDSY-SGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCC-----CCCCCEEEEcceeccCCCCccccc-ceEEEEEEEEeecccccchH Confidence 99 78887766443333 4554443321 26799999765422 222111111 13579999999985 3334 Q ss_pred HHHHHHHHHHHHhhccee--ec--------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS--IV--------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~--~~--------~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++.++...+....-.. .+ ...+-..++-|.|..+.+..-- T Consensus 75 s~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~l~d~ttn~~L~Hgvi~lef~~ 126 (127) T protein:vir:96 75 DGLVKRCIDDLTPSVKTNDYDFEEDDTNITQLVDDTTNQELLHTSITISYKT 126 (127) T ss_pred HHHHHHHHHHhccceeccceeEEeeeeeeEEcccCCCcceeeeEEEEEEEee Confidence 455555554443211110 01 1111222455666665555444 No 77 >protein:vir:78854 Length: 127 # NCBI annotation: hypothetical protein # Family: family:all:504 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285366;genbank:gi:148717894;genbank:GeneID:5246985 Probab=94.11 E-value=0.0023 Score=35.08 Aligned_cols=109 Identities=10% Similarity=0.116 Sum_probs=54.9 Q ss_pred Cc-hHHHHHHHHhhcCC---ccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGG---QVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--ITEA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~---Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~~A 73 (115) |+ +++||.-|=..+-+ -+|.+.-++. .+++|+|+...+.. .+..+-... ...+.+.||||+.. +.+. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~-----ev~YPFV~ig~~q~~~~~~tKt~~-~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:78 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQ-----EIPYPFFVIKMPESNRSKYTFDSY-SGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCC-----CCCCCEEEEcceeccCCCCccccc-ceEEEEEEEEeecccccchH Confidence 99 78887766443333 4554443321 26799999765422 222111111 13579999999985 3334 Q ss_pred HHHHHHHHHHHHhhccee--ec--------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS--IV--------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~--~~--------~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++.++...+....-.. .+ ...+-..++-|.|..+.+..-- T Consensus 75 s~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~l~d~ttn~~L~Hgvi~lef~~ 126 (127) T protein:vir:78 75 DGLVKRCIDDLTPSVKTNDYDFEEDDTNITQLVDDTTNQELLHTSITISYKT 126 (127) T ss_pred HHHHHHHHHHhccceeccceeEEeeeeeeEEcccCCCcceeeeEEEEEEEee Confidence 455555554443211110 01 1111222455666665555444 No 78 >protein:vir:9313 Length: 127 # NCBI annotation: phi Mu50B-like protein # Family: family:all:504 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803291;genbank:gi:29028601;genbank:GeneID:1258049 Probab=93.98 E-value=0.0025 Score=34.86 Aligned_cols=109 Identities=10% Similarity=0.113 Sum_probs=54.9 Q ss_pred Cc-hHHHHHHHHhhcCC---ccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCcceEEEEEEeeCC--HHHH Q lcl|NC_019769. 1 MT-EDDLYPLLEPLAGG---QVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAESAVSVQVDVYSST--ITEA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l~~~---Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~~~~vQIDvyA~t--~~~A 73 (115) |+ +++||.-|=..+-+ -+|.+.-++. .+++|+|+...+.. .+..+-... ...+.+.||||+.. +.+. T Consensus 1 mtp~qeLfd~~f~~~~~lGy~vYd~lP~~~-----ev~YPFV~ig~~q~~~~~~tKt~~-~G~v~ltIdVWg~~~~R~~v 74 (127) T protein:vir:93 1 MTPNLQLYNKAYETLQGYGFPVISRKEMQQ-----EIPYPFFVIKMPESNRSKYTFDSY-SGDTNLVIDIWSVSDDLGHH 74 (127) T ss_pred CchhHHHHHHHHHHHHhcCCceeccCCCCC-----CCCCCEEEEcceeccCCCCccccc-ceEEEEEEEEeecccccchH Confidence 99 78887766443333 4554443321 26799999765422 222111111 13579999999985 3334 Q ss_pred HHHHHHHHHHHHhhccee--ec--------cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS--IV--------KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~--~~--------~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++.++...+....-.. .+ ...+-..++-|.|..+.+..-- T Consensus 75 s~i~~~i~~~~~~~~~t~~y~~~~~~~~~~~l~d~ttn~~L~Hgvl~lef~~ 126 (127) T protein:vir:93 75 DGLVKRCIDDLTPSVKTNDYDFEEEDTNITQLVDDTTNQELLHTSVTISYKT 126 (127) T ss_pred HHHHHHHHHHhccceeccceeEEeeeeeeEEcccCCCcceeeeEEEEEEEee Confidence 455555554443211110 01 1111222455776665555444 No 79 >protein:vir:6215 Length: 109 # NCBI annotation: hypothetical protein # Family: family:all:10885 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852595;genbank:gi:31415855;genbank:GeneID:1489213 Probab=93.94 E-value=0.0017 Score=35.80 Aligned_cols=102 Identities=11% Similarity=0.091 Sum_probs=62.8 Q ss_pred Cc--hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHHHHHHH Q lcl|NC_019769. 1 MT--EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEARTIRN 78 (115) Q Consensus 1 M~--E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A~~l~~ 78 (115) |- =+++.++|+.. +-.||-+.||.|+ .+||+||+.++..--..-.+.-....-.||..|-.-..+-- . T Consensus 1 M~i~Fe~lr~~Lk~~-g~~V~RD~ap~~t------~YPyivYs~v~e~~k~AS~kv~~~~~~YQvSl~T~GtE~dl---~ 70 (109) T protein:vir:62 1 MQINFEQLRSLMKKS-GIPVSRDNAPTGI------DYPYIVYEFVNEQHKRASNKVLKDMPLYQIAVITNGTEKDY---E 70 (109) T ss_pred CcccHHHHHHHHHhc-CCceeeccCCCCC------CCceEEEEeecCceeeeccceEeecceeEEEEeeccchhHH---H Confidence 65 68999999983 5599999999985 69999999997654433333333456799999987544322 2 Q ss_pred HHHHHHHh--hcceeeccCCC---ccccccceeeEEEEEEeC Q lcl|NC_019769. 79 MALDALQV--LKPGSIVKTPG---YEPDLRYHRATLEFQVTV 115 (115) Q Consensus 79 av~~Al~~--~~~~~~~~~~~---ye~dT~lyr~~~df~i~~ 115 (115) .+.++++. ..+....++++ =|.-|++|--. +.|- T Consensus 71 ~l~k~f~~~~vpfs~f~gIqgDENDdTiTnfyTyV---rcie 109 (109) T protein:vir:62 71 PLKAVFNEVGVSYSQFDGMDYDENDDTITQFITYV---RCIQ 109 (109) T ss_pred HHHHHHhhcCCccccccccCCCCCcchheeeeeee---EEeC Confidence 33445554 23333444443 34445555322 1111 No 80 >protein:vir:967 Length: 105 # NCBI annotation: Orf49 # Family: family:all:1089 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076621;genbank:gi:13095729;genbank:GeneID:920255 Probab=93.85 E-value=0.0013 Score=36.38 Aligned_cols=102 Identities=19% Similarity=0.248 Sum_probs=65.9 Q ss_pred CchHHHHHHHHhhcCC-ccceeeccCCCCCCccccccEEEEEecCCCccceecCCC-CcceEEEEEEeeCCHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGG-QVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA-ESAVSVQVDVYSSTITEARTIRN 78 (115) Q Consensus 1 M~E~~i~~lL~~l~~~-Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~-~~~~~vQIDvyA~t~~~A~~l~~ 78 (115) ||-++|+..|+.+ |- =.|-.+. +| ..| .+||+||...++.. ..=+|.- .....+||..|.+-.+.+.. + T Consensus 1 Mt~~~l~~~L~~~-GlPvAy~hF~-~g--~~P--~pPyivy~~~~~~~-~~ADn~~y~~~~~~~IELYT~~Kd~~~E--~ 71 (105) T protein:vir:96 1 MTLEELKVILDQT-GLKVGYRLWA-VG--QAP--PLPYILYYVDEEIG-FKADNQIYAKNKDITIELYSNLKNEREE--Q 71 (105) T ss_pred CCHHHHHHHHHhc-CCCeeeeecc-cC--ccc--CCCeEEEEecCCcc-eecCceEEEeecceeEEEeeeccCHHHH--H Confidence 9999999999974 21 1222222 11 222 47999999864332 1223432 24458999999987765543 4 Q ss_pred HHHHHHHhhc-ceeeccCCCccccccceeeEEEEEE Q lcl|NC_019769. 79 MALDALQVLK-PGSIVKTPGYEPDLRYHRATLEFQV 113 (115) Q Consensus 79 av~~Al~~~~-~~~~~~~~~ye~dT~lyr~~~df~i 113 (115) .|+++|+.+. +.... ..|-..-|+|-...+|.+ T Consensus 72 ~iE~~Ld~~~i~y~k~--et~IesEklyq~~Y~~~l 105 (105) T protein:vir:96 72 KLEKLLDDNKIVYEIY--ESYLDSEKMYLRAYEINI 105 (105) T ss_pred HHHHHHhhCCCceeee--EEEecCcceEEEEEEEeC Confidence 6777887654 22222 458888899999999999 No 81 >protein:vir:7994 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817348;genbank:gi:29565776;genbank:GeneID:1259015 Probab=93.07 E-value=0.0018 Score=35.61 Aligned_cols=103 Identities=17% Similarity=0.185 Sum_probs=67.4 Q ss_pred CchH-------HHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHH Q lcl|NC_019769. 1 MTED-------DLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEA 73 (115) Q Consensus 1 M~E~-------~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A 73 (115) |||. -+-+-|+++.. |=-...| .-++|+++-+++.|... -- -+.+..++||++++...++| T Consensus 1 m~~~saP~~e~~vv~WLsp~~~--va~~R~~-------~~PLPf~~V~Rv~G~d~-~e--~~tD~avvsv~~fg~~~eaA 68 (134) T protein:vir:79 1 MATDSAPSIHRVLVAWLSPLGK--VSTRRLS-------GDPLPHRVVRRVDGRDV-PE--EGSDSAVVSVHTFAASDEAA 68 (134) T ss_pred CCcccCCChheeeeeecccchh--ceeccCC-------CCCCCeEEEEEeCCCCC-cc--ccccCceeEEEEeeCCHHHh Confidence 8863 23444565421 1111111 22689999999987543 11 12356799999999999999 Q ss_pred HHHHHHHHHHHHhh---cceee--c-----c-----------CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVL---KPGSI--V-----K-----------TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~---~~~~~--~-----~-----------~~~ye~dT~lyr~~~df~i~~ 115 (115) +.+++.+-..|..+ .+... . . .-+|..|+++-|.+--+++=+ T Consensus 69 ~d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~vl~~P~~~eY~dD~~~vrytgRY~~g~ 131 (134) T protein:vir:79 69 ENEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGV 131 (134) T ss_pred hHHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeecc Confidence 99999988887554 23211 1 1 125889999999888888877 No 82 >protein:vir:102609 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655006;genbank:gi:109392196;genbank:GeneID:4157231 Probab=92.91 E-value=0.002 Score=35.38 Aligned_cols=103 Identities=17% Similarity=0.182 Sum_probs=67.3 Q ss_pred CchH-------HHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHH Q lcl|NC_019769. 1 MTED-------DLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEA 73 (115) Q Consensus 1 M~E~-------~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A 73 (115) |||. -+-+-|+++.. |=-...| .-++|+++-+++.|... -- -+.+..++||++++...++| T Consensus 1 m~~~saP~~e~~vv~WLsp~~~--va~~R~~-------~~PLPf~~V~Rv~G~d~-~e--~~tD~avvsv~~fg~~~eaA 68 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPLGK--VSTRRLS-------GDPLPHRVVRRVDGRDV-PE--EGSDVAVVSVHTFAASDEAA 68 (134) T ss_pred CCcccCCChheeeeeecccchh--ceeccCC-------CCCCCeEEEEEeCCCCC-cc--cccccceEEEEEeeCCHHHh Confidence 8863 23444565421 1111111 22689999999987543 11 12356799999999999999 Q ss_pred HHHHHHHHHHHHhh---cceee--c-----c-----------CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVL---KPGSI--V-----K-----------TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~---~~~~~--~-----~-----------~~~ye~dT~lyr~~~df~i~~ 115 (115) +.+++.+-..|..+ .+... . . .-+|..|+++-|.+--+++=+ T Consensus 69 ~d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~ 131 (134) T protein:vir:10 69 ENEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGV 131 (134) T ss_pred hHHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeecc Confidence 99999988887554 23211 1 1 125889999999888888877 No 83 >protein:vir:105826 Length: 134 # NCBI annotation: gp10 # Family: family:all:2795 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655771;genbank:gi:109522094;genbank:GeneID:4157634 Probab=92.91 E-value=0.002 Score=35.38 Aligned_cols=103 Identities=17% Similarity=0.182 Sum_probs=67.3 Q ss_pred CchH-------HHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHHHH Q lcl|NC_019769. 1 MTED-------DLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTITEA 73 (115) Q Consensus 1 M~E~-------~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~~A 73 (115) |||. -+-+-|+++.. |=-...| .-++|+++-+++.|... -- -+.+..++||++++...++| T Consensus 1 m~~~saP~~e~~vv~WLsp~~~--va~~R~~-------~~PLPf~~V~Rv~G~d~-~e--~~tD~avvsv~~fg~~~eaA 68 (134) T protein:vir:10 1 MATDSAPSIHRVLVAWLSPLGK--VSTRRLS-------GDPLPHRVVRRVDGRDV-PE--EGSDVAVVSVHTFAASDEAA 68 (134) T ss_pred CCcccCCChheeeeeecccchh--ceeccCC-------CCCCCeEEEEEeCCCCC-cc--cccccceEEEEEeeCCHHHh Confidence 8863 23444565421 1111111 22689999999987543 11 12356799999999999999 Q ss_pred HHHHHHHHHHHHhh---cceee--c-----c-----------CCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVL---KPGSI--V-----K-----------TPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~---~~~~~--~-----~-----------~~~ye~dT~lyr~~~df~i~~ 115 (115) +.+++.+-..|..+ .+... . . .-+|..|+++-|.+--+++=+ T Consensus 69 ~d~ad~vHrRM~kL~~~~~~~~~~~gG~~~~id~~~v~~~P~~~eY~dD~~~vrytgRY~~g~ 131 (134) T protein:vir:10 69 ENEAELTHQRMLELVVNPLTEIPVGGGVVARIDYARVLMKPVLVEYDDDGHLVRHVGRYEIGV 131 (134) T ss_pred hHHHHHHHHHHHHHhcccccceecCCceEEEeehhhhhccceeeeeCCCceEEEEeeeeeecc Confidence 99999988887554 23211 1 1 125889999999888888877 No 84 >protein:vir:106554 Length: 122 # NCBI annotation: putative protein # Family: family:all:6476 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958589;genbank:gi:41179248;genbank:GeneID:2717090 Probab=91.80 E-value=0.011 Score=31.37 Aligned_cols=105 Identities=14% Similarity=0.136 Sum_probs=69.8 Q ss_pred Cc----hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC--cceEEEEEEeeCCHHHHH Q lcl|NC_019769. 1 MT----EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE--SAVSVQVDVYSSTITEAR 74 (115) Q Consensus 1 M~----E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~--~~~~vQIDvyA~t~~~A~ 74 (115) |- -+.+|.+|..+.+-+-=..--|..- ..+|-++|... ..|.-.-.+..+ ...|+-||.|... .+-. T Consensus 1 m~~INiK~~vy~~L~~v~e~k~Vs~~YP~~w-----~~fP~~iY~t~-~~~~~~~~~~~E~~t~w~itIDi~~~~-~Stt 73 (122) T protein:vir:10 1 MEIYNVKALVFKTLKSMPELKLVSPSYPDKF-----TTFPAAIYSTS-QSSYIRNAQQEETDTEWKITIDLYNDH-GSLT 73 (122) T ss_pred CceeeccHHHHHHHhhcccccccCCCCCCCc-----ccCcEEEEecC-CCceeeecCcceeeEEEEEEEEEEcCC-ccHH Confidence 54 6889999999877322223334332 35899999775 455434444444 3469999999741 2334 Q ss_pred HHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 75 TIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 75 ~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +++-+|.++++.++.....+. ..-+++.|..+.|+-+| T Consensus 74 ~ia~~i~~~f~~lGft~~~~~---~d~sglkr~vmr~~gIV 111 (122) T protein:vir:10 74 NIKAKLIARFSAMGFSNSVGD---QDLNGVSRVVIVFAGIV 111 (122) T ss_pred HHHHHHHHHHhhccccccCCC---CCcCCCeEEEEEEEEEE Confidence 566777778888777444333 34578999999999999 No 85 >protein:vir:78612 Length: 178 # NCBI annotation: BcepNY3gp04 # Family: family:all:3176 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294841;genbank:gi:149882904;genbank:GeneID:5291083 Probab=84.61 E-value=0.059 Score=27.32 Aligned_cols=106 Identities=18% Similarity=0.146 Sum_probs=53.4 Q ss_pred CchHHHHHHHHhhc----C----CccceeeccCCCCCCccccccEEEEEecCCCcccee----cCC-CC------cceEE Q lcl|NC_019769. 1 MTEDDLYPLLEPLA----G----GQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVL----CGQ-AE------SAVSV 61 (115) Q Consensus 1 M~E~~i~~lL~~l~----~----~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l----~G~-~~------~~~~v 61 (115) -||++||+.|.+.+ + ..|.-+- ++.-..| .-+||+.+-++....++- +.. +. ...++ T Consensus 9 ~Te~di~~alr~fL~~lf~~~~~~eVi~gq--qN~~p~P--~g~fiimt~l~~~~lsT~~~~Y~~~~~~~~~~~~~~~~~ 84 (178) T protein:vir:78 9 PSEDEVFDTLWGWVTSLFDPALASQIAKAD--QNATSTL--YGTYALIRPGVREALNQTIRTYDATAGTVSNELHTGYWY 84 (178) T ss_pred ccHHHHHHHHHHHHHHhcCcccCceEEEec--cCCCCcc--CCCEEEEecccccccccceeeccCccceeeeeeeeEEEE Confidence 67999999997643 2 2333211 0111112 247899887765544332 221 11 13578 Q ss_pred EEEEeeCCHHH-HHHHHHHHHHH-----HHh--hcceee---------ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 62 QVDVYSSTITE-ARTIRNMALDA-----LQV--LKPGSI---------VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 62 QIDvyA~t~~~-A~~l~~av~~A-----l~~--~~~~~~---------~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |||||+....+ |..++...|+. ++. ++|.-. ++-+-||+ |-+++++..+ T Consensus 85 QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~P~q~p~iN~e~QyE~-----Rwt~~~~lQ~ 150 (178) T protein:vir:78 85 QVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGENQFEQ-----RYMVKLHAQV 150 (178) T ss_pred EEEEecCChhHHHHHHHHHhcChhHHHHHhcCcccceecCCcccCCccCccccccc-----eEEEEEEEEe Confidence 99999997655 55555555532 221 122211 12233554 4455554444 No 86 >protein:vir:106763 Length: 178 # NCBI annotation: gp05 # Family: family:all:3176 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944313;genbank:gi:38638612;genbank:GeneID:2657394 Probab=84.08 E-value=0.063 Score=27.16 Aligned_cols=106 Identities=19% Similarity=0.154 Sum_probs=53.6 Q ss_pred CchHHHHHHHHhhc--------CCccceeeccCCCCCCccccccEEEEEecCCCcccee----cCC-CC------cceEE Q lcl|NC_019769. 1 MTEDDLYPLLEPLA--------GGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVL----CGQ-AE------SAVSV 61 (115) Q Consensus 1 M~E~~i~~lL~~l~--------~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l----~G~-~~------~~~~v 61 (115) -||++||+.|.+.+ +..|.-+- ++.-..| .-+||+.+-++....++- +.. +. ...++ T Consensus 9 ~Te~di~~alr~fL~~lf~~~~~~eVi~gq--qN~~p~P--~g~fiimt~l~~~~lsT~~~~Y~~~~~~~~~~~~~~~~~ 84 (178) T protein:vir:10 9 PSEDEVFDTLWGWVTSLFDPALASQIAKAD--QNATSTL--YGTYALIRPGVREALNQTIRTYDATAGTVSNELHTGYWY 84 (178) T ss_pred ccHHHHHHHHHHHHHHhcCcccCceEEEec--cCCCCcc--CCCEEEEecccccccccceeeccCCcceeeeeeeeEEEE Confidence 67999999997643 23333211 0111112 247899887765544332 221 11 13578 Q ss_pred EEEEeeCCHHH-HHHHHHHHHHH-----HHh--hcceee---------ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 62 QVDVYSSTITE-ARTIRNMALDA-----LQV--LKPGSI---------VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 62 QIDvyA~t~~~-A~~l~~av~~A-----l~~--~~~~~~---------~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |||||+....+ |..++...|+. ++. ++|.-. ++-+-||+ |-+++++..+ T Consensus 85 QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~P~q~p~iN~e~QyE~-----Rwt~~~~lQ~ 150 (178) T protein:vir:10 85 QVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGEGQYEQ-----RYMVKLHAQV 150 (178) T ss_pred EEEeecCChhHHHHHHHHHhcChhHHHHHhcCcccceecCCcccCCccCccccccc-----eEEEEEEEEe Confidence 99999987655 55555555532 221 122211 12233554 4455555444 No 87 >protein:vir:101569 Length: 178 # NCBI annotation: gp05 # Family: family:all:3176 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958109;genbank:gi:41057655;genbank:GeneID:2716826 Probab=82.66 E-value=0.075 Score=26.75 Aligned_cols=103 Identities=20% Similarity=0.176 Sum_probs=53.1 Q ss_pred CchHHHHHHHHhh----c----CCccce---eeccCCCCCCccccccEEEEEecCCCcccee----cCCCC-------cc Q lcl|NC_019769. 1 MTEDDLYPLLEPL----A----GGQVYP---YVAPLGSDGKPSVSPPWVIFSIITDVAADVL----CGQAE-------SA 58 (115) Q Consensus 1 M~E~~i~~lL~~l----~----~~Rvyp---~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l----~G~~~-------~~ 58 (115) -||++||+.|.+. + +..|.- .-.|. | .-+||+.+-++....++- +++.. .. T Consensus 9 ~Te~di~~alr~fL~~lf~p~~~~eVi~gqqN~~p~-----P--~g~fiimt~l~~~~lsT~~~~Y~~~~~~~~~~~~~~ 81 (178) T protein:vir:10 9 PSEDEVFDTLWEWVTSLFDPASAAQIAKADQNATST-----L--YGTYALIRPGVREALNQTIRSYDATAQTVANELHTG 81 (178) T ss_pred ccHHHHHHHHHHHHHHHcCCCCCceEEeecccCCCc-----C--CCCEEEEecccccccccceeecCCcchheeeeeeeE Confidence 6799999999864 3 222322 22221 2 247899887765544432 22111 13 Q ss_pred eEEEEEEeeCCHHH-HHHHHHHHHHH-----HHh--hcceee---------ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 59 VSVQVDVYSSTITE-ARTIRNMALDA-----LQV--LKPGSI---------VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 59 ~~vQIDvyA~t~~~-A~~l~~av~~A-----l~~--~~~~~~---------~~~~~ye~dT~lyr~~~df~i~~ 115 (115) .++|||||+....+ |..++...|+. ++. ++|.-. ++-+-||+ |-+++++..+ T Consensus 82 ~~~QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~p~qlp~iN~e~QyE~-----Rwt~~~~lQ~ 150 (178) T protein:vir:10 82 YWYQVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGEGQYEQ-----RYMVKLHAQV 150 (178) T ss_pred EEEEEEeecCChhHHHHHHHHHhcChhHHHHHhcCccccccCCCccccCccCccccccc-----eEEEEEEEEe Confidence 57899999997655 55555555532 221 112211 12233554 4455554444 No 88 >protein:vir:9931 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:2393 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795693;genbank:gi:28876455;genbank:GeneID:1258023 Probab=82.14 E-value=0.067 Score=27.00 Aligned_cols=106 Identities=15% Similarity=0.144 Sum_probs=60.9 Q ss_pred Cch--HHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCcccee---cCCCCcceEEEEEEee--CCHHHH Q lcl|NC_019769. 1 MTE--DDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVL---CGQAESAVSVQVDVYS--STITEA 73 (115) Q Consensus 1 M~E--~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l---~G~~~~~~~vQIDvyA--~t~~~A 73 (115) ||+ .+|..-|+.+ +=-|| +..|.. .+..|++|.-.=. ..+.. -|..-.+..+|||++= .++.++ T Consensus 6 ~t~~Lk~i~~kL~~~-~IPiY-fkLP~s-----di~EPF~ViGsh~--~DdsktA~~Ga~ivdt~lqIDlFyp~~sR~d~ 76 (119) T protein:vir:99 6 ETLYLKKVKNRLGVL-DIPIY-FKLPKS-----DVLEPFIVVGTNI--SDLSKTAQTGAVIDDFSLNIDAFLPGDSRLDA 76 (119) T ss_pred hhHHHHHHHHhhccc-CcceE-EeCCCC-----CcCCceEEEeccc--CccccccccceEEEeeeEEEEEeecCcccccH Confidence 554 5556666542 33566 556754 3678998854221 11222 2233367899999874 477788 Q ss_pred HHHHHHHHHHHHhhcceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..+-.+...++....-....-.-|..-.-..||+.|-++=.+ T Consensus 77 eeiks~~~~~l~r~~~it~qil~DnSIGReVYhV~f~isd~i 118 (119) T protein:vir:99 77 EEIKSRMLRLLGRNNQIKAQILVDNSIGREVYRVAINITETL 118 (119) T ss_pred HHHHHHHHHHhhhhhhhhhcccccccccceeeeeeeEeeeec Confidence 888888888885433322222222222334588887776666 No 89 >protein:vir:3637 Length: 178 # NCBI annotation: gp05 # Family: family:all:3176 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705632;genbank:gi:23752317;genbank:gi:47835036;genbank:GeneID:955731 Probab=81.99 E-value=0.081 Score=26.57 Aligned_cols=103 Identities=20% Similarity=0.176 Sum_probs=52.9 Q ss_pred CchHHHHHHHHhh----c----CCccce---eeccCCCCCCccccccEEEEEecCCCcccee----cCCCC-------cc Q lcl|NC_019769. 1 MTEDDLYPLLEPL----A----GGQVYP---YVAPLGSDGKPSVSPPWVIFSIITDVAADVL----CGQAE-------SA 58 (115) Q Consensus 1 M~E~~i~~lL~~l----~----~~Rvyp---~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l----~G~~~-------~~ 58 (115) -||++||+.|.+. + +..|.- .-.|. | .-+||+.+-++....++- +++.. .. T Consensus 9 ~Te~di~~alr~fL~~lf~p~~~~eVi~gqqN~~p~-----P--~g~fiimt~l~~~~lsT~~~~Y~~~~~~~~~~~~~~ 81 (178) T protein:vir:36 9 PSEDEVFDTLWEWVTSLFDPASAAQIAKADQNATST-----L--YGTYALIRPGVREALNQTIRSYDATAQTVANELHTG 81 (178) T ss_pred ccHHHHHHHHHHHHHHHcCCCCCceEEeecccCCCc-----C--CCCEEEEecccccccccceeecCCcchheeeeeeeE Confidence 6799999999864 3 222322 22221 2 237899887765544432 22111 13 Q ss_pred eEEEEEEeeCCHHH-HHHHHHHHHHH-----HHh--hcceee---------ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 59 VSVQVDVYSSTITE-ARTIRNMALDA-----LQV--LKPGSI---------VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 59 ~~vQIDvyA~t~~~-A~~l~~av~~A-----l~~--~~~~~~---------~~~~~ye~dT~lyr~~~df~i~~ 115 (115) .++|||||+....+ |..++...|+. ++. ++|.-. ++-+-||+ |-+++++..+ T Consensus 82 ~~~QvD~YG~~A~d~A~~i~tl~Rs~~a~~~f~~~~iaPLYad~p~qlp~iN~e~QyE~-----Rwt~~~~lQ~ 150 (178) T protein:vir:36 82 YWYQVDCYGPQAPDWANTIAAMWRTMWSADALRGTALIPLYADQPQQLNIVNGEGQYEQ-----RYMVKLHAQV 150 (178) T ss_pred EEEEEEeecCChhHHHHHHHHHhcChhHHHHHhcCccccccCCCccccCccCccccccc-----eEEEEEEEEe Confidence 57899999997655 55555555532 221 112211 12233554 4455554444 No 90 >protein:vir:94061 Length: 175 # NCBI annotation: hypothetical protein # Family: family:all:3176 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453620;genbank:gi:84662656;genbank:GeneID:5142571 Probab=78.02 E-value=0.12 Score=25.66 Aligned_cols=103 Identities=18% Similarity=0.139 Sum_probs=52.7 Q ss_pred CchHHHHHHHHhhc----C-----Cccce---eeccCCCCCCccccccEEEEEecCCCcccee-----cCCCC------c Q lcl|NC_019769. 1 MTEDDLYPLLEPLA----G-----GQVYP---YVAPLGSDGKPSVSPPWVIFSIITDVAADVL-----CGQAE------S 57 (115) Q Consensus 1 M~E~~i~~lL~~l~----~-----~Rvyp---~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l-----~G~~~------~ 57 (115) -||++||+.|.+.+ + ..|.- .-.|. | .-+||+.+-++....++- .|.+. . T Consensus 8 ~Te~di~~alr~fL~~lf~lp~~~~eVi~g~qN~~p~-----P--~g~fi~mt~l~~~~lsT~~~~Y~~~~g~~~~~~~~ 80 (175) T protein:vir:94 8 PTEDAVFDAMFGFLAKVLDLPDDTQAIIKGFQNLSST-----P--TGSCVVVSPGMMTRQDFGSRLYDPGLSKVVIEAHL 80 (175) T ss_pred ccHHHHHHHHHHHHHHHcCCCCCCceEEEeccCCCCc-----c--CCCEEEEecccccccccceeeecccccceeeeeee Confidence 67999999998644 2 22322 22221 2 247888886654432221 11121 1 Q ss_pred ceEEEEEEeeCCHHH-HHHHHHHHHHH-----HH-hhcceee---------ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 58 AVSVQVDVYSSTITE-ARTIRNMALDA-----LQ-VLKPGSI---------VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 58 ~~~vQIDvyA~t~~~-A~~l~~av~~A-----l~-~~~~~~~---------~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ...+|||||+....+ |..++...|+. +. .++|.-. ++-+-||+ |-+++++..+ T Consensus 81 q~~~QvD~YG~~A~d~A~~~~tl~Rs~~a~~~~~~~~~PLYad~p~qlp~iN~e~QyE~-----Rwt~~~~lQ~ 149 (175) T protein:vir:94 81 TYSYQVDCYGPLAPTWASVISVAWKSMWGVDNTAPAFAPLYADAPQQLNIVNSEGQFEQ-----RFMVRLFGQV 149 (175) T ss_pred EEEEEEEeecCChHHHHHHHHHHhcChhHhhhhhcccccccCcCccccCccCccccccc-----eEEEEEEEEe Confidence 347999999997655 55555555542 11 1222221 12233554 4455555444 No 91 >protein:vir:8107 Length: 138 # NCBI annotation: gp11 # Family: family:all:2795 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817688;genbank:gi:29566119;genbank:GeneID:1259313 Probab=69.95 E-value=0.07 Score=26.93 Aligned_cols=102 Identities=18% Similarity=0.226 Sum_probs=55.1 Q ss_pred Cc----------hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCH Q lcl|NC_019769. 1 MT----------EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~----------E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~ 70 (115) |- |.-+-.-|+++.. ++-+ -.|.-++|++.-+++.|...- - -+.+..++|||+|+... T Consensus 1 ~~~~~~~~aP~~e~~vv~WLspv~~------va~~---R~~d~pLPF~~V~Rv~G~d~~-e--~~tD~avv~~~~fg~g~ 68 (138) T protein:vir:81 1 MADLHDQDAPDEEDFVVCWMQPVMR------TAVE---RDIDAELPFCEVTRIDGADDP-E--AGTDNPVIQLDFYALGA 68 (138) T ss_pred CcccccCCCCchheeeeeeccchhc------cccc---cCCCCCCCeEEEEEeCCCCCc-c--ccccCceEEEEEeecCH Confidence 32 2223344455422 1111 122336899999999875431 0 13356799999999999 Q ss_pred HHHHHHHHHHHHHHHhh---ccee-eccCC----------------CccccccceeeEEEEEEeC Q lcl|NC_019769. 71 TEARTIRNMALDALQVL---KPGS-IVKTP----------------GYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 ~~A~~l~~av~~Al~~~---~~~~-~~~~~----------------~ye~dT~lyr~~~df~i~~ 115 (115) ++|+.+++.+-..|..+ .+.. .++.. +|..| +.=|.+--+++=- T Consensus 69 eaA~d~a~~vHrRM~kL~~~~~~vTl~dGt~~~ld~~~~~~~P~~~~y~dD-~ivRYtaRY~~g~ 132 (138) T protein:vir:81 69 EAAKAAAKQGHRRMLFLFRNFPTVTLSDGTLADLDFGETLIKPFRMAFEHD-QIVRYTARYQLGT 132 (138) T ss_pred HHHHHHHHhHHHHHHHHhhcccceecCCCceEecchhhhhccccccccCCC-eeeEeeeeeeccc Confidence 99999999998877652 2221 11111 23333 2333333322211 No 92 >protein:vir:8331 Length: 150 # NCBI annotation: gp48 # Family: family:all:2795 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817899;genbank:gi:29566332;genbank:GeneID:1259527 Probab=63.51 E-value=0.19 Score=24.54 Aligned_cols=102 Identities=19% Similarity=0.218 Sum_probs=54.1 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCH---HHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTI---TEARTIR 77 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~---~~A~~l~ 77 (115) =+|.-+-.-|+++ +|+ |-.- .+.-++|+++-+++.|...- +-+.+...+|||+|+... ++|+.++ T Consensus 27 dae~~vv~wLsp~--~rv----A~~R---~~~dplPf~lv~rv~G~d~p---de~td~avvsv~~fg~~v~G~daA~~~a 94 (150) T protein:vir:83 27 DAETFVVKWLGEV--YRA----ANTR---RPGDPLPFLLIQQVAGKENL---DESTADPVVQVDILCDKVDGEDAARDIK 94 (150) T ss_pred cHHHHHHHHhhHH--hhh----hhcc---cCCCCCCeEEEEecCCCCCc---ccccccceeeeeeccccccchhhhhhhh Confidence 3344445566665 222 1111 11125899999999775320 112356799999998866 8899999 Q ss_pred HHHHHHHHhhcceeec-cC------------CCccccccceeeEEEEEEeC Q lcl|NC_019769. 78 NMALDALQVLKPGSIV-KT------------PGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 78 ~av~~Al~~~~~~~~~-~~------------~~ye~dT~lyr~~~df~i~~ 115 (115) +.+-..|-.++-.... +. -+|-.| +.-|.+--+++-- T Consensus 95 d~vH~RM~~l~r~tl~~Gtld~~~v~~aP~~leY~dD-~vvrYt~RY~~G~ 144 (150) T protein:vir:83 95 DRVHRRMLLLGRYLEMDGTLDWMKVFESPRRLEYTND-KVIRYTARYQFGQ 144 (150) T ss_pred hhHHHHHHHHhhhhccCCcchhhhhhccccccccCCC-eEEEeeeeeeccC Confidence 8888777655421111 11 123233 3333333332222 No 93 >protein:vir:108220 Length: 133 # NCBI annotation: gp14 # Family: family:all:6424 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552343;genbank:gi:160700663;genbank:GeneID:5758940 Probab=62.49 E-value=0.33 Score=23.19 Aligned_cols=109 Identities=18% Similarity=0.181 Sum_probs=64.1 Q ss_pred Cch--------HHHHHHHHhhcCCccce-eeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCHH Q lcl|NC_019769. 1 MTE--------DDLYPLLEPLAGGQVYP-YVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTIT 71 (115) Q Consensus 1 M~E--------~~i~~lL~~l~~~Rvyp-~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~~ 71 (115) |.. ..|++-|..-++-++=- +..|++ + .+....|++|---=| .|.| =+-....+|.|-+||.-.. T Consensus 1 m~~~Rvp~D~~~~Ik~~L~~~l~a~v~~~~~lPdd-W-~~~s~~P~vvV~dDg-gpv~---wpv~t~~~IRvtv~a~gr~ 74 (133) T protein:vir:10 1 MSDVRVVGDPVPPVKAYLAAFWGARVRIADEVPDD-W-HVETDVPLIVVDDDG-GPID---WPVKSDPLVRCGIYANGKQ 74 (133) T ss_pred CCCcccCCCChHHHHHHHHhhccccceeeeecCCC-c-cccCCceEEEEecCC-Cccc---cceeccceEEEEEeecCCh Confidence 653 66777665544433321 334432 1 111234666644322 2221 1223345788999999999 Q ss_pred HHHHHHHHHHHHHHhhccee--------eccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 72 EARTIRNMALDALQVLKPGS--------IVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 72 ~A~~l~~av~~Al~~~~~~~--------~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +|++|+.++-.+|=...-.. .+-...+|++|+-|=++|-+.=.. T Consensus 75 ~Ar~l~~~~~g~LLa~~i~Gva~ii~~g~glL~aRD~~tgg~iAsfTV~A~~ 126 (133) T protein:vir:10 75 TAKNLRRITMGALLAEPIPGIAHIQRTGIGYVDARDPDTGADIASFTVTATV 126 (133) T ss_pred hHHHHHHHHHHHHhcCCCCceeEEcCCCceEEecCCCCCCceEEEEEEEeee Confidence 99999999988885432211 122456999999999887765444 No 94 >protein:vir:105772 Length: 128 # NCBI annotation: gp15 # Family: family:all:10994 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224153;genbank:gi:62362228;genbank:GeneID:3342525 Probab=48.50 E-value=0.67 Score=21.55 Aligned_cols=106 Identities=9% Similarity=0.050 Sum_probs=56.4 Q ss_pred CchHHHHHHHHh------hcCC----ccceeeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCCH Q lcl|NC_019769. 1 MTEDDLYPLLEP------LAGG----QVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSSTI 70 (115) Q Consensus 1 M~E~~i~~lL~~------l~~~----Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t~ 70 (115) |+-++++..+.. |-+| ..+|.- +.|. ...||||||.-||... -++-+.+.+++ ++|=+++. T Consensus 1 ~~~~~m~~~vr~~l~daGLt~GftvQl~~W~d-~~g~-----~~e~~iV~qpNGGt~i--~d~~~~dy~~i-~~Vsg~~d 71 (128) T protein:vir:10 1 MTRSEVYDALRVWLQSHGFDVGYRVQKRFWNE-QEGT-----EGERYLVIQQNGGGKP--EEAITRDFFRI-LVLSGQND 71 (128) T ss_pred CchhHHHHHHHHHHHhCCCcchheeeeeeeec-cCCC-----CCceEEEEecCCCCch--hhhcccceeEE-EEEeecCC Confidence 999999888853 4444 333321 1222 2459999999877742 23344444444 45555522 Q ss_pred ---HHHHHHHHHHHHHHHhhccee-------eccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 71 ---TEARTIRNMALDALQVLKPGS-------IVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 71 ---~~A~~l~~av~~Al~~~~~~~-------~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) .++.+.+++|++.+....-.. .+++..=-.+-+-+=-++.|+++. T Consensus 72 ~~~~~ve~ra~~Ii~yv~~np~~~cig~i~n~Ggippi~T~EgR~ifrL~f~~i~ 126 (128) T protein:vir:10 72 SDINEVEDRADAIRQAMIDDYRTECIISMQPVGGITAIQTEEGRYLFDISFQTII 126 (128) T ss_pred CcchhHHHHHHHHHHHHHhCccccccceeeccCCCCCccccCCceeeeehhhhhh Confidence 358888899998885422111 222321111222333344455554 No 95 >protein:vir:9880 Length: 136 # NCBI annotation: hypothetical protein # Family: family:all:1887 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795642;genbank:gi:28876399;genbank:GeneID:1257930 Probab=45.22 E-value=0.78 Score=21.18 Aligned_cols=106 Identities=16% Similarity=0.199 Sum_probs=66.5 Q ss_pred Cch----HHHHHHH----HhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCC-CcceEEEEEEeeC--- Q lcl|NC_019769. 1 MTE----DDLYPLL----EPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQA-ESAVSVQVDVYSS--- 68 (115) Q Consensus 1 M~E----~~i~~lL----~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~-~~~~~vQIDvyA~--- 68 (115) |-. .++.+.. ..--|-||| +.+|.+ .+.|+--.-.|+..|+++- +. .+.+++||-.++. T Consensus 1 mLKkLsl~~l~~aV~~~iee~tgL~c~-d~~p~~------ep~Pfyfie~I~~rpe~sK--tmw~e~y~~~IHais~~g~ 71 (136) T protein:vir:98 1 MLKKLGLVDLHASIKQKIEDKTGLMAY-DHVPED------MPSPFYFIEVVDKRPEDTK--VMWCEVFTVWIHAIAEAGK 71 (136) T ss_pred CccccchHHHHHHHHHHhhccCCceEE-EecccC------CCCCEEEEEeecCCccccc--eeeeeEEEEEEEEEcCCCC Confidence 432 1222222 222344888 556754 3578877777777777532 11 2457999999998 Q ss_pred CHHHHHHHHHHHHHHHHh---h-cceeec-----cCCC-ccccccceeeEEEEEEeC Q lcl|NC_019769. 69 TITEARTIRNMALDALQV---L-KPGSIV-----KTPG-YEPDLRYHRATLEFQVTV 115 (115) Q Consensus 69 t~~~A~~l~~av~~Al~~---~-~~~~~~-----~~~~-ye~dT~lyr~~~df~i~~ 115 (115) |.-.--...+++..||.. + .+...+ +.+. -+++|+.+++...|.|+| T Consensus 72 t~~~~~~mI~~l~EAlte~i~Lpe~y~l~~q~~~G~q~~~~~etge~HAi~~fei~v 128 (136) T protein:vir:98 72 SKIAIYDMIEKLEEALTEELVLPEEIDILRQSEVGMQSLQEDETGEMHAIVAYEIKV 128 (136) T ss_pred ccchHHHHHHHHHhhhhceeecCCCeEEEEEechhhhheecccCCceeeeeeEEEEE Confidence 666666777888778753 1 222222 2333 578999999999999999 No 96 >protein:vir:3874 Length: 114 # NCBI annotation: putative head-tail joining protein # Family: family:all:28620 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680491;swissprot:trembl:p94215;genbank:gi:22296531;uniprot:P94215;genbank:GeneID:951676 Probab=36.42 E-value=1.2 Score=20.20 Aligned_cols=103 Identities=22% Similarity=0.269 Sum_probs=58.8 Q ss_pred Cc-hHHHHHHHHh---hcCCcccee---eccCCCCCCccccccEEEEEecCCCccceecC-CCCcceEEEEEEeeC--CH Q lcl|NC_019769. 1 MT-EDDLYPLLEP---LAGGQVYPY---VAPLGSDGKPSVSPPWVIFSIITDVAADVLCG-QAESAVSVQVDVYSS--TI 70 (115) Q Consensus 1 M~-E~~i~~lL~~---l~~~Rvyp~---~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G-~~~~~~~vQIDvyA~--t~ 70 (115) |. |-+++..|+. |+ ..+|=. .-|+.. -.++-..|||--.-+-|-.+.+-+. +--+--|||||-|-. .. T Consensus 1 ~~PE~~vaDiLsad~~lv-~~mYipift~tpdd~-fik~SsAPWiRiTpiPGDda~yaDD~R~~EYPrVqVDfWvr~e~~ 78 (114) T protein:vir:38 1 MAPEKRVYDILSANLDIA-DKVYIGTPNFNNQTS-ATPESLAPWVRITYLPGDAADYADDSRILEYPKVQVDFWVGITDW 78 (114) T ss_pred CCchhhhhhhhccchhhh-hheeccCCCCCCCCc-ccccccCCeeEeeecCCccccccccceeeecCceeEEEeeccCCh Confidence 87 8889999864 32 245532 233321 2333457888777776666555444 223456999999965 66 Q ss_pred HHHHHHHHHHHHHHHhhcceeeccCCCcc-cccccee Q lcl|NC_019769. 71 TEARTIRNMALDALQVLKPGSIVKTPGYE-PDLRYHR 106 (115) Q Consensus 71 ~~A~~l~~av~~Al~~~~~~~~~~~~~ye-~dT~lyr 106 (115) ++-..+-++|-++|......-... +.|- --..-|. T Consensus 79 d~~e~iqe~IY~~Lha~gweRYY~-nsY~D~~~~~~~ 114 (114) T protein:vir:38 79 DQQEKIETQIYQALHAADWERYYR-NSYVDGIPQPFA 114 (114) T ss_pred hhHHHHHHHHHHHHHhcCcceeee-ccccCCCCCCCC Confidence 778889999999998654322111 1111 0011111 No 97 >protein:vir:101655 Length: 134 # NCBI annotation: gp18 # Family: family:all:2795 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654773;genbank:gi:109302771;genbank:GeneID:4156089 Probab=35.35 E-value=1.2 Score=20.08 Aligned_cols=101 Identities=19% Similarity=0.269 Sum_probs=62.4 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cc-eEEEEEEeeCCHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SA-VSVQVDVYSSTITEARTIRN 78 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~-~~vQIDvyA~t~~~A~~l~~ 78 (115) -.|.-+-+.|++.+.+- +-..--+ .+.|+|...+.-| .|+++ .+ .-+.|-|++++.++|-.|++ T Consensus 10 naeklvcaylspffenv-----ashrwvd---aptpfilvkrlpg------ggqgevsdcalmsikvfgkdvdeagdlad 75 (134) T protein:vir:10 10 NAEKLVCAYLSPFFENV-----ASHRWVD---APTPFILVKRLPG------GGQGEVSDCALMSIKVFGKDVDEAGDLAD 75 (134) T ss_pred chhhhhhhhhhhHHhhh-----hcccccc---CCCceEEEeeCCC------CCCccccceeeeeeeeeccccccccchHH Confidence 23556677888876552 2111111 2469998887654 24665 33 57999999999999999999 Q ss_pred HHHHHHHhhcceeec-------cC-----------CCcccccc-ceeeEEEEEEeC Q lcl|NC_019769. 79 MALDALQVLKPGSIV-------KT-----------PGYEPDLR-YHRATLEFQVTV 115 (115) Q Consensus 79 av~~Al~~~~~~~~~-------~~-----------~~ye~dT~-lyr~~~df~i~~ 115 (115) .|-..++.+.|-... ++ -+|-.||. .|-++.=++.-| T Consensus 76 evhermrkwkpkdtvsygghsfginllevedapfwldygddteecytarywvhlrv 131 (134) T protein:vir:10 76 EVHERMRKWKPKDTVSYGGHSFGINLLEVEDAPFWLDYGDDTEECYTARYWVHLRV 131 (134) T ss_pred HHHHHHhccCcccccccCchhhcceeEeecCCceeeecCCCccceeeeeEEEEEEE Confidence 999999877653211 11 12444443 454544444444 No 98 >protein:vir:7860 Length: 134 # NCBI annotation: gp17 # Family: family:all:2795 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817467;genbank:gi:29565896;genbank:GeneID:1259089 Probab=35.35 E-value=1.2 Score=20.08 Aligned_cols=101 Identities=19% Similarity=0.269 Sum_probs=62.4 Q ss_pred CchHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cc-eEEEEEEeeCCHHHHHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SA-VSVQVDVYSSTITEARTIRN 78 (115) Q Consensus 1 M~E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~-~~vQIDvyA~t~~~A~~l~~ 78 (115) -.|.-+-+.|++.+.+- +-..--+ .+.|+|...+.-| .|+++ .+ .-+.|-|++++.++|-.|++ T Consensus 10 naeklvcaylspffenv-----ashrwvd---aptpfilvkrlpg------ggqgevsdcalmsikvfgkdvdeagdlad 75 (134) T protein:vir:78 10 NAEKLVCAYLSPFFENV-----ASHRWVD---APTPFILVKRLPG------GGQGEVSDCALMSIKVFGKDVDEAGDLAD 75 (134) T ss_pred chhhhhhhhhhhHHhhh-----hcccccc---CCCceEEEeeCCC------CCCccccceeeeeeeeeccccccccchHH Confidence 23556677888876552 2111111 2469998887654 24665 33 57999999999999999999 Q ss_pred HHHHHHHhhcceeec-------cC-----------CCcccccc-ceeeEEEEEEeC Q lcl|NC_019769. 79 MALDALQVLKPGSIV-------KT-----------PGYEPDLR-YHRATLEFQVTV 115 (115) Q Consensus 79 av~~Al~~~~~~~~~-------~~-----------~~ye~dT~-lyr~~~df~i~~ 115 (115) .|-..++.+.|-... ++ -+|-.||. .|-++.=++.-| T Consensus 76 evhermrkwkpkdtvsygghsfginllevedapfwldygddteecytarywvhlrv 131 (134) T protein:vir:78 76 EVHERMRKWKPKDTVSYGGHSFGINLLEVEDAPFWLDYGDDTEECYTARYWVHLRV 131 (134) T ss_pred HHHHHHhccCcccccccCchhhcceeEeecCCceeeecCCCccceeeeeEEEEEEE Confidence 999999877653211 11 12444443 454544444444 No 99 >protein:vir:103278 Length: 169 # NCBI annotation: phage-related conserved hypothetical protein # Family: family:all:5121 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277458;genbank:gi:71834100;genbank:GeneID:3562389 Probab=29.81 E-value=1.6 Score=19.42 Aligned_cols=113 Identities=13% Similarity=0.023 Sum_probs=55.0 Q ss_pred Cc---hHHHHHHHHhhcCCc--cceeeccCCCCCCc-cccccEEEEEecCC-CccceecCCCCc-ceEEEEEEeeC---C Q lcl|NC_019769. 1 MT---EDDLYPLLEPLAGGQ--VYPYVAPLGSDGKP-SVSPPWVIFSIITD-VAADVLCGQAES-AVSVQVDVYSS---T 69 (115) Q Consensus 1 M~---E~~i~~lL~~l~~~R--vyp~~aP~~~~~~p-~~~~Pyiv~q~vsg-~p~n~l~G~~~~-~~~vQIDvyA~---t 69 (115) |- -..+.++|...+..+ -+|-..| +..-.| +..-+|+-+..+-+ .-.+.|+|.... .-.+||+|-.+ - T Consensus 38 ~h~ei~~a~rk~l~~~a~a~~~~LpVA~E-NVaFtPp~dG~~YLr~~~lPadT~~~~L~gd~R~y~GVfQIsVV~PaGtG 116 (169) T protein:vir:10 38 VHYEMMVAARKLVSDAAVDIAGSLPVAYE-NCGFTPPKNGSSWLKFDYTEVDSVTWGLQRTCRYYVGMVQVSIFFSPGEG 116 (169) T ss_pred hHHHHHHHHHHHHHHHHhhcccCCcEeeC-CCCcCCCCCCccEEEEEEecCCceeeeccCCCceEEEEEEEEEEecCCCC Confidence 22 233444444433332 2222222 222222 33457887765544 444466665543 45899997643 5 Q ss_pred HHHHHHHHHHHHHHHHhh-----cceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 70 ITEARTIRNMALDALQVL-----KPGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 70 ~~~A~~l~~av~~Al~~~-----~~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) .++|+.+++++.+++..- ++++ ++...|-+-+.--.-.+-++.+| T Consensus 117 ~~ka~qiAdeiadlF~~gt~L~~Gyi~-~~~~~~p~i~~~s~~~iPvr~~~ 166 (169) T protein:vir:10 117 TDRPRQLAGRLSEAFADGTMLDSGYIY-EGGSVFPPVKSQSGWFIPVRFYV 166 (169) T ss_pred cchhHHHHHHHHHhhhCCceeeceeec-CCCeECCeeecCCceEEeEEEEE Confidence 678999999999988742 2221 11122333332222222333333 No 100 >protein:vir:81228 Length: 109 # NCBI annotation: gp11 # Family: family:all:10297 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456741;genbank:gi:157168384;uniprot:Q9MBJ4;genbank:GeneID:5580351 Probab=28.55 E-value=1.7 Score=19.27 Aligned_cols=94 Identities=18% Similarity=0.181 Sum_probs=57.0 Q ss_pred ccceeeccCCCCCCccccccEEEEEecCCCccc--eecCCCCcceEEEEEEeeC----CHHHHHHHHHHHHHHHHhhcc- Q lcl|NC_019769. 17 QVYPYVAPLGSDGKPSVSPPWVIFSIITDVAAD--VLCGQAESAVSVQVDVYSS----TITEARTIRNMALDALQVLKP- 89 (115) Q Consensus 17 Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n--~l~G~~~~~~~vQIDvyA~----t~~~A~~l~~av~~Al~~~~~- 89 (115) -||- ..|.+. |.-.+|.+=.|-++ ...| .+...+.+-+-+-||.|+. .-..|.+++.+||.-|..... T Consensus 1 ~VY~-E~PH~~---~~~~LP~i~~Q~~G-P~a~~~A~Na~G~D~VD~DiD~~~~~D~~~sG~A~E~A~~IRs~~~r~R~~ 75 (109) T protein:vir:81 1 MVYV-EVPHNA---PLDKLPCIDLQPAG-PGSNLGAFNALGGDLVDVDVDFYAPVDYFNSGTAYEHASAIRSFLSKIRLP 75 (109) T ss_pred Ceee-ecCCCC---cccccCeeEeecCC-CcccccchhccCceeEeeeeeeeeeecccccchhHHHHHHHHHHHHhhccc Confidence 4553 234433 33468988888774 3333 2333556667777777776 457899999999999986432 Q ss_pred -eeec----cCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 90 -GSIV----KTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 90 -~~~~----~~~~ye~dT~lyr~~~df~i~~ 115 (115) +++. ..+--|-..+..|..+...|-| T Consensus 76 ~~~V~~~~~P~~~PD~N~~IRR~G~T~TVAV 106 (109) T protein:vir:81 76 GFNVVAVTYPISLPDRNPRIRRLGMTATVAV 106 (109) T ss_pred cceeeeccCCccCCCCCcceeeeceeEEEEe Confidence 2221 1222444556677777777777 No 101 >protein:vir:79637 Length: 130 # NCBI annotation: gp41 # Family: family:all:5121 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285530;genbank:gi:148734513;genbank:GeneID:5219995 Probab=27.10 E-value=1.9 Score=19.08 Aligned_cols=114 Identities=15% Similarity=0.008 Sum_probs=61.8 Q ss_pred Cc---hHHHHHHHHhhcCCccceeeccCCCCCCccccccEEEEEecCC-CccceecCCCCc-ceEEEEEEeeC---CHHH Q lcl|NC_019769. 1 MT---EDDLYPLLEPLAGGQVYPYVAPLGSDGKPSVSPPWVIFSIITD-VAADVLCGQAES-AVSVQVDVYSS---TITE 72 (115) Q Consensus 1 M~---E~~i~~lL~~l~~~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg-~p~n~l~G~~~~-~~~vQIDvyA~---t~~~ 72 (115) |- -..+++++.....++ +|-..+--.-..|+..-.|+-+..+-+ .-.+.|.|.... .-.+||+|-.+ -.++ T Consensus 1 ~~~e~~~aaR~~~~~~~~~~-lpVA~ENv~FtPp~~G~~YLr~~~lpa~T~~~~L~~d~r~y~Gv~QI~VV~paG~G~~~ 79 (130) T protein:vir:79 1 MHYELSVAARMALAQEYESE-YMIAYENVEFTPPKGGGIWLKYDYKEADTIIHDLKRKCISYIGMVQIGIEFPPGSGIDK 79 (130) T ss_pred CcchhhHHHHHHHHhhhhhh-CceeecCCCcCCCCCCceEEEEEecCCCceeeeccCCCceEEEEEEEEEEecCCCCcch Confidence 43 245566776666663 665544333222343456887775544 444467776553 45999997654 5678 Q ss_pred HHHHHHHHHHHHHhhcc---eee-ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 73 ARTIRNMALDALQVLKP---GSI-VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 73 A~~l~~av~~Al~~~~~---~~~-~~~~~ye~dT~lyr~~~df~i~~ 115 (115) |+.+++++.+.+..-.- .-+ ++-..|-+-+.--.-.+-++.+| T Consensus 80 a~~iA~ei~dlF~~g~~L~~Gyi~~~~~~~p~i~~~~~~~iPvr~~~ 126 (130) T protein:vir:79 80 ARKLAKNIADFFEDGKMLSNGYISEGAKVHQVQKSESGWFYPVRFYV 126 (130) T ss_pred hhHHHHHHHHhccCCceeeceeecCCCeECCeeecCCceEEeEEEEE Confidence 99999999988864211 111 11122433333323333333333 No 102 >protein:vir:99874 Length: 154 # NCBI annotation: hypothetical protein # Family: family:all:964 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164078;genbank:gi:56692610;genbank:GeneID:3192602 Probab=23.16 E-value=2.3 Score=18.55 Aligned_cols=112 Identities=14% Similarity=0.190 Sum_probs=56.5 Q ss_pred CchHHHHHHHHhhc--CC-ccceeeccCCCCCCccccccEEEEEe--cCCCccceecCCCCcceE------EEEEEeeC- Q lcl|NC_019769. 1 MTEDDLYPLLEPLA--GG-QVYPYVAPLGSDGKPSVSPPWVIFSI--ITDVAADVLCGQAESAVS------VQVDVYSS- 68 (115) Q Consensus 1 M~E~~i~~lL~~l~--~~-Rvyp~~aP~~~~~~p~~~~Pyiv~q~--vsg~p~n~l~G~~~~~~~------vQIDvyA~- 68 (115) ++-+.|++....+. .| -=|-.+. + ..+.| .+.-|+++.. .++.+...-.|.-..+.. +=++.|.+ T Consensus 13 ~Vi~RLra~~p~l~~V~gaadlAal~-~-~~~~p-~PaAyVlp~~d~~~~~~~~~~~g~~~Q~i~~~f~Vvl~v~~~~d~ 89 (154) T protein:vir:99 13 LVIERLRDQVKVLKHVGGAAELGTIT-Q-LRDFR-TPAAYVLLAQETLSPKPAGHAGGATRQMANVHFAITVAVRNYRDN 89 (154) T ss_pred HHHHHHHHhCcchhhhhhhhhhhhhh-h-hcCCC-CceEEEEecccccCCCCCCccccceeeeeeeEEEEEEEeeccCcc Confidence 44455553333221 11 1121111 1 11111 2345666654 334443333332222222 22233432 Q ss_pred ----CHHHHHHHHHHHHHHHHhhcceee----------ccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 69 ----TITEARTIRNMALDALQVLKPGSI----------VKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 69 ----t~~~A~~l~~av~~Al~~~~~~~~----------~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ..++...++.+|+.||-.+.|... ++.-+|+..+-+|+-.|.....+ T Consensus 90 ~G~~a~d~l~~lr~~v~~AL~GW~P~~~~G~~pi~~~gG~l~d~~~g~l~y~~~F~~~~~l 150 (154) T protein:vir:99 90 KGVTAADDLRPVLGDVRKALIGWTPPGLAGARDCQLVQGQVVDYDASVLIWTDLYQTQHAI 150 (154) T ss_pred cchhhHHHHHHHHHHHHHHHhCCCCCcccCCceeeecCcceeeccCcEEEEeeeeeeeeec Confidence 456777889999999998887521 12346888888888877777777 No 103 >protein:vir:100200 Length: 123 # NCBI annotation: putative tail component # Family: family:all:1192 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025035;genbank:gi:48697268;genbank:GeneID:2948298 Probab=21.60 E-value=2.6 Score=18.33 Aligned_cols=113 Identities=16% Similarity=0.157 Sum_probs=69.3 Q ss_pred Cc-hHHHHHHHHhh-cC--CccceeeccCCCCCCccccccEEEEEecCCCccceecCCCC-cceEEEEEEeeCCHH--HH Q lcl|NC_019769. 1 MT-EDDLYPLLEPL-AG--GQVYPYVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAE-SAVSVQVDVYSSTIT--EA 73 (115) Q Consensus 1 M~-E~~i~~lL~~l-~~--~Rvyp~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~-~~~~vQIDvyA~t~~--~A 73 (115) |. =.+++.+|.+. .+ .+||.+..|.+..+ ....--+.-+.+.+.|..+=..... -+.+|||-+|-+.-. +. T Consensus 1 M~~v~~v~~ll~~~~~~~iD~vy~~~Ip~e~~~--~~~~T~vLiTe~~~~~~~ygnn~f~~~~~~VeIQIfY~~~~~~d~ 78 (123) T protein:vir:10 1 MSAVDDAVTVLNQAHIAGIDAIYGNNLPKSELD--NVNKTVVLVTDSADDPPSFGNNDFWSLNQEVELQIWYAQLLDSDT 78 (123) T ss_pred CCcHHHHHHHHHhcCCCccceeeecCCCccccc--CCceeEEEEeccCCCcccccCCceeeeEEEEEEEEEeccCCCCCH Confidence 98 68999999642 23 38999988865432 2334566666666555554444443 346899998888632 35 Q ss_pred HHHHHHHHHHHHhhccee-eccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 74 RTIRNMALDALQVLKPGS-IVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 74 ~~l~~av~~Al~~~~~~~-~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) ...-.++-.++...+-.. .......||||+-.-.++-|.=+= T Consensus 79 ~~~E~~L~~~f~~~~W~i~~s~~h~~DPdT~Q~~~t~~~~k~~ 121 (123) T protein:vir:10 79 EAIEIAMMKAFTHQHWQVAAVRQRTLDPDTQQLFNTFYFSRTK 121 (123) T ss_pred HHHHHHHHHHHhcCCcEEEecCCCcCCCCCCeEEEEEEEEeee Confidence 555555555665444322 234467999998666665554333 No 104 >protein:vir:99925 Length: 147 # NCBI annotation: gp12 # Family: family:all:11707 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655529;genbank:gi:109392299;genbank:GeneID:4157094 Probab=20.57 E-value=1.1 Score=20.26 Aligned_cols=104 Identities=16% Similarity=0.222 Sum_probs=61.3 Q ss_pred CchHHH-HHHHHhhc----------CCccceee----ccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEE Q lcl|NC_019769. 1 MTEDDL-YPLLEPLA----------GGQVYPYV----APLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDV 65 (115) Q Consensus 1 M~E~~i-~~lL~~l~----------~~Rvyp~~----aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDv 65 (115) ||.-++ -+-.+++. +.|-.|.. .|+|. +.-|+..++++...+-++ .++-+.+-| T Consensus 1 ~~~~~~~~P~v~P~~A~RaYLl~~L~~Rg~~L~VgatpPeG~------Pt~Yallsr~~s~r~~~l-----~~~LIRvRV 69 (147) T protein:vir:99 1 MTAPEMVGPTMEPAIACRAYLMRRLDDRGIDLSVGATPPDGK------PTRYVLVNQVDSRRRGPV-----ADYLIRTRV 69 (147) T ss_pred CCCccccCCcchhHHHHHHHHHHHHhhcCCcccccccCCCCC------CcceEEEecCCCCceeeh-----hheeEEEEe Confidence 653222 22222222 33444433 34432 356888888865554333 456777788 Q ss_pred eeCCHHHHHHHHHHHHHHHHhh------cc--eeec--------cCCCc--cccccceeeEEEEEEeC Q lcl|NC_019769. 66 YSSTITEARTIRNMALDALQVL------KP--GSIV--------KTPGY--EPDLRYHRATLEFQVTV 115 (115) Q Consensus 66 yA~t~~~A~~l~~av~~Al~~~------~~--~~~~--------~~~~y--e~dT~lyr~~~df~i~~ 115 (115) |..+.-+..+-++.+-.+|... .| ...| +-.++ |+++-||+...-+-|+| T Consensus 70 yd~D~~~~~r~A~LLHa~LlgA~h~kvv~Pd~G~vWiTGa~H~~GPad~~DD~~v~LfG~q~aVFWTi 137 (147) T protein:vir:99 70 YNADAYECGQHATLLHAALLGAAQARIVFPDVGQLWVTGTEHVSGPSDITDDDTTTLFGQAISVFWTV 137 (147) T ss_pred ecchhhhhccchhHHHHHHhhhhcceeeecCCCceEeecccccccccccCCCCCccccchhhheeeee Confidence 9988888877777777666421 11 1111 22333 67899999999999999 No 105 >protein:vir:94547 Length: 117 # NCBI annotation: hypothetical protein # Family: family:all:31527 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223894;genbank:gi:62327106;genbank:GeneID:5075563 Probab=20.50 E-value=2.8 Score=18.17 Aligned_cols=109 Identities=17% Similarity=0.168 Sum_probs=57.9 Q ss_pred CchHHHHHHHHh-hcCCccce-eeccCCCCCCccccccEEEEEecCCCccceecCCCCcceEEEEEEeeCC---HHHHHH Q lcl|NC_019769. 1 MTEDDLYPLLEP-LAGGQVYP-YVAPLGSDGKPSVSPPWVIFSIITDVAADVLCGQAESAVSVQVDVYSST---ITEART 75 (115) Q Consensus 1 M~E~~i~~lL~~-l~~~Rvyp-~~aP~~~~~~p~~~~Pyiv~q~vsg~p~n~l~G~~~~~~~vQIDvyA~t---~~~A~~ 75 (115) ||-++-|.-|.. +-..-+-| ++-|...+ .+|.+ +-.+.-...|.-.-.-...+.-|||.|... ..|=.. T Consensus 1 mtls~wy~~~~~~~tadgl~~~f~qp~~~~-----~lpl~-~vnvh~d~d~ssk~~tl~~v~qqid~y~~~~~~~~e~e~ 74 (117) T protein:vir:94 1 MTLSEWYLSLRDTCTADGLTVKFKQPSTDD-----ALPLL-HVNVHTDSDNSTKIDTLNQVSQQIDLYCENTISVIEFET 74 (117) T ss_pred CchHHHHHHHHhhhcccCceeEeecCCccc-----cceeE-EEEEeecCccccccchhhhhhheeeeeecCCCChhhHHH Confidence 998888877753 32333444 56676443 35653 444433333322212224578899999874 566667 Q ss_pred HHHHHHHHHHhhc---ceeeccCCCccccccceeeEEEEEEeC Q lcl|NC_019769. 76 IRNMALDALQVLK---PGSIVKTPGYEPDLRYHRATLEFQVTV 115 (115) Q Consensus 76 l~~av~~Al~~~~---~~~~~~~~~ye~dT~lyr~~~df~i~~ 115 (115) +-+.|+..|+... -......-+-...-.+.|+.|=+..++ T Consensus 75 ~v~kvk~s~sk~~rw~slt~~~~idts~g~~~rr~m~lit~ti 117 (117) T protein:vir:94 75 LVNKVKNSISKTIRWDSLTTQTMVDTSTGRDIRRAMFLVTFTI 117 (117) T ss_pred HHHHHhhhhhhhhhccccccccccccccchhhhhhhhhheecC Confidence 7778877776421 111111111111115677776666666 Done!