Query lcl|NC_021296.1_cdsid_YP_008050642.1 [gene=9] [protein=hypothetical protein] [protein_id=YP_008050642.1] [location=7231..7695] Match_columns 154 No_of_seqs 80 out of 85 Neff 5.4 Searched_HMMs 1612 Date Thu Nov 7 16:36:21 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_9 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_9_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99002 Length: 158 100.0 1.7E-44 1E-47 260.6 12.9 147 1-154 1-156 (158) 2 protein:vir:98481 Length: 136 100.0 2.7E-32 1.7E-35 193.7 10.4 129 1-141 1-136 (136) 3 protein:vir:9761 Length: 140 # 99.9 2E-29 1.2E-32 177.9 11.6 126 1-144 1-140 (140) 4 protein:vir:94761 Length: 132 99.9 2.3E-29 1.4E-32 177.6 11.5 117 1-121 1-132 (132) 5 protein:vir:9576 Length: 131 # 99.9 4.5E-29 2.8E-32 176.0 12.0 117 1-121 1-131 (131) 6 protein:vir:1640 Length: 132 # 99.9 5.4E-29 3.3E-32 175.6 11.6 117 1-121 1-132 (132) 7 protein:vir:2505 Length: 128 # 99.9 6.5E-29 4E-32 175.1 7.9 118 1-129 4-128 (128) 8 protein:vir:78254 Length: 149 99.9 2.4E-28 1.5E-31 172.0 10.8 137 2-148 1-149 (149) 9 protein:vir:78478 Length: 149 99.9 2.4E-28 1.5E-31 172.0 10.8 137 2-148 1-149 (149) 10 protein:vir:2432 Length: 124 # 99.9 2.1E-28 1.3E-31 172.3 9.5 116 2-122 1-124 (124) 11 protein:vir:7773 Length: 123 # 99.9 9.6E-28 6E-31 168.7 9.8 115 2-123 1-123 (123) 12 protein:vir:2345 Length: 125 # 99.9 3.7E-26 2.3E-29 160.0 9.8 116 1-123 1-125 (125) 13 protein:vir:104088 Length: 125 99.8 9E-25 5.6E-28 152.4 9.1 116 2-122 1-125 (125) 14 protein:vir:4228 Length: 125 # 99.8 7E-24 4.3E-27 147.5 9.1 116 2-122 1-125 (125) 15 protein:vir:108221 Length: 150 99.6 1.2E-17 7.5E-21 113.3 9.0 130 1-149 4-150 (150) 16 protein:vir:101652 Length: 188 98.7 1.9E-10 1.2E-13 73.9 9.1 103 1-111 1-188 (188) 17 protein:vir:7857 Length: 188 # 98.7 1.9E-10 1.2E-13 73.9 9.1 103 1-111 1-188 (188) 18 protein:vir:99922 Length: 165 98.4 3E-09 1.9E-12 67.3 9.1 139 1-149 9-165 (165) 19 protein:vir:8189 Length: 151 # 98.3 4E-09 2.5E-12 66.6 7.7 135 1-153 1-151 (151) 20 protein:vir:80668 Length: 153 98.3 9E-09 5.6E-12 64.7 9.3 138 1-144 1-153 (153) 21 protein:vir:81255 Length: 180 98.0 9.1E-08 5.7E-11 59.2 8.6 105 1-110 1-180 (180) 22 protein:vir:79253 Length: 138 97.9 3E-07 1.8E-10 56.4 9.5 126 1-146 1-138 (138) 23 protein:vir:99222 Length: 138 97.9 3E-07 1.8E-10 56.4 9.5 126 1-146 1-138 (138) 24 protein:vir:103846 Length: 138 97.8 2.8E-07 1.8E-10 56.5 9.1 126 1-146 1-138 (138) 25 protein:vir:105823 Length: 189 97.7 2.8E-07 1.8E-10 56.5 7.7 116 2-117 1-189 (189) 26 protein:vir:102606 Length: 189 97.7 2.8E-07 1.8E-10 56.5 7.7 116 2-117 1-189 (189) 27 protein:vir:7991 Length: 189 # 97.7 3.4E-07 2.1E-10 56.0 7.7 116 2-117 1-189 (189) 28 protein:vir:1993 Length: 141 # 97.5 1.8E-06 1.1E-09 52.0 8.6 130 1-152 1-141 (141) 29 protein:vir:99848 Length: 172 97.3 3.9E-06 2.4E-09 50.3 9.1 129 1-148 1-172 (172) 30 protein:vir:95004 Length: 169 97.2 1E-05 6.2E-09 48.0 10.3 118 1-119 14-169 (169) 31 protein:vir:9928 Length: 118 # 97.2 1.2E-05 7.4E-09 47.6 10.2 115 1-132 1-118 (118) 32 protein:vir:99517 Length: 124 97.1 1.9E-05 1.2E-08 46.4 10.8 113 1-124 1-124 (124) 33 protein:vir:97145 Length: 110 97.1 1.6E-05 9.9E-09 46.9 9.9 102 4-115 1-110 (110) 34 protein:vir:99796 Length: 110 97.1 1.6E-05 9.9E-09 46.9 9.9 102 4-115 1-110 (110) 35 protein:vir:9311 Length: 110 # 97.1 1.6E-05 9.9E-09 46.9 9.9 102 4-115 1-110 (110) 36 protein:vir:96221 Length: 110 97.1 1.6E-05 9.9E-09 46.9 9.9 102 4-115 1-110 (110) 37 protein:vir:78849 Length: 110 97.1 1.6E-05 9.9E-09 46.9 9.9 102 4-115 1-110 (110) 38 protein:vir:103957 Length: 110 97.1 1.6E-05 9.9E-09 46.9 9.9 102 4-115 1-110 (110) 39 protein:vir:96390 Length: 110 97.1 1.6E-05 9.9E-09 46.9 9.9 102 4-115 1-110 (110) 40 protein:vir:95774 Length: 115 96.9 1.9E-05 1.2E-08 46.4 9.3 108 1-117 1-115 (115) 41 protein:vir:79074 Length: 150 96.9 1.5E-05 9.3E-09 47.0 8.6 128 2-148 1-150 (150) 42 protein:vir:3970 Length: 110 # 96.9 3E-05 1.9E-08 45.4 9.8 101 4-117 1-110 (110) 43 protein:vir:78383 Length: 169 96.9 3.8E-05 2.4E-08 44.8 10.4 118 1-119 14-169 (169) 44 protein:vir:107864 Length: 150 96.8 2.4E-05 1.5E-08 46.0 9.0 128 2-148 1-150 (150) 45 protein:vir:106583 Length: 105 96.8 2.1E-05 1.3E-08 46.2 8.7 101 1-112 2-105 (105) 46 protein:vir:94507 Length: 113 96.8 3E-05 1.9E-08 45.4 9.4 104 4-114 1-113 (113) 47 protein:vir:80389 Length: 172 96.7 8.1E-05 5E-08 43.0 11.2 115 1-119 14-172 (172) 48 protein:vir:106596 Length: 128 96.7 2E-05 1.2E-08 46.4 7.5 107 1-117 13-128 (128) 49 protein:vir:8104 Length: 170 # 96.6 3.5E-05 2.2E-08 45.0 8.7 95 13-111 1-170 (170) 50 protein:vir:95176 Length: 172 96.6 8E-05 5E-08 43.0 10.2 117 1-119 16-172 (172) 51 protein:vir:9877 Length: 114 # 96.5 5.5E-05 3.4E-08 43.9 9.2 101 1-113 1-114 (114) 52 protein:vir:741 Length: 110 # 96.5 9E-05 5.6E-08 42.8 10.0 101 4-117 1-110 (110) 53 protein:vir:3615 Length: 110 # 96.4 7.8E-05 4.8E-08 43.1 9.2 101 4-117 1-110 (110) 54 protein:vir:98900 Length: 132 96.3 0.00018 1.1E-07 41.1 10.8 111 2-118 1-132 (132) 55 protein:vir:96488 Length: 113 96.1 0.00017 1E-07 41.3 9.5 101 4-115 1-113 (113) 56 protein:vir:4904 Length: 113 # 95.8 0.00018 1.1E-07 41.1 8.6 100 1-115 1-113 (113) 57 protein:vir:43 Length: 131 # N 95.7 0.00072 4.5E-07 37.8 11.5 111 2-118 1-131 (131) 58 protein:vir:1329 Length: 122 # 95.7 7.1E-05 4.4E-08 43.3 5.9 110 2-114 1-122 (122) 59 protein:vir:94955 Length: 170 95.6 0.00056 3.5E-07 38.4 10.5 115 1-119 13-170 (170) 60 protein:vir:80967 Length: 131 95.6 0.00083 5.1E-07 37.5 11.3 111 2-118 1-131 (131) 61 protein:vir:2738 Length: 112 # 95.5 0.00026 1.6E-07 40.2 8.4 100 1-115 3-112 (112) 62 protein:vir:1241 Length: 104 # 94.6 0.00072 4.5E-07 37.8 8.3 102 5-114 1-104 (104) 63 protein:vir:97430 Length: 104 94.6 0.00078 4.8E-07 37.6 8.4 102 5-114 1-104 (104) 64 protein:vir:94492 Length: 104 94.6 0.00078 4.8E-07 37.6 8.4 102 5-114 1-104 (104) 65 protein:vir:95071 Length: 104 94.6 0.00078 4.9E-07 37.6 8.4 102 5-114 1-104 (104) 66 protein:vir:93740 Length: 104 94.5 0.00084 5.2E-07 37.5 8.4 102 5-114 1-104 (104) 67 protein:vir:107119 Length: 104 94.3 0.00085 5.3E-07 37.4 8.0 102 5-114 1-104 (104) 68 protein:vir:105327 Length: 104 94.3 0.00085 5.3E-07 37.4 8.0 102 5-114 1-104 (104) 69 protein:vir:97329 Length: 104 94.1 0.001 6.2E-07 37.0 7.9 102 5-114 1-104 (104) 70 protein:vir:95891 Length: 104 94.1 0.001 6.4E-07 36.9 7.9 102 5-114 1-104 (104) 71 protein:vir:96281 Length: 104 94.1 0.001 6.4E-07 36.9 7.9 102 5-114 1-104 (104) 72 protein:vir:94798 Length: 104 94.1 0.001 6.5E-07 36.9 7.9 102 5-114 1-104 (104) 73 protein:vir:5976 Length: 102 # 94.0 0.0011 6.9E-07 36.8 7.9 100 5-110 1-102 (102) 74 protein:vir:96128 Length: 98 # 93.6 0.0016 9.8E-07 35.9 8.0 97 5-112 1-98 (98) 75 protein:vir:96831 Length: 98 # 92.6 0.0029 1.8E-06 34.5 7.9 97 5-112 1-98 (98) 76 protein:vir:79701 Length: 144 90.8 0.014 8.5E-06 30.8 9.7 111 1-114 1-144 (144) 77 protein:vir:6243 Length: 122 # 90.7 0.0024 1.5E-06 35.0 5.5 110 2-114 1-122 (122) 78 protein:vir:100103 Length: 120 87.3 0.014 8.9E-06 30.7 7.3 101 1-111 1-120 (120) 79 protein:vir:102961 Length: 131 87.3 0.022 1.4E-05 29.6 8.3 97 6-109 1-131 (131) 80 protein:vir:93592 Length: 108 87.2 0.037 2.3E-05 28.4 9.4 96 1-111 1-108 (108) 81 protein:vir:97267 Length: 172 85.3 0.054 3.4E-05 27.5 10.0 115 1-119 15-172 (172) 82 protein:vir:79050 Length: 133 85.2 0.016 1E-05 30.4 6.4 104 1-112 1-133 (133) 83 protein:vir:3160 Length: 198 # 81.7 0.048 3E-05 27.8 7.5 111 1-152 1-198 (198) 84 protein:vir:100245 Length: 113 79.8 0.1 6.2E-05 26.0 8.8 97 1-111 1-113 (113) 85 protein:vir:97069 Length: 115 78.7 0.083 5.2E-05 26.5 7.8 97 4-112 1-115 (115) 86 protein:vir:192 Length: 108 # 78.6 0.11 7E-05 25.8 9.7 104 1-120 3-108 (108) 87 protein:vir:1887 Length: 108 # 78.6 0.11 7E-05 25.8 9.7 104 1-120 3-108 (108) 88 protein:vir:94126 Length: 116 78.2 0.086 5.3E-05 26.4 7.7 108 1-118 1-116 (116) 89 protein:vir:105899 Length: 116 78.2 0.086 5.3E-05 26.4 7.7 108 1-118 1-116 (116) 90 protein:vir:81069 Length: 115 68.5 0.22 0.00014 24.2 7.5 97 4-112 1-115 (115) 91 protein:vir:10365 Length: 115 62.5 0.33 0.00021 23.2 8.8 97 4-112 1-115 (115) 92 protein:vir:486 Length: 107 # 55.3 0.48 0.0003 22.3 8.3 96 1-114 1-107 (107) 93 protein:vir:5256 Length: 119 # 51.6 0.58 0.00036 21.9 7.4 99 4-113 1-119 (119) 94 protein:vir:4512 Length: 107 # 51.5 0.58 0.00036 21.9 8.1 96 3-114 1-107 (107) 95 protein:vir:4788 Length: 130 # 36.2 1.2 0.00074 20.2 9.8 111 2-120 1-130 (130) 96 protein:vir:1384 Length: 92 # 33.7 1.3 0.00083 19.9 6.9 89 5-110 1-92 (92) 97 protein:vir:107614 Length: 96 29.0 1.4 0.00085 19.8 4.9 94 1-124 1-96 (96) 98 protein:vir:102083 Length: 96 29.0 1.4 0.00085 19.8 4.9 94 1-124 1-96 (96) 99 protein:vir:102863 Length: 96 29.0 1.4 0.00085 19.8 4.9 94 1-124 1-96 (96) 100 protein:vir:105005 Length: 96 29.0 1.4 0.00085 19.8 4.9 94 1-124 1-96 (96) 101 protein:vir:4458 Length: 107 # 24.0 2.2 0.0014 18.7 8.6 94 3-110 1-107 (107) No 1 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=100.00 E-value=1.7e-44 Score=260.59 Aligned_cols=147 Identities=35% Similarity=0.617 Sum_probs=131.6 Q ss_pred CCcCCCHHHHHHHhcCCCCHH---HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCcccee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGD---ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISR 77 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~---E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~e 77 (154) |++|||+|||++||..+|+++ .+++|+++|++||+++|.+.|+.|+ .+..+|++|++||+++|+|.|+||++++|+ T Consensus 1 ~~alasvee~~trl~~~lp~~~~r~~a~a~~vLd~~S~~ar~~~gr~W~-~~~daP~~vr~ivL~aa~R~~~NP~g~~~~ 79 (158) T protein:vir:99 1 MAALVSVEEFTTFLRVPLPEEGSEKYTQMEFLLTLASDWARELSCKPWL-LPADAPVTARGIILAASRREWNNPKRVSYV 79 (158) T ss_pred CcceeeHhhhhhhhcccCChhhhHHHHHHHHHHHHHHHHHHHhcCccCC-CCCcchhHHHHHHHHHHHHHHhcCCceEEe Confidence 999999999999999999855 4455566799999999999999999 677889999999999999999999999999 Q ss_pred eecceeeEeecCC--CcccCHHHHHHHHhhccC-CceeeccccccccccccCCCceeecC--CCCcCCc-cCCCCCcCCc Q lcl|NC_021296. 78 QMGPFNVQYSQPP--DGFFYPAELAILKRFKRS-GGLQTVSTSRGEEGRPWAGKTAFIRY--GDGLFPF-CSEDDGYGDV 151 (154) Q Consensus 78 taG~fs~s~~~sg--g~~lt~aE~~~Lrr~r~~-~g~~sV~~~r~~~~~~~~~~~~~v~~--gg~~~p~-~~~~~g~~~~ 151 (154) ++|+|+++|++++ ++|||++|++.|+||+++ +|+++|+++|+|+.+ .+.|||+ +|||||+ +.+||||| T Consensus 80 ~~G~~~~~~~~~g~~~~ffT~~E~~~L~r~~~s~GG~~~~~ttR~d~~~----~~~yv~v~~~GdpfP~~~~~d~g~g-- 153 (158) T protein:vir:99 80 VKGPQSATFMQSAYPPGFFTDAEEAKLRSYGRSTGNWGVIETYRDDEEQ----LNGYLEVYPHGGLMPVYHPDDIGYG-- 153 (158) T ss_pred eecchhhhcccccCCCcccCHHHHHHHHHhhcccCceeEEEeecCcccc----CCceecccCCCCcccccCccccCCC-- Confidence 9999999999885 459999999999999866 889999999999865 7788886 7999995 56667999 Q ss_pred cCC Q lcl|NC_021296. 152 VPW 154 (154) Q Consensus 152 ~~w 154 (154) ..| T Consensus 154 ~~~ 156 (158) T protein:vir:99 154 GSI 156 (158) T ss_pred ccc Confidence 566 No 2 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=99.95 E-value=2.7e-32 Score=193.67 Aligned_cols=129 Identities=19% Similarity=0.262 Sum_probs=100.8 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHH--HHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDEL--EQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQ 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~--~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~et 78 (154) |.+|||++||++|++++|+++|. ++++++|++||++||.++ |+...+.|.++|+|+|+||+|+|+||+|++|+| T Consensus 1 M~~fAtv~Dl~~rw~~~~~dee~~ra~~~~lL~dAS~~ir~~~----p~~~~~~~~~~~~V~~~~V~R~~~np~G~~s~T 76 (136) T protein:vir:98 1 MAAYATVEDYQARAAVTLPDGSPRRAQVEAYLDDASALMARHI----PTGHTPDPGTLRAICVAVVRRVMANPGGYRQRT 76 (136) T ss_pred CCccCCHHHHHHHhccCCCCchhHHHHHHHHHHHHHHHHHHhC----CCCCCCChhHHHHHHHHHHHHHhhCCCCccccc Confidence 99999999999998888887664 578999999999999875 444455689999999999999999999999999 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHh----hccCCceeeccccccccccccCCCceeecC-CCCcCCc Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKR----FKRSGGLQTVSTSRGEEGRPWAGKTAFIRY-GDGLFPF 141 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr----~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~-gg~~~p~ 141 (154) +|+||.|++..|++|||++|+++|.+ |....+.|||....|.+ .|.++ +||--|- T Consensus 77 aG~ys~s~t~~G~Lylt~~E~~~Lg~~rqr~~~~d~a~si~~~~~~~--------~~~~dp~~~~~~~ 136 (136) T protein:vir:98 77 IGQYAETLGEDGGLYLTEDEKGQLQPPDQTAPDADAAYSLDLDPGTR--------AWVDDPAGCGWPR 136 (136) T ss_pred chhHHHhhhcCCCcccChHHHHHhCCCCCcccccccceecccCCCcC--------CcCCCCCCCCCCC Confidence 99999988778999999999988843 22233567776665544 33333 1221121 No 3 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=99.93 E-value=2e-29 Score=177.94 Aligned_cols=126 Identities=20% Similarity=0.190 Sum_probs=98.4 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccc------hHHHHHHHHHHHHHHHh---CC Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADV------PDDVRAVVLQASRRELK---NP 71 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~------p~~v~~Vv~~~vaR~l~---nP 71 (154) |.+|||++||++| ||+|+++|.+||++||++||++||.++-+...++|... +..+|.|||+||+|+|. |+ T Consensus 1 m~~fATv~Dv~~r-wr~Lt~dE~~ra~~LL~dAS~~iR~~~p~~g~~~~~~~~~~~~~~~~~k~V~~~mV~Ral~~~~d~ 79 (140) T protein:vir:97 1 MGNFATTDDVILL-WRPLSVDELKRANALLKVVSDTLRMEADKVGKDLDKTMVDKPYFVNVIKSVTVDIVARTLMTSTQG 79 (140) T ss_pred CCcCCCHHHHHHH-hcCCCHhHHHHHHHHHHHHHHHHHHhhhhccCCcchhcccCccchhHHHHHHHHHHHHHhcCCCCC Confidence 9999999999998 59999999999999999999999998876666665443 44689999999999984 55 Q ss_pred Cc--cceeeecceeeE--eec-CCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCccCC Q lcl|NC_021296. 72 DR--VISRQMGPFNVQ--YSQ-PPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPFCSE 144 (154) Q Consensus 72 ~g--~~~etaG~fs~s--~~~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~~~~ 144 (154) .| +.|+|+|+||.| |.+ +|++|||++|+++| +.+++.+..++-+|+.++- +| ||.. T Consensus 80 ~G~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L---Gl~~~r~~~i~~~g~~~~~---~~-----------~~~~ 140 (140) T protein:vir:97 80 EPMSQESQSALGYTWSGTYLVPGGGLFIKDNELKRL---GLKKQRYGGIELYGEIKRD---ND-----------YFDR 140 (140) T ss_pred CcceeeeeeccchhheeeeecCCCCceeChHHHHHh---CCCCCceeeecccCccccC---cc-----------cccC Confidence 56 455899999665 655 47899999988655 5566766666668988753 22 2222 No 4 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=99.93 E-value=2.3e-29 Score=177.60 Aligned_cols=117 Identities=24% Similarity=0.260 Sum_probs=93.8 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcc-------cchHHHHHHHHHHHHHHHhCC-- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPA-------DVPDDVRAVVLQASRRELKNP-- 71 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~-------~~p~~v~~Vv~~~vaR~l~nP-- 71 (154) |.+|||++||++| ||+|+++|.+||++||++||++||.++.+...+++. ..+..+|+|||+||+|+|.+| T Consensus 1 m~~fAtv~Dl~~r-~r~L~~dE~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~~~~~k~V~~~~V~Ral~~~~~ 79 (132) T protein:vir:94 1 MNPFATVDDLTML-WRPLKGDEKERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYFSSVVKSVTVDIVARTLMTSTD 79 (132) T ss_pred CCCcCCHHHHHHH-hccCChhHHHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccchhHHHHHHHHHHHHHhcCCCC Confidence 9999999999997 699999999999999999999999886654443332 235678999999999999764 Q ss_pred -Cc--cceeeecceeeE--eec-CCCcccCHHHHHHHHhhccCCceeecccccccc Q lcl|NC_021296. 72 -DR--VISRQMGPFNVQ--YSQ-PPDGFFYPAELAILKRFKRSGGLQTVSTSRGEE 121 (154) Q Consensus 72 -~g--~~~etaG~fs~s--~~~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~ 121 (154) .| +.|+|+|+||.| |++ +|++|||++|+++| +.+++.+..++.+|++ T Consensus 80 ~~g~tq~S~TaG~ys~S~T~~np~G~lylt~~e~~~L---Gl~~~r~~~i~~~~~~ 132 (132) T protein:vir:94 80 QEPMTQTTESALGYSVSGSYLVPGGGLFIKNSELSRL---GLKKQRFGVIDFYGND 132 (132) T ss_pred CCCceeeeeecccceeeeeeecCCCCceeChHHHHhh---CCCCCceEEEeecCCC Confidence 33 456899999765 555 47899999988555 6566777777777777 No 5 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=99.93 E-value=4.5e-29 Score=175.97 Aligned_cols=117 Identities=21% Similarity=0.255 Sum_probs=94.8 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCc------ccchHHHHHHHHHHHHHHHhCC--- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAP------ADVPDDVRAVVLQASRRELKNP--- 71 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~------~~~p~~v~~Vv~~~vaR~l~nP--- 71 (154) |.+|||++||++| ||+|+++|.++|++||++||++||.++.+...+++ ...+.++|+|||+||+|+|.+| T Consensus 1 m~~fAtv~D~~~r-wr~Lt~~E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V~~~~V~Ral~~~~~~ 79 (131) T protein:vir:95 1 MENFATVEDLKKL-WRALKFDEEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSVTVDVVARTLMTSTDQ 79 (131) T ss_pred CCccCCHHHHHHH-hcCCCHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHHHHHHHHHHhcCCCCC Confidence 9999999999987 69999999999999999999999998765433333 2345689999999999999755 Q ss_pred Ccc--ceeeecceeeE--eec-CCCcccCHHHHHHHHhhccCCceeecccccccc Q lcl|NC_021296. 72 DRV--ISRQMGPFNVQ--YSQ-PPDGFFYPAELAILKRFKRSGGLQTVSTSRGEE 121 (154) Q Consensus 72 ~g~--~~etaG~fs~s--~~~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~ 121 (154) .|+ .|+|+|+||.| |.+ +|++|||++|+++| +.+++.+..++.||++ T Consensus 80 ~G~tq~S~TaG~ys~S~t~~~p~g~lylt~~e~~~L---Gl~~~r~~~i~~~~~~ 131 (131) T protein:vir:95 80 EPMTQVAESALGYSFSGSYLVPGGGLFIKDSELKRL---GLKKQRYGVIDIYGTD 131 (131) T ss_pred CCceeeeeecccceeeeeeecCCCCceeChHHHHHh---CCCCCceeEEeeccCC Confidence 454 45899999765 555 47899999988555 6667777777777877 No 6 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=99.92 E-value=5.4e-29 Score=175.58 Aligned_cols=117 Identities=24% Similarity=0.249 Sum_probs=95.0 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCc-------ccchHHHHHHHHHHHHHHHhCCC- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAP-------ADVPDDVRAVVLQASRRELKNPD- 72 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~-------~~~p~~v~~Vv~~~vaR~l~nP~- 72 (154) |.+|||++||++| ||+|+++|.++|++||++||++||..+-+...+++ +..+.++|+|+|+||+|+|.||. T Consensus 1 m~~fAtv~Dv~~r-~r~L~~~E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~V~~~~V~Ral~~~~~ 79 (132) T protein:vir:16 1 MNPFATVDDLTML-WRPLKGDEKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKSVTVDIVARTLMTSTD 79 (132) T ss_pred CCccCCHHHHHHH-hcCCCHhHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHHHHHHHHHHHhcCCCC Confidence 9999999999998 59999999999999999999999998754444443 22355789999999999999873 Q ss_pred --c--cceeeecceeeE--eec-CCCcccCHHHHHHHHhhccCCceeecccccccc Q lcl|NC_021296. 73 --R--VISRQMGPFNVQ--YSQ-PPDGFFYPAELAILKRFKRSGGLQTVSTSRGEE 121 (154) Q Consensus 73 --g--~~~etaG~fs~s--~~~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~ 121 (154) | +.|+|+|+||.| |.+ +|++|||++|+++| +.+++.|.+++.+|++ T Consensus 80 ~~G~tq~S~TaG~ys~S~t~~~p~G~lylt~~e~~~L---G~~~~r~~~i~~~~~~ 132 (132) T protein:vir:16 80 QEPMTQTTESALGYSVSGSYLVPGGGLFIKNSELSRL---GLKKQRFGVIDFYGND 132 (132) T ss_pred CCCceeeeeeccchheeeeeecCCCcceeChHHHHhh---CCCCCceEEEeecCCC Confidence 3 456899999765 554 47899999998655 6666777777777877 No 7 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=99.92 E-value=6.5e-29 Score=175.12 Aligned_cols=118 Identities=17% Similarity=0.259 Sum_probs=97.7 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCcc---ce- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRV---IS- 76 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~---~~- 76 (154) -.+|||++||+++|+|+|+++|+++|..||++|||+||+|+++ -..+.++|+.|++||++||+|+|+.|... .| T Consensus 4 ~~alAtvdDv~~~lrr~Lt~dE~~~a~~Ll~eAsdlI~g~l~~--~~vp~~~p~~v~rVvA~ivarAltr~~~~~pe~~S 81 (128) T protein:vir:25 4 CKALATSQDVKRALRRDLTEAEQTDLSELLAEATDLVVGYLHP--YPVPTPTPGPIKRVVASMVAAVLTRPTQILPETQS 81 (128) T ss_pred chhccCHHHHHHHhcCCCCHHHHHHHHHHHhcchheeeeecCC--CCCCCCCCchHHHHHHHHHHHHhhCCCccCCCcee Confidence 3499999999999999999999999999999999999998863 24578899999999999999999877643 22 Q ss_pred eeecceeeEee---cCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCc Q lcl|NC_021296. 77 RQMGPFNVQYS---QPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKT 129 (154) Q Consensus 77 etaG~fs~s~~---~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~ 129 (154) -|||||+.+|+ +++|+|||.+||++||+||. |+|||+..- +|+ | T Consensus 82 ~TAgpfs~~ft~~~~~~g~yLTaa~k~~Lrp~R~--~~~sV~l~s---ery----~ 128 (128) T protein:vir:25 82 LTADGFGVTFTPGGNSPGPYLSAALKQRLRPYRT--GMVAVEMGS---ERY----C 128 (128) T ss_pred eecccccccccCCCCCCCceEcHHHHhhcccccc--eeeEeeccc---ccC----C Confidence 38999998774 56899999999999999977 556665432 122 2 No 8 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=99.91 E-value=2.4e-28 Score=172.01 Aligned_cols=137 Identities=20% Similarity=0.209 Sum_probs=102.0 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccch-----HHHHHHHHHHHHHHHhCCCccce Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVP-----DDVRAVVLQASRRELKNPDRVIS 76 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p-----~~v~~Vv~~~vaR~l~nP~g~~~ 76 (154) -+|||++||++|++|+||++|.++|+++|++||++||.. +|++++.++ ..++.|+|+||+|+|+||+|++| T Consensus 1 ~afAtv~Dve~rw~r~LT~eE~~~ae~lL~dAs~~IR~~----iP~La~~~~dp~~~a~v~~V~~~mV~R~~rnpeG~~S 76 (149) T protein:vir:78 1 MAYAEPSDVVARLGRPLTDDEETQVETFLEDAEIEIRSR----IPDLDDKAEDEDYLKRVIKVEASAVTRLIRNPDGYIG 76 (149) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHh----ccccccccCCcchhhHHHHHHHHHHHHHhcCCCCeee Confidence 689999999999889999999999999999999999974 466665444 46899999999999999999999 Q ss_pred eeecceeeEe--ec-CCCcccCHHHHHHHHhhccCCceeeccccc---cccccccCCCcee-ecCCCCcCCccCCCCCc Q lcl|NC_021296. 77 RQMGPFNVQY--SQ-PPDGFFYPAELAILKRFKRSGGLQTVSTSR---GEEGRPWAGKTAF-IRYGDGLFPFCSEDDGY 148 (154) Q Consensus 77 etaG~fs~s~--~~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r---~~~~~~~~~~~~~-v~~gg~~~p~~~~~~g~ 148 (154) +|.|+||.|. .+ +|++|||++|+++|.. +++.|+|.|-.-- ..-+-+.++...| |-.-..|+ | ||| T Consensus 77 ~T~G~YS~slt~~np~G~LylT~~E~a~LG~-~r~~G~~~i~p~~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~ 149 (149) T protein:vir:78 77 ETDGNYSYQLNWRLNTGAIEITDKEWAQLGL-SKNVGVLNVRPKTPLERSGEYPAFGSVEWQVFQQSSPL-Y----WGY 149 (149) T ss_pred eecchhhhhhhccCCCCceeeCHHHHHhhCC-cccccceeecccCccccCCCCCcccceeeeeeeccCcc-c----ccC Confidence 9999999865 33 5899999999988865 4455888876431 1111222222222 21123333 2 455 No 9 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=99.91 E-value=2.4e-28 Score=172.01 Aligned_cols=137 Identities=20% Similarity=0.209 Sum_probs=102.0 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccch-----HHHHHHHHHHHHHHHhCCCccce Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVP-----DDVRAVVLQASRRELKNPDRVIS 76 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p-----~~v~~Vv~~~vaR~l~nP~g~~~ 76 (154) -+|||++||++|++|+||++|.++|+++|++||++||.. +|++++.++ ..++.|+|+||+|+|+||+|++| T Consensus 1 ~afAtv~Dve~rw~r~LT~eE~~~ae~lL~dAs~~IR~~----iP~La~~~~dp~~~a~v~~V~~~mV~R~~rnpeG~~S 76 (149) T protein:vir:78 1 MAYAEPSDVVARLGRPLTDDEETQVETFLEDAEIEIRSR----IPDLDDKAEDEDYLKRVIKVEASAVTRLIRNPDGYIG 76 (149) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHHh----ccccccccCCcchhhHHHHHHHHHHHHHhcCCCCeee Confidence 689999999999889999999999999999999999974 466665444 46899999999999999999999 Q ss_pred eeecceeeEe--ec-CCCcccCHHHHHHHHhhccCCceeeccccc---cccccccCCCcee-ecCCCCcCCccCCCCCc Q lcl|NC_021296. 77 RQMGPFNVQY--SQ-PPDGFFYPAELAILKRFKRSGGLQTVSTSR---GEEGRPWAGKTAF-IRYGDGLFPFCSEDDGY 148 (154) Q Consensus 77 etaG~fs~s~--~~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r---~~~~~~~~~~~~~-v~~gg~~~p~~~~~~g~ 148 (154) +|.|+||.|. .+ +|++|||++|+++|.. +++.|+|.|-.-- ..-+-+.++...| |-.-..|+ | ||| T Consensus 77 ~T~G~YS~slt~~np~G~LylT~~E~a~LG~-~r~~G~~~i~p~~~~~~~~~~~~~~~~~~~~~~~~~~~-~----~~~ 149 (149) T protein:vir:78 77 ETDGNYSYQLNWRLNTGAIEITDKEWAQLGL-SKNVGVLNVRPKTPLERSGEYPAFGSVEWQVFQQSSPL-Y----WGY 149 (149) T ss_pred eecchhhhhhhccCCCCceeeCHHHHHhhCC-cccccceeecccCccccCCCCCcccceeeeeeeccCcc-c----ccC Confidence 9999999865 33 5899999999988865 4455888876431 1111222222222 21123333 2 455 No 10 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=99.91 E-value=2.1e-28 Score=172.28 Aligned_cols=116 Identities=20% Similarity=0.188 Sum_probs=97.0 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCccc-----chHHHHHHHHHHHHHHHhCCCccce Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPAD-----VPDDVRAVVLQASRRELKNPDRVIS 76 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~-----~p~~v~~Vv~~~vaR~l~nP~g~~~ 76 (154) -+|||++||++|++|+|+++|.++++.+|++||++||. ++|++++. .+..|+.|+|+||+|+|+||+|++| T Consensus 1 ~~~At~~Dv~~rw~r~Lt~~E~~~ve~lL~dAs~~ir~----r~P~l~~~~~~~~~~~~v~~V~a~~V~R~~rnP~G~~s 76 (124) T protein:vir:24 1 MAYATADDVVTLWAKEPEPEVMALIERRLEQVERMIRR----RIPDLDARVSSDIFRADLIDIEADAVLRLVRNPEGYLS 76 (124) T ss_pred CCCCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHh----cCCCcchhcCCCCChhhHHHHHHHHHHHHhhCCCCcee Confidence 68999999999988999999999999999999999995 56777553 4678999999999999999999999 Q ss_pred eeecceeeEee--c-CCCcccCHHHHHHHHhhccCCceeeccccccccc Q lcl|NC_021296. 77 RQMGPFNVQYS--Q-PPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEG 122 (154) Q Consensus 77 etaG~fs~s~~--~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~ 122 (154) +|.|+||.|.+ + +|++|+|++|+++|..- +..|+|+|...----. T Consensus 77 ~T~G~Ys~sl~~~~~~g~Lylt~~E~~~Lg~~-r~~~~~~i~p~~~~~~ 124 (124) T protein:vir:24 77 ETDGAYTYQLQADLSQGKLVILDEEWTTLGVN-RLSRMSTLVPNIVMPT 124 (124) T ss_pred cccchhHHhhhhcccCCceeeCHHHHHhhCcc-cccceeEeecceeeCC Confidence 99999998653 3 48999999999877663 3457888876531111 No 11 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=99.90 E-value=9.6e-28 Score=168.71 Aligned_cols=115 Identities=17% Similarity=0.216 Sum_probs=93.9 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccch-----HHHHHHHHHHHHHHHhCCCccce Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVP-----DDVRAVVLQASRRELKNPDRVIS 76 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p-----~~v~~Vv~~~vaR~l~nP~g~~~ 76 (154) -+|||++||++|++|+|+++|.++++.+|.+||++||. ++|++++.++ ..++.|+|+||+|+|+||+|++| T Consensus 1 ~~~At~~Dv~ar~~r~LT~~E~~~ve~lL~dAs~~ir~----r~P~l~~~a~d~~~~~~~~~V~~~~V~R~~rnpeG~~s 76 (123) T protein:vir:77 1 MPYATASDVTSRWARQPTDEETALINVRLADVERMIKR----RIPDLATKVTDPDYLEDLKQVEADAVLRLVRNPEGYLS 76 (123) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHH----hccCcccccCCcchhHHHHHHHHHHHHHHhhCCCCcee Confidence 68999999999999999999999999999999999996 4566664433 57899999999999999999999 Q ss_pred eeecceeeEe--ec-CCCcccCHHHHHHHHhhccCCceeecccccccccc Q lcl|NC_021296. 77 RQMGPFNVQY--SQ-PPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGR 123 (154) Q Consensus 77 etaG~fs~s~--~~-sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~ 123 (154) +|.|+||.+. .+ +|++|+|++|+++|..-+ .++|++...-. ... T Consensus 77 ~T~G~ys~sl~~a~~~g~Lylt~~E~~~Lg~~~--~~~~~i~p~~~-~~~ 123 (123) T protein:vir:77 77 ETDGNYTYMLRSDLASGKLEIFPEEWEILGYRR--SRMTVIVPNPV-MPT 123 (123) T ss_pred cccchhhhhhcccCCCCcceeCHHHHHhhcCCC--CceeEEeecee-cCC Confidence 9999999874 33 589999999998776644 34666554421 111 No 12 >protein:vir:2345 Length: 125 # NCBI annotation: gp15 # Family: family:all:2817 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075282;genbank:gi:12657869;genbank:GeneID:920134 Probab=99.88 E-value=3.7e-26 Score=160.03 Aligned_cols=116 Identities=19% Similarity=0.194 Sum_probs=98.2 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC------cccchHHHHHHHHHHHHHHHhCCCcc Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA------PADVPDDVRAVVLQASRRELKNPDRV 74 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~------~~~~p~~v~~Vv~~~vaR~l~nP~g~ 74 (154) |++||+++|++++++|+|+++|..+++.+|.+|+.+|| ++|||+ +.+.+.+++.|+++||+|.++||+|+ T Consensus 1 ma~~A~~eDV~a~w~R~lt~eE~~~V~~~L~~ae~~ir----rriPdL~~r~~~~~~~~~~v~~V~a~~V~Rv~rnPeGy 76 (125) T protein:vir:23 1 MATLATHEDVTAFWARTPTAEEIVLINRRLAQAERMLL----RAIPELLIKASSDPVFRAEVIDIEAEAVLRLVRNHEGY 76 (125) T ss_pred CCcccCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHH----HhcCChhhhhcCCCcchhhHHHHHHHHHHHHhcCCCCc Confidence 99999999999999999999999999999999999999 579987 44567789999999999999999999 Q ss_pred ceeeecceeeEee---cCCCcccCHHHHHHHHhhccCCceeecccccccccc Q lcl|NC_021296. 75 ISRQMGPFNVQYS---QPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGR 123 (154) Q Consensus 75 ~~etaG~fs~s~~---~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~ 123 (154) +|+|.|+||+|+. .+|++|+|++|+++|..-+. |+|.+...-. .+. T Consensus 77 ~seT~g~Yt~~l~~~~~~g~L~it~~E~a~Lg~~~s--~~~vi~p~~~-~p~ 125 (125) T protein:vir:23 77 LSETDGNYTYMLQAQDPNRKLEILPEEWEVLGIVRS--GLGILVPTVV-LPS 125 (125) T ss_pred cccccchhhhhhhccCCCCceeecHHHHHhhccccc--cceEEeecee-cCC Confidence 9999999998764 46899999999988876443 5555544321 111 No 13 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=99.85 E-value=9e-25 Score=152.42 Aligned_cols=116 Identities=19% Similarity=0.178 Sum_probs=99.1 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC------cccchHHHHHHHHHHHHHHHhCCCccc Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA------PADVPDDVRAVVLQASRRELKNPDRVI 75 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~------~~~~p~~v~~Vv~~~vaR~l~nP~g~~ 75 (154) -+|||++||+++++|+|+++|.++++.+|.+||.+||. +|||+ +.+.+.+|+.|+..||+|.++||.|++ T Consensus 1 ma~A~~~Dv~~~w~r~lT~~E~~~v~~~L~~Ae~~Ir~----riP~L~~r~~a~~~~~~~v~~Vea~aV~Rv~rNPeGy~ 76 (125) T protein:vir:10 1 MAYANAQDVVTLWAKEPEPEVMELIERRLAQVERMIKR----RIPNLDLKVAADATFQADLIDIEADAVLRLVRNPEGYI 76 (125) T ss_pred CCcCCHHHHHHHhCCCCCHHHHHHHHHHHHHHHHHHHH----hCCChhhhhhcCCCccccHHHHHHHHHHHHhcCCCccc Confidence 46899999999999999999999999999999999994 78877 445677899999999999999999999 Q ss_pred eeeecceeeEee---cCCCcccCHHHHHHHHhhccCCceeeccccccccc Q lcl|NC_021296. 76 SRQMGPFNVQYS---QPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEG 122 (154) Q Consensus 76 ~etaG~fs~s~~---~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~ 122 (154) |+|.|+||.|+. .+|++|+|++|+++|..-+ ..|+|.|...----. T Consensus 77 s~T~G~Ys~~l~~~~~~g~L~it~~Ew~~Lg~~r-~s~~~~i~p~~~~~~ 125 (125) T protein:vir:10 77 SETDGAYTYQLQTDLSQGRLTILDDEWTTLGVNR-LSRMSVIAPNIVMPT 125 (125) T ss_pred ccccchhHHhhhcccccCceeeCHHHHHhhcccc-ccceeeeecccccCC Confidence 999999998764 4689999999999998744 567888876531111 No 14 >protein:vir:4228 Length: 125 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2817 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039683;swissprot:sw:q05225;genbank:gi:9625449;uniprot:Q05225;genbank:GeneID:2942926 Probab=99.82 E-value=7e-24 Score=147.53 Aligned_cols=116 Identities=18% Similarity=0.194 Sum_probs=98.7 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCc------ccchHHHHHHHHHHHHHHHhCCCccc Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAP------ADVPDDVRAVVLQASRRELKNPDRVI 75 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~------~~~p~~v~~Vv~~~vaR~l~nP~g~~ 75 (154) -+|||++|+++|++|+|+++|..+++++|.+|+.+|| ++|||++ ...+..|+.|+..||+|+++||.|++ T Consensus 1 m~~A~~eDV~a~w~r~lt~~e~~~v~~~L~~Ae~~Ir----~riPdL~~r~~~~~~~~~~v~~Vea~aV~Rv~RNpeGy~ 76 (125) T protein:vir:42 1 MAYATAEDVVTLWAKEPEPEVMALIERRLQQIERMIK----RRIPDLDVKAAASATFRADLIDIEADAVLRLVRNPEGYL 76 (125) T ss_pred CCcccHhHHHHHhCCCCChHHHHHHHHHHHHHHHHHH----HhCCCchhhhcccCcchhhHHHHHHHHHHHHHhCCCccc Confidence 4689999999999999999999999999999999999 4799884 44577899999999999999999999 Q ss_pred eeeecceeeEee---cCCCcccCHHHHHHHHhhccCCceeeccccccccc Q lcl|NC_021296. 76 SRQMGPFNVQYS---QPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEG 122 (154) Q Consensus 76 ~etaG~fs~s~~---~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~ 122 (154) |+|.|+||+++. .+|.+|+|++|+++|..-. +.|+|.|..+----. T Consensus 77 s~T~G~Ys~~l~~~~~~g~L~it~eEw~~L~p~~-~~g~~~i~P~~~~~~ 125 (125) T protein:vir:42 77 SETDGAYTYQLQADLSQGKLTILDEEWEILGVNS-QKRMAVIVPNVVMPT 125 (125) T ss_pred cccchhHHHhhhcccccCceeeCHHHHHhhCccc-cccceeecccceeCC Confidence 999999999764 4689999999999998754 457887765531110 No 15 >protein:vir:108221 Length: 150 # NCBI annotation: gp11 # Family: family:all:28004 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552340;genbank:gi:160700660;genbank:GeneID:5758941 Probab=99.55 E-value=1.2e-17 Score=113.32 Aligned_cols=130 Identities=15% Similarity=0.120 Sum_probs=84.6 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCc----ccchHHHHHHHHHHHHHHHh-CCC--c Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAP----ADVPDDVRAVVLQASRRELK-NPD--R 73 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~----~~~p~~v~~Vv~~~vaR~l~-nP~--g 73 (154) |.+||+|++|+++ ||+|+.+|+.+|+.||++||++||.. ||-.. .+.+...+.|++++|+|+|. -|+ | T Consensus 4 ~~pFadv~~lea~-WrpLt~~E~~~Ae~LL~~As~~IR~~----~Pa~a~a~l~~dd~~A~~Vs~~vVk~Am~~~~e~~G 78 (150) T protein:vir:10 4 VTPFIDVSQFEAM-FRPLGDGERLLAEVLLKAAAIRIRDR----VAAAGRAPLEPDDAMAILVSFEVTRDAMPPIPEMAG 78 (150) T ss_pred CccccchhhhHhh-hcccChhHHHHHHHHHHHHHHHHhhc----ccccCCCCCCCCcchhHHHHHHHHHHhccccccccc Confidence 7799999999996 99999999999999999999999964 33322 12345689999999999995 344 4 Q ss_pred c--ceeeecceee--EeecC-CCcccCHHHHHHHHhhccCCceeeccccccccc---cccCCCceeecCCCCcCCcc-CC Q lcl|NC_021296. 74 V--ISRQMGPFNV--QYSQP-PDGFFYPAELAILKRFKRSGGLQTVSTSRGEEG---RPWAGKTAFIRYGDGLFPFC-SE 144 (154) Q Consensus 74 ~--~~etaG~fs~--s~~~s-gg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~---~~~~~~~~~v~~gg~~~p~~-~~ 144 (154) + .+.|+|+||. ||+++ +.++||..||+.| +++.+....- .-.+++.+-- .|.|.- +. T Consensus 79 ~ss~S~T~G~rses~T~snPag~L~ft~~~k~lL----------Gis~ta~P~~~~~~~df~~~~~~----~~~~~~~~~ 144 (150) T protein:vir:10 79 RTQYSITTDDRTEQATMATAAGLLDFNERHWSLL----------GISATAGPEYGGMGGDFGQLGRA----NPYPIVIGS 144 (150) T ss_pred cchhhhccccccccccccchhhhhhhhHHHHHHh----------CCCccCCccccCCCcchhhhcCC----CCcceEecC Confidence 4 4568999976 57775 7899999998554 2232221110 0001111111 233322 11 Q ss_pred C-CCcC Q lcl|NC_021296. 145 D-DGYG 149 (154) Q Consensus 145 ~-~g~~ 149 (154) + |-.| T Consensus 145 ~~~~~~ 150 (150) T protein:vir:10 145 DADWLG 150 (150) T ss_pred CccccC Confidence 1 1122 No 16 >protein:vir:101652 Length: 188 # NCBI annotation: gp15 # Family: family:all:11114 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654770;genbank:gi:109302768;genbank:GeneID:4156086 Probab=98.69 E-value=1.9e-10 Score=73.90 Aligned_cols=103 Identities=24% Similarity=0.354 Sum_probs=79.9 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC-------------------------------- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA-------------------------------- 48 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~-------------------------------- 48 (154) |. | .++|++.+ +|+++ .|+..|+.||+++|.+||..+.-. T Consensus 1 ~~-~--~~~la~~~--~~da~---~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~ 72 (188) T protein:vir:10 1 MT-F--AQQLADAF--PEDAD---DAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLP 72 (188) T ss_pred Cc-h--hhhHHHhc--CCCcc---hHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEee Confidence 42 3 34776653 35554 444569999999999987654300 Q ss_pred ------------------------------------------c-----------ccchHHHHHHHHHHHHHHHhCCCccc Q lcl|NC_021296. 49 ------------------------------------------P-----------ADVPDDVRAVVLQASRRELKNPDRVI 75 (154) Q Consensus 49 ------------------------------------------~-----------~~~p~~v~~Vv~~~vaR~l~nP~g~~ 75 (154) | .++|++|..++|++++|++.||..+. T Consensus 73 ~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~np~~L~ 152 (188) T protein:vir:10 73 TGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSNPELLV 152 (188) T ss_pred CCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcCcccce Confidence 1 14578899999999999999999999 Q ss_pred eeeecceeeEeecCCCcccCHHHHHHHHhhccCCce Q lcl|NC_021296. 76 SRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGL 111 (154) Q Consensus 76 ~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~ 111 (154) |++.|+||.+|...++.-+++.++++|+||....-. T Consensus 153 q~~vG~~S~tfa~~~~~sl~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:10 153 SKQVGEIERRFGSVAGTSLSKADQAILDRYVIATLA 188 (188) T ss_pred eeecCceeeecccccCCcccchhHHhhccccccccC Confidence 999999999999878878999999999999864433 No 17 >protein:vir:7857 Length: 188 # NCBI annotation: gp14 # Family: family:all:11114 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817464;genbank:gi:29565893;genbank:GeneID:1259086 Probab=98.69 E-value=1.9e-10 Score=73.90 Aligned_cols=103 Identities=24% Similarity=0.354 Sum_probs=79.9 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC-------------------------------- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA-------------------------------- 48 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~-------------------------------- 48 (154) |. | .++|++.+ +|+++ .|+..|+.||+++|.+||..+.-. T Consensus 1 ~~-~--~~~la~~~--~~da~---~v~lAL~~As~~VR~~~g~~i~~V~~dt~~id~~Gg~~~LP~~Pvv~i~~Ve~~~~ 72 (188) T protein:vir:78 1 MT-F--AQQLADAF--PEDAD---DAATALSWAKSQVEGYCGRKFDLVEDDVAIVDPYCGSSLLPSIPVVEISKVEGYLP 72 (188) T ss_pred Cc-h--hhhHHHhc--CCCcc---hHHHHHHHHHHHHHHHhCCcceeeeeeeeeeecCCCceeeccCcceeeeEEEEEee Confidence 42 3 34776653 35554 444569999999999987654300 Q ss_pred ------------------------------------------c-----------ccchHHHHHHHHHHHHHHHhCCCccc Q lcl|NC_021296. 49 ------------------------------------------P-----------ADVPDDVRAVVLQASRRELKNPDRVI 75 (154) Q Consensus 49 ------------------------------------------~-----------~~~p~~v~~Vv~~~vaR~l~nP~g~~ 75 (154) | .++|++|..++|++++|++.||..+. T Consensus 73 ~G~~~~~v~~r~y~~~g~~~~l~~trg~pg~~~~r~~~WPw~p~~VtVTytHGy~evP~eiv~lv~d~A~~~~~np~~L~ 152 (188) T protein:vir:78 73 TGNGMDWVELTNYWFKRDTGLIFDTTGLPGSEWSTGHTWPWLPGSLRVTYTHGYNPVPDELIDVAIRLAREYQSNPELLV 152 (188) T ss_pred CCcccccccccccccccceeeecccccCcccccccccccccCcceEEEEEecCCCcccHHHHHHHHHHHHHHhcCcccce Confidence 1 14578899999999999999999999 Q ss_pred eeeecceeeEeecCCCcccCHHHHHHHHhhccCCce Q lcl|NC_021296. 76 SRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGL 111 (154) Q Consensus 76 ~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~ 111 (154) |++.|+||.+|...++.-+++.++++|+||....-. T Consensus 153 q~~vG~~S~tfa~~~~~sl~~~~~~il~ry~l~~~~ 188 (188) T protein:vir:78 153 SKQVGEIERRFGSVAGTSLSKADQAILDRYVIATLA 188 (188) T ss_pred eeecCceeeecccccCCcccchhHHhhccccccccC Confidence 999999999999878878999999999999864433 No 18 >protein:vir:99922 Length: 165 # NCBI annotation: gp9 # Family: family:all:7267 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655526;genbank:gi:109392296;genbank:GeneID:4157091 Probab=98.43 E-value=3e-09 Score=67.27 Aligned_cols=139 Identities=17% Similarity=0.226 Sum_probs=94.9 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCc-cceeee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDR-VISRQM 79 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g-~~~eta 79 (154) =+..-+.+||.-+ ...+.++|+++++++.++++..+-= .-+.+..-++..|.|...++.|-=..-.| ++|+|| T Consensus 9 p~~ii~~eDl~Pf-----~~i~~~ka~~mI~da~A~A~~vAPC-i~~~~f~~~~aAKaIlrgAiLRW~e~GSGAit~~Ta 82 (165) T protein:vir:99 9 PEPLLTAEDLAPF-----ATIPKAKADEMIEDALGMAEVHAPC-INDPGFAHRRAAKAILRGAILRWNEAGAGAATTKTA 82 (165) T ss_pred cceeeehhhcccc-----ccCCHHHHHHHHhhhhhhhhhhccc-cCCCCcccHHHHHHHHHHhhhhhhcccCceeeeccc Confidence 2345567777543 3344579999999999999986532 23344456889999999998887665555 466899 Q ss_pred cceeeEee--cCCCcccCHHHHHHHHhhc----cCCceeeccccccccccccCCC-----------ceeecCCCCcCCcc Q lcl|NC_021296. 80 GPFNVQYS--QPPDGFFYPAELAILKRFK----RSGGLQTVSTSRGEEGRPWAGK-----------TAFIRYGDGLFPFC 142 (154) Q Consensus 80 G~fs~s~~--~sgg~~lt~aE~~~Lrr~r----~~~g~~sV~~~r~~~~~~~~~~-----------~~~v~~gg~~~p~~ 142 (154) |||.+|+. ++....|.++|...|+++= ..+|+|+|.+.--. ..+++. |+-|+.+| .|+| T Consensus 83 GPf~qT~DtRs~r~~mfwPSEItqLqklC~~~g~~~~AFsIDt~p~g--~v~Hs~~Cs~~fGg~CSCGavl~~~--gplw 158 (165) T protein:vir:99 83 GIYGQTVDTRQPRKAMFFPSEIDQLRKLCRPDDDNGGAFSIDLLPQE--TVTHAEICSIYFGGGCSCGAILTQG--LPLY 158 (165) T ss_pred ccceeeeccccccccccChhhHHHHHHHhcCCCCCCcceeeecccCC--CcccccccceeecCcccchhhhccC--Cccc Confidence 99999984 3456678889999999984 33689999987421 111111 11233222 6889 Q ss_pred CCCCCcC Q lcl|NC_021296. 143 SEDDGYG 149 (154) Q Consensus 143 ~~~~g~~ 149 (154) +...|+- T Consensus 159 e~~~~~~ 165 (165) T protein:vir:99 159 EKNNGWA 165 (165) T ss_pred cccCCCC Confidence 8877666 No 19 >protein:vir:8189 Length: 151 # NCBI annotation: gp9 # Family: family:all:7267 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817982;genbank:gi:29566416;genbank:GeneID:2700970 Probab=98.33 E-value=4e-09 Score=66.65 Aligned_cols=135 Identities=19% Similarity=0.265 Sum_probs=91.1 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCc-cceeee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDR-VISRQM 79 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g-~~~eta 79 (154) |...-+++||.- +-.+...|..++++|-++++..+-=.-.|.+..-++..|.|...++.|-=..-.| .+|+|| T Consensus 1 m~~iik~eDL~~------~~i~e~~a~~mI~da~a~A~~vAPCi~~dp~f~~~~aAKaIlrgAiLRW~e~GSGait~~ta 74 (151) T protein:vir:81 1 MTEIIKAADLPD------DIAANAMAAVWVDGANARASRVAPCLAADPSDDQLAEAKLILIGAVMRWSQAGSGALQSQTM 74 (151) T ss_pred CccccccccCCc------cccchhhHHHHhhcchhhhhhhcccccCCCCccchHHHHHHHHHhhhhhhcccCceeeeccc Confidence 888888888832 2245577888999999988876532222333445788999999998887655555 467899 Q ss_pred cceeeEee--cCCCcccCHHHHHHHHhh---ccCCceeeccccccccccccCCCceeecCC----------CCcCCccCC Q lcl|NC_021296. 80 GPFNVQYS--QPPDGFFYPAELAILKRF---KRSGGLQTVSTSRGEEGRPWAGKTAFIRYG----------DGLFPFCSE 144 (154) Q Consensus 80 G~fs~s~~--~sgg~~lt~aE~~~Lrr~---r~~~g~~sV~~~r~~~~~~~~~~~~~v~~g----------g~~~p~~~~ 144 (154) |||.+||- ++....|.++|...|+++ +..+++|+|.+.+-.. +++..-.|-.| |+ |.|+. T Consensus 75 Gp~~qT~DTRs~r~~~fwPSEI~qLqklC~~~~~g~AFsIdt~p~~~---~Hs~~Cs~~fGg~CSCGa~l~g~--Pl~e~ 149 (151) T protein:vir:81 75 GPYGVTFDTRQRGGFNLWPSEITQLQDICKNGAESKAFAVDTVACGN---YHSPICSVYFGGTCSCGAVLAGQ--PIYEQ 149 (151) T ss_pred cccccccccccCCCcccChhhHHHHHHHhccCCCCcceEEeecccCC---ccccchheeecCccccccccccC--ccccc Confidence 99999973 455666688999999988 3456799999776432 12111111111 45 77877 Q ss_pred CCCcCCccC Q lcl|NC_021296. 145 DDGYGDVVP 153 (154) Q Consensus 145 ~~g~~~~~~ 153 (154) + | T Consensus 150 ~-------~ 151 (151) T protein:vir:81 150 E-------P 151 (151) T ss_pred C-------C Confidence 6 4 No 20 >protein:vir:80668 Length: 153 # NCBI annotation: gp7 # Family: family:all:7267 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285583;genbank:gi:148727089;genbank:GeneID:5247039 Probab=98.32 E-value=9e-09 Score=64.70 Aligned_cols=138 Identities=18% Similarity=0.232 Sum_probs=91.6 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCC-Cc-cceee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNP-DR-VISRQ 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP-~g-~~~et 78 (154) |.---+.+||.-+- .++ ..++++++.|+.++++..+-= .-+.+..-++..|.|...|+.|-=..- .| .+|+| T Consensus 1 m~v~i~~~Dl~pF~--dI~---~~k~~ami~D~~a~A~~vAPC-i~~~~f~~~~aAKaIlrgAiLRW~e~G~SGait~~t 74 (153) T protein:vir:80 1 MGIILKPEDIEPFA--DIP---REKLEAMIADVEAVAVSVAPC-IAKPDFKYKDAAKAILRRALLRWNDTGVSGQVQYES 74 (153) T ss_pred Cceeechhhccccc--cCC---HHHHHHHHHhhhhhhhhhccc-cCCCCcccHHHHHHHHHHHhhhhhhcCcccceeeec Confidence 99888999997652 344 478999999999999986522 224445568899999999988875544 44 57789 Q ss_pred ecceeeEee-cCCCcccCHHHHHHHHhhc----cCCceeecccc-ccccc------cccCCCceeec-CCCCcCCccCC Q lcl|NC_021296. 79 MGPFNVQYS-QPPDGFFYPAELAILKRFK----RSGGLQTVSTS-RGEEG------RPWAGKTAFIR-YGDGLFPFCSE 144 (154) Q Consensus 79 aG~fs~s~~-~sgg~~lt~aE~~~Lrr~r----~~~g~~sV~~~-r~~~~------~~~~~~~~~v~-~gg~~~p~~~~ 144 (154) ||||.+|+. .++-..|.++|...|+++= ..+++|+|.++ ++.-- .++-+.|.--. .-|+-.|.|+- T Consensus 75 aGpf~qT~dtrs~r~lfwPSEItqLqklC~~~~~~g~Af~id~t~~~~v~Hs~~Cs~~fGg~CSCGa~l~g~~gplwe~ 153 (153) T protein:vir:80 75 AGPFAQTTRSNTPTNLLWPSEIAALKKLCEGDGGAGKAFTITPTMRSSVNHSEVCSTVWGEGCSCGSDINGYAGPLWEI 153 (153) T ss_pred cccceeeeccCCceeccChhhHHHHHHHhcCCCCCcceeEeecCCCCccccccccceeecCccccchhhcccCcccccC Confidence 999999874 3455677889999999984 33469999976 33211 01011111111 02344566666 No 21 >protein:vir:81255 Length: 180 # NCBI annotation: gp8 # Family: family:all:3238 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456738;genbank:gi:157168381;uniprot:Q9MBJ7;genbank:GeneID:5580378 Probab=97.98 E-value=9.1e-08 Score=59.18 Aligned_cols=105 Identities=18% Similarity=0.160 Sum_probs=79.8 Q ss_pred CC--cCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCC-C----------------cc----------- Q lcl|NC_021296. 1 MA--GLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPD-A----------------PA----------- 50 (154) Q Consensus 1 M~--~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d-~----------------~~----------- 50 (154) |. +..|++|+-+..+..+..+ ..++.+|+.|++.+|.|||.+... . |. T Consensus 1 ~~~p~~l~p~~~~~~~~g~~~~~--~~~q~~l~aA~aavRr~cGwhv~pv~~~t~~ldg~G~~~l~LPt~~vvsV~sV~~ 78 (180) T protein:vir:81 1 MQPPHGLTPEILRTYPGGHLLSK--DLTQEHVDAVVATVRKLCGWHVFPVATTEYSFPWRGDPEFLVPTKRLVSVESVTC 78 (180) T ss_pred CCCCccCCcchhhhhhccccCCc--hhhHHHHHHHHHHHHHHhCCcccceeeeEEEEecCCCeeEeCCCCcceeeeeEEE Confidence 65 8999999988776655443 345889999999999999876421 1 11 Q ss_pred --------------------------------------------cchHHHHHHHHHHHHHHHhCC-CccceeeecceeeE Q lcl|NC_021296. 51 --------------------------------------------DVPDDVRAVVLQASRRELKNP-DRVISRQMGPFNVQ 85 (154) Q Consensus 51 --------------------------------------------~~p~~v~~Vv~~~vaR~l~nP-~g~~~etaG~fs~s 85 (154) .+| .+.+|++.|++|+.+.+ .+..|++.|++| T Consensus 79 dG~~v~~~~~~~~~~~~~G~l~r~~G~~~rg~~~V~Vt~~hGye~vP-~~~aVi~~~a~ra~~s~~~~v~~~tvG~~S-- 155 (180) T protein:vir:81 79 GDLSIPNEDIVFYPYGEVNLLRRVHGTPWRVARPMTVTMTHGYEDAP-GLVGVIAQMLTRAFTSTGGGDGNLTVGNMS-- 155 (180) T ss_pred CCeeeCCccceecccCCCCeeEecCCccccccceEEEEEEeCCCCCc-hHHHHHHHHHHHhccccccccccceeccee-- Confidence 012 24689999999999876 456788999877 Q ss_pred eecCCCcccCHHHHHHHHhhccCCc Q lcl|NC_021296. 86 YSQPPDGFFYPAELAILKRFKRSGG 110 (154) Q Consensus 86 ~~~sgg~~lt~aE~~~Lrr~r~~~g 110 (154) |..+++.-|+++|+++|.|||...= T Consensus 156 ~~~~~~~~~~~~e~aiLdrYrl~~~ 180 (180) T protein:vir:81 156 YGLSTGITPKSSEWLIIDQYRLHPV 180 (180) T ss_pred eccccCCCccHHHHHHHHhhhccCC Confidence 4677888899999999999997653 No 22 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=97.87 E-value=3e-07 Score=56.38 Aligned_cols=126 Identities=12% Similarity=0.033 Sum_probs=77.2 Q ss_pred CCcCCCHHHHHHHhcCC----CC--------HHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHH Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQT----FE--------GDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRREL 68 (154) Q Consensus 1 M~~~ATvdDl~arlgr~----L~--------~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l 68 (154) |. |||.+||.+|+|.. |+ .-+.++.+..|++||+.|.+|.+++..=+-.++|..++.+||+++.=.| T Consensus 1 M~-YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L 79 (138) T protein:vir:79 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANL 79 (138) T ss_pred CC-CCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHH Confidence 98 99999999998743 22 2245678899999999999999988753335678899999999987777 Q ss_pred hCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCccCCCC Q lcl|NC_021296. 69 KNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPFCSEDD 146 (154) Q Consensus 69 ~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~~~~~~ 146 (154) .+-... .+.. ...| ++-++.|+.-+. |.+++.......... .++..-+. . +-+.|+.|| T Consensus 80 ~~~~~~-~e~i---~~rY---------~~Ai~~L~~Ia~--Gk~~Lg~~~~~~~~~-~~~~~~~~-~--~~r~F~Rd~ 138 (138) T protein:vir:79 80 HIVLKE-ENPV---YKTA---------EHLRKLLSGIAN--GKLSLALDADGKPAP-VANTVQIS-E--GRNDWGADW 138 (138) T ss_pred hcCCCC-cHHH---HHHH---------HHHHHHHHHHhc--CcccCCCCCCCcCCC-CCCceeee-c--CCCCCCCCC Confidence 542211 0100 0001 223444555444 666665443222222 22322232 2 226889998 No 23 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=97.87 E-value=3e-07 Score=56.38 Aligned_cols=126 Identities=12% Similarity=0.033 Sum_probs=77.2 Q ss_pred CCcCCCHHHHHHHhcCC----CC--------HHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHH Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQT----FE--------GDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRREL 68 (154) Q Consensus 1 M~~~ATvdDl~arlgr~----L~--------~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l 68 (154) |. |||.+||.+|+|.. |+ .-+.++.+..|++||+.|.+|.+++..=+-.++|..++.+||+++.=.| T Consensus 1 M~-YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L 79 (138) T protein:vir:99 1 MS-YCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANL 79 (138) T ss_pred CC-CCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHH Confidence 98 99999999998743 22 2245678899999999999999988753335678899999999987777 Q ss_pred hCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCccCCCC Q lcl|NC_021296. 69 KNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPFCSEDD 146 (154) Q Consensus 69 ~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~~~~~~ 146 (154) .+-... .+.. ...| ++-++.|+.-+. |.+++.......... .++..-+. . +-+.|+.|| T Consensus 80 ~~~~~~-~e~i---~~rY---------~~Ai~~L~~Ia~--Gk~~Lg~~~~~~~~~-~~~~~~~~-~--~~r~F~Rd~ 138 (138) T protein:vir:99 80 HIVLKE-ENPV---YKTA---------EHLRKLLSGIAN--GKLSLALDADGKPAP-VANTVQIS-E--GRNDWGADW 138 (138) T ss_pred hcCCCC-cHHH---HHHH---------HHHHHHHHHHhc--CcccCCCCCCCcCCC-CCCceeee-c--CCCCCCCCC Confidence 542211 0100 0001 223444555444 666665443222222 22322232 2 226889998 No 24 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=97.85 E-value=2.8e-07 Score=56.49 Aligned_cols=126 Identities=10% Similarity=0.026 Sum_probs=77.0 Q ss_pred CCcCCCHHHHHHHhcCC----CC--------HHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHH Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQT----FE--------GDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRREL 68 (154) Q Consensus 1 M~~~ATvdDl~arlgr~----L~--------~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l 68 (154) |. |||.+||.+|+|.. |+ .-+.++.+..|++||+.|.+|.+++..=+-.++|..++.+||+++.=.| T Consensus 1 M~-Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~vP~~L~~~a~dIA~Y~L 79 (138) T protein:vir:10 1 MS-YCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLHLHARYQLPLAQVPVVLKRVACVLAFANL 79 (138) T ss_pred CC-cCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHH Confidence 98 99999999997654 22 2245678999999999999999888664345678899999999987777 Q ss_pred hCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCccCCCC Q lcl|NC_021296. 69 KNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPFCSEDD 146 (154) Q Consensus 69 ~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~~~~~~ 146 (154) .+-... ++.. ...| ++-++.|+..+. |.+++...-.+...+ .++..-|. . +-.+|+.|| T Consensus 80 ~~~~~~-~e~~---~~rY---------~~Ai~~L~~Ia~--G~~~Lg~~~~~~~~~-~~~~~~~~-s--~~r~Fg~d~ 138 (138) T protein:vir:10 80 HTQVKD-DHPA---ILDA---------ERKRKLLGGISS--GKLSLALTSSGTPAP-IANTVQIS-S--QRNDFGGTW 138 (138) T ss_pred hcCCCC-ChHH---HHHH---------HHHHHHHHHHhc--CcccCCCCCCcccCC-CCCceeee-c--CCccCCCCC Confidence 432111 1100 0011 233444555444 666665443322222 22222222 1 234788887 No 25 >protein:vir:105823 Length: 189 # NCBI annotation: gp7 # Family: family:all:6971 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655768;genbank:gi:109522091;genbank:GeneID:4157631 Probab=97.75 E-value=2.8e-07 Score=56.48 Aligned_cols=116 Identities=27% Similarity=0.351 Sum_probs=84.8 Q ss_pred CcCCCHHHHHHHhcCC----CCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC---------------ccc----------- Q lcl|NC_021296. 2 AGLASIQDLQTLMSQT----FEGDELEQAQLVLDIVSSWARVVSGRAWPDA---------------PAD----------- 51 (154) Q Consensus 2 ~~~ATvdDl~arlgr~----L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~---------------~~~----------- 51 (154) --+||++|+.+.||-+ |+++++.|.+.+|+.+|+...-++||..... |++ T Consensus 1 m~las~~dva~algl~~~~~lt~~q~~rv~g~l~rvs~~fqr~~gr~~t~ga~~vra~~v~grv~lp~~~~~~~~vt~~~ 80 (189) T protein:vir:10 1 MLLATADDVAAALGLPSAAALTPEQSSRVDGVLGRVSDTFQRVTGRVFTTGATQVRAQVVNGRVWLPGVVDEVEAVTLTG 80 (189) T ss_pred CcccchhhHHHhhCCcchhhcChhhhhHHHHHHHHHHHHHhhhhcceeeccceEEEEEEeeeeEecCCCcccceeeeecC Confidence 3479999999999976 8889999999999999999988877642211 111 Q ss_pred ------------------------------------chHHHHHHHHHHHHHHHh-CCCccce----eeecceeeEee--c Q lcl|NC_021296. 52 ------------------------------------VPDDVRAVVLQASRRELK-NPDRVIS----RQMGPFNVQYS--Q 88 (154) Q Consensus 52 ------------------------------------~p~~v~~Vv~~~vaR~l~-nP~g~~~----etaG~fs~s~~--~ 88 (154) .|+.+..-+.+|++|-|+ .|....| -|||+|..... . T Consensus 81 g~~~~~~~~g~yvdvtrngc~~~tg~i~ivey~~~~~p~~~~~~vaa~~arhltv~pgs~~s~~~~ltag~f~qr~a~wv 160 (189) T protein:vir:10 81 GEEVDFNQDGNYVDVTRNGCSLVTGTVVIVEYVGGGVPDSVTEFVAAVAARHLTVTPGSVSSQAVSLTAGPFTQRNAEWV 160 (189) T ss_pred CceeeeeecCcEEEeecCCcceeeccEEEEEecCCCCchHHHHHHHHHHhhceeecCCCcccceeeeccchhhhhhhhhh Confidence 144556667889999996 5665543 48999988765 4 Q ss_pred CCCcccCHHHHHHHHhhccCCceeecccc Q lcl|NC_021296. 89 PPDGFFYPAELAILKRFKRSGGLQTVSTS 117 (154) Q Consensus 89 sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~ 117 (154) |+.-.||++|++.-|+|+-..-...|--. T Consensus 161 s~t~~ft~~el~~a~~~~~p~p~i~ihrl 189 (189) T protein:vir:10 161 SGTAVFTRDELEDAKRFANPAPTITIHRL 189 (189) T ss_pred cccceechhhHHHHhhhcCCCCceEEeeC Confidence 78999999999999999864422222111 No 26 >protein:vir:102606 Length: 189 # NCBI annotation: gp7 # Family: family:all:6971 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655003;genbank:gi:109392193;genbank:GeneID:4157228 Probab=97.75 E-value=2.8e-07 Score=56.48 Aligned_cols=116 Identities=27% Similarity=0.351 Sum_probs=84.8 Q ss_pred CcCCCHHHHHHHhcCC----CCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC---------------ccc----------- Q lcl|NC_021296. 2 AGLASIQDLQTLMSQT----FEGDELEQAQLVLDIVSSWARVVSGRAWPDA---------------PAD----------- 51 (154) Q Consensus 2 ~~~ATvdDl~arlgr~----L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~---------------~~~----------- 51 (154) --+||++|+.+.||-+ |+++++.|.+.+|+.+|+...-++||..... |++ T Consensus 1 m~las~~dva~algl~~~~~lt~~q~~rv~g~l~rvs~~fqr~~gr~~t~ga~~vra~~v~grv~lp~~~~~~~~vt~~~ 80 (189) T protein:vir:10 1 MLLATADDVAAALGLPSAAALTPEQSSRVDGVLGRVSDTFQRVTGRVFTTGATQVRAQVVNGRVWLPGVVDEVEAVTLTG 80 (189) T ss_pred CcccchhhHHHhhCCcchhhcChhhhhHHHHHHHHHHHHHhhhhcceeeccceEEEEEEeeeeEecCCCcccceeeeecC Confidence 3479999999999976 8889999999999999999988877642211 111 Q ss_pred ------------------------------------chHHHHHHHHHHHHHHHh-CCCccce----eeecceeeEee--c Q lcl|NC_021296. 52 ------------------------------------VPDDVRAVVLQASRRELK-NPDRVIS----RQMGPFNVQYS--Q 88 (154) Q Consensus 52 ------------------------------------~p~~v~~Vv~~~vaR~l~-nP~g~~~----etaG~fs~s~~--~ 88 (154) .|+.+..-+.+|++|-|+ .|....| -|||+|..... . T Consensus 81 g~~~~~~~~g~yvdvtrngc~~~tg~i~ivey~~~~~p~~~~~~vaa~~arhltv~pgs~~s~~~~ltag~f~qr~a~wv 160 (189) T protein:vir:10 81 GEEVDFNQDGNYVDVTRNGCSLVTGTVVIVEYVGGGVPDSVTEFVAAVAARHLTVTPGSVSSQAVSLTAGPFTQRNAEWV 160 (189) T ss_pred CceeeeeecCcEEEeecCCcceeeccEEEEEecCCCCchHHHHHHHHHHhhceeecCCCcccceeeeccchhhhhhhhhh Confidence 144556667889999996 5665543 48999988765 4 Q ss_pred CCCcccCHHHHHHHHhhccCCceeecccc Q lcl|NC_021296. 89 PPDGFFYPAELAILKRFKRSGGLQTVSTS 117 (154) Q Consensus 89 sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~ 117 (154) |+.-.||++|++.-|+|+-..-...|--. T Consensus 161 s~t~~ft~~el~~a~~~~~p~p~i~ihrl 189 (189) T protein:vir:10 161 SGTAVFTRDELEDAKRFANPAPTITIHRL 189 (189) T ss_pred cccceechhhHHHHhhhcCCCCceEEeeC Confidence 78999999999999999864422222111 No 27 >protein:vir:7991 Length: 189 # NCBI annotation: gp7 # Family: family:all:6971 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817345;genbank:gi:29565773;genbank:GeneID:1258986 Probab=97.72 E-value=3.4e-07 Score=56.03 Aligned_cols=116 Identities=26% Similarity=0.330 Sum_probs=84.8 Q ss_pred CcCCCHHHHHHHhcCC----CCHHHHHHHHHHHHHHHHHHHHhhCCCCC------------------CCcc--------- Q lcl|NC_021296. 2 AGLASIQDLQTLMSQT----FEGDELEQAQLVLDIVSSWARVVSGRAWP------------------DAPA--------- 50 (154) Q Consensus 2 ~~~ATvdDl~arlgr~----L~~~E~~~A~~lL~~aS~lir~~~~~~~~------------------d~~~--------- 50 (154) --+||++|+.+.||-+ |+++++.|.+.+|+.+|+...-++||... +.+. T Consensus 1 m~las~~dva~algl~~~~~lt~~q~~rv~g~l~rvs~~fqr~~gr~~t~ga~~vra~~v~grv~lp~~~~~~~~vt~~~ 80 (189) T protein:vir:79 1 MLLATADDVAAALGLPSAAALTPEQSSRVDGVLGRVSDTFQRVTGRVFTTGATRVRAQVVNGRVWLPGVVDEVEAVTLTG 80 (189) T ss_pred CcccchhhHHHHcCCcchhhcChhhhhHHHHHHHHHHHHHhhhhcceeeccceEEEEEEeeeeEEcCCCcccceeeeecC Confidence 3479999999999976 88899999999999999999888776322 1111 Q ss_pred -----------------------------------cchHHHHHHHHHHHHHHHh-CCCccce----eeecceeeEee--c Q lcl|NC_021296. 51 -----------------------------------DVPDDVRAVVLQASRRELK-NPDRVIS----RQMGPFNVQYS--Q 88 (154) Q Consensus 51 -----------------------------------~~p~~v~~Vv~~~vaR~l~-nP~g~~~----etaG~fs~s~~--~ 88 (154) ..|+.+..-+.+|++|-|+ .|....| -|||+|..... . T Consensus 81 g~~~~~~~~g~yvdvtrngc~~~tg~i~ivey~~~~~p~~~~~~vaa~~arhltv~pgs~~s~~~~ltag~f~qr~a~wv 160 (189) T protein:vir:79 81 GEEVDFNQDGNYVDVTRNGCSLVTGTVVIVEYVGGGVPDSVTEFVAAVAARHLTVTPGSVSSQAVSLTAGPFTQRNAEWV 160 (189) T ss_pred CceeeeeecCcEEEeecCCcceeeccEEEEEecCCCCchHHHHHHHHHHhhceeecCCCcccceeeeccchhhhhhhhhh Confidence 0144556667889999996 5665543 48999988765 4 Q ss_pred CCCcccCHHHHHHHHhhccCCceeecccc Q lcl|NC_021296. 89 PPDGFFYPAELAILKRFKRSGGLQTVSTS 117 (154) Q Consensus 89 sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~ 117 (154) |+.-.||++|++.-|+|+-..-...|--. T Consensus 161 s~t~~ft~~el~~a~~~~~p~p~i~ihrl 189 (189) T protein:vir:79 161 SGTAVFTRDELEDAKRFANPAPTITIHRL 189 (189) T ss_pred cccceechhhHHHHhhhcCCCCceEEecC Confidence 78999999999999999864422222111 No 28 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=97.46 E-value=1.8e-06 Score=52.04 Aligned_cols=130 Identities=15% Similarity=0.154 Sum_probs=76.1 Q ss_pred CCcCCCHHHHHHHhcCC----CCH-------HHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQT----FEG-------DELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK 69 (154) Q Consensus 1 M~~~ATvdDl~arlgr~----L~~-------~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~ 69 (154) |. |||.+||.+++|.. |+. -..++++..|++||+.|.+|.+++..=+-.++|..++.+||+++.=.|. T Consensus 1 M~-Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~~P~~L~~~a~dIA~Y~L~ 79 (141) T protein:vir:19 1 MN-YATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGYLAARFVLPLTVVPSLLKRQCCVVAWFYLN 79 (141) T ss_pred CC-cCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHh Confidence 88 99999999998742 331 1346788999999999999999887643456788999999999877664 Q ss_pred CCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCccCCCCCcC Q lcl|NC_021296. 70 NPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPFCSEDDGYG 149 (154) Q Consensus 70 nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~~~~~~g~~ 149 (154) +-. .++.. ...| ++=++.|++.+. |..++.........+..++...+..+ -++|+.+ .=| T Consensus 80 ~~~--~~e~i---~~rY---------~~Ai~~L~~Ia~--Gk~~Lg~~~~~~~~~~~~~~~~~~~~---~r~f~r~-~~G 139 (141) T protein:vir:19 80 ESQ--PTEQI---TATY---------RDTVRWLEQVRD--GKTDPGVESRTAASPEGEDLVQVQSD---PPVFSRK-QKG 139 (141) T ss_pred cCC--CChHH---HHHH---------HHHHHHHHHHhc--CccccCCCCCCCCCCCCCceeEeecC---CcccCcc-ccc Confidence 321 01100 0011 222334444444 66666533222222223333344322 2677774 333 Q ss_pred Ccc Q lcl|NC_021296. 150 DVV 152 (154) Q Consensus 150 ~~~ 152 (154) .+ T Consensus 140 -~~ 141 (141) T protein:vir:19 140 -FI 141 (141) T ss_pred -CC Confidence 22 No 29 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=97.35 E-value=3.9e-06 Score=50.25 Aligned_cols=129 Identities=14% Similarity=0.065 Sum_probs=75.3 Q ss_pred CCcCCCHHHHHHHhcCC----CCH---------------------------------HHHHHHHHHHHHHHHHHHHhhCC Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQT----FEG---------------------------------DELEQAQLVLDIVSSWARVVSGR 43 (154) Q Consensus 1 M~~~ATvdDl~arlgr~----L~~---------------------------------~E~~~A~~lL~~aS~lir~~~~~ 43 (154) |.-|||++||.+|+|.. |+. -..++++..|++||+.|.+|... T Consensus 1 ~~mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~~ 80 (172) T protein:vir:99 1 MAVYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQR 80 (172) T ss_pred CcccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHhc Confidence 99999999999998732 111 12467888999999999999977 Q ss_pred C-CCCCcccchHHHHHHHHHHHHHHH-hC-CCcc-ceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeecccccc Q lcl|NC_021296. 44 A-WPDAPADVPDDVRAVVLQASRREL-KN-PDRV-ISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRG 119 (154) Q Consensus 44 ~-~~d~~~~~p~~v~~Vv~~~vaR~l-~n-P~g~-~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~ 119 (154) + ..-+=.++|..++.+||+++.=.| .+ |.+. .++. -...| ++=++.|+..+. |.+++...-. T Consensus 81 R~Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~---v~~rY---------~~Ai~~L~~Ia~--Gk~~Lg~~~~ 146 (172) T protein:vir:99 81 RGYSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDP---IVRDY---------RDALKFLQLIAE--GKFSLGPDDP 146 (172) T ss_pred ccccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHH---HHHHH---------HHHHHHHHHHhc--CccccCCCCC Confidence 6 443235688999999999998444 33 2211 1111 00111 222334444443 6666654322 Q ss_pred ccccccCCCceeecCCCCcCCccCCCC--Cc Q lcl|NC_021296. 120 EEGRPWAGKTAFIRYGDGLFPFCSEDD--GY 148 (154) Q Consensus 120 ~~~~~~~~~~~~v~~gg~~~p~~~~~~--g~ 148 (154) +. +..+....|..+ -..|+.+- || T Consensus 147 ~~--~~~~~~~~v~~~---~r~F~rd~L~gf 172 (172) T protein:vir:99 147 LT--PPGGGVPQVLAP---ARTFSHDTLKDY 172 (172) T ss_pred CC--CCCCCceeeecC---CCccChhhccCC Confidence 21 223333444422 24667654 66 No 30 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=97.23 E-value=1e-05 Score=47.99 Aligned_cols=118 Identities=18% Similarity=0.198 Sum_probs=78.1 Q ss_pred CCcCCCHHHHHHHh---cCCCCHHHHHHHHHHHHHHHHHHHHh----hCC--------CCCCC----------cccchHH Q lcl|NC_021296. 1 MAGLASIQDLQTLM---SQTFEGDELEQAQLVLDIVSSWARVV----SGR--------AWPDA----------PADVPDD 55 (154) Q Consensus 1 M~~~ATvdDl~arl---gr~L~~~E~~~A~~lL~~aS~lir~~----~~~--------~~~d~----------~~~~p~~ 55 (154) -..|+|++|..+.. |..+..++.++ +.+|--|++.|-.+ .|+ .||.. ...+|.. T Consensus 14 anSYvt~~ea~aY~~~rg~~~~~dd~~~-e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~~~~~IP~~ 92 (169) T protein:vir:95 14 ADSYVSLEDGRALAAKYGLELPEDDIAA-EASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLHGFPQPSNVIPSL 92 (169) T ss_pred ccccccHHHHHHHHHHcCCcCCCCHHHH-HHHHHHHHHHhhccccccccccCCcchhhccccCCceecccccccccchHH Confidence 45899999998765 44554444443 44455699888753 232 46633 3457899 Q ss_pred HHHHHHHHHHHHHhCCCcc--------cee-eecceeeEeecC---CCcccCHHHHHHHHhhc-cCCceeecccccc Q lcl|NC_021296. 56 VRAVVLQASRRELKNPDRV--------ISR-QMGPFNVQYSQP---PDGFFYPAELAILKRFK-RSGGLQTVSTSRG 119 (154) Q Consensus 56 v~~Vv~~~vaR~l~nP~g~--------~~e-taG~fs~s~~~s---gg~~lt~aE~~~Lrr~r-~~~g~~sV~~~r~ 119 (154) |+.=+|..+.+++.+|..+ .++ ..|+=+++|..+ ++.....+=...|++|- .++|.|+|-..|| T Consensus 93 V~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~~~a~~~LL~p~l~g~~g~~~i~~~rg 169 (169) T protein:vir:95 93 VIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVSITAADDALRPLLCGSNNAYSFNVFRG 169 (169) T ss_pred HHHHHHHHHHHHHcCccccCCCCccceeeeeeccceeEeecCCCCcCccccHHHHHHhhhhhcccCCCcceeeeecC Confidence 9999999999999876532 223 338888888532 23233334345799996 4567899999999 No 31 >protein:vir:9928 Length: 118 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795690;genbank:gi:28876458;genbank:GeneID:1258013 Probab=97.18 E-value=1.2e-05 Score=47.59 Aligned_cols=115 Identities=13% Similarity=0.161 Sum_probs=80.1 Q ss_pred CCcCCCHHHHHHHhcCCCC-HHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCcccee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFE-GDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISR 77 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~-~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~e 77 (154) |..=-..++|+.+||.+.+ ..+-++.+.+|+.|++.|+.+++.....-...+|+.+..|++.++-..+. .-.|.+|+ T Consensus 1 md~~~~L~~vK~~lgI~~~D~~~D~lL~~~i~~a~~~i~~~l~~~~~~~~~eiP~~l~~iv~evav~ryNR~g~EG~~S~ 80 (118) T protein:vir:99 1 MGDKQLIDDIKLFIGISKGDGAQDELITLAIYESKERVLAKLNEYSETEITKIPDRLRFIVRDVAIKRFNRINSEGAVED 80 (118) T ss_pred CchhhHHHHHHHHhCCCCCchhhHHHHHHHHHHHHHHHHHHhccccccchhhhhHHHHHHHHHHHHHHhcCcCCccccee Confidence 9986779999999997654 34567999999999999999998543222345899999999998888874 46688999 Q ss_pred eecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceee Q lcl|NC_021296. 78 QMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFI 132 (154) Q Consensus 78 taG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v 132 (154) |.++.|.||.+...-| ...|++|+..... .. .+..+|+ T Consensus 81 SeeG~S~sf~~d~~ey-----~~~l~~~~~~~~~----~~--------~g~v~Fi 118 (118) T protein:vir:99 81 SEEGKTFKWDSYLKEY-----ESTLRSAAIGKVY----SG--------KGVARFI 118 (118) T ss_pred ecCCeeeeeccCchhH-----HHHHHHHhhhccc----Cc--------CcceeeC Confidence 9999999995433333 2335666422110 00 1122333 No 32 >protein:vir:99517 Length: 124 # NCBI annotation: putative protein # Family: family:all:372 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958539;genbank:gi:41179321;genbank:GeneID:2717155 Probab=97.12 E-value=1.9e-05 Score=46.44 Aligned_cols=113 Identities=12% Similarity=0.201 Sum_probs=75.3 Q ss_pred CCc--CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccce Q lcl|NC_021296. 1 MAG--LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVIS 76 (154) Q Consensus 1 M~~--~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~ 76 (154) |+. -.++++|+.+||.+ +..+-++.+.+|+.|++.|+.++|. .+.+|+.+..|+..+|-..+. .-.|.+| T Consensus 1 m~~~~~~~Le~vK~~LgI~-d~~~D~lL~~lI~~a~~~i~~~l~~-----~e~iP~~L~~Iv~evavkryNR~g~EG~~S 74 (124) T protein:vir:99 1 MNDVLDDQLKKLKTALQLT-DTKHDDLLKLYLEDATDFLKLRLSI-----TGVIPTEMLAIVRGAAVKKFNRFKNEGMAS 74 (124) T ss_pred CCcchHHHHHHHHHHhCCC-CcchhHHHHHHHHHHHHHHHHhcCC-----cccchhHHHHHHHHHHHHHhcccCCcccce Confidence 763 34699999999976 3333467999999999999999874 245889998888888877763 4668899 Q ss_pred eeecceeeEeecC-CCcccCHHHHHHHHhhccCC---ceee-cccc--ccccccc Q lcl|NC_021296. 77 RQMGPFNVQYSQP-PDGFFYPAELAILKRFKRSG---GLQT-VSTS--RGEEGRP 124 (154) Q Consensus 77 etaG~fs~s~~~s-gg~~lt~aE~~~Lrr~r~~~---g~~s-V~~~--r~~~~~~ 124 (154) ++.++.|.||.++ +.-| ...|++|+... |... |..+ |++-+-- T Consensus 75 ~SeeG~S~sf~d~d~~~y-----~~~L~~y~~~~~~~g~~~fi~~~~~~~~~~~~ 124 (124) T protein:vir:99 75 YSQDGESITFASSDFDEW-----EDEINQWRKDHTGMNKGMWVNPYEIRQNGRAN 124 (124) T ss_pred eeeCceeeeecccChhhH-----HHHHHHHhhccCcCCceeeecCccccCCCCCC Confidence 9999999999542 3333 23366775332 1111 1122 2222210 No 33 >protein:vir:97145 Length: 110 # NCBI annotation: ORF049 # Family: family:all:372 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239728;genbank:gi:66394913;genbank:GeneID:5130878 Probab=97.07 E-value=1.6e-05 Score=46.89 Aligned_cols=102 Identities=10% Similarity=0.178 Sum_probs=73.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .++.++|+.+||.+ +..+-++.+.+|+.|++.|..+++- +. ..+|+.+..|++.+|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgI~-d~~~D~lL~~ii~~a~~~i~~~l~~---~~-~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:97 1 MTTLADVKKRIGLK-DEKQDEQLEEIIKSCESQLLSMLPI---EV-EQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhCCC-CCchhHHHHHHHHHHHHHHHHHhcc---ch-hhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 67899999999975 3445568999999999999999862 22 34899999999998877774 567899999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC-----ceeecc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG-----GLQTVS 115 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~-----g~~sV~ 115 (154) .|.||.+ -+.-|. ..|++|+... |.+... T Consensus 76 ~S~sf~d~d~~~y~-----~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:97 76 RSNAYELNDFKEYE-----AIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eeeeecccccchHH-----HHHHHHHhhcCCCCCceeeeC Confidence 9999954 355552 2356665221 222211 No 34 >protein:vir:99796 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004309;genbank:gi:122891763;genbank:GeneID:4712351 Probab=97.07 E-value=1.6e-05 Score=46.89 Aligned_cols=102 Identities=10% Similarity=0.178 Sum_probs=73.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .++.++|+.+||.+ +..+-++.+.+|+.|++.|..+++- +. ..+|+.+..|++.+|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgI~-d~~~D~lL~~ii~~a~~~i~~~l~~---~~-~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:99 1 MTTLADVKKRIGLK-DEKQDEQLEEIIKSCESQLLSMLPI---EV-EQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhCCC-CCchhHHHHHHHHHHHHHHHHHhcc---ch-hhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 67899999999975 3445568999999999999999862 22 34899999999998877774 567899999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC-----ceeecc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG-----GLQTVS 115 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~-----g~~sV~ 115 (154) .|.||.+ -+.-|. ..|++|+... |.+... T Consensus 76 ~S~sf~d~d~~~y~-----~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:99 76 RSNAYELNDFKEYE-----AIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eeeeecccccchHH-----HHHHHHHhhcCCCCCceeeeC Confidence 9999954 355552 2356665221 222211 No 35 >protein:vir:9311 Length: 110 # NCBI annotation: phi Mu50B-like protein # Family: family:all:372 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803289;genbank:gi:29028599;genbank:GeneID:1258047 Probab=97.07 E-value=1.6e-05 Score=46.89 Aligned_cols=102 Identities=10% Similarity=0.178 Sum_probs=73.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .++.++|+.+||.+ +..+-++.+.+|+.|++.|..+++- +. ..+|+.+..|++.+|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgI~-d~~~D~lL~~ii~~a~~~i~~~l~~---~~-~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:93 1 MTTLADVKKRIGLK-DEKQDEQLEEIIKSCESQLLSMLPI---EV-EQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhCCC-CCchhHHHHHHHHHHHHHHHHHhcc---ch-hhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 67899999999975 3445568999999999999999862 22 34899999999998877774 567899999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC-----ceeecc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG-----GLQTVS 115 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~-----g~~sV~ 115 (154) .|.||.+ -+.-|. ..|++|+... |.+... T Consensus 76 ~S~sf~d~d~~~y~-----~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:93 76 RSNAYELNDFKEYE-----AIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eeeeecccccchHH-----HHHHHHHhhcCCCCCceeeeC Confidence 9999954 355552 2356665221 222211 No 36 >protein:vir:96221 Length: 110 # NCBI annotation: ORF044 # Family: family:all:372 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239573;genbank:gi:66395333;genbank:GeneID:5132767 Probab=97.07 E-value=1.6e-05 Score=46.89 Aligned_cols=102 Identities=10% Similarity=0.178 Sum_probs=73.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .++.++|+.+||.+ +..+-++.+.+|+.|++.|..+++- +. ..+|+.+..|++.+|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgI~-d~~~D~lL~~ii~~a~~~i~~~l~~---~~-~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:96 1 MTTLADVKKRIGLK-DEKQDEQLEEIIKSCESQLLSMLPI---EV-EQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhCCC-CCchhHHHHHHHHHHHHHHHHHhcc---ch-hhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 67899999999975 3445568999999999999999862 22 34899999999998877774 567899999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC-----ceeecc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG-----GLQTVS 115 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~-----g~~sV~ 115 (154) .|.||.+ -+.-|. ..|++|+... |.+... T Consensus 76 ~S~sf~d~d~~~y~-----~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:96 76 RSNAYELNDFKEYE-----AIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eeeeecccccchHH-----HHHHHHHhhcCCCCCceeeeC Confidence 9999954 355552 2356665221 222211 No 37 >protein:vir:78849 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285363;genbank:gi:148717891;genbank:GeneID:5246980 Probab=97.07 E-value=1.6e-05 Score=46.89 Aligned_cols=102 Identities=10% Similarity=0.178 Sum_probs=73.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .++.++|+.+||.+ +..+-++.+.+|+.|++.|..+++- +. ..+|+.+..|++.+|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgI~-d~~~D~lL~~ii~~a~~~i~~~l~~---~~-~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:78 1 MTTLADVKKRIGLK-DEKQDEQLEEIIKSCESQLLSMLPI---EV-EQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhCCC-CCchhHHHHHHHHHHHHHHHHHhcc---ch-hhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 67899999999975 3445568999999999999999862 22 34899999999998877774 567899999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC-----ceeecc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG-----GLQTVS 115 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~-----g~~sV~ 115 (154) .|.||.+ -+.-|. ..|++|+... |.+... T Consensus 76 ~S~sf~d~d~~~y~-----~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:78 76 RSNAYELNDFKEYE-----AIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eeeeecccccchHH-----HHHHHHHhhcCCCCCceeeeC Confidence 9999954 355552 2356665221 222211 No 38 >protein:vir:103957 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873994;genbank:gi:118430769;genbank:GeneID:4525451 Probab=97.07 E-value=1.6e-05 Score=46.89 Aligned_cols=102 Identities=10% Similarity=0.178 Sum_probs=73.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .++.++|+.+||.+ +..+-++.+.+|+.|++.|..+++- +. ..+|+.+..|++.+|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgI~-d~~~D~lL~~ii~~a~~~i~~~l~~---~~-~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:10 1 MTTLADVKKRIGLK-DEKQDEQLEEIIKSCESQLLSMLPI---EV-EQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhCCC-CCchhHHHHHHHHHHHHHHHHHhcc---ch-hhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 67899999999975 3445568999999999999999862 22 34899999999998877774 567899999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC-----ceeecc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG-----GLQTVS 115 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~-----g~~sV~ 115 (154) .|.||.+ -+.-|. ..|++|+... |.+... T Consensus 76 ~S~sf~d~d~~~y~-----~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:10 76 RSNAYELNDFKEYE-----AIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eeeeecccccchHH-----HHHHHHHhhcCCCCCceeeeC Confidence 9999954 355552 2356665221 222211 No 39 >protein:vir:96390 Length: 110 # NCBI annotation: ORF048 # Family: family:all:372 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239650;genbank:gi:66395410;genbank:GeneID:5132866 Probab=97.07 E-value=1.6e-05 Score=46.89 Aligned_cols=102 Identities=10% Similarity=0.178 Sum_probs=73.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .++.++|+.+||.+ +..+-++.+.+|+.|++.|..+++- +. ..+|+.+..|++.+|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgI~-d~~~D~lL~~ii~~a~~~i~~~l~~---~~-~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:96 1 MTTLADVKKRIGLK-DEKQDEQLEEIIKSCESQLLSMLPI---EV-EQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhCCC-CCchhHHHHHHHHHHHHHHHHHhcc---ch-hhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 67899999999975 3445568999999999999999862 22 34899999999998877774 567899999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC-----ceeecc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG-----GLQTVS 115 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~-----g~~sV~ 115 (154) .|.||.+ -+.-|. ..|++|+... |.+... T Consensus 76 ~S~sf~d~d~~~y~-----~~l~~y~~~~~~~~kG~v~Fl 110 (110) T protein:vir:96 76 RSNAYELNDFKEYE-----AIIDNYFNARTRTKKGRAVFF 110 (110) T ss_pred eeeeecccccchHH-----HHHHHHHhhcCCCCCceeeeC Confidence 9999954 355552 2356665221 222211 No 40 >protein:vir:95774 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950592;genbank:gi:119953787;genbank:GeneID:5076844 Probab=96.93 E-value=1.9e-05 Score=46.44 Aligned_cols=108 Identities=11% Similarity=0.091 Sum_probs=74.5 Q ss_pred CCcCCCHHHHHHHhcCCCCH-HHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCcccee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEG-DELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISR 77 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~-~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~e 77 (154) |..- ++++|+.+||.+.++ .+-.+.+.+|+.|++.|+.++|. ..+|+.+..|+..++-+.+. .-.|.+|+ T Consensus 1 md~~-~L~~vK~~LgI~~~D~~~D~lL~~ii~~a~~~i~~~l~~------~~iP~~L~~Iv~ev~vkryNR~g~EG~~S~ 73 (115) T protein:vir:95 1 MDTT-QLEKIKRRLGIPADDDKEDKLLEDLVEDAETYFKLLTSS------AVVDSKYHFMIEAVVYKLYGRKGSEGVTSE 73 (115) T ss_pred Cchh-HHHHHHHHhCCCCCCchhhHHHHHHHHHHHHHHHHhcCc------hhcchhHHHHHHHHHHHHhcCCCcccccee Confidence 9875 899999999987665 35579999999999999999974 35788888888888776663 46688999 Q ss_pred eecceeeEeecC---CCcccCHHHHHHHHhhccC-Cceeecccc Q lcl|NC_021296. 78 QMGPFNVQYSQP---PDGFFYPAELAILKRFKRS-GGLQTVSTS 117 (154) Q Consensus 78 taG~fs~s~~~s---gg~~lt~aE~~~Lrr~r~~-~g~~sV~~~ 117 (154) +.++.|.||... ..-| .+++...+..+.. +....|... T Consensus 74 S~eG~S~tf~~nD~~f~eY--~~~l~~~~~~~~~~~~~G~v~Fl 115 (115) T protein:vir:95 74 TVDGYSVTYQEWDNLFKPY--MAILNKDFGLDGSVREKGKVMFL 115 (115) T ss_pred ecCceeeeccccccccchh--HHHHHHHHhccCCccCCcceeeC Confidence 999999999432 3333 2344333332211 111112222 No 41 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=96.93 E-value=1.5e-05 Score=47.04 Aligned_cols=128 Identities=12% Similarity=0.188 Sum_probs=75.5 Q ss_pred CcCCCHHHHHHHhcCC----CCH-------------HHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHH Q lcl|NC_021296. 2 AGLASIQDLQTLMSQT----FEG-------------DELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQAS 64 (154) Q Consensus 2 ~~~ATvdDl~arlgr~----L~~-------------~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~v 64 (154) =+|||.+||.+|+|.. |++ -..++.+..|++||+.|.+|.+++..-+-.++|..++.+||+++ T Consensus 1 M~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgyL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:79 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 3699999999998732 221 13467889999999999999998876444578899999999999 Q ss_pred HHHHhC--CC-ccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCc Q lcl|NC_021296. 65 RRELKN--PD-RVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPF 141 (154) Q Consensus 65 aR~l~n--P~-g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~ 141 (154) .=.|-. +. +..++. -...| ++-++.|+.-+. |.+++...-+. ..+ ..+...|..+ -++ T Consensus 81 ~Y~L~~~~~~~~~~~e~---v~~rY---------~~Ai~~L~~Ia~--Gk~~Lg~~~~~-~~~-~~~~~~v~~~---~r~ 141 (150) T protein:vir:79 81 RHWLYARRPEGAALPDT---VSQTF---------KASMHMLEKIRD--NKLTIGDPSGP-ATP-EPGEMKVRAR---RRQ 141 (150) T ss_pred HHHHHhcccCCCCCCHH---HHHHH---------HHHHHHHHHHhc--CccccCCCCcc-CCC-CCCceeeecC---CCc Confidence 766632 21 111111 00011 223344444444 55666543222 222 2233444432 346 Q ss_pred cCCCC--Cc Q lcl|NC_021296. 142 CSEDD--GY 148 (154) Q Consensus 142 ~~~~~--g~ 148 (154) |+.+- || T Consensus 142 f~r~~l~g~ 150 (150) T protein:vir:79 142 FDADLLERF 150 (150) T ss_pred cChhhccCC Confidence 67654 55 No 42 >protein:vir:3970 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663678;genbank:gi:21716115;genbank:GeneID:951203 Probab=96.87 E-value=3e-05 Score=45.37 Aligned_cols=101 Identities=16% Similarity=0.279 Sum_probs=71.1 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .+++++|+.++|.+ .+ ++.+.+|++|.+.|+.++++.. ..+|+.+..|++++|-..+. .-.|.+|++.++ T Consensus 1 M~iL~~vK~~lgi~--~D--~lL~~li~~a~~~i~~~l~~~~----~~iP~~l~~iv~evav~ryNR~g~EG~~S~SeeG 72 (110) T protein:vir:39 1 MAITDDLKKLLGGS--SD--ERLEVIEKRTRERLLLILSSNI----KEVPPELEYVVLDVSLKRFNRIGQEGMQSYSQEG 72 (110) T ss_pred CchHHHHHHhcCCC--hh--HHHHHHHHHHHHHHHHHhCCCh----hhhhhHHHHHHHHHHHHHhccccccccceeecCC Confidence 56799999999964 33 5899999999999999998532 23788888888888877774 466889999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccC------Cceeecccc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRS------GGLQTVSTS 117 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~------~g~~sV~~~ 117 (154) .|.||.+ -..-|. ..|++|+.. .+...|..+ T Consensus 73 ~S~sf~~~d~~~y~-----~~l~~y~~~~~~~~~~~~g~~~f~ 110 (110) T protein:vir:39 73 LSMTFSESDFDEYA-----DEIESWRKSKETEGDKKIGRFRLY 110 (110) T ss_pred eeeeecccCcchhH-----HHHHHHhhhccccccCcceeeeeC Confidence 9999953 344442 235666522 122223222 No 43 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=96.86 E-value=3.8e-05 Score=44.81 Aligned_cols=118 Identities=18% Similarity=0.202 Sum_probs=76.4 Q ss_pred CCcCCCHHHHHHHh---cCCCCHHHHHHHHHHHHHHHHHHHHh----hCC--------CCCCCc----------ccchHH Q lcl|NC_021296. 1 MAGLASIQDLQTLM---SQTFEGDELEQAQLVLDIVSSWARVV----SGR--------AWPDAP----------ADVPDD 55 (154) Q Consensus 1 M~~~ATvdDl~arl---gr~L~~~E~~~A~~lL~~aS~lir~~----~~~--------~~~d~~----------~~~p~~ 55 (154) -..|+|++|..+.. |..+..++.+ .+.+|--|++.|-.+ .|+ .||..+ ..+|.. T Consensus 14 anSYvtv~~a~aY~~~rg~~~~~d~~~-~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~~~~~IP~~ 92 (169) T protein:vir:78 14 ADSYVSLEDGRALAAKYGLELPEDDTA-AEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLHGFPQPSNVIPPL 92 (169) T ss_pred ccccccHHHHHHHHHHcCCcCCCChHH-HHHHHHHHHHHhhhccccceeeeCCcccccccccCCceecccccccccchHH Confidence 45899999988764 4445444333 444455699888753 222 355332 357889 Q ss_pred HHHHHHHHHHHHHhCCCcc--------ceeee-cceeeEeecC---CCcccCHHHHHHHHhhc-cCCceeecccccc Q lcl|NC_021296. 56 VRAVVLQASRRELKNPDRV--------ISRQM-GPFNVQYSQP---PDGFFYPAELAILKRFK-RSGGLQTVSTSRG 119 (154) Q Consensus 56 v~~Vv~~~vaR~l~nP~g~--------~~eta-G~fs~s~~~s---gg~~lt~aE~~~Lrr~r-~~~g~~sV~~~r~ 119 (154) |+.=+|..+.+++.++.-. .+|.. |+=+++|..+ ++.....+=...|++|- .++|.|+|-..|| T Consensus 93 v~~A~~elA~~~~~g~~~~~~~~~~~v~~e~v~G~i~veY~~~~~~~~~~~~~~~~~LL~p~l~~~~g~~~i~~~rg 169 (169) T protein:vir:78 93 VIQAQVMAAVEYGAGTDVRGSTDGREVQTERVEGAVTVSYFKNGYSGGTVSITTADDALRPLLCGSNNAYSFNVFRG 169 (169) T ss_pred HHHHHHHHHHHHhcCcccCCCCCcceeEEEEecCceeEeecCCCCCCCcccHHHHHHHhhhhcccCCCcceeeeecC Confidence 9999999999998765322 33444 7888888532 23233334346799996 4467899999999 No 44 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=96.83 E-value=2.4e-05 Score=45.96 Aligned_cols=128 Identities=11% Similarity=0.166 Sum_probs=74.5 Q ss_pred CcCCCHHHHHHHhcCC----CC-------------HHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHH Q lcl|NC_021296. 2 AGLASIQDLQTLMSQT----FE-------------GDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQAS 64 (154) Q Consensus 2 ~~~ATvdDl~arlgr~----L~-------------~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~v 64 (154) =+|||.+||.+|+|.. |+ .-..++++..|++||+.|.+|.+++..-+-.++|..++.+||+++ T Consensus 1 M~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~~RY~lPl~~vP~~L~~~a~dIA 80 (150) T protein:vir:10 1 MRYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLA 80 (150) T ss_pred CCcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHH Confidence 3699999999998732 22 123467889999999999999998875433568899999999999 Q ss_pred HHHHhC--CC-ccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCc Q lcl|NC_021296. 65 RRELKN--PD-RVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPF 141 (154) Q Consensus 65 aR~l~n--P~-g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~ 141 (154) .=.|.+ +. +..++.. ...| ++=++.|+.-+. |.+++...- +...+ ..+...|..+ -+. T Consensus 81 rY~L~~~~~~~~~~~e~v---~~rY---------~~Ai~~L~~Ia~--Gk~~Lg~~~-~~~~~-~~~~~~v~~~---~r~ 141 (150) T protein:vir:10 81 RHWLYARRPEGAALPDTV---SQTF---------KASMHMLEKIRD--NKLTIGDPS-GPATP-EPGEMKVRAR---RRQ 141 (150) T ss_pred HHHHHhcccccCCCCHHH---HHHH---------HHHHHHHHHHhc--CcccCCCCC-CCCCC-CCceeeeecC---CCc Confidence 766632 21 1111110 0011 222334444444 556655432 22222 2233334322 346 Q ss_pred cCCCC--Cc Q lcl|NC_021296. 142 CSEDD--GY 148 (154) Q Consensus 142 ~~~~~--g~ 148 (154) |+.+- || T Consensus 142 f~r~~l~gf 150 (150) T protein:vir:10 142 FDADLLERF 150 (150) T ss_pred cChhhccCC Confidence 66654 55 No 45 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=96.82 E-value=2.1e-05 Score=46.20 Aligned_cols=101 Identities=8% Similarity=-0.022 Sum_probs=76.9 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQ 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~et 78 (154) |..=--++.++.+.+.+ +..+-+....+|++|.+.|..|+++ ..+|+.+..+++++|...+. +-.|.+|++ T Consensus 2 ~~~~~~~e~ik~L~~~~-d~~~DelL~~lieda~~~vl~y~nr------~~ip~~l~~~v~evav~~fNR~G~EG~tS~S 74 (105) T protein:vir:10 2 LNVDQLTEIVSALSTRL-ENVNNALLTELVKESIAQVLDYTGQ------KKLVGSMDIYVKKLAVINYNRLGIEGETQRS 74 (105) T ss_pred CchHHHHHHHHHHhccC-CCchhHHHHHHHHHHHHHHHHHcCC------cccchhHHHHHHHHHHHHhcccCCcccceee Confidence 66555666777666543 5566779999999999999999976 25788888998888777774 457899999 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHhhccCC-cee Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKRFKRSG-GLQ 112 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~ 112 (154) .|+.|.||-+ .++ +.-+..|++||..+ +.| T Consensus 75 egGvS~sy~~---~~~-~~~~~~l~~yR~~~v~~~ 105 (105) T protein:vir:10 75 EGGITNYLET---GIP-KDIRQGLNSYRIAKVKKL 105 (105) T ss_pred cCCeeeeeec---cCc-HHHHHHHHHHhhhcccCC Confidence 9999999954 222 34557799999654 677 No 46 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=96.81 E-value=3e-05 Score=45.36 Aligned_cols=104 Identities=10% Similarity=0.184 Sum_probs=73.9 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) ..++++|+.+||.. +..+-++.+.+|+.|++.++.++|....+ ...+|+.+..|++.++-+.+. +-.|.+|.+.++ T Consensus 1 M~~L~~vK~~lgi~-d~~~D~lL~~iI~~a~~~i~~~l~~~~~~-~~~iP~~l~~Iv~evavkryNR~g~EG~~S~SeeG 78 (113) T protein:vir:94 1 MALLDSIKLRIGIE-DTKQDDLLTDIISDVQARVLAYVNQDGLV-QSELPNGLDFVIKDVTIRIYNKIGDEGKESSSEGN 78 (113) T ss_pred CchHHHHHHHhCCC-CCchhhHHHHHHHHHHHHHHHHhCCccch-hhhhhhHHHHHHHHHHHHHhcccCCccceeeecCc Confidence 67899999999964 33344689999999999999999853221 346899999999998877774 577899999999 Q ss_pred eeeEeec--CCCcccCHHHHHHHHhhcc-----CCceeec Q lcl|NC_021296. 82 FNVQYSQ--PPDGFFYPAELAILKRFKR-----SGGLQTV 114 (154) Q Consensus 82 fs~s~~~--sgg~~lt~aE~~~Lrr~r~-----~~g~~sV 114 (154) .|.||-+ ...-| ...|++|+. ++|++=+ T Consensus 79 ~S~sf~~~~df~~y-----~~~l~~~~~~~~~~~~g~rF~ 113 (113) T protein:vir:94 79 VSNTWDTPADLSEY-----SDVLDVYRKSYKRRSAGMRFI 113 (113) T ss_pred eeeeecCccchhhH-----HHHHHHHHhhccCCCCCceeC Confidence 9999954 23444 223555542 1222212 No 47 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=96.72 E-value=8.1e-05 Score=43.02 Aligned_cols=115 Identities=17% Similarity=0.230 Sum_probs=79.3 Q ss_pred CCcCCCHHHHHHHh---cCCCCHHHHHHHHHHHHHHHHHHHHh----hCC--------CCCCCc----------ccchHH Q lcl|NC_021296. 1 MAGLASIQDLQTLM---SQTFEGDELEQAQLVLDIVSSWARVV----SGR--------AWPDAP----------ADVPDD 55 (154) Q Consensus 1 M~~~ATvdDl~arl---gr~L~~~E~~~A~~lL~~aS~lir~~----~~~--------~~~d~~----------~~~p~~ 55 (154) -..|+|++|+.+.. |.++++++. +.+|-.|++.|-.+ .|+ .||..+ +.+|.. T Consensus 14 anSYvt~~~a~aY~~~rg~~~~~d~~---e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~~~~~~~IP~~ 90 (172) T protein:vir:80 14 ANTYAGADFVIAYAQARGVTVDADEA---ERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGFVIPSDVIPKE 90 (172) T ss_pred ccccccHHHHHHHHHHcCCCcCHHHH---HHHHHHHHHHHhhccCccccccCCccccccccccCcccCcccccccchhHH Confidence 46899999987654 888888754 55667899999873 222 255432 457889 Q ss_pred HHHHHHHHHHHHHhCCCc--------cceeeecceeeEeecCC-Cc----------ccCHHHHHHHHhhccCCceeeccc Q lcl|NC_021296. 56 VRAVVLQASRRELKNPDR--------VISRQMGPFNVQYSQPP-DG----------FFYPAELAILKRFKRSGGLQTVST 116 (154) Q Consensus 56 v~~Vv~~~vaR~l~nP~g--------~~~etaG~fs~s~~~sg-g~----------~lt~aE~~~Lrr~r~~~g~~sV~~ 116 (154) |+.-+|..+.+++.++.. ..++..|+=+.+|..++ ++ -++ .=...|++|-.++|-+++.. T Consensus 91 v~~A~~elA~~~~~g~~~~~~~~~~~v~~ekVG~i~~eY~~~~~~~~~~~~~~~~~~~~-~v~~LL~p~l~~~gg~~~~~ 169 (172) T protein:vir:80 91 LQSAVAAAVIEQVNGFELQQSQDQWAVRIEKVDVIEVQYAAGGGGQSASANAPMKPTFP-KIDALLNPLLVGDGGLFLVA 169 (172) T ss_pred HHHHHHHHHHHHhcCCccCcCCCCceeeEEeccceEEeeecccCccccccccCCccchH-HHHHHHhhhhcCCCCeeeee Confidence 999999999988865332 34467888888885321 11 222 22457999977777788889 Q ss_pred ccc Q lcl|NC_021296. 117 SRG 119 (154) Q Consensus 117 ~r~ 119 (154) -|| T Consensus 170 vrg 172 (172) T protein:vir:80 170 VRG 172 (172) T ss_pred ecC Confidence 998 No 48 >protein:vir:106596 Length: 128 # NCBI annotation: ORF042 # Family: family:all:372 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239495;genbank:gi:66395254;genbank:GeneID:4555750 Probab=96.66 E-value=2e-05 Score=46.35 Aligned_cols=107 Identities=8% Similarity=0.119 Sum_probs=72.3 Q ss_pred CC---cCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccc Q lcl|NC_021296. 1 MA---GLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVI 75 (154) Q Consensus 1 M~---~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~ 75 (154) |. =..++++|+.+||..-+ .+-++.+.+|++|++.|+.|++... ..+|+.+..|+.++|-..+. +-.|.+ T Consensus 13 ~~~~~~m~~Le~vK~~LgI~d~-~~D~lL~~lI~~a~~~i~~~l~~~~----~~iP~~L~~Iv~evaVkryNR~g~EG~~ 87 (128) T protein:vir:10 13 LNSGEVMNYLDDVKSRIGLNDN-EQDKQLNSIINNVAAELLSRLPVDT----ISIPDKLQFIVVEVSTKRYNRIGAEGMS 87 (128) T ss_pred ecHHHHHHHHHHHHHHhCCCCc-chhhHHHHHHHHHHHHHHHHcCCCh----hhhhhhHHHHHHHHHHHHhcccCccCcc Confidence 11 12468899999997532 3346899999999999999998532 24788888888888777663 466899 Q ss_pred eeeecceeeEeec-CCCcccCHHHHHHHHhhccCC---ceeecccc Q lcl|NC_021296. 76 SRQMGPFNVQYSQ-PPDGFFYPAELAILKRFKRSG---GLQTVSTS 117 (154) Q Consensus 76 ~etaG~fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~---g~~sV~~~ 117 (154) |++.+++|.||.. ..+-|- ..|++|+... +...|..+ T Consensus 88 S~SeeG~S~tf~dnd~~~Y~-----~~L~~y~~~~~~~~kG~v~F~ 128 (128) T protein:vir:10 88 TDSQDGRSNTFERNDFEEYQ-----SIIDALYPKLDSSERGSVNFY 128 (128) T ss_pred eeeeCceeeeeccCCcchhH-----HHHHHHHhhccCCCCCceeeC Confidence 9999999999954 355552 2356665321 12222222 No 49 >protein:vir:8104 Length: 170 # NCBI annotation: gp8 # Family: family:all:3238 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817685;genbank:gi:29566116;genbank:GeneID:1259310 Probab=96.65 E-value=3.5e-05 Score=45.01 Aligned_cols=95 Identities=20% Similarity=0.181 Sum_probs=65.2 Q ss_pred HhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCC-CCC-cc---------------------------------------- Q lcl|NC_021296. 13 LMSQTFEGDELEQAQLVLDIVSSWARVVSGRAW-PDA-PA---------------------------------------- 50 (154) Q Consensus 13 rlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~-~d~-~~---------------------------------------- 50 (154) .-|.-.++ ..++.+|+.||+.+|.|||.+. |.. ++ T Consensus 1 ~~~~~a~~---~~~q~~l~aA~a~vR~~cGwhv~P~v~d~t~~ldg~G~~vl~LPt~pvvsV~sV~~~G~~l~~~~~~~~ 77 (170) T protein:vir:81 1 MRGQFADN---TEAQAAIDAVLAAARRWCGWHVSPVIIDDVMEVDGPGGRVLSLPTLNLVSVKSVVELGYALDVSTLDRS 77 (170) T ss_pred CcccccCc---hHHHHHHHHHHHHHHHHhCCcccceecccEEEEeCCCCeeEECCCCcceeeEEEEECCeeecCccceee Confidence 11222232 4678889999999999999653 211 00 Q ss_pred -------------------------------cchHHHHHHHHHHHHHHHh-CCCccce-eeecceeeEeecCCCcccCHH Q lcl|NC_021296. 51 -------------------------------DVPDDVRAVVLQASRRELK-NPDRVIS-RQMGPFNVQYSQPPDGFFYPA 97 (154) Q Consensus 51 -------------------------------~~p~~v~~Vv~~~vaR~l~-nP~g~~~-etaG~fs~s~~~sgg~~lt~a 97 (154) ++|+.+..|++.|++|+.. ||.+..+ -..+.+|++|. +++.-+.++ T Consensus 78 ~~~glL~r~~G~~~~~~~~V~VT~tHGy~~~~apd~~~~vi~~~a~r~~~s~~~~~l~~~~~~~vs~~~~-~~~~s~~~~ 156 (170) T protein:vir:81 78 RRKGTLTKPYGRWTARDGAIVVTATHGFTETEAADWRRAVVQLVGRRAQTSRPSADLKRKKVDDVEYEWF-ETAVSVDAE 156 (170) T ss_pred cCCceEEecCCccccccceEEEEEEeCCCCCccchHHHHHHHHHHHHhhccCCcccceeeeccceeeeec-ccccccCHH Confidence 2356688999999999996 6887444 45667888876 567778899 Q ss_pred HHHHHHhhccCCce Q lcl|NC_021296. 98 ELAILKRFKRSGGL 111 (154) Q Consensus 98 E~~~Lrr~r~~~g~ 111 (154) |++.|.|||...-= T Consensus 157 ~~~iL~~Yrl~~~p 170 (170) T protein:vir:81 157 LSAVFSPFRILPSP 170 (170) T ss_pred HHHhhhhcccCCCC Confidence 99999999963211 No 50 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=96.56 E-value=8e-05 Score=43.03 Aligned_cols=117 Identities=20% Similarity=0.257 Sum_probs=77.0 Q ss_pred CCcCCCHHHHHHHh---cCCCCHHHHHHHHHHHHHHHHHHHHh----hCC--------CCCCC----------cccchHH Q lcl|NC_021296. 1 MAGLASIQDLQTLM---SQTFEGDELEQAQLVLDIVSSWARVV----SGR--------AWPDA----------PADVPDD 55 (154) Q Consensus 1 M~~~ATvdDl~arl---gr~L~~~E~~~A~~lL~~aS~lir~~----~~~--------~~~d~----------~~~~p~~ 55 (154) -..|+|++|+.+.+ +..+..++.++ +.+|-.|++.|-.+ .|+ .||.. .+.+|.. T Consensus 16 anSYvtv~ea~aY~~~rg~~~~~~~~~k-e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~v~~~~IP~~ 94 (172) T protein:vir:95 16 ANSYVSVADARIYASNRGVELPLDDDEL-AAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDEVPSNVIPKS 94 (172) T ss_pred ccccccHHHHHHHHHhcCCcCCCChHHH-HHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCcccccccchhHH Confidence 46899999998765 33333333344 55556699999753 221 35543 3357889 Q ss_pred HHHHHHHHHHHHHhCCC---------ccceeeecceeeEeecC---CC-cccCHHHHHHHHhhc--cCCceeecccccc Q lcl|NC_021296. 56 VRAVVLQASRRELKNPD---------RVISRQMGPFNVQYSQP---PD-GFFYPAELAILKRFK--RSGGLQTVSTSRG 119 (154) Q Consensus 56 v~~Vv~~~vaR~l~nP~---------g~~~etaG~fs~s~~~s---gg-~~lt~aE~~~Lrr~r--~~~g~~sV~~~r~ 119 (154) |+.-+|..+.+++.+++ ++.++..|+=+.+|..+ +. .-+. +=...|++|. .+++.|++-+.|= T Consensus 95 V~~A~~elA~~~~~~~~~~~~~~~~~~vk~~kVG~I~veY~~~~~~~~~~~~~-~v~~LL~p~l~~~~~~~~~~r~~r~ 172 (172) T protein:vir:95 95 LIAAQVQLTMAINAGFDLQPNVSPQDYVTREKVGPIETEYADPLSVGIMPTFT-AANALLAPLFGECASNKFALRTIRV 172 (172) T ss_pred HHHHHHHHHHHHHcCccccccCCcccceeEEeccceEEeeccCCCCCCcccHH-HHHHHHhhhhcccCCcceeeEEEeC Confidence 99999999998887753 12456789999988542 22 3333 4445789995 4567888888885 No 51 >protein:vir:9877 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:2716 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795639;genbank:gi:28876402;genbank:GeneID:1257933 Probab=96.54 E-value=5.5e-05 Score=43.94 Aligned_cols=101 Identities=7% Similarity=0.059 Sum_probs=72.9 Q ss_pred CCcCC--CHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccce Q lcl|NC_021296. 1 MAGLA--SIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVIS 76 (154) Q Consensus 1 M~~~A--TvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~ 76 (154) |+..- +.++|+.+||.. +..+-++.+.+|+.|++.++.++|. ..+|+.+..|+..++-+.+. .-.|.+| T Consensus 1 m~~~~~~~L~~vK~~Lgi~-d~~~D~lL~~ii~~~~~~i~~~l~~------~~iP~~L~~Iv~ev~vkryNR~g~EG~~S 73 (114) T protein:vir:98 1 MDETKQAIIDRVRVRLADE-TSLKEELLEELTQTAIDRINLKVGD------VVFNPLFNSIAVDVVVKMYRRMYFEGIDT 73 (114) T ss_pred CchhHHHHHHHHHHHhCCC-CCchhhHHHHHHHHHHHHHHHhhCc------cccchHHHHHHHHHHHHHhcccCccccce Confidence 98764 699999999975 4445588999999999999999873 36788888888887766663 4578999 Q ss_pred eeecceeeEeec-CCCcccCHHHHHHHHhhcc------CC--ceee Q lcl|NC_021296. 77 RQMGPFNVQYSQ-PPDGFFYPAELAILKRFKR------SG--GLQT 113 (154) Q Consensus 77 etaG~fs~s~~~-sgg~~lt~aE~~~Lrr~r~------~~--g~~s 113 (154) ++.+++|.||.. -..-|. ++ |++|+. ++ ++|= T Consensus 74 ~S~eG~S~tf~dndf~ey~--~~---l~~y~~~~~~~~~g~~v~Fl 114 (114) T protein:vir:98 74 EKADTISTKFIENVLAEYG--EE---LASYKKDRLAILNKKVVRFL 114 (114) T ss_pred eeccceeeeeeccccchhH--HH---HHHHHhhhhhhhcCceeecC Confidence 999999999954 345552 22 444432 11 2222 No 52 >protein:vir:741 Length: 110 # NCBI annotation: unknown # Family: family:all:372 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108718;genbank:gi:13487840;genbank:GeneID:920873 Probab=96.49 E-value=9e-05 Score=42.77 Aligned_cols=101 Identities=15% Similarity=0.268 Sum_probs=71.0 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .+++++|+.+||.+ .+ ++.+.+|+.|.+.|..++|.... .+|+.+..|++++|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lgi~--~D--~lL~~li~~a~~~i~~~l~~~~~----~iP~~l~~iv~evav~ryNR~g~EG~~S~SeeG 72 (110) T protein:vir:74 1 MAITYEIKKLLGGS--SD--ERLEIIEKRTRERLLLILGSDLK----EVPPELEYVVLDVSLKRFNRIGQEGMQSYSQEG 72 (110) T ss_pred ChHHHHHHHHcCCC--hh--HHHHHHHHHHHHHHHHHhCCChh----hhhHHHHHHHHHHHHHHhcccCccccceeecCC Confidence 67899999999965 33 58999999999999999874322 4788888888888877774 456889999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC------ceeecccc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG------GLQTVSTS 117 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~------g~~sV~~~ 117 (154) .|.||.+ -+.-| ...|++|+... +...|..+ T Consensus 73 ~S~sf~~~d~~~y-----~~~l~~y~~~~~~~~~~~~~~~~f~ 110 (110) T protein:vir:74 73 LSMTFSESDFDEY-----ADEIESRRKSKETEGDKKIGRFRLY 110 (110) T ss_pred eeeeecccchhhH-----HHHHHHHHhhccccccCcceeeeeC Confidence 9999954 23333 23356665321 22222222 No 53 >protein:vir:3615 Length: 110 # NCBI annotation: ORF38 # Family: family:all:372 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112701;genbank:gi:13786569;genbank:GeneID:921067 Probab=96.40 E-value=7.8e-05 Score=43.12 Aligned_cols=101 Identities=17% Similarity=0.304 Sum_probs=71.4 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeeecc Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQMGP 81 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~etaG~ 81 (154) .+++++|+.++|.+. + ++.+.+|++|.+.|+.++++.. ..+|+.+..|++++|-..+. .-.|.+|++.++ T Consensus 1 M~~L~~vK~~lg~~~--D--~lL~~li~~a~~~i~~~~~~~~----~eiP~~l~~iv~evav~ryNR~g~EG~~S~SeeG 72 (110) T protein:vir:36 1 MAITDDLKMLLGGSL--D--ERLEVIEKRTRDRLLLILGSDI----KEVPPELEYVVLDVSLKRFNRIGQEGMQSYSQEG 72 (110) T ss_pred ChhHHHHHhhcCCCh--h--HHHHHHHHHHHHHHHHHhCCCh----hhhhhHHHHHHHHHHHHHhccccccccceeecCC Confidence 678999999998643 2 4999999999999999998632 24788888888888877764 466889999999 Q ss_pred eeeEeec-CCCcccCHHHHHHHHhhccCC------ceeecccc Q lcl|NC_021296. 82 FNVQYSQ-PPDGFFYPAELAILKRFKRSG------GLQTVSTS 117 (154) Q Consensus 82 fs~s~~~-sgg~~lt~aE~~~Lrr~r~~~------g~~sV~~~ 117 (154) .|.||.+ -.+-|. ..|++|+... +...|..+ T Consensus 73 ~S~sf~~~d~~~y~-----~~l~~y~~~~~~~~~~~~g~~~f~ 110 (110) T protein:vir:36 73 LSMTFSESDFDEYA-----DEIESWRKSRETEGDKKIGRFRLY 110 (110) T ss_pred ceeeecccCcchHH-----HHHHHHHhhhccccCCcceeeeeC Confidence 9999954 234442 2356665321 22222222 No 54 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=96.33 E-value=0.00018 Score=41.15 Aligned_cols=111 Identities=10% Similarity=0.002 Sum_probs=66.8 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCC-----CcccchHHHHHHHHHHHHHHHhC------ Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPD-----APADVPDDVRAVVLQASRRELKN------ 70 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d-----~~~~~p~~v~~Vv~~~vaR~l~n------ 70 (154) -+|+|.+...+..|..+++++ .+.+|..||+.|...+..+... ++....+.||-.+|..+.-.-.+ T Consensus 1 M~Y~t~~~Y~~~~G~~i~e~~---F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c~qiey~~~~G~~sae 77 (132) T protein:vir:98 1 MPYLTYEEFMDLNGRDIDDKK---FEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALCAQIEYFDALGATTFE 77 (132) T ss_pred CCCCCHHHHHhhcCCCCCHHH---HHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHHHHHHHHHhccchhhh Confidence 478999999997787777653 6778899999999998877542 22233455766666655533222 Q ss_pred --CCccceeeecceeeEeecC-------CCcccCHH-HHHHHHhhccCCceeeccccc Q lcl|NC_021296. 71 --PDRVISRQMGPFNVQYSQP-------PDGFFYPA-ELAILKRFKRSGGLQTVSTSR 118 (154) Q Consensus 71 --P~g~~~etaG~fs~s~~~s-------gg~~lt~a-E~~~Lrr~r~~~g~~sV~~~r 118 (154) -....+.+.|.+|+||..+ +....+.. =..-|++.++ +|.=...+ T Consensus 78 ~~~~~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tGL---LyrGV~~~ 132 (132) T protein:vir:98 78 EINNSPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTGL---LFQGVKTW 132 (132) T ss_pred hccCccceeeeCcEEEEeeccCCcccccccccchHHHHHHHHhhcCC---ccccCCCC Confidence 2225678999999998532 12222222 2234555553 22211111 No 55 >protein:vir:96488 Length: 113 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238494;genbank:gi:66391770;genbank:GeneID:5176910 Probab=96.09 E-value=0.00017 Score=41.29 Aligned_cols=101 Identities=10% Similarity=0.164 Sum_probs=70.0 Q ss_pred CCCHHHHHHHhcCCC--CHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceeee Q lcl|NC_021296. 4 LASIQDLQTLMSQTF--EGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQM 79 (154) Q Consensus 4 ~ATvdDl~arlgr~L--~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~eta 79 (154) ..+.++++-+++... +..+-.+.+.+|+.|++.++.++|. ..+|+.+..|+..++-+.+. .-.|.+|++. T Consensus 1 M~~L~~~K~l~~ik~~~~~~~D~lL~~ii~~a~~~i~~~l~~------~~iP~~L~~Iv~evavkryNR~g~EG~~S~S~ 74 (113) T protein:vir:96 1 MMALDKDKVIKNVSVDLNTDDDVLLKILLERVVNHFKSEYGV------EEIDDKLAFIFEDCVIKRFNRRGAEGAKSESV 74 (113) T ss_pred CchhHHHHHHhcCCCCCCCchhHHHHHHHHHHHHHHHHHhcc------cccchhHHHHHHHHHHHHhcCCCccccceecc Confidence 457778887777653 3345678999999999999999974 35788988888888877774 5778999999 Q ss_pred cceeeEeec---CCCcccCHHHHHHHHhhcc-----CCceeecc Q lcl|NC_021296. 80 GPFNVQYSQ---PPDGFFYPAELAILKRFKR-----SGGLQTVS 115 (154) Q Consensus 80 G~fs~s~~~---sgg~~lt~aE~~~Lrr~r~-----~~g~~sV~ 115 (154) +++|.||.. -...| .++ |++|+. +.|.+... T Consensus 75 eG~S~sf~d~~~df~eY--~~~---l~~~~~~~~~~~~G~v~Fl 113 (113) T protein:vir:96 75 DGHSMSYYDNENEFKPY--DDM---LQRLYGTSGQSKEGEVLFL 113 (113) T ss_pred Cceeeeecccccccchh--HHH---HHHHHhhcCCCCCceeeeC Confidence 999999953 24444 233 344431 12222222 No 56 >protein:vir:4904 Length: 113 # NCBI annotation: gp113 # Family: family:all:372 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056682;genbank:gi:9635017;genbank:GeneID:1262667 Probab=95.84 E-value=0.00018 Score=41.13 Aligned_cols=100 Identities=15% Similarity=0.224 Sum_probs=68.3 Q ss_pred CCcCC---CHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccc Q lcl|NC_021296. 1 MAGLA---SIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVI 75 (154) Q Consensus 1 M~~~A---TvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~ 75 (154) |.++. +.++|+.++|. +++ ++.+.+|+.|++.++.++|. ..+|+.+..|++.++-+.+. +-.|.+ T Consensus 1 m~~l~~~~~L~~vK~~lgi--~dD--~lL~~li~~a~~~i~~~l~~------~~iP~~l~~Iv~evavkryNR~g~EG~~ 70 (113) T protein:vir:49 1 MMALDKEKVIQNVSVDLNI--NDD--NLLGILLERIVNHFKAEYGV------DEVDDNLAFIFEDCLVKRFNRRGAEGAR 70 (113) T ss_pred CcchhHHHHHHHHHHhcCC--Chh--HHHHHHHHHHHHHHHHHhCc------cccchHHHHHHHHHHHHHhcccCccccc Confidence 66554 46777777774 333 57999999999999999874 35789999999888877774 566899 Q ss_pred eeeecceeeEeec---CCCcccCHHHHHHHHhhcc-----CCceeecc Q lcl|NC_021296. 76 SRQMGPFNVQYSQ---PPDGFFYPAELAILKRFKR-----SGGLQTVS 115 (154) Q Consensus 76 ~etaG~fs~s~~~---sgg~~lt~aE~~~Lrr~r~-----~~g~~sV~ 115 (154) |++.++.|.||.. ....| .++ |++|+. +.|.+... T Consensus 71 S~SeeG~S~sf~d~~~df~eY--~~~---l~~~~~~~~~~~~G~v~Fl 113 (113) T protein:vir:49 71 SESIDGHSMSYYDNENEFDPY--DNM---LQRLYGTSGQAKEGEVLFL 113 (113) T ss_pred eeecCceeeeecccccccchh--HHH---HHHHHhhcCCCCCcceeeC Confidence 9999999999953 23334 223 344431 11222222 No 57 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=95.73 E-value=0.00072 Score=37.81 Aligned_cols=111 Identities=9% Similarity=-0.017 Sum_probs=68.6 Q ss_pred CcCCCHHHHHHHh-cCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC-----cccchHHHHHHHHHHHHHHHh------ Q lcl|NC_021296. 2 AGLASIQDLQTLM-SQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA-----PADVPDDVRAVVLQASRRELK------ 69 (154) Q Consensus 2 ~~~ATvdDl~arl-gr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~-----~~~~p~~v~~Vv~~~vaR~l~------ 69 (154) =+|+|.+..++.+ |-++++++ -..+|..||+.|..++..+.... ++..++.||-.||..+.-.-. T Consensus 1 M~Y~d~~~Y~~~y~g~~i~e~~---F~~l~~rAs~~ID~~T~~ri~~~~~~~~~~~~~~~vk~A~c~q~e~~~~~g~~s~ 77 (131) T protein:vir:43 1 MPYTTLEFYNDEYAGEHLEQDE---FDKLLKHAERKIDSVTFYRIRKGGIESFSEFIQHQIQLATCNQIEYFKEAGGTSE 77 (131) T ss_pred CCCCCHHHHHHhhCCCCCCHhH---HHHHHHHHHHHHHHHhcccccccCccccchhhHHHHHHHHHHHHHHHHHhHHHhh Confidence 4789999998877 55566654 45778999999999988775532 234567788777777754432 Q ss_pred CCC-ccceeeecceeeEeecCCCcc------cCHHH-HHHHHhhccCCceeeccccc Q lcl|NC_021296. 70 NPD-RVISRQMGPFNVQYSQPPDGF------FYPAE-LAILKRFKRSGGLQTVSTSR 118 (154) Q Consensus 70 nP~-g~~~etaG~fs~s~~~sgg~~------lt~aE-~~~Lrr~r~~~g~~sV~~~r 118 (154) ++. +..+++.|.+|++|...+..- .+..+ ..-|.+.++ +|.=..+| T Consensus 78 ~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGL---lyrGV~~~ 131 (131) T protein:vir:43 78 LAVSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVRSYLAHTGL---LYNGVGVR 131 (131) T ss_pred hhccccCeeecCceEEeecccccchhhhchhhhHHHHHHHHhccCC---eecCCCCC Confidence 233 467889999999986432111 11111 122344333 33333333 No 58 >protein:vir:1329 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047928;swissprot:trembl:q9zxb0;genbank:gi:9631146;uniprot:Q9ZXB0;genbank:GeneID:2715909 Probab=95.72 E-value=7.1e-05 Score=43.34 Aligned_cols=110 Identities=20% Similarity=0.235 Sum_probs=65.4 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHH----HHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh-----CCC Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVS----SWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK-----NPD 72 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS----~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~-----nP~ 72 (154) -+|||+++|.+.=| |+++ .-....+|.+|- ..+.+|||+.|-...+|.|++++--+-..++..+. -|+ T Consensus 1 mayatieelraldg--ldds-alfsdellsdaidfsvetveaycgrkwdtaedptpetirwcvrtlarqyvldhvsripd 77 (122) T protein:vir:13 1 MAYATIEELRALDG--LDDS-ALFSDELLSDAIDFSVETVEAYCGRKWDTAEDPTPETIRWCVRTLARQYVLDHVSRIPD 77 (122) T ss_pred CcchhhhhhhhhcC--ccch-hhhhhhhhhhhhhhhhhhhhhhhCcccCCcCCCChhHHHHHHHHHHHHHHHHHhhhcch Confidence 46899999998644 5543 122344555554 45788999999988899999997766666655543 366 Q ss_pred ccceeeecceeeEeecCCCcccC---HHHHHHHHhhccCCceeec Q lcl|NC_021296. 73 RVISRQMGPFNVQYSQPPDGFFY---PAELAILKRFKRSGGLQTV 114 (154) Q Consensus 73 g~~~etaG~fs~s~~~sgg~~lt---~aE~~~Lrr~r~~~g~~sV 114 (154) ...|-+.-=-|.+..+.||.|-- ++--+.|+-||.+--..-+ T Consensus 78 ralqlqsefgsiqlaqaggnwrptslpevnaklnlyrvrlpfifm 122 (122) T protein:vir:13 78 RALQLQSEFGSIQLAQAGGNWRPTSLPEVNAKLNLYRVRLPFIFM 122 (122) T ss_pred hhhhhhhcccceeeeccCCCcccCcccccccceeeeeeecceeeC Confidence 54442221115567777887732 2222445555543211111 No 59 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=95.61 E-value=0.00056 Score=38.40 Aligned_cols=115 Identities=16% Similarity=0.259 Sum_probs=77.5 Q ss_pred CCcCCCHHHHHHHh--------cCCCCHHHHHHHHHHHHHHHHHHHHh---hCC--------CCCCC----------ccc Q lcl|NC_021296. 1 MAGLASIQDLQTLM--------SQTFEGDELEQAQLVLDIVSSWARVV---SGR--------AWPDA----------PAD 51 (154) Q Consensus 1 M~~~ATvdDl~arl--------gr~L~~~E~~~A~~lL~~aS~lir~~---~~~--------~~~d~----------~~~ 51 (154) -..|+|++|..+.. |.+++++ ..+.+|-.|++.|-.. .|+ .||-. .+. T Consensus 13 AnSYvtv~ea~aY~~~r~~~~~w~~~~~~---~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg~~~~~~~ 89 (170) T protein:vir:94 13 ANSYVTVAEANSYFDGSYGRPLWTSASED---EKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGGMTLSQVS 89 (170) T ss_pred ccceecHHHHHHHHHhhccccccCCCCHH---HHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCccccccch Confidence 57899999998852 3345544 4455668888888542 222 25532 246 Q ss_pred chHHHHHHHHHHHHHHHhCCCcc-------ceeeecceeeEeecC--CCcccCHHHHHHHHhhc-----cCCceeecccc Q lcl|NC_021296. 52 VPDDVRAVVLQASRRELKNPDRV-------ISRQMGPFNVQYSQP--PDGFFYPAELAILKRFK-----RSGGLQTVSTS 117 (154) Q Consensus 52 ~p~~v~~Vv~~~vaR~l~nP~g~-------~~etaG~fs~s~~~s--gg~~lt~aE~~~Lrr~r-----~~~g~~sV~~~ 117 (154) +|..|+.-+|..+.+++.++..+ .+++.|+=+++|..+ +-+-++.- ...|++|- .+.+.+++... T Consensus 90 IP~~V~~Aq~elA~~~~~~~~~~~~~~~~v~~~kVG~i~veY~~~~~~~~~~~~v-~~LL~p~l~~~~~g~~~~~~~~~~ 168 (170) T protein:vir:94 90 IPVKVKIAVFELAYFMLESGAALSFADQTIDSVKVGTIRVEFTKNSTDAGLPTFV-EAMLSGFGSPVLYGSNAARSIDLV 168 (170) T ss_pred hhHHHHHHHHHHHHHHHhCcccCcccccceeeEecceeEEEecCCCCCCccHHHH-HHHhhhhhccccccccccceeeee Confidence 89999999999999999876643 446789999998632 22333432 45688885 33478889999 Q ss_pred cc Q lcl|NC_021296. 118 RG 119 (154) Q Consensus 118 r~ 119 (154) || T Consensus 169 r~ 170 (170) T protein:vir:94 169 RA 170 (170) T ss_pred cC Confidence 98 No 60 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=95.58 E-value=0.00083 Score=37.49 Aligned_cols=111 Identities=9% Similarity=-0.015 Sum_probs=69.1 Q ss_pred CcCCCHHHHHHHh-cCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC-----cccchHHHHHHHHHHHHHHHh------ Q lcl|NC_021296. 2 AGLASIQDLQTLM-SQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA-----PADVPDDVRAVVLQASRRELK------ 69 (154) Q Consensus 2 ~~~ATvdDl~arl-gr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~-----~~~~p~~v~~Vv~~~vaR~l~------ 69 (154) =+|+|.+..++.. |-++.+++ -..+|..||+.|..++..+.... .+..++.||-.||..+.-.-. T Consensus 1 M~Y~d~~~Y~~~y~G~~i~e~~---F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c~q~e~~~~~g~~~~ 77 (131) T protein:vir:80 1 MPYTTLEFYTNEYAGEHLEQDE---FAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATCNQIEYFKEAGGTSE 77 (131) T ss_pred CCCCCHHHHHHhhCCCCCchhH---HHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHHHHHHHHHHhhhhhh Confidence 3689999998765 55567654 45788999999999988776533 134567787778777754433 Q ss_pred C-CCccceeeecceeeEeecCC------CcccCHHH-HHHHHhhccCCceeeccccc Q lcl|NC_021296. 70 N-PDRVISRQMGPFNVQYSQPP------DGFFYPAE-LAILKRFKRSGGLQTVSTSR 118 (154) Q Consensus 70 n-P~g~~~etaG~fs~s~~~sg------g~~lt~aE-~~~Lrr~r~~~g~~sV~~~r 118 (154) + ..+..+++.|.+|++|...+ +...+..+ ..-|.+.++ +|.=..+| T Consensus 78 ~~~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~~~L~~TGL---lyrGV~~~ 131 (131) T protein:vir:80 78 LAVSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVRSYLAHTGL---LYNGVGVR 131 (131) T ss_pred hcccccCeeeeCceEEeeccccchhhhhhhhhhHHHHHHHHhccCC---eecCCCCC Confidence 2 34567899999999986432 11212111 123444443 33333333 No 61 >protein:vir:2738 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695111;genbank:gi:23455880;genbank:GeneID:955641 Probab=95.52 E-value=0.00026 Score=40.21 Aligned_cols=100 Identities=11% Similarity=0.173 Sum_probs=71.7 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--CCCccceee Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--NPDRVISRQ 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--nP~g~~~et 78 (154) |...-..++|+.++|.. ++ ++.+.+|+.|++.++.+++. ..+|+.+..|++.++-+.+. +-.|.+|++ T Consensus 3 l~~~~~L~~iK~~lg~~--dD--~lL~~ii~~a~~~i~~~l~~------~~iP~~l~~Iv~evavkryNR~g~EG~~S~S 72 (112) T protein:vir:27 3 LDKDKVIKNVSVDLNTN--DD--ALLKILLERVVNHFKSEYGV------EEIDDKLAFIFEDCVIKRFNRRGAEGAKSES 72 (112) T ss_pred chhHHHHHHHHhhcCCC--hh--HHHHHHHHHHHHHHHHhcCc------cccchhHHHHHHHHHHHHhcccCccccceee Confidence 67888999999998853 33 47999999999999999874 25789998998888877774 467999999 Q ss_pred ecceeeEeec---CCCcccCHHHHHHHHhhcc-----CCceeecc Q lcl|NC_021296. 79 MGPFNVQYSQ---PPDGFFYPAELAILKRFKR-----SGGLQTVS 115 (154) Q Consensus 79 aG~fs~s~~~---sgg~~lt~aE~~~Lrr~r~-----~~g~~sV~ 115 (154) .++.|.||.. -...| .++ |++|+. +.|.+... T Consensus 73 eeG~S~sf~d~~~df~~Y--~~~---l~~~~~~~~~~~~G~v~Fl 112 (112) T protein:vir:27 73 VDGHSMSYYDNENEFKPY--DDM---LQRLYGTSGQAKEGEVLFL 112 (112) T ss_pred cCceeeeecccccchhhh--HHH---HHHHHhhcCCCCCceeeeC Confidence 9999999953 23444 223 344421 12322222 No 62 >protein:vir:1241 Length: 104 # NCBI annotation: similar to phage Spp1 gp15 (product required for head morphogenesis) # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510940;genbank:gi:17426274;genbank:GeneID:927373 Probab=94.63 E-value=0.00072 Score=37.82 Aligned_cols=102 Identities=16% Similarity=0.288 Sum_probs=70.9 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+.+.|..|+--|+.+++ . .-+..+++++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~~F--~~~~lP~gVkkfvAe~ik-y-~~~~NissRsMgtVS 76 (104) T protein:vir:12 1 MDAKDVKMINGLSLNDSSDDEQIEYLIEEYKSVAEDYCNQKF--DDKEVPSGVKKFIAECIK-F-GTTGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCCccHHHHHHHHHHHHHHHHHHhCCCC--CCccCCccHHHHHHHHHh-h-CCCCCccccccccee Confidence 45789998888888741 223367777777888899998776 235689999998888888 2 336678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~T~----iP~~i~~~L~PYRrlrw~~~~~ 104 (104) T protein:vir:12 77 YTYVTD----IPSSAYAYLLPYRKLSWGKRYV 104 (104) T ss_pred eechhh----hhHHHHHhhhhhhhhcccccCC Confidence 999642 1223346688887532 22233 No 63 >protein:vir:97430 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240751;genbank:gi:66396455;genbank:GeneID:5133786 Probab=94.59 E-value=0.00078 Score=37.63 Aligned_cols=102 Identities=15% Similarity=0.276 Sum_probs=70.9 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+.+.|..|+--|+.+++ . .-+..+++++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~~F--~~~~lP~gVkkfvAe~ik-y-~~~~NissRsMgtVS 76 (104) T protein:vir:97 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDYCNQKF--DDKEVPSGVKKFIAECIK-F-GTTGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCCCC--CCccCCccHHHHHHHHHh-h-CCCCCccccccccee Confidence 45789998888888742 223367777777888899998776 235689999998888888 2 336678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~T~----iP~~i~~~L~PYRrlrw~~~~~ 104 (104) T protein:vir:97 77 YTYVTD----IPSSAYAYLLPYRKLSWGKRYV 104 (104) T ss_pred eechhh----hhHHHHHhhhhhhhhcccccCC Confidence 999642 1223346688887532 22233 No 64 >protein:vir:94492 Length: 104 # NCBI annotation: ORF049 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240678;genbank:gi:66396380;genbank:GeneID:5133756 Probab=94.59 E-value=0.00078 Score=37.63 Aligned_cols=102 Identities=15% Similarity=0.276 Sum_probs=70.9 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+.+.|..|+--|+.+++ . .-+..+++++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~~F--~~~~lP~gVkkfvAe~ik-y-~~~~NissRsMgtVS 76 (104) T protein:vir:94 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKGVAEDYCNQKF--DDKEVPSGVKKFIAECIK-F-GTTGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCCCC--CCccCCccHHHHHHHHHh-h-CCCCCccccccccee Confidence 45789998888888742 223367777777888899998776 235689999998888888 2 336678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~T~----iP~~i~~~L~PYRrlrw~~~~~ 104 (104) T protein:vir:94 77 YTYVTD----IPSSAYAYLLPYRKLSWGKRYV 104 (104) T ss_pred eechhh----hhHHHHHhhhhhhhhcccccCC Confidence 999642 1223346688887532 22233 No 65 >protein:vir:95071 Length: 104 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240825;genbank:gi:66394717;genbank:GeneID:5133865 Probab=94.59 E-value=0.00078 Score=37.61 Aligned_cols=102 Identities=16% Similarity=0.284 Sum_probs=70.9 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+.+.|..|+--|+.+++ . .-+..+++++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~~F--~~~~lP~gVkkfvAe~ik-y-~~~~NissRsMgtVS 76 (104) T protein:vir:95 1 MDAKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDYCNQKF--DDKEVPSGVKKFIAECIK-F-GTTGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCCCC--CCccCCccHHHHHHHHHh-h-CCCCCccccccccee Confidence 45789998888888742 223367777777888899998776 235689999998888888 2 336678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~T~----iP~~i~~~L~PYRrlrw~~~~~ 104 (104) T protein:vir:95 77 YTYVTD----IPSSAYAYLMPYRKLSWGKRYV 104 (104) T ss_pred eechhh----hhHHHHHhhhhhhhhcccccCC Confidence 999642 1223346688887532 22233 No 66 >protein:vir:93740 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240461;genbank:gi:66396159;genbank:GeneID:5133509 Probab=94.52 E-value=0.00084 Score=37.45 Aligned_cols=102 Identities=16% Similarity=0.279 Sum_probs=70.8 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+.+.|..|+--|+.+++ . .-+..+++++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~~F--~~~~lP~gVkkfvAe~ik-y-~~~~NissRsMgtVS 76 (104) T protein:vir:93 1 MDAKDVKMINGLSLNDSSNDEQIDYLIEEYKSVAEDYCNQKF--DDKEVPSGVKKFIAECIK-F-GTTGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCCCC--CCccCCccHHHHHHHHHh-h-CCCCCccccccccee Confidence 45789998888888742 223366777777888899998776 235689999998888888 2 336678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~T~----iP~~i~~~L~PYRrlrw~~~~~ 104 (104) T protein:vir:93 77 YTYVTD----IPSSAYAYLLPYRKLSWGKRYV 104 (104) T ss_pred eechhh----hhHHHHHhhhhhhhhcccccCC Confidence 999642 1223346688887532 22233 No 67 >protein:vir:107119 Length: 104 # NCBI annotation: conserved phage protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950608;genbank:gi:119953688;genbank:GeneID:4643128 Probab=94.34 E-value=0.00085 Score=37.43 Aligned_cols=102 Identities=17% Similarity=0.259 Sum_probs=70.8 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+...|..|+--|..+++ .++ +..+.+++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~F--~~~~lP~gV~~fvA~~ik-y~~-~~NissRSMGtVS 76 (104) T protein:vir:10 1 MNAQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEYCNQTF--NRKSLPSNVEKFIANCIK-QGT-TSNISSRTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCCC--CCCCCCccHHHHHHHHHh-hcC-CCCccccccccee Confidence 46789998888888742 223367777777888899998775 235788999888888877 444 6688899999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 yTy~t~----iP~~i~~~L~PYRklr~~~~~~ 104 (104) T protein:vir:10 77 YTFVTD----LPKETYGYLKPFRRLRWTGYHV 104 (104) T ss_pred ecccch----hHHHHHHhhhhhhhhccccccC Confidence 998542 1233446788887533 22223 No 68 >protein:vir:105327 Length: 104 # NCBI annotation: putative head morphogenesis protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950671;genbank:gi:119967841;genbank:GeneID:4643206 Probab=94.34 E-value=0.00085 Score=37.43 Aligned_cols=102 Identities=17% Similarity=0.259 Sum_probs=70.8 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+...|..|+--|..+++ .++ +..+.+++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~F--~~~~lP~gV~~fvA~~ik-y~~-~~NissRSMGtVS 76 (104) T protein:vir:10 1 MNAQDVKLLNNLSLDDTSNDETIELLIEKYLNVAEEYCNQTF--NRKSLPSNVEKFIANCIK-QGT-TSNISSRTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCCC--CCCCCCccHHHHHHHHHh-hcC-CCCccccccccee Confidence 46789998888888742 223367777777888899998775 235788999888888877 444 6688899999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 yTy~t~----iP~~i~~~L~PYRklr~~~~~~ 104 (104) T protein:vir:10 77 YTFVTD----LPKETYGYLKPFRRLRWTGYHV 104 (104) T ss_pred ecccch----hHHHHHHhhhhhhhhccccccC Confidence 998542 1233446788887533 22223 No 69 >protein:vir:97329 Length: 104 # NCBI annotation: ORF048 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240613;genbank:gi:66396311;genbank:GeneID:5133685 Probab=94.11 E-value=0.001 Score=37.02 Aligned_cols=102 Identities=16% Similarity=0.263 Sum_probs=70.2 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+...|..|+--|..+++ ..+ +..+.+++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~F--~~~~lP~gV~~fvA~~ik-y~~-~~NissRSMGtVS 76 (104) T protein:vir:97 1 MDTKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDYCNQKF--DDKAVPSGVKKFIAECIK-FGT-TGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCCC--CCCCCCccHHHHHHHHHh-hCC-CCCccccccccee Confidence 46789998888888742 223367777777888899998775 235788999888888877 333 6678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 Yty~t~----iP~~i~~~LkPYRklr~~~~~~ 104 (104) T protein:vir:97 77 YTYVTD----IPSSAYAYLMPYRKLSWGKRYV 104 (104) T ss_pred ecccch----hHHHHHHhhhhhhhhcccccCC Confidence 998542 1233446688887532 22223 No 70 >protein:vir:95891 Length: 104 # NCBI annotation: ORF051 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240387;genbank:gi:66396087;genbank:GeneID:5133402 Probab=94.06 E-value=0.001 Score=36.94 Aligned_cols=102 Identities=16% Similarity=0.260 Sum_probs=70.0 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+...|..|+--|..+++ ..+ +..+.+++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~F--~~~~lP~gV~~fvA~~ik-y~~-~~NissRSMGtVS 76 (104) T protein:vir:95 1 MDAKDVKMINGLSLNDSSNDEQIKYLIEEYKSVAEDYCNQKF--DDKAVPSGVKKFIAECIK-FGT-TGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCCC--CCCCCCccHHHHHHHHHh-hCC-CCCccccccccee Confidence 45789998888888742 222366777777788899998775 235788999888888877 333 6678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~t~----iP~~i~~~LkPYRklr~~~~~~ 104 (104) T protein:vir:95 77 YTYVTD----IPSSAYAYLLPYRKLSWGKRYV 104 (104) T ss_pred ecccch----hHHHHHHhhhhhhhhcccccCC Confidence 998542 1233446688887532 22223 No 71 >protein:vir:96281 Length: 104 # NCBI annotation: ORF047 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240313;genbank:gi:66396008;genbank:GeneID:5133358 Probab=94.06 E-value=0.001 Score=36.94 Aligned_cols=102 Identities=16% Similarity=0.260 Sum_probs=70.0 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+...|..|+--|..+++ ..+ +..+.+++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~F--~~~~lP~gV~~fvA~~ik-y~~-~~NissRSMGtVS 76 (104) T protein:vir:96 1 MDAKDVKMINGLSLNDSSNDEQIKYLIEEYKSVAEDYCNQKF--DDKAVPSGVKKFIAECIK-FGT-TGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCCC--CCCCCCccHHHHHHHHHh-hCC-CCCccccccccee Confidence 45789998888888742 222366777777788899998775 235788999888888877 333 6678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~t~----iP~~i~~~LkPYRklr~~~~~~ 104 (104) T protein:vir:96 77 YTYVTD----IPSSAYAYLLPYRKLSWGKRYV 104 (104) T ss_pred ecccch----hHHHHHHhhhhhhhhcccccCC Confidence 998542 1233446688887532 22223 No 72 >protein:vir:94798 Length: 104 # NCBI annotation: ORF043 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240538;genbank:gi:66396233;genbank:GeneID:5133578 Probab=94.05 E-value=0.001 Score=36.92 Aligned_cols=102 Identities=16% Similarity=0.260 Sum_probs=70.2 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+... .+...|..|+--|..+++ ..+ +..+.+++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCn~~F--~~~~lP~gVk~fvA~~ik-y~~-~~NissRSMGtVS 76 (104) T protein:vir:94 1 MDTKDVKMINGLSLNDSSNDEQIEYLIEEYKSVAEDYCNQKF--DDKAVPSGVKKFIAECIK-FGT-TGNISARTMGTVS 76 (104) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHhcCCCC--CCCCCCccHHHHHHHHHh-hCC-CCCccccccccee Confidence 46789998888888742 223367777777888899998775 235788999888888877 333 6678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCC-ceeec Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSG-GLQTV 114 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~-g~~sV 114 (154) +||... +-+.=.+-|++||+-+ +-+-| T Consensus 77 YTy~T~----iP~~i~~~LkPYRklr~~~~~~ 104 (104) T protein:vir:94 77 YTYITD----IPSSAYAYLMPYRKLSWGKRYV 104 (104) T ss_pred ecccch----hHHHHHHhhhhhhhhcccccCC Confidence 998542 1233446688887532 22223 No 73 >protein:vir:5976 Length: 102 # NCBI annotation: hypothetical protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690676;genbank:geneid:6329129;genbank:gi:22855070;uniprot:Q38584;genbank:GeneID:955305 Probab=93.95 E-value=0.0011 Score=36.77 Aligned_cols=100 Identities=16% Similarity=0.237 Sum_probs=68.2 Q ss_pred CCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC--cccchHHHHHHHHHHHHHHHhCCCccceeeecce Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA--PADVPDDVRAVVLQASRRELKNPDRVISRQMGPF 82 (154) Q Consensus 5 ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~--~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~f 82 (154) -.++||+-.|+.+-+ +.-+..+.+++.--..++.||+...-+- ..+.|..|+--|..+++=.++ |..+++++.|.- T Consensus 1 Md~~~VK~ll~i~~~-s~d~~i~~lip~y~e~aedyCN~~F~dkdg~~~lP~gVkkfvAe~ik~y~~-~~nissRsMgtV 78 (102) T protein:vir:59 1 MDIQRVKRLLSITND-KHDEYLTEMVPLLVEFAKDECHNPFIDKDGNESIPSGVLIFVAKAAQFYMT-NAGLTGRSMDTV 78 (102) T ss_pred CChHHhhhhhcCCCC-ccHHHHHHHHHHHHHHHHHHhCCccccccccccCCccHHHHHHHHHHhcCC-CCCcccccccce Confidence 457899887766543 2334455567777788899998876432 246788888877777765554 478889999999 Q ss_pred eeEeecCCCcccCHHHHHHHHhhccCCc Q lcl|NC_021296. 83 NVQYSQPPDGFFYPAELAILKRFKRSGG 110 (154) Q Consensus 83 s~s~~~sgg~~lt~aE~~~Lrr~r~~~g 110 (154) |+||... +-+.=.+-|++||+-.. T Consensus 79 SYty~T~----iP~~i~~~L~PyRrl~~ 102 (102) T protein:vir:59 79 SYNFATE----IPSTILKKLNPYRKMAR 102 (102) T ss_pred eeechhh----hhHHHHHHhhHHHhhcC Confidence 9999642 12233466888886432 No 74 >protein:vir:96128 Length: 98 # NCBI annotation: ORF050 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240080;genbank:gi:66395776;genbank:GeneID:5133109 Probab=93.55 E-value=0.0016 Score=35.93 Aligned_cols=97 Identities=18% Similarity=0.216 Sum_probs=71.0 Q ss_pred CCHHHHHHHhcCCCCHHH-HHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGDE-LEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~E-~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.=|.+++.++ -+..+.+++.--..++.||+... + ++.|..|+--|+.+++= .-+..+++++.|.-| T Consensus 1 Md~~dVK~ln~~~i~~~~~d~~~~~li~~y~e~aedyCN~~F-~--k~lP~gVkkfiAe~iky--~~~~nissRsMgtVS 75 (98) T protein:vir:96 1 MEPKEVKQLNLMPIEDTSNDDVLGDLIKFYKGIAEEYCNKTF-E--APYPFGVRKFIAECIKY--GTNSNVSSRTMGTVS 75 (98) T ss_pred CchHHhHHhhcccCCCcchHHHHHHHHHHHHHHHHHHhCCcc-c--ccCCccHHHHHHHHHhh--CCCCCccccccccee Confidence 456899865466677654 67778888888889999998765 3 56899999988888882 336678889999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCCcee Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSGGLQ 112 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~ 112 (154) +||... +-+.=.+-|++||+-+ | T Consensus 76 Yty~T~----iP~~i~~~L~PyRrlr--w 98 (98) T protein:vir:96 76 YTFVTD----LPKATYRHLKPFRRLR--W 98 (98) T ss_pred eechhh----hhHHHHHHhhhhhhcc--C Confidence 998642 1233446788988743 3 No 75 >protein:vir:96831 Length: 98 # NCBI annotation: ORF052 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240159;genbank:gi:66395852;genbank:GeneID:5133172 Probab=92.55 E-value=0.0029 Score=34.54 Aligned_cols=97 Identities=12% Similarity=0.169 Sum_probs=69.6 Q ss_pred CCHHHHHHHhcCCCCHH-HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceeeeccee Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGD-ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQMGPFN 83 (154) Q Consensus 5 ATvdDl~arlgr~L~~~-E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~etaG~fs 83 (154) -.++||+-.++.++++. .-+..+.+++.--..++.||+...- ++.|..|+--|+.+++ ...| ..+++++.|.-| T Consensus 1 Md~~dVK~l~~~~~~d~~~D~~~~~li~~y~e~aedyCN~~F~---~~lP~gVkkfvAe~ik-y~~~-~nissRsMgtVS 75 (98) T protein:vir:96 1 MDALDVKMLNGTRIDDVSNDDVINKLILAYKQVAEEYCNQVFG---DPLPGGVKKFIAECIK-YGVS-GNIASRSMGTVS 75 (98) T ss_pred CCHHHHHHHhCCCCCCcccHHHHHHHHHHHHHHHHHHhCCccc---ccCCccHHHHHHHHHh-hccc-CCccccccccee Confidence 46789998888888742 2233677777778888999987653 4689999998888888 4444 567888999999 Q ss_pred eEeecCCCcccCHHHHHHHHhhccCCcee Q lcl|NC_021296. 84 VQYSQPPDGFFYPAELAILKRFKRSGGLQ 112 (154) Q Consensus 84 ~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~ 112 (154) +||... +-+.=.+-|++||+-+ | T Consensus 76 Yty~T~----iP~~i~~~L~PyRrlr--w 98 (98) T protein:vir:96 76 YTYVTD----VPSSMYKYLKPYRKLR--W 98 (98) T ss_pred eechhh----hhHHHHHHhhhhhhcc--C Confidence 998642 1233446788988743 3 No 76 >protein:vir:79701 Length: 144 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285884;genbank:gi:148750841;genbank:GeneID:5220403 Probab=90.77 E-value=0.014 Score=30.80 Aligned_cols=111 Identities=11% Similarity=0.092 Sum_probs=60.1 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCC--------CCCCc-----ccchHHHHHHHHHHHHHH Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRA--------WPDAP-----ADVPDDVRAVVLQASRRE 67 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~--------~~d~~-----~~~p~~v~~Vv~~~vaR~ 67 (154) |.||=|-++.+..-+..+.+++ =+.+|..|+++|-..|+-- .-+.. ..++-.++.|=.++++-. T Consensus 1 ~~pYLTy~ef~~lg~~~~~~d~---F~kllk~A~~~ID~~T~y~~~~y~~~~i~~d~~~d~~~~~~~r~~~vKkA~a~QI 77 (144) T protein:vir:79 1 MKPYLTTSDFEKLGYELKKPDN---FGKLLKSATVLINQICSYYDPAFAYHDLEADSQADPDSYLFRQAMAFKKAVALEM 77 (144) T ss_pred CCcccchhhhhhhCCCCcchhh---hhhHHHHHHHHhhhhhhhhccccccccccccccccchhhhhHHHHHHHHHHHHHH Confidence 9999999999886565554443 6678899999998887531 11101 112233333322222222 Q ss_pred --Hh--------C--CCccceeeecceeeEeecCCCc------c-cCHHHHHHHHhhccC-Cceeec Q lcl|NC_021296. 68 --LK--------N--PDRVISRQMGPFNVQYSQPPDG------F-FYPAELAILKRFKRS-GGLQTV 114 (154) Q Consensus 68 --l~--------n--P~g~~~etaG~fs~s~~~sgg~------~-lt~aE~~~Lrr~r~~-~g~~sV 114 (154) |. + -....+.|+|..|+++.+++.. . +...=..-|.+.+.. +|.-|| T Consensus 78 eY~~~~G~~sa~e~~~~~~~S~svGrtsvs~~~~~~~s~t~~~~~v~~~a~~yL~~tGLLYrGV~s~ 144 (144) T protein:vir:79 78 LFLEDSGYSSAYDVAQGALNSFTVGHTSMSLNPSAGQNLTVGSTGVVKSAYDLLGRYGLLFSGVASL 144 (144) T ss_pred HHHHHcCCcchhhhhcCccceeEecceEEeecCCCccccccccccccHHHHHHHhhcCccccccccC Confidence 11 1 2223456899999887543221 2 222333446676653 455555 No 77 >protein:vir:6243 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813697;swissprot:trembl:q859c0;genbank:gi:29366757;uniprot:Q859C0;genbank:GeneID:1258898 Probab=90.69 E-value=0.0024 Score=34.95 Aligned_cols=110 Identities=20% Similarity=0.215 Sum_probs=61.8 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHH----HHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh-----CCC Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVS----SWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK-----NPD 72 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS----~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~-----nP~ 72 (154) -+|||+++|.+.-|. +++ .-.-..+|.+|- ..+..|||..|-....|.|+.++--+-..++..+. -|+ T Consensus 1 mayatieelralegi--dda-slfpdellsdaidfsvetvevycgqkwdtaenptpevirwcvrtlarqyvldhvsripd 77 (122) T protein:vir:62 1 MAYATIEELRALEGI--DDA-SLFPDELLSDAIDFSVETVEVYCGQKWDTAENPTPEVIRWCVRTLARQYVLDHVSRIPD 77 (122) T ss_pred CccchhhhhHhhccc--ccc-ccchhhhhhhhhhhhhhhhhhhcCcccCCcCCCchHHHHHHHHHHHHHHHHHHhhhcch Confidence 468999999987553 332 112233444444 45788999999887788899887665555555542 366 Q ss_pred ccceeeecceeeEeecCCCccc---CHHHHHHHHhhccCCceeec Q lcl|NC_021296. 73 RVISRQMGPFNVQYSQPPDGFF---YPAELAILKRFKRSGGLQTV 114 (154) Q Consensus 73 g~~~etaG~fs~s~~~sgg~~l---t~aE~~~Lrr~r~~~g~~sV 114 (154) ...|-+.-=-|.+..+.||.|- +++--+.|+-||.+--..-+ T Consensus 78 ralqlqsefgsiqlaqaggtwrptslpevnaklnlyrvrlpfifm 122 (122) T protein:vir:62 78 RALQLQSEFGSIQLAQAGGTWRPTSLPEVNAKLNLYRVRLPFIFM 122 (122) T ss_pred hhhhhhhcccceeeeccCCccccCcCcccccceeeeEeecceeeC Confidence 5444221111456667777662 22222445555543211111 No 78 >protein:vir:100103 Length: 120 # NCBI annotation: gp7 # Family: family:all:363 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945037;genbank:gi:38707897;genbank:GeneID:2744150 Probab=87.34 E-value=0.014 Score=30.69 Aligned_cols=101 Identities=14% Similarity=0.109 Sum_probs=66.4 Q ss_pred CC---cCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCc----------------ccchHHHHHHHH Q lcl|NC_021296. 1 MA---GLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAP----------------ADVPDDVRAVVL 61 (154) Q Consensus 1 M~---~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~----------------~~~p~~v~~Vv~ 61 (154) |+ ++.|+++++.-|...-+ ++-...+.+|+-|.+.+..++|+.+.... ..+|..++.-++ T Consensus 1 ~~~~m~~vtL~e~K~hLRvd~d-~DD~lI~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~~~~i~~AvL 79 (120) T protein:vir:10 1 MADQTPIVSLEVALAHLREDAG-VADDLIKIYIGAATQSASDYVDRKLYANDAEMQAAVADATAGADPIVANDAIRAAIL 79 (120) T ss_pred CCCCCCccCHHHHHHHcCCCCC-cchHHHHHHHHHHHHHHHHHhCCcccccccccchhhhccccccccccCCHHHHHHHH Confidence 66 67899999999987644 44556688999999999999998764321 125777888888 Q ss_pred HHHHHHHhCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCce Q lcl|NC_021296. 62 QASRRELKNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGL 111 (154) Q Consensus 62 ~~vaR~l~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~ 111 (154) -.+.-...|......-+ ......-|.+ =...|.+||..-|. T Consensus 80 llvg~~YenRe~~~~~~---~~~~~~lP~~------v~~Ll~~yR~~~gv 120 (120) T protein:vir:10 80 LTIGKLYAFREDVVSGA---SASVTELPSG------AKSLLFPYRVGLGV 120 (120) T ss_pred HHHHHHHhchhhhhhcc---cccccccCHH------HHHHHHHhhhccCC Confidence 88888888877653311 1101111111 12347888875555 No 79 >protein:vir:102961 Length: 131 # NCBI annotation: hypothetical protein # Family: family:all:26777 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945287;genbank:gi:39653722;uniprot:Q708M5;genbank:GeneID:2672875 Probab=87.27 E-value=0.022 Score=29.63 Aligned_cols=97 Identities=19% Similarity=0.175 Sum_probs=63.1 Q ss_pred CHHHHHH---------HhcCCCCHH--HHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhC---- Q lcl|NC_021296. 6 SIQDLQT---------LMSQTFEGD--ELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKN---- 70 (154) Q Consensus 6 TvdDl~a---------rlgr~L~~~--E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~n---- 70 (154) -++.|+. .+|...++. +....+..|+++...|..||+ +++ +|+.+.-|++.||--.+.+ T Consensus 1 ~~~~lkq~~~~~~~~~~l~~~~d~~~kD~~vl~faie~v~~~IlnycN--ike----iP~~Le~v~~~maiDll~~e~~~ 74 (131) T protein:vir:10 1 MIQELKQDNTMYLISCVRKMRQDNYFKDMEVLHYALTQAENEILNYIH--QDS----VPGRLENVWIDMTNDLLDKVKEQ 74 (131) T ss_pred ChhhhhhhhhhhhhhhhhccccccccchHHHHHHHHHHHHHHHhhhcC--Ccc----cchhhHHHHHHHHHHHHhhhccc Confidence 4566655 445433321 333468899999999999996 444 5777888888888776642 Q ss_pred ---------CCc-cceeeecceeeEeecCCC---------cccCHHHHHHHHhhccCC Q lcl|NC_021296. 71 ---------PDR-VISRQMGPFNVQYSQPPD---------GFFYPAELAILKRFKRSG 109 (154) Q Consensus 71 ---------P~g-~~~etaG~fs~s~~~sgg---------~~lt~aE~~~Lrr~r~~~ 109 (154) +++ +.|=+-|+.|++|..+.. .|++ .=++.|++||+-- T Consensus 75 ~~k~~~i~~~~g~VsSI~eGDTsIsf~s~t~~~qrl~~~~s~l~-~Y~~qL~~yRRL~ 131 (131) T protein:vir:10 75 SVLAEKAGADDFSVKSIKMGDTTIEKVSPYEMIQRMKQVPSSLE-RYKRQLNRFRKLL 131 (131) T ss_pred ccccccccccccceeeeeecceeeeccCCccHHHHHHHHHHHHh-hhHHHHhhhcccC Confidence 223 334478999999853322 3444 3366889998744 No 80 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=87.22 E-value=0.037 Score=28.45 Aligned_cols=96 Identities=10% Similarity=0.174 Sum_probs=65.2 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCC---C-----CcccchHHHHHHHHHHHHHHHhCCC Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWP---D-----APADVPDDVRAVVLQASRRELKNPD 72 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~---d-----~~~~~p~~v~~Vv~~~vaR~l~nP~ 72 (154) |-.+.|+++++.-|...-+ +|-+..+.+|.-|++.|..|.+.... + .+.+.|..+|.-|+-.+.=...|.. T Consensus 1 mm~~vtLeevK~hLRId~d-~dD~li~~~i~aA~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~i~~AvLlLv~~~YenRe 79 (108) T protein:vir:93 1 MTALLTLEEIKAHLRVDHD-ADDDMLMDKVRQATAVLLAYIQGSRDKVIREDGELIPGEALTRMKGAAMRLTGMLYRNPD 79 (108) T ss_pred CCcCCCHHHHHHHcCCCCC-cChHHHHHHHHHHHHHHHHHhccccccccccccccccccCChHHHHHHHHHHHHHHhccc Confidence 9999999999999876443 45667788999999999999864321 1 1234567788888889998999987 Q ss_pred ccceee----ecceeeEeecCCCcccCHHHHHHHHhhccCCce Q lcl|NC_021296. 73 RVISRQ----MGPFNVQYSQPPDGFFYPAELAILKRFKRSGGL 111 (154) Q Consensus 73 g~~~et----aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~ 111 (154) ..+..+ --||++ ...|.+||...=+ T Consensus 80 ~~~~~~~~~~elP~~v--------------~~Ll~~~R~p~~~ 108 (108) T protein:vir:93 80 LAEREELLQGELPFSV--------------SVLIYDLRCPTVL 108 (108) T ss_pred cccccccccccCCHHH--------------HHHHHHccccccC Confidence 664321 123332 2235666654323 No 81 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=85.26 E-value=0.054 Score=27.53 Aligned_cols=115 Identities=23% Similarity=0.342 Sum_probs=66.6 Q ss_pred CCcCCCHHHHHHHh---cCCCCHHHHHHHHHHHHHHHHHHHHh---hC-C--------CCCCC---------cccchHHH Q lcl|NC_021296. 1 MAGLASIQDLQTLM---SQTFEGDELEQAQLVLDIVSSWARVV---SG-R--------AWPDA---------PADVPDDV 56 (154) Q Consensus 1 M~~~ATvdDl~arl---gr~L~~~E~~~A~~lL~~aS~lir~~---~~-~--------~~~d~---------~~~~p~~v 56 (154) -..|+|++|+.+.. +..+.+.+-..-+++|--|++.|-.. .| | .||.. ...+|..| T Consensus 15 AnSYvtv~~a~aY~~~rg~~~~a~~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~~~~~~~~IP~~v 94 (172) T protein:vir:97 15 ANAYISVEEFKTYHTDRGNSFAGSTDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEV 94 (172) T ss_pred ccccccHHHHHHHHHhcCcccCCCCcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCCcccccccccHHH Confidence 57899999988754 34444333333455566799888642 22 1 14421 13468889 Q ss_pred HHHHHHHHHHHHhCCCcc-------------ceeeecceeeEeecCC----CcccCHHHHHHHHhhc--cCCceeecccc Q lcl|NC_021296. 57 RAVVLQASRRELKNPDRV-------------ISRQMGPFNVQYSQPP----DGFFYPAELAILKRFK--RSGGLQTVSTS 117 (154) Q Consensus 57 ~~Vv~~~vaR~l~nP~g~-------------~~etaG~fs~s~~~sg----g~~lt~aE~~~Lrr~r--~~~g~~sV~~~ 117 (154) +.=+|..+.++|.+|-.. ..+..|+=+..|...+ +..-..+=...|++++ +++|. .- T Consensus 95 ~~A~~elA~~al~~~l~~d~~~~~~~~~v~~kr~kvg~i~~~y~~~~~~~~~~p~~~~v~aLL~p~gl~~~~~~----~~ 170 (172) T protein:vir:97 95 KEACAEYALRALAAELNPDPERNASGVAVLSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGGT----LL 170 (172) T ss_pred HHHHHHHHHHHHhcccccccccccccccceeeeeeecceeeEeeccCCCCCccccHHHHHHHHhhhccccCcce----ec Confidence 999999999999775321 1234577777764322 1222234456688865 33343 34 Q ss_pred cc Q lcl|NC_021296. 118 RG 119 (154) Q Consensus 118 r~ 119 (154) || T Consensus 171 r~ 172 (172) T protein:vir:97 171 RG 172 (172) T ss_pred cC Confidence 55 No 82 >protein:vir:79050 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:6416 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110727;genbank:gi:134287344;genbank:GeneID:4955224 Probab=85.15 E-value=0.016 Score=30.41 Aligned_cols=104 Identities=9% Similarity=0.106 Sum_probs=66.7 Q ss_pred CCc--CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh--C------ Q lcl|NC_021296. 1 MAG--LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK--N------ 70 (154) Q Consensus 1 M~~--~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~--n------ 70 (154) |.. +.++.+.-.-++-.++++.....+.+++.+...|..||+ + ..+|..+.-|+++||.-.+. | T Consensus 1 ~~~~i~e~i~~~Lk~~~~~~~~~d~~iL~fa~e~~~n~I~N~cN--i----~eiP~~L~~v~~~mai~~fl~~kk~~~~~ 74 (133) T protein:vir:79 1 MGNNIIDDIEKRLESFGYILKDGDKWLIDFVREKIENIIKLDCN--I----KTMPIELKEIEADMIVGEFLFTKKNMGQL 74 (133) T ss_pred CCchHHHHHHHHHHHhCCCCCccchHHHHHHHHHHHHHHhhhcC--h----hhcchhHHHHHHHHHHHHHHhcccccCCC Confidence 663 333444433446666666667778899999999999996 3 34788899999988877653 1 Q ss_pred C-Cc------cceeeecceeeEeecCCCc------------ccCHHHHHHHHhhccCCcee Q lcl|NC_021296. 71 P-DR------VISRQMGPFNVQYSQPPDG------------FFYPAELAILKRFKRSGGLQ 112 (154) Q Consensus 71 P-~g------~~~etaG~fs~s~~~sgg~------------~lt~aE~~~Lrr~r~~~g~~ 112 (154) | .+ +.|=+-|+.|++|....|. ||++..++.|.+||+-+ | T Consensus 75 ~l~~~D~~~~v~sIkeGDTsv~f~~~~~s~t~eq~l~s~i~~L~~~~k~~l~~yRkLr--W 133 (133) T protein:vir:79 75 DIESINFEAVEKSISEGDTKVDFAIGSGSQTPEQRFDSLIAYLTAYGKNKILTFRCLR--W 133 (133) T ss_pred CcccccchhhhhheecccceeecccCCCccchhHHHHHHHHHHhhcccchhhcccccc--C Confidence 3 22 2444779999998643332 44444455566666532 3 No 83 >protein:vir:3160 Length: 198 # NCBI annotation: unknown # Family: family:all:28414 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665931;genbank:gi:22091117;genbank:GeneID:951344 Probab=81.73 E-value=0.048 Score=27.81 Aligned_cols=111 Identities=13% Similarity=0.115 Sum_probs=47.0 Q ss_pred CCcCCC------HHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcc------------------------ Q lcl|NC_021296. 1 MAGLAS------IQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPA------------------------ 50 (154) Q Consensus 1 M~~~AT------vdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~------------------------ 50 (154) |.--.+ +=| +++|+.+ .+++-..++.||+-||+++..++|+.....+- T Consensus 1 ~~~~~~~~~~~~~~d-Kk~Lgv~-~d~~D~~le~LIa~ASa~~E~~~Gr~L~a~d~t~d~~r~~G~g~~~L~LPq~PV~s 78 (198) T protein:vir:31 1 MPLEPSDVESELPFD-AKAFGWS-EEKFKSELETYIAAATETVEKWINTTLEPETVTRDLSRPSHVDGHDLPMPSRPVQD 78 (198) T ss_pred CCCCcccccccCchh-hHhhccc-ccchhhHHHHHHHHHHHHHHHhhCcEeccccceeccCcccCCCcceeecCCCCcce Confidence 431111 012 6778865 34566789999999999999998863211000 Q ss_pred -------------------------------------------------------cchHHHHHHHHHHHHHHHh--CCCc Q lcl|NC_021296. 51 -------------------------------------------------------DVPDDVRAVVLQASRRELK--NPDR 73 (154) Q Consensus 51 -------------------------------------------------------~~p~~v~~Vv~~~vaR~l~--nP~g 73 (154) .+|.+|+.-|+..|+-.++ +-.| T Consensus 79 VsSV~iD~~~~~g~~v~~~dy~l~~~~~~~~~G~~r~~~p~~~rnV~V~y~AGye~VPeDikeAVI~lv~~~~~e~~~~G 158 (198) T protein:vir:31 79 VVSVTIDTDRAMGRDVDEDDYWVEETHLELKPGADRKSWPTDRRCITVEWEYGYEEVPESPKKAIIRLVRARLRAINAEG 158 (198) T ss_pred eEEEEEecCccccccccchhhhhhhhhhhhcccccccccccccceEEEEeecCccccchHHHHHHHHHHHHHHhhhhccc Confidence 0122222222222222221 2223 Q ss_pred cceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccccCCCceeecCCCCcCCccCCCCCcCCcc Q lcl|NC_021296. 74 VISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRPWAGKTAFIRYGDGLFPFCSEDDGYGDVV 152 (154) Q Consensus 74 ~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~~~~~~~~v~~gg~~~p~~~~~~g~~~~~ 152 (154) +.|.|.++.|+||.. - ...|+.-.+..-=|+---.||+|+ T Consensus 159 i~s~T~~gesvSy~~-~--------------------------------------~e~~~~~~~~~~~~~~~~~~~~~~ 198 (198) T protein:vir:31 159 ISSDTIMGDSISYDP-E--------------------------------------DEVVLAARKDVAGFEAPSYYGGVE 198 (198) T ss_pred ceeeeecCcceeecC-c--------------------------------------ccchhhhhhhhccccCcccccCCC Confidence 344444444444331 0 111111111111122223445455 No 84 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=79.82 E-value=0.1 Score=26.05 Aligned_cols=97 Identities=21% Similarity=0.237 Sum_probs=64.7 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCC----------------cccchHHHHHHHHHHH Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDA----------------PADVPDDVRAVVLQAS 64 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~----------------~~~~p~~v~~Vv~~~v 64 (154) |. +.|+++++.-|...-+ +|-...+.+|+-|++.+..++++.-.+. +.++|..++.-|+-.+ T Consensus 1 M~-~vtLee~K~hLRvd~d-~dD~lI~~li~AA~~~ve~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AvLllv 78 (113) T protein:vir:10 1 MA-LVELKLALGFVRANAG-VEDDVVQMLLDAATQSAVDYLNRQVFETEDAMTTAIEAGTAGQNPMVVNAAIRAAILKIT 78 (113) T ss_pred CC-CCCHHHHHHHcCCCCC-cchHHHHHHHHHHHHHHHHHhCccccccccccccccccccccccccccChHHHHHHHHHH Confidence 76 8999999999976544 4567778999999999999998753221 2236778888888888 Q ss_pred HHHHhCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCce Q lcl|NC_021296. 65 RRELKNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGL 111 (154) Q Consensus 65 aR~l~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~ 111 (154) .-...|-...+. | +.+ .-|.+ =...|.+||.-.|. T Consensus 79 ~~~Y~nRe~~~~---~--~~~-~lP~~------v~~Ll~~yR~~~g~ 113 (113) T protein:vir:10 79 AELYANREDTAF---G--PIT-ELPLN------ARALLRPHRIIPGV 113 (113) T ss_pred HHHHhhhhhhch---h--hhh-ccCHH------HHHHHHHhhhhcCC Confidence 888887543222 1 111 01111 13347888876666 No 85 >protein:vir:97069 Length: 115 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453566;genbank:gi:84662601;genbank:GeneID:5142484 Probab=78.74 E-value=0.083 Score=26.50 Aligned_cols=97 Identities=12% Similarity=0.140 Sum_probs=64.2 Q ss_pred CCCHHHHHHHhcCCCCH--HHHHHHHHHHHHHHHHHHHhhCCCCCCCcc----------------cchHHHHHHHHHHHH Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEG--DELEQAQLVLDIVSSWARVVSGRAWPDAPA----------------DVPDDVRAVVLQASR 65 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~--~E~~~A~~lL~~aS~lir~~~~~~~~d~~~----------------~~p~~v~~Vv~~~va 65 (154) .-|+++++.-|....+. ++-...+.+|.-|++.+..++++.+.+... .+|+.||.-++-.+. T Consensus 1 mvtLee~K~hLRid~d~~d~DDali~~~i~AA~~~v~~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLllvg 80 (115) T protein:vir:97 1 MITLAMMQRHLQAELYEDDERDYVMQQLLPAARESAELFLNRKLYDVQADMLADQVLGVDPSDQLLITRTVEQAILLTVG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccchhhcccccccccCCCcccccCCHHHHHHHHHHHH Confidence 88999999988765542 235577899999999999999987642211 157788888888888 Q ss_pred HHHhCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCcee Q lcl|NC_021296. 66 RELKNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQ 112 (154) Q Consensus 66 R~l~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~ 112 (154) -...|-..++. | +.+ .-|.+ =...|.+||.-+|.- T Consensus 81 ~~Y~NRE~v~~---~--~~~-elP~~------~~~LL~pyR~~~Gv~ 115 (115) T protein:vir:97 81 EWYSSREQVWI---K--GAG-LVTSS------AQNLLHPYRKFAGVR 115 (115) T ss_pred HHHhccccccc---c--ccc-ccCHH------HHHHHHHHHhhcCCC Confidence 88888654332 2 111 01222 123478888766654 No 86 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=78.63 E-value=0.11 Score=25.79 Aligned_cols=104 Identities=13% Similarity=0.094 Sum_probs=67.3 Q ss_pred CC--cCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceee Q lcl|NC_021296. 1 MA--GLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQ 78 (154) Q Consensus 1 M~--~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~et 78 (154) |. .+.|+++++..|...- ++|-...+.+|.-|.+.|..++++.-.+.+..+|..+|.-|+-.+.-...|=..++. T Consensus 3 ~~~M~~vtLee~K~hLRid~-dddD~lI~~~i~AA~~~v~~~~~~~~~~~~~~~p~~ik~AiLllv~~~YenRE~~~~-- 79 (108) T protein:vir:19 3 IDVLDVISLSLFKQQIEFEE-DDRDELITLYAQAAFDYCMRWCDEPAWKVAADIPAAVKGAVLLVFADMFEHRTAQSE-- 79 (108) T ss_pred CCcccccCHHHHHHHcCCCC-CcchHHHHHHHHHHHHHHHHHhCCcccccccccchHHHHHHHHHHHHHHhccccccc-- Confidence 22 5699999999997654 345567789999999999999998754556678999998888888888877543211 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccc Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGE 120 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~ 120 (154) ++.+ .++ .=+..|.+||.-.|.- .++.|. T Consensus 80 -~~~~----~~~------~~~~LL~pYR~~~g~~--~~~~~~ 108 (108) T protein:vir:19 80 -VQLY----ENA------AAERMMFIHRNWRGKA--ESEEGS 108 (108) T ss_pred -chhh----hhH------HHHHHHHHHHhcCCCC--CcccCC Confidence 1111 111 1234577777544430 111111 No 87 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=78.63 E-value=0.11 Score=25.79 Aligned_cols=104 Identities=13% Similarity=0.094 Sum_probs=67.3 Q ss_pred CC--cCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceee Q lcl|NC_021296. 1 MA--GLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQ 78 (154) Q Consensus 1 M~--~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~et 78 (154) |. .+.|+++++..|...- ++|-...+.+|.-|.+.|..++++.-.+.+..+|..+|.-|+-.+.-...|=..++. T Consensus 3 ~~~M~~vtLee~K~hLRid~-dddD~lI~~~i~AA~~~v~~~~~~~~~~~~~~~p~~ik~AiLllv~~~YenRE~~~~-- 79 (108) T protein:vir:18 3 IDVLDVISLSLFKQQIEFEE-DDRDELITLYAQAAFDYCMRWCDEPAWKVAADIPAAVKGAVLLVFADMFEHRTAQSE-- 79 (108) T ss_pred CCcccccCHHHHHHHcCCCC-CcchHHHHHHHHHHHHHHHHHhCCcccccccccchHHHHHHHHHHHHHHhccccccc-- Confidence 22 5699999999997654 345567789999999999999998754556678999998888888888877543211 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccc Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGE 120 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~ 120 (154) ++.+ .++ .=+..|.+||.-.|.- .++.|. T Consensus 80 -~~~~----~~~------~~~~LL~pYR~~~g~~--~~~~~~ 108 (108) T protein:vir:18 80 -VQLY----ENA------AAERMMFIHRNWRGKA--ESEEGS 108 (108) T ss_pred -chhh----hhH------HHHHHHHHHHhcCCCC--CcccCC Confidence 1111 111 1234577777544430 111111 No 88 >protein:vir:94126 Length: 116 # NCBI annotation: ORF041 # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240236;genbank:gi:66395926;genbank:GeneID:5133295 Probab=78.23 E-value=0.086 Score=26.43 Aligned_cols=108 Identities=20% Similarity=0.273 Sum_probs=62.4 Q ss_pred CCcCCCHHHHHHHhcCCC---CHHH-HHHHHHHHHHHHHHHHHhhCCCCC-CCcccchHHHHHHHHHHHHHHHhCCC--- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTF---EGDE-LEQAQLVLDIVSSWARVVSGRAWP-DAPADVPDDVRAVVLQASRRELKNPD--- 72 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L---~~~E-~~~A~~lL~~aS~lir~~~~~~~~-d~~~~~p~~v~~Vv~~~vaR~l~nP~--- 72 (154) |.- -+||+-.++..+ +.+| .+-...+...--+ ++.||+...- |...+.|..|+--|+.+++=.| .|. T Consensus 1 ~~~---~~DVk~ln~k~~~~~tsD~~d~~l~ev~~~l~~-A~dyCnn~F~~dg~~~lP~gVkkFVA~~iky~~-~p~t~~ 75 (116) T protein:vir:94 1 MTL---YEDVKLLLKKNGVEVKSDEEEIFKMEVDGILED-VRDITNNDFMKDGQVIYPYSIKKYVADVLEYYQ-RPEVKK 75 (116) T ss_pred Cch---HHHHHHHhcCCCCCcccchHHHHHHhhHHHHHH-HHHHhcCcccccCCccCcchhHHHHHHHHHhhc-cccccc Confidence 544 466665544333 3333 2222222222222 7788887753 3456789999998888877655 344 Q ss_pred ccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccc Q lcl|NC_021296. 73 RVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSR 118 (154) Q Consensus 73 g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r 118 (154) ++++.|+|.=|+||...-- +.-.+-|++||+-+ -......| T Consensus 76 nlssRSMGTVSYty~Te~P----~~~~~~L~PyRklr-w~~~~~~~ 116 (116) T protein:vir:94 76 NLKSRSMGTVSYTYNDGVP----DYISGVLNRYKRAK-FHPFKPIR 116 (116) T ss_pred Ccccccccceeeeccccch----HHHHHhhhhhhhcc-cCCCCCCC Confidence 7888999999999854321 23446688887643 11111222 No 89 >protein:vir:105899 Length: 116 # NCBI annotation: head completion protein # Family: family:all:789 # ACLAME annotation(s): phi:0000067 - phage portal associated protein # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004377;genbank:gi:122891832;genbank:GeneID:4712370 Probab=78.23 E-value=0.086 Score=26.43 Aligned_cols=108 Identities=20% Similarity=0.273 Sum_probs=62.4 Q ss_pred CCcCCCHHHHHHHhcCCC---CHHH-HHHHHHHHHHHHHHHHHhhCCCCC-CCcccchHHHHHHHHHHHHHHHhCCC--- Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTF---EGDE-LEQAQLVLDIVSSWARVVSGRAWP-DAPADVPDDVRAVVLQASRRELKNPD--- 72 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L---~~~E-~~~A~~lL~~aS~lir~~~~~~~~-d~~~~~p~~v~~Vv~~~vaR~l~nP~--- 72 (154) |.- -+||+-.++..+ +.+| .+-...+...--+ ++.||+...- |...+.|..|+--|+.+++=.| .|. T Consensus 1 ~~~---~~DVk~ln~k~~~~~tsD~~d~~l~ev~~~l~~-A~dyCnn~F~~dg~~~lP~gVkkFVA~~iky~~-~p~t~~ 75 (116) T protein:vir:10 1 MTL---YEDVKLLLKKNGVEVKSDEEEIFKMEVDGILED-VRDITNNDFMKDGQVIYPYSIKKYVADVLEYYQ-RPEVKK 75 (116) T ss_pred Cch---HHHHHHHhcCCCCCcccchHHHHHHhhHHHHHH-HHHHhcCcccccCCccCcchhHHHHHHHHHhhc-cccccc Confidence 544 466665544333 3333 2222222222222 7788887753 3456789999998888877655 344 Q ss_pred ccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccc Q lcl|NC_021296. 73 RVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSR 118 (154) Q Consensus 73 g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r 118 (154) ++++.|+|.=|+||...-- +.-.+-|++||+-+ -......| T Consensus 76 nlssRSMGTVSYty~Te~P----~~~~~~L~PyRklr-w~~~~~~~ 116 (116) T protein:vir:10 76 NLKSRSMGTVSYTYNDGVP----DYISGVLNRYKRAK-FHPFKPIR 116 (116) T ss_pred Ccccccccceeeeccccch----HHHHHhhhhhhhcc-cCCCCCCC Confidence 7888999999999854321 23446688887643 11111222 No 90 >protein:vir:81069 Length: 115 # NCBI annotation: p10 # Family: family:all:363 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285680;genbank:gi:148727188;genbank:GeneID:5247114 Probab=68.47 E-value=0.22 Score=24.18 Aligned_cols=97 Identities=12% Similarity=0.155 Sum_probs=64.5 Q ss_pred CCCHHHHHHHhcCCCCH--HHHHHHHHHHHHHHHHHHHhhCCCCCCCcc----------------cchHHHHHHHHHHHH Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEG--DELEQAQLVLDIVSSWARVVSGRAWPDAPA----------------DVPDDVRAVVLQASR 65 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~--~E~~~A~~lL~~aS~lir~~~~~~~~d~~~----------------~~p~~v~~Vv~~~va 65 (154) .-|+++++.-|....+. ++-...+.+|.-|...+..++++....... .+|+.||.-|+-.+. T Consensus 1 ivtLee~K~HlRid~dd~deDD~li~~~i~AA~~~v~~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLllvg 80 (115) T protein:vir:81 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQADMLADQAAGVDPAGQLLITRTVEQAILLTLG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCccchHHHHHHHHHHHHHHHHHhCCccccccccccccccccCCCCcccccCHHHHHHHHHHHH Confidence 78999999998776543 346677899999999999999987532211 157778888888888 Q ss_pred HHHhCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCcee Q lcl|NC_021296. 66 RELKNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQ 112 (154) Q Consensus 66 R~l~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~ 112 (154) -...|-..++..+ .+ .-|.+ =+..|++||.-.|.- T Consensus 81 ~~Y~NRE~v~~~~-----~~-elP~~------~~~LL~pyR~~~g~~ 115 (115) T protein:vir:81 81 EWYSSREQVWTKG-----AG-LVTSS------AQNLLHPYRKFAGVR 115 (115) T ss_pred HHHhccchhcchh-----hh-hcCHH------HHHHHHHHHhhcCCC Confidence 8888855433211 11 11222 133478888766553 No 91 >protein:vir:10365 Length: 115 # NCBI annotation: conserved phage protein # Family: family:all:363 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858957;genbank:gi:32128422;genbank:GeneID:2648387 Probab=62.49 E-value=0.33 Score=23.20 Aligned_cols=97 Identities=13% Similarity=0.172 Sum_probs=62.8 Q ss_pred CCCHHHHHHHhcCCC--CHHHHHHHHHHHHHHHHHHHHhhCCCCCCCc----------------ccchHHHHHHHHHHHH Q lcl|NC_021296. 4 LASIQDLQTLMSQTF--EGDELEQAQLVLDIVSSWARVVSGRAWPDAP----------------ADVPDDVRAVVLQASR 65 (154) Q Consensus 4 ~ATvdDl~arlgr~L--~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~----------------~~~p~~v~~Vv~~~va 65 (154) .-|+++++.-|.... +++|-...+.+|+-|++.+..++++...... ..+|+.++.=++-.+. T Consensus 1 mvtLe~~K~hLRid~~d~d~dD~li~~~i~AA~~~v~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AiLLlvg 80 (115) T protein:vir:10 1 MITLAMVQRHLQAELYEDDERDYVMQQLLPAARESAELFINRKLYDTQADMLADQAAGVDPAGQLLITRTVEQAILLTVG 80 (115) T ss_pred CCCHHHHHHHcCCCCCCCchhhHHHHHHHHHHHHHHHHHhCCcccccccccccccccccCCcccccCChHHHHHHHHHHH Confidence 789999999997754 3456778899999999999999987643211 1256677777777777 Q ss_pred HHHhCCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCcee Q lcl|NC_021296. 66 RELKNPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQ 112 (154) Q Consensus 66 R~l~nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~ 112 (154) -...|-...+. | +.+ .-|.+. ...|++||+-+|.- T Consensus 81 ~~Y~nRe~~~~---~--~~~-elP~~v------~~LL~pyR~~~gv~ 115 (115) T protein:vir:10 81 EWYANREQVWV---K--GVG-LVTSSA------QNLLHPYRKFAGVR 115 (115) T ss_pred HHHhcchhccc---c--hhh-hcCHHH------HHHHHHHHhcCCCC Confidence 77777543322 1 111 112221 23477777666543 No 92 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=55.33 E-value=0.48 Score=22.32 Aligned_cols=96 Identities=11% Similarity=0.132 Sum_probs=60.7 Q ss_pred CCcCCCHHHHHHHhcCCCC-HHHHHHHHHHHHHHHHHHHHhhCCCCCCCcc----------cchHHHHHHHHHHHHHHHh Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFE-GDELEQAQLVLDIVSSWARVVSGRAWPDAPA----------DVPDDVRAVVLQASRRELK 69 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~-~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~----------~~p~~v~~Vv~~~vaR~l~ 69 (154) |- .|.++++.-|...-+ .+|-...+.+|.-|++.|..++|+.+.+... .+|+.+|.-|+-.+.-... T Consensus 1 M~--vtL~e~K~hLRid~D~~ddD~li~~~i~aA~~~i~~~~~r~l~~~~~~~~~~~~~~~~~~~~ik~Avlllv~~~Y~ 78 (107) T protein:vir:48 1 ML--LKEEEIKSHLRLDDGLYSDGDFLKLLAQAVQKRTETYLNRKLYAPEETIPEDDPDGMHLTDDVRLAMLMLVSHFYE 78 (107) T ss_pred CC--CCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccccccCccccccchhHHHHHHHHHHHHHh Confidence 54 899999999876433 2466778899999999999999987543321 1467788888888888888 Q ss_pred CCCccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeec Q lcl|NC_021296. 70 NPDRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTV 114 (154) Q Consensus 70 nP~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV 114 (154) |....+..+. ..-|.+ . ...|.+||. |++ T Consensus 79 NRe~v~~~~~------~~iP~~---v---~~LL~~yR~----~~l 107 (107) T protein:vir:48 79 NRSTITDVEK------LETPMS---F---RWLAGPYRI----VPL 107 (107) T ss_pred hhhhhccccc------cccCHH---H---HHHHHHhhc----cCC Confidence 8765433211 011222 1 122455542 111 No 93 >protein:vir:5256 Length: 119 # NCBI annotation: hypothetical protein # Family: family:all:664 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852761;genbank:gi:31544036;uniprot:Q7Y5T9;genbank:GeneID:2753553 Probab=51.64 E-value=0.58 Score=21.90 Aligned_cols=99 Identities=21% Similarity=0.321 Sum_probs=54.1 Q ss_pred CCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHh-----------CCC Q lcl|NC_021296. 4 LASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELK-----------NPD 72 (154) Q Consensus 4 ~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~-----------nP~ 72 (154) +.|+++..++. =++.+---++.+..|++|...+- ..+|-+.- + +.+-+ .++-.|. .+. T Consensus 1 m~t~~~Fr~~~-PeF~~~pd~~i~~~l~~A~~~l~---~~~~g~~~----~--~~~~L-~~AH~l~l~~~~~~~~g~~~g 69 (119) T protein:vir:52 1 MPLTEDFLLRY-TEFGKTDAKRIGLFLSDAQAEVS---KVQWGKLY----D--RGVMA-LTAHLLKLSADAEISGGAANR 69 (119) T ss_pred CCcHHHHHHhh-hhccCCCHHHHHHHHHHHHHhhC---CcCCchHH----H--HHHHH-HHHHHHHhhhhhhcccccccc Confidence 67888888875 23332223577888888877774 23554321 1 12222 2222221 133 Q ss_pred ccceeeecceeeEeecCC-----Cccc--CH--HHHHHHHhhccCCceee Q lcl|NC_021296. 73 RVISRQMGPFNVQYSQPP-----DGFF--YP--AELAILKRFKRSGGLQT 113 (154) Q Consensus 73 g~~~etaG~fs~s~~~sg-----g~~l--t~--aE~~~Lrr~r~~~g~~s 113 (154) .++|.+.|..|+||..+. ..|| |. .|-..|+|.-+.+|+.. T Consensus 70 ~v~S~s~G~vSvS~~~~~~~~~~~~w~~~T~YG~~y~~L~r~~g~Gg~Va 119 (119) T protein:vir:52 70 NLASESAGELSVSYTAPISANGSDDFYQLTAYGQEYLRLRRLIGVGVMVA 119 (119) T ss_pred ceeeeeecceeeeeeccccCCcchhhhhcCHHHHHHHHHHHHhcCCCcCC Confidence 457789999999985332 2332 22 35555665555566655 No 94 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=51.48 E-value=0.58 Score=21.88 Aligned_cols=96 Identities=15% Similarity=0.134 Sum_probs=59.2 Q ss_pred cCCCHHHHHHHhcCCCC-HHHHHHHHHHHHHHHHHHHHhhCCCCCCCc----------ccchHHHHHHHHHHHHHHHhCC Q lcl|NC_021296. 3 GLASIQDLQTLMSQTFE-GDELEQAQLVLDIVSSWARVVSGRAWPDAP----------ADVPDDVRAVVLQASRRELKNP 71 (154) Q Consensus 3 ~~ATvdDl~arlgr~L~-~~E~~~A~~lL~~aS~lir~~~~~~~~d~~----------~~~p~~v~~Vv~~~vaR~l~nP 71 (154) =+.|+++++.-|...-+ .+|-...+.+|.-|.+.|..++|+...+.. ..+|..++.-++-.+.-...|- T Consensus 1 M~vtL~e~K~hLRId~D~~ddD~lI~~~i~AA~~~i~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~AvLllv~~~Y~NR 80 (107) T protein:vir:45 1 MLLKMEEIKLQLRLDDDFSDEDELLELLGKAAQSRTENFLNRKLYATADDRPADDPDGLVISDDVKLALLLLVSHFYENR 80 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccccccccccccCChhHHHHHHHHHHHHHhhh Confidence 35799999998876433 256677889999999999999998653221 1146778888888888888886 Q ss_pred CccceeeecceeeEeecCCCcccCHHHHHHHHhhccCCceeec Q lcl|NC_021296. 72 DRVISRQMGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTV 114 (154) Q Consensus 72 ~g~~~etaG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV 114 (154) ...+..+. + .-|.+. ++ .|.+||.- +| T Consensus 81 e~~~~~~~--~----~lp~~v---~~---Ll~~~R~~----~~ 107 (107) T protein:vir:45 81 STVTDVEK--M----ELPMSF---NW---LVAPYRLI----PL 107 (107) T ss_pred hhccccch--h----ccchHH---HH---HHHHHhhc----CC Confidence 54321111 0 112221 12 24444431 11 No 95 >protein:vir:4788 Length: 130 # NCBI annotation: hypothetical protein # Family: family:all:1399 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150168;swissprot:trembl:q94m43;genbank:gi:15088779;uniprot:Q94M43;genbank:GeneID:955972 Probab=36.20 E-value=1.2 Score=20.18 Aligned_cols=111 Identities=12% Similarity=0.069 Sum_probs=57.6 Q ss_pred CcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCC---CCcccchHHHHHHHHHHHHHHH-hCCC----- Q lcl|NC_021296. 2 AGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWP---DAPADVPDDVRAVVLQASRREL-KNPD----- 72 (154) Q Consensus 2 ~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~---d~~~~~p~~v~~Vv~~~vaR~l-~nP~----- 72 (154) -+|.|.++.....+. +. +--+.||..|+++|-.+|+...- +.+.+.+..++.|=.++++-.. .+-. T Consensus 1 M~YlT~eey~el~~~--~~---~~F~kl~k~A~~~ID~~t~~~y~~~~~~~~~~~~r~~~vK~A~a~QieY~~~~G~~s~ 75 (130) T protein:vir:47 1 MTYLTQEEFDELDFD--EV---TDFEKLAKRAKIAIDLYTNGIYQKDIDFEKEIAYRKSAVKLAMAFQIAYLDASGIMSA 75 (130) T ss_pred CCCCchhhHhhcCCC--Ch---hhHHHHHHHHHHHHHHHhcccccccCCccCcchHHHHHHHHHHHHHHHHHHHhccccc Confidence 478999999986554 22 23899999999999999875432 1122333334333333332221 1211 Q ss_pred ----ccceeeecceeeEeecCC-----CcccCHHH-HHHHHhhccCCceeeccccccc Q lcl|NC_021296. 73 ----RVISRQMGPFNVQYSQPP-----DGFFYPAE-LAILKRFKRSGGLQTVSTSRGE 120 (154) Q Consensus 73 ----g~~~etaG~fs~s~~~sg-----g~~lt~aE-~~~Lrr~r~~~g~~sV~~~r~~ 120 (154) ...+-|.|-.|+++...+ .++....+ ..-|.+.+. ++|+=..+ +- T Consensus 76 ~~~~~~~S~svGrtSis~~~~~~~~~~~~~~vs~da~~~L~~tGL--~Ly~GV~y-d~ 130 (130) T protein:vir:47 76 DDKQLANSVSIGRTSISYSTSQSTLAGQRFNLSMDAENALRQAGF--SLVVGVAY-DR 130 (130) T ss_pred hhccCcceeeecceeeecCcCccccccCCccccHHHHHHHHhccc--ccccCCCc-cC Confidence 223447888888875321 13333222 223555554 23332222 11 No 96 >protein:vir:1384 Length: 92 # NCBI annotation: Gp7 protein # Family: family:all:316 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612836;genbank:gi:20065970;genbank:GeneID:935785 Probab=33.69 E-value=1.3 Score=19.89 Aligned_cols=89 Identities=12% Similarity=0.096 Sum_probs=57.4 Q ss_pred CCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCccceee---ecc Q lcl|NC_021296. 5 ASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISRQ---MGP 81 (154) Q Consensus 5 ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~et---aG~ 81 (154) -|+++++.-|...-+ +|-...+.+|+-|...|+.++++.. ..+..++..++-.+.-...|-...+..+ .-| T Consensus 1 vtLeevK~~LRID~d-dDD~lI~~~i~aA~~~i~~~~~~~~-----~~~~~~~~Avlllv~~~YenR~~~~~~~~~~~ip 74 (92) T protein:vir:13 1 MDLRELKEYLRIDFE-EDDILLRSLLLAAEEYLYNAGIKRD-----YKKSLYSLAIKILVKHWYDNRDCVVAGNVNNKLE 74 (92) T ss_pred CCHHHHHHHcCCCCC-cchHHHHHHHHHHHHHHHhhccccc-----cchhHHHHHHHHHHHHhHhccccccccchhhhhh Confidence 999999999976533 4556779999999999999887532 3466788888999999998876554322 123 Q ss_pred eeeEeecCCCcccCHHHHHHHHhhccCCc Q lcl|NC_021296. 82 FNVQYSQPPDGFFYPAELAILKRFKRSGG 110 (154) Q Consensus 82 fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g 110 (154) |+++ +-+..||-+.--.| T Consensus 75 ~~v~-----------sll~~lR~~~~~~~ 92 (92) T protein:vir:13 75 YSLN-----------AILTQLRYCGDDNG 92 (92) T ss_pred HHHH-----------HHHHHhhhccCCCC Confidence 3331 11122222221112 No 97 >protein:vir:107614 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338189;genbank:gi:77020184;genbank:GeneID:3703745 Probab=28.95 E-value=1.4 Score=19.82 Aligned_cols=94 Identities=10% Similarity=0.139 Sum_probs=55.8 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCcccee--e Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISR--Q 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~e--t 78 (154) |- .|+++++..|...-+ |-...+.+|+-|...|...+|+.+.+ .+..++..|+-.|+-...|=...+.. . T Consensus 1 M~--vtLee~K~~LRID~D--dD~lI~~~i~aA~~~i~~~~g~~~~e----~~~~~k~Avl~lv~~~YenR~~~~~~~~~ 72 (96) T protein:vir:10 1 ML--VTLEEAKEWIRVDGD--DDPTITMLIKAAELYIYKATGKTFTQ----TNEDAKLLCLFLVADWYGNRLLVGEKASE 72 (96) T ss_pred Cc--CCHHHHHHHcCCCCc--hhHHHHHHHHHHHHHHHHhhCCCCCC----CcchHHHHHHHHHHHHHhhhhhccccccc Confidence 54 899999999987654 33478899999999999999876543 34568888888888888774332211 1 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccc Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRP 124 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~ 124 (154) --||+++ .+|..||..+ .+.. |. + T Consensus 73 ~ip~~v~--------------sli~qLr~~~--~~~~----e~--~ 96 (96) T protein:vir:10 73 KIRTIVQ--------------SMILQLQYAS--EPQE----ER--K 96 (96) T ss_pred hhhHHHH--------------HHHHHHhhcC--Cccc----cc--C Confidence 1233332 1122222211 0000 00 0 No 98 >protein:vir:102083 Length: 96 # NCBI annotation: DNA packaging protein # Family: family:all:316 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512316;genbank:gi:89152485;genbank:GeneID:3953076 Probab=28.95 E-value=1.4 Score=19.82 Aligned_cols=94 Identities=10% Similarity=0.139 Sum_probs=55.8 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCcccee--e Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISR--Q 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~e--t 78 (154) |- .|+++++..|...-+ |-...+.+|+-|...|...+|+.+.+ .+..++..|+-.|+-...|=...+.. . T Consensus 1 M~--vtLee~K~~LRID~D--dD~lI~~~i~aA~~~i~~~~g~~~~e----~~~~~k~Avl~lv~~~YenR~~~~~~~~~ 72 (96) T protein:vir:10 1 ML--VTLEEAKEWIRVDGD--DDPTITMLIKAAELYIYKATGKTFTQ----TNEDAKLLCLFLVADWYGNRLLVGEKASE 72 (96) T ss_pred Cc--CCHHHHHHHcCCCCc--hhHHHHHHHHHHHHHHHHhhCCCCCC----CcchHHHHHHHHHHHHHhhhhhccccccc Confidence 54 899999999987654 33478899999999999999876543 34568888888888888774332211 1 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccc Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRP 124 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~ 124 (154) --||+++ .+|..||..+ .+.. |. + T Consensus 73 ~ip~~v~--------------sli~qLr~~~--~~~~----e~--~ 96 (96) T protein:vir:10 73 KIRTIVQ--------------SMILQLQYAS--EPQE----ER--K 96 (96) T ss_pred hhhHHHH--------------HHHHHHhhcC--Cccc----cc--C Confidence 1233332 1122222211 0000 00 0 No 99 >protein:vir:102863 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338138;genbank:gi:77020236;genbank:GeneID:3703772 Probab=28.95 E-value=1.4 Score=19.82 Aligned_cols=94 Identities=10% Similarity=0.139 Sum_probs=55.8 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCcccee--e Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISR--Q 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~e--t 78 (154) |- .|+++++..|...-+ |-...+.+|+-|...|...+|+.+.+ .+..++..|+-.|+-...|=...+.. . T Consensus 1 M~--vtLee~K~~LRID~D--dD~lI~~~i~aA~~~i~~~~g~~~~e----~~~~~k~Avl~lv~~~YenR~~~~~~~~~ 72 (96) T protein:vir:10 1 ML--VTLEEAKEWIRVDGD--DDPTITMLIKAAELYIYKATGKTFTQ----TNEDAKLLCLFLVADWYGNRLLVGEKASE 72 (96) T ss_pred Cc--CCHHHHHHHcCCCCc--hhHHHHHHHHHHHHHHHHhhCCCCCC----CcchHHHHHHHHHHHHHhhhhhccccccc Confidence 54 899999999987654 33478899999999999999876543 34568888888888888774332211 1 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccc Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRP 124 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~ 124 (154) --||+++ .+|..||..+ .+.. |. + T Consensus 73 ~ip~~v~--------------sli~qLr~~~--~~~~----e~--~ 96 (96) T protein:vir:10 73 KIRTIVQ--------------SMILQLQYAS--EPQE----ER--K 96 (96) T ss_pred hhhHHHH--------------HHHHHHhhcC--Cccc----cc--C Confidence 1233332 1122222211 0000 00 0 No 100 >protein:vir:105005 Length: 96 # NCBI annotation: putative DNA packaging protein phage # Family: family:all:316 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459970;genbank:gi:85701385;genbank:GeneID:3882146 Probab=28.95 E-value=1.4 Score=19.82 Aligned_cols=94 Identities=10% Similarity=0.139 Sum_probs=55.8 Q ss_pred CCcCCCHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHhhCCCCCCCcccchHHHHHHHHHHHHHHHhCCCcccee--e Q lcl|NC_021296. 1 MAGLASIQDLQTLMSQTFEGDELEQAQLVLDIVSSWARVVSGRAWPDAPADVPDDVRAVVLQASRRELKNPDRVISR--Q 78 (154) Q Consensus 1 M~~~ATvdDl~arlgr~L~~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~~~p~~v~~Vv~~~vaR~l~nP~g~~~e--t 78 (154) |- .|+++++..|...-+ |-...+.+|+-|...|...+|+.+.+ .+..++..|+-.|+-...|=...+.. . T Consensus 1 M~--vtLee~K~~LRID~D--dD~lI~~~i~aA~~~i~~~~g~~~~e----~~~~~k~Avl~lv~~~YenR~~~~~~~~~ 72 (96) T protein:vir:10 1 ML--VTLEEAKEWIRVDGD--DDPTITMLIKAAELYIYKATGKTFTQ----TNEDAKLLCLFLVADWYGNRLLVGEKASE 72 (96) T ss_pred Cc--CCHHHHHHHcCCCCc--hhHHHHHHHHHHHHHHHHhhCCCCCC----CcchHHHHHHHHHHHHHhhhhhccccccc Confidence 54 899999999987654 33478899999999999999876543 34568888888888888774332211 1 Q ss_pred ecceeeEeecCCCcccCHHHHHHHHhhccCCceeeccccccccccc Q lcl|NC_021296. 79 MGPFNVQYSQPPDGFFYPAELAILKRFKRSGGLQTVSTSRGEEGRP 124 (154) Q Consensus 79 aG~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g~~sV~~~r~~~~~~ 124 (154) --||+++ .+|..||..+ .+.. |. + T Consensus 73 ~ip~~v~--------------sli~qLr~~~--~~~~----e~--~ 96 (96) T protein:vir:10 73 KIRTIVQ--------------SMILQLQYAS--EPQE----ER--K 96 (96) T ss_pred hhhHHHH--------------HHHHHHhhcC--Cccc----cc--C Confidence 1233332 1122222211 0000 00 0 No 101 >protein:vir:4458 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700381;genbank:gi:23505453;genbank:GeneID:955660 Probab=24.02 E-value=2.2 Score=18.67 Aligned_cols=94 Identities=14% Similarity=0.096 Sum_probs=59.9 Q ss_pred cCCCHHHHHHHhcCCCC-HHHHHHHHHHHHHHHHHHHHhhCCCCCCCcc----------cchHHHHHHHHHHHHHHHhCC Q lcl|NC_021296. 3 GLASIQDLQTLMSQTFE-GDELEQAQLVLDIVSSWARVVSGRAWPDAPA----------DVPDDVRAVVLQASRRELKNP 71 (154) Q Consensus 3 ~~ATvdDl~arlgr~L~-~~E~~~A~~lL~~aS~lir~~~~~~~~d~~~----------~~p~~v~~Vv~~~vaR~l~nP 71 (154) =+.|+++++.-|...-+ .+|-...+.+|+-|...|..++|+.-.+... .+|+.++.-++-.+.-...|. T Consensus 1 M~vtLee~K~hLRId~D~~dDD~lI~~~i~AA~~~i~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~AiLllv~~~Y~NR 80 (107) T protein:vir:44 1 MLLSVEEIKAQLRLDEDFEADERYLQLLARAVQKRTETYLNRKLYAPDETIPDSDPDGLLLQDDIRLGMLMLISHFYENR 80 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHhhcCccccccccccccccccccchhhHHHHHHHHHHHHHhhh Confidence 36799999999876433 2456778899999999999999976432221 146678887888888888887 Q ss_pred Cccceeee--cceeeEeecCCCcccCHHHHHHHHhhccCCc Q lcl|NC_021296. 72 DRVISRQM--GPFNVQYSQPPDGFFYPAELAILKRFKRSGG 110 (154) Q Consensus 72 ~g~~~eta--G~fs~s~~~sgg~~lt~aE~~~Lrr~r~~~g 110 (154) ...+..+. -||++ ...|.+||.-=+ T Consensus 81 e~~~~~~~~~lP~~v--------------~~Ll~~yR~~p~ 107 (107) T protein:vir:44 81 SSVTEVEKLDMPQSF--------------GWLVGPYRYFPQ 107 (107) T ss_pred hhhccccccccCHHH--------------HHHHHHhhhcCC Confidence 65432111 12222 122455543222 Done!