Query lcl|NC_011270.1_cdsid_YP_002224375.1 [gene=126] [protein=gp126] [protein_id=YP_002224375.1] [location=68374..70119] Match_columns 581 No_of_seqs 304 out of 494 Neff 8.0 Searched_HMMs 1612 Date Thu Nov 7 14:36:49 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_121 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_121_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107310 Length: 581 100.0 6E-156 4E-159 871.5 55.7 581 1-581 1-581 (581) 2 protein:vir:7653 Length: 581 # 100.0 5E-155 3E-158 866.3 55.9 581 1-581 1-581 (581) 3 protein:vir:96586 Length: 587 100.0 9.6E-71 6E-74 404.4 37.5 533 1-571 1-587 (587) 4 protein:vir:99306 Length: 587 100.0 5.7E-67 3.5E-70 383.8 43.8 523 1-571 1-587 (587) 5 protein:vir:80779 Length: 569 100.0 6.8E-68 4.2E-71 388.8 36.6 508 1-571 1-569 (569) 6 protein:vir:80488 Length: 562 100.0 6.2E-67 3.8E-70 383.6 41.5 515 1-571 1-562 (562) 7 protein:vir:95741 Length: 587 100.0 2.1E-66 1.3E-69 380.7 40.5 525 1-571 1-587 (587) 8 protein:vir:63742 Length: 562 100.0 2.7E-66 1.7E-69 380.0 40.3 500 1-571 1-562 (562) 9 protein:vir:100829 Length: 607 100.0 1.6E-64 9.9E-68 370.3 39.0 522 1-580 7-607 (607) 10 protein:vir:105470 Length: 451 100.0 2E-59 1.2E-62 342.4 30.9 428 116-565 1-451 (451) 11 protein:vir:102957 Length: 437 100.0 2.1E-58 1.3E-61 336.8 32.7 418 116-565 1-437 (437) 12 protein:vir:102819 Length: 648 100.0 1.4E-55 8.8E-59 321.3 36.4 535 1-569 1-648 (648) 13 protein:vir:78986 Length: 436 100.0 8.5E-56 5.3E-59 322.5 33.8 415 101-565 1-436 (436) 14 protein:vir:102359 Length: 356 100.0 3.7E-50 2.3E-53 291.6 31.0 322 172-564 1-356 (356) 15 protein:vir:98824 Length: 774 100.0 4.3E-46 2.7E-49 269.3 38.4 546 1-573 138-774 (774) 16 protein:vir:108052 Length: 660 100.0 4.6E-42 2.9E-45 247.2 35.3 549 1-581 28-652 (660) 17 protein:vir:79798 Length: 717 100.0 8E-42 4.9E-45 245.9 34.9 499 1-566 132-717 (717) 18 protein:vir:106984 Length: 743 100.0 2.2E-41 1.4E-44 243.5 36.5 540 1-578 132-743 (743) 19 protein:vir:104477 Length: 749 100.0 1E-40 6.4E-44 239.8 38.6 544 1-580 133-749 (749) 20 protein:vir:106427 Length: 679 100.0 4.5E-41 2.8E-44 241.8 36.0 551 1-581 28-670 (679) 21 protein:vir:98263 Length: 664 100.0 1.2E-40 7.3E-44 239.5 37.8 541 1-581 1-664 (664) 22 protein:vir:7206 Length: 659 # 100.0 2.2E-40 1.3E-43 238.0 37.2 488 1-580 81-659 (659) 23 protein:vir:100539 Length: 663 100.0 3.1E-40 1.9E-43 237.2 37.8 516 1-581 81-662 (663) 24 protein:vir:101187 Length: 663 100.0 5.2E-40 3.2E-43 235.9 38.1 544 1-581 28-653 (663) 25 protein:vir:103456 Length: 659 100.0 5.4E-40 3.4E-43 235.8 38.0 549 6-580 1-659 (659) 26 protein:vir:104858 Length: 729 100.0 8E-40 5E-43 234.9 37.7 554 1-575 86-729 (729) 27 protein:vir:5833 Length: 742 # 100.0 1.3E-39 8.2E-43 233.7 38.2 529 1-572 102-742 (742) 28 protein:vir:6894 Length: 660 # 100.0 1.1E-39 6.9E-43 234.1 35.4 557 1-577 1-660 (660) 29 protein:vir:101804 Length: 663 100.0 1.2E-38 7.5E-42 228.5 40.6 557 6-581 1-662 (663) 30 protein:vir:5711 Length: 396 # 100.0 3.3E-41 2.1E-44 242.5 26.3 372 175-581 1-389 (396) 31 protein:vir:80984 Length: 666 100.0 1.2E-39 7.3E-43 234.0 33.9 524 1-581 67-656 (666) 32 protein:vir:1845 Length: 392 # 100.0 1.6E-41 9.9E-45 244.3 23.6 361 175-577 1-392 (392) 33 protein:vir:98553 Length: 395 100.0 1.3E-40 8.1E-44 239.2 26.8 373 139-573 1-395 (395) 34 protein:vir:6594 Length: 666 # 100.0 2.2E-39 1.3E-42 232.6 33.4 541 1-581 28-656 (666) 35 protein:vir:79141 Length: 391 100.0 6.4E-41 4E-44 241.0 24.9 360 168-574 1-391 (391) 36 protein:vir:79181 Length: 390 100.0 2.6E-41 1.6E-44 243.1 22.6 363 168-577 1-390 (390) 37 protein:vir:78206 Length: 390 100.0 8.8E-41 5.5E-44 240.2 25.5 357 168-573 1-390 (390) 38 protein:vir:103993 Length: 390 100.0 8.8E-41 5.5E-44 240.2 25.5 357 168-573 1-390 (390) 39 protein:vir:6079 Length: 396 # 100.0 1E-40 6.4E-44 239.8 25.6 370 139-581 1-389 (396) 40 protein:vir:79092 Length: 477 100.0 3.4E-39 2.1E-42 231.5 31.4 418 132-578 1-477 (477) 41 protein:vir:5663 Length: 671 # 100.0 2.5E-38 1.5E-41 226.7 36.1 535 1-577 81-671 (671) 42 protein:vir:1996 Length: 495 # 100.0 6.3E-39 3.9E-42 230.0 31.5 430 103-566 1-495 (495) 43 protein:vir:96740 Length: 388 100.0 7.1E-40 4.4E-43 235.2 25.5 361 168-575 1-388 (388) 44 protein:vir:1172 Length: 391 # 100.0 3.8E-40 2.3E-43 236.7 23.1 362 168-577 1-391 (391) 45 protein:vir:4517 Length: 498 # 100.0 2.1E-40 1.3E-43 238.2 21.1 425 85-569 1-498 (498) 46 protein:vir:4463 Length: 498 # 100.0 1.1E-40 6.8E-44 239.7 19.2 430 85-569 1-498 (498) 47 protein:vir:489 Length: 498 # 100.0 8.6E-40 5.3E-43 234.8 23.6 425 85-569 1-498 (498) 48 protein:vir:2035 Length: 396 # 100.0 3.5E-39 2.2E-42 231.4 23.1 379 139-578 1-396 (396) 49 protein:vir:100323 Length: 393 100.0 4.3E-38 2.7E-41 225.4 27.0 362 168-574 1-393 (393) 50 protein:vir:107865 Length: 477 100.0 1.7E-37 1.1E-40 222.1 28.5 422 103-578 1-477 (477) 51 protein:vir:10336 Length: 386 100.0 4.5E-38 2.8E-41 225.3 24.4 358 168-581 1-385 (386) 52 protein:vir:2344 Length: 397 # 99.7 5.7E-19 3.6E-22 120.6 11.2 148 1-155 218-397 (397) 53 protein:vir:103168 Length: 641 99.5 1.8E-14 1.1E-17 95.9 25.7 435 1-468 99-641 (641) 54 protein:vir:101326 Length: 529 99.5 4.6E-13 2.9E-16 88.2 29.6 466 72-566 1-529 (529) 55 protein:vir:3788 Length: 376 # 99.3 2.3E-11 1.4E-14 78.9 28.5 332 185-570 1-376 (376) 56 protein:vir:3751 Length: 376 # 99.3 7.9E-11 4.9E-14 76.0 29.4 332 185-570 1-376 (376) 57 protein:vir:276 Length: 369 # 99.3 5.5E-11 3.4E-14 76.8 27.9 337 178-569 1-369 (369) 58 protein:vir:78782 Length: 370 99.1 4.2E-10 2.6E-13 72.0 25.3 337 188-577 1-370 (370) 59 protein:vir:95263 Length: 450 99.0 7.3E-09 4.5E-12 65.2 26.6 373 168-567 1-450 (450) 60 protein:vir:80052 Length: 331 98.9 1.3E-09 8E-13 69.3 21.5 308 171-566 1-331 (331) 61 protein:vir:5260 Length: 502 # 98.9 1.5E-08 9.2E-12 63.5 25.4 420 101-566 1-502 (502) 62 protein:vir:3165 Length: 426 # 97.8 1.3E-05 8.4E-09 47.3 17.9 380 168-566 1-426 (426) 63 protein:vir:99586 Length: 507 96.6 0.00049 3.1E-07 38.7 26.2 373 167-565 1-507 (507) 64 protein:vir:96104 Length: 504 96.4 0.00067 4.1E-07 38.0 23.9 388 167-580 1-504 (504) 65 protein:vir:106730 Length: 501 94.7 0.0037 2.3E-06 33.9 28.4 390 1-581 53-500 (501) 66 protein:vir:94073 Length: 494 92.8 0.01 6.2E-06 31.5 28.4 401 129-581 1-494 (494) 67 protein:vir:107720 Length: 515 92.1 0.013 8.1E-06 30.9 29.9 411 1-565 1-515 (515) 68 protein:vir:78611 Length: 501 89.2 0.028 1.7E-05 29.1 33.2 389 1-581 53-500 (501) 69 protein:vir:101576 Length: 501 81.0 0.089 5.5E-05 26.3 28.6 366 168-581 1-500 (501) 70 protein:vir:3636 Length: 501 # 71.5 0.2 0.00012 24.5 30.1 404 103-581 1-500 (501) No 1 >protein:vir:107310 Length: 581 # NCBI annotation: gp123 # Family: family:all:1196 # MgeID: mge:1550 # MgeName: Catera # Cross-refs: genbank:acc:YP_656131;genbank:gi:109393333;genbank:GeneID:4156791 Probab=100.00 E-value=6.2e-156 Score=871.48 Aligned_cols=581 Identities=100% Similarity=1.431 Sum_probs=563.1 Q ss_pred CeeccccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccccceeEEEEeC Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKLSLA 80 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~~~GtF~l~~~ 80 (581) |+|+++.+++++.++.++++|..+.++..+..+....|.+||.+++|+.|.||+.++.+|+|+|++.+.+++|+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i~g~~~g~~g~~~s~~~~p~~~~~~e~q~v~~~~~~t~GtFtLsf~ 80 (581) T protein:vir:10 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKLSLA 80 (581) T ss_pred CeeeeccccccchhhhhccccccceeeeeccccccccccccccccccccccCCCCCCccceEEEEEEecCCCceEEEEeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeccccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCCccccccccceeccCCCceEEEEEcccccceee Q lcl|NC_011270. 81 GEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMN 160 (581) Q Consensus 81 g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~ 160 (581) |++|++|||||||++||+|||+|++|+.++|+|+++.|..|+|||.|+++.|.++...|++|.++.|+|.+..+|.++++ T Consensus 81 G~tT~~I~~~asa~~v~~AL~~L~~i~~~~v~v~g~~g~~~~VtF~g~~~~l~~~~~~lt~g~~~~vtV~~~~~g~~~~~ 160 (581) T protein:vir:10 81 GEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMN 160 (581) T ss_pred ceecccccccCCHHHHHHHHhccCCCCcceEEEECCCCceEEEEEcCCccceeeeeceecCCCceeEEEeccccCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999888 Q ss_pred eccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccc Q lcl|NC_011270. 161 RALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHE 240 (581) Q Consensus 161 ~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e 240 (581) ...+..++.....++.+...+.+.++++|..........+++.+...|+.++...+.++..+++++.+++|+|.||..++ T Consensus 161 ~~~s~~gi~~~~~~l~~~~~~~~~~~gsd~~~~~~~~~~~~~~~~~~D~~t~~~~~~g~~~~~~~v~~~~~~~~d~~~~~ 240 (581) T protein:vir:10 161 RALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHE 240 (581) T ss_pred ccccccccccccccccccccCcceeccccceeeecccCccccccccccceeeeeeecccccccceEEEEEEEeecCCcce Confidence 88877788889999999988999999999999888888889999999999999899999899999999999999999999 Q ss_pred eeEeccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCC Q lcl|NC_011270. 241 VIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGT 320 (581) Q Consensus 241 ~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t 320 (581) ++.|+++++..++++++.+.+|+..+++++.++++++|+++..+++++++++++++++||.+||++||+++++.|++|++ T Consensus 241 ~v~~~~~~~~~~~~~~~~~~~g~~~~~~t~~~~~~~tn~~~~~l~~gvd~~g~tvt~~dy~~Al~ale~~~~~~ivv~~t 320 (581) T protein:vir:10 241 VIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGT 320 (581) T ss_pred eEEeecCcchhhhhhhhhhccCccccchhhhheeeeecccceeEEeeccCCCCccchHHHHHHHHHHhcCCceEEEEeCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHH Q lcl|NC_011270. 321 GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMA 400 (581) Q Consensus 321 ~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~A 400 (581) +++++|+++++||++|+++++++|+++|+++.......+.++++++.+||+|+++++|+++.+++..+.....+|+|++| T Consensus 321 ~~~~v~a~l~ahv~~~s~~~~~~ravigV~g~~~~~~~~~~~~~a~~~n~~Rvvlv~p~~~~~~g~~~~~~v~lp~y~~A 400 (581) T protein:vir:10 321 GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMA 400 (581) T ss_pred CCHHHHHHHHHHHHHHHhccCCcEEEEEecCCCCCccHHHHHHhhccCCCceEEEEecCceeecCcccCceeccchhhHH Confidence 99999999999999999999999999999888888888889999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCcccceEEeehhhH Q lcl|NC_011270. 401 AAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQD 480 (581) Q Consensus 401 a~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d 480 (581) |++||++++.++++||||++|+|+.++..+|+++|+++|+++|+++|++.+++++||+|||||+.++++|++|++||++| T Consensus 401 A~vAGl~a~~~~~~slT~~~i~gi~~l~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~~~i~~iR~~D 480 (581) T protein:vir:10 401 AAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQD 480 (581) T ss_pred HHHHHHhhccccccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEeeeecCCCCCcceeeeeehhhh Confidence 99999999999999999999999999999999999999999999999998999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEE Q lcl|NC_011270. 481 VMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVV 560 (581) Q Consensus 481 ~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~ 560 (581) |+.++||+.+++++||||||++++|++||++|++||++||++|+|++|++.+++++++++|+++|+|+++|++||||||| T Consensus 481 ~v~~~ir~~~~~~~fIG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~~~~~~~~~~~~~~~d~v~V~i~v~Pv~~i~~I~v 560 (581) T protein:vir:10 481 VMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVV 560 (581) T ss_pred HHHHHHHHHhhhhcCCCcccCHHHHHHHHHHHHHHHHHHHhcCcccCCccceeeeeecCCCEEEEEEEEEecccceEEEE Confidence 99999999998889999999999999999999999999999999999999899999999999999999999999999999 Q ss_pred EEEEEeccceEEEEEeecccC Q lcl|NC_011270. 561 RYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 561 ~~~~~~~tg~~~~~~~~~~~~ 581 (581) |+||.||||+||++||||||| T Consensus 561 ti~~~p~~~~~~~~~~~~~~~ 581 (581) T protein:vir:10 561 RYSIAPETGDITSTIEGTTSF 581 (581) T ss_pred EEEEecCCCceEEEEeccccC Confidence 999999999999999999999 No 2 >protein:vir:7653 Length: 581 # NCBI annotation: gp124 # Family: family:all:1196 # MgeID: mge:148 # MgeName: Bxz1 # Cross-refs: genbank:acc:NP_818197;genbank:gi:29566631;genbank:GeneID:1259809 Probab=100.00 E-value=5.5e-155 Score=866.29 Aligned_cols=581 Identities=100% Similarity=1.430 Sum_probs=561.7 Q ss_pred CeeccccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccccceeEEEEeC Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKLSLA 80 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~~~GtF~l~~~ 80 (581) |+|||++++.+++|+++++++..+.+...|....++.|..||.+++|+.|.||..++++|+|+|+|.+++++|+|+|+|+ T Consensus 1 ~~~~~~~~~~~~~~t~~~~~~~~g~~~~~~~~~~i~g~~~g~~g~~~s~r~~p~~~~~~evq~v~~~~~~t~G~ftLt~~ 80 (581) T protein:vir:76 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKLSLA 80 (581) T ss_pred CcccccccccchhhhhhccccccCcceeeeeeeeecccccccccccceeeecCCCCCCCceEEEEEeecCCcceEEEEeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeccccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCCccccccccceeccCCCceEEEEEcccccceee Q lcl|NC_011270. 81 GEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMN 160 (581) Q Consensus 81 g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~ 160 (581) |++|++||||||+.+||+|||+|++|+.++|+|+++.+..|+|+|.|+++.|..+...|++|.++.++|.+.+.|.++.+ T Consensus 81 g~tT~~I~~~asa~~v~~AL~~L~~i~~~~v~vtg~~~~~~~V~F~g~~~~~~~~~~~ltg~~~~~~~V~~~~~G~~~~~ 160 (581) T protein:vir:76 81 GEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDNPDLNIASEQTGVPAMN 160 (581) T ss_pred ceeccccccCCCHHHHHHHHhhccCCCCceEEEEcCCCceEEEEEcCCccceeEeeeeeecCCcceeEEEEEecCcCCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred eccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccc Q lcl|NC_011270. 161 RALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHE 240 (581) Q Consensus 161 ~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e 240 (581) .......++...+.+.....+....+++++.+.+...+.+++.+..+++.++.+.+++++.++.++.+++|++.||..++ T Consensus 161 ~~l~~~g~~~~~~~~~~s~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~g~~~~~~~i~~~~~~~~D~~~~~ 240 (581) T protein:vir:76 161 RALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHE 240 (581) T ss_pred ceeeeccccccccceeecCCcceeeecccccceeeccCcccceeeeeeeeeeEeecccccccceeEEEEEEEeecCCccc Confidence 77666777777788888888889999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEeccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCC Q lcl|NC_011270. 241 VIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGT 320 (581) Q Consensus 241 ~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t 320 (581) ++.|+++++..++++++.+.+|+..+++++.+.++++|++...+++++++++++++++||++||++||+++++.|++|++ T Consensus 241 ~v~~~~~~~~~~~~~~~~~~~g~~~~e~~~~~~~~~t~~~~~~l~~gvd~~g~tvt~~dy~~aL~ale~~~~~~ivvp~t 320 (581) T protein:vir:76 241 VIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGT 320 (581) T ss_pred eEEEecccccccceeeehhhcCccccchhhhhheeeccccceEEEeeecCCCCccchHHHHHHHHHHhcCCeEEEEEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHH Q lcl|NC_011270. 321 GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMA 400 (581) Q Consensus 321 ~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~A 400 (581) ++++||+++++||++|+++++++|+++|+++.......+.+++++..+||+|+++++|+++.+++..+.....+|+|++| T Consensus 321 ~~~~i~a~l~ahv~~~s~~~~~~ra~igv~g~~~~~~~~~~~~~a~~~ns~Rvvlv~p~~~~~~g~~~~~~~~lp~~~~A 400 (581) T protein:vir:76 321 GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMA 400 (581) T ss_pred CChHHHHHHHHHHHHHHhccCCceEEEEeeCCCCCchHHHHHHhhcccCCCcEEEEEcCceEeccccCCcceecchhhhh Confidence 99999999999999999999999999999888888888888999999999999999999999999989999999999999 Q ss_pred HHHHHHhhccchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCcccceEEeehhhH Q lcl|NC_011270. 401 AAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQD 480 (581) Q Consensus 401 a~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d 480 (581) |++||++++.++++||||++|+|+.++..+|+++|+++|+++|+++|++.+++++||+|||||+.++++|++|++||++| T Consensus 401 A~vAG~~a~~~~~~slT~~~i~g~~~~~~~~s~~e~e~ll~~Gv~~l~~~~~~~v~Iv~gItT~~s~~~~k~i~viR~~D 480 (581) T protein:vir:76 401 AAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQD 480 (581) T ss_pred hhHHhhhhccccccCcccccccccccccccCCHHHHHHHHhCCeEEEEEecCCeEEEEEeeecCCCCCccceeeehhhhH Confidence 99999999999999999999999999999999999999999999999998999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEE Q lcl|NC_011270. 481 VMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVV 560 (581) Q Consensus 481 ~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~ 560 (581) |++++||+.+++++||||||++++|++||++|++||++||++|+|++|++.++++.++++|+++|+|+++|++||||||| T Consensus 481 ~v~~~vr~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g~I~g~~~~~~~~~~~~~d~v~V~i~v~Pv~~ie~I~v 560 (581) T protein:vir:76 481 VMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVV 560 (581) T ss_pred HHHHHHHHHHhhhcCCCcccChHHHHHHHHHHHHHHHHHHhcCcccCcccceeeEEecCCCEEEEEEEEEecccceEEEE Confidence 99999999998889999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEeccceEEEEEeecccC Q lcl|NC_011270. 561 RYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 561 ~~~~~~~tg~~~~~~~~~~~~ 581 (581) |+||.|+||+||++||||||| T Consensus 561 t~~~~p~~~~~~~~~~~~~~~ 581 (581) T protein:vir:76 561 RYSIAPETGDITSTIEGTTSF 581 (581) T ss_pred EEEEeeCCCceEEEEeccccC Confidence 999999999999999999999 No 3 >protein:vir:96586 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238553;genbank:gi:66391266;genbank:GeneID:5130368 Probab=100.00 E-value=9.6e-71 Score=404.44 Aligned_cols=533 Identities=18% Similarity=0.166 Sum_probs=308.0 Q ss_pred Ceecc---ccccCCCcccccCcccccccccccCceeeEEEecCCCCc-e----------eeeeE-----------EcCcC Q lcl|NC_011270. 1 MAIDF---SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQT-Y----------RESIR-----------INPDT 55 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~-~----------~~~~~-----------~~~~~ 55 (581) |||++ ++|+-|++|.+...+...+.++...+.+.....+-+-.. . ...++ ..|.. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~~g~~~~~~~~~~~~~~~~~~g~G~l~~ai~~a~~~~~ 80 (587) T protein:vir:96 1 MAKDIFPRRPIQRPHASIEVDSSGIGGSASNSEKILCLIGKAEGGEPNTVYQVRNYAQAKSVFRSGELLDAIELAWGSNP 80 (587) T ss_pred CeeeeeCCCcccCCceEEEEecCCccCCCCCCCceEEEEEEecCCCCceeEEEcChHHHHHhhcCCcHHHHHHHHhccCc Confidence 99999 999999999987655555555555555555554433111 0 00000 01110 Q ss_pred C-ceeeEEEEEEeccccceeEEEEeCceeccccccCCCHHHHHHHHHh--cCCCCcceEEEEcC--------CCceEEEE Q lcl|NC_011270. 56 G-ETITTQILALVGEPTGGSFKLSLAGEPTGNIPFNATQGQVQSALRA--LPNVEDDEVTVLGD--------PGGPWTVT 124 (581) Q Consensus 56 ~-~~~evq~v~~~~~~~~GtF~l~~~g~~T~~i~~~asa~~v~~aLe~--l~~i~~~~V~~~~~--------~g~~w~Vt 124 (581) . .--++--+.+ ..+..++ |+..|.+...-.+.+.+.+++.+|+. .++.....+..... .|....|+ T Consensus 81 ~~g~~~~~a~rv-~~~~~a~--~~~~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~~~~~~~~~~~~~~n~G~v~~i~ 157 (587) T protein:vir:96 81 QYTAGKILAMRV-EDAKASQ--LEKGGLRVTSKIFGSVSNDIQVALEKNTITDSLRLRVVFQKDNYQEVFDNLGNIFSIN 157 (587) T ss_pred CCCceEEEEEec-CCCccce--eecccccccccccCCCCceEEEEEEeccCCCccceEEEEecCCceeeccccCceEEEE Confidence 0 0112222222 2333333 33444544555566667777777763 34443333322211 12222333 Q ss_pred ecCCcc--ccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccc-eeEEeeccccc Q lcl|NC_011270. 125 FTKAVA--ALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDY-VVTRVNAGEDG 201 (581) Q Consensus 125 f~g~~~--~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~-~v~~v~~~~dg 201 (581) +.|+-. .+++.+..++. ....+.+....... ....|.+.....+..+-.+- .+...++..-+ T Consensus 158 y~g~~~~a~~~~~~~~~~~-~A~~l~l~gg~~~v--------------~~yrl~~g~~~~~~~~~~~~~~~~~~tAky~g 222 (587) T protein:vir:96 158 YKGEGEKATFSVEKDKETQ-EAKRLVLKVDEKEV--------------KAYELNGGAYSFTNEIITDINELPDFEAKLSP 222 (587) T ss_pred ecccccceeEeeccCcccc-eeeeeEEEecCceE--------------EEEEeCCCchhhhhhhhhhhccccceEEEeec Confidence 332211 11111111100 00000000000000 00011110000000000000 00000000000 Q ss_pred ccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhh--------hccccccceee-- Q lcl|NC_011270. 202 EANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDE--------AGNVQSEITLC-- 271 (581) Q Consensus 202 ~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~--------~g~~~~~i~~~-- 271 (581) ..+..+ +++. .+ . ....+.....+ |..+.........+.+...++....... .....+.+... T Consensus 223 ~~~n~~---~v~v-~d-~-~~~~~~k~~~~-y~~t~~~di~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~ 295 (587) T protein:vir:96 223 FGDKNL---ESRK-LD-E-ATDVDIKGKAV-YVKAVFGDIENQTQYNQYVKFEQLPEQASEPSDVEVHAETESATVTATS 295 (587) T ss_pred ccCcee---EEEe-ec-c-ccccccceEEE-eehhhhhhhhhhhccccceeeccccchhhhhhcccccccccceeeeecc Confidence 000000 0000 00 0 00000000110 1111111111111111111110000000 00000001100 Q ss_pred eeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCC Q lcl|NC_011270. 272 AQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDG 351 (581) Q Consensus 272 ~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~ 351 (581) ....+.+.....++++.++ ....+|+++|++|+.++++ +++++++++++|+++++||++|+++++++++++|... T Consensus 296 ~~~~~~~~~~~aLtGG~dG----~~~~~y~~~l~ale~~~~~-~i~~~t~d~ai~~~l~a~vk~~r~~gk~~~aVlg~~~ 370 (587) T protein:vir:96 296 KPKAIEPFELTKLSGGTNG----EPPTSWSAKLEKFKNEGGY-YIVPLTDRQSVHSEVATFVKNRSDAGEPMRAIVGGGT 370 (587) T ss_pred cccccccccceeeecCCCC----CCcccHHHHHHHHhhCCcE-EEEecCCCHHHHHHHHHHHHHHHhCCCeEEEEecCCC Confidence 1111222222334433222 2346899999999998765 5677888999999999999999999999999998765 Q ss_pred CCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCcccccccC Q lcl|NC_011270. 352 SVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ 431 (581) Q Consensus 352 ~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~ 431 (581) . ...++.+++++.+|++|++++++++....+ ......+|+|++|||+||++|++++++||||+++++ .++..+| T Consensus 371 ~---~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~--~~~~~~~~~~~~aa~vAG~~Ag~~~~~S~T~~~~~~-~~v~~~~ 444 (587) T protein:vir:96 371 S---ETKEKLFGRQAILNNPRVALVANSGKFVMG--NGRILQAPAYMVASAVAGLVSGLDIGESITFKPLFV-NSLDKVY 444 (587) T ss_pred C---CCHHHHHHHHhhcCCCcEEEEecceEEecC--CCceeeechhhHHHHHHHHHhcCccccCccceeeec-ccccccC Confidence 4 456677889999999999999998876554 234567999999999999999999999999999996 6899999 Q ss_pred CHHHHHHHHhCCcEEEEEeCCCeEE---EEEeeeccCC--CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHH Q lcl|NC_011270. 432 RDGEKSRESSEGLMVIEKTPRNLVH---VRHGVTTDPT--SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIV 506 (581) Q Consensus 432 t~~e~~~l~~~Gv~~l~~~~~~~v~---i~~~itT~~t--d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~ 506 (581) +++|+++|+++|+++|++.+++.++ +++++||++. +..|++|+++|++|+|.++||+.++ +.|||||||+++|. T Consensus 445 t~~e~~~~i~~G~~~l~~~~~~~~~v~~~vnsitT~t~~~~~~~~~i~virv~D~i~~di~~~~~-~~yiGk~nn~~~r~ 523 (587) T protein:vir:96 445 ESEELDELNENGIITIEFVRNRMTTMFRIVDDVTTFPDKNDPVKSEMALGEANDFLVSELKILLE-EQYIGTRTINTSAS 523 (587) T ss_pred CHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhHHHHHHHHHHHHHHHH-hcCCccccCHHHHH Confidence 9999999999999999987776554 4566777654 5679999999999999999999885 67999999999999 Q ss_pred HHHHHHHHHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceE Q lcl|NC_011270. 507 QVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDI 571 (581) Q Consensus 507 ~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~ 571 (581) +||++|++||++|+++|+|++|+++++ +...++|+++|++.++|++||||||+|++|+++|=+- T Consensus 524 ~v~~~i~~~L~~l~~~g~I~~~~~~dv-~v~~~~D~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:96 524 QIKDFVQSYLGRKKRDNEIQDFPPEDV-QVIIEGNEARISLTIFPIRALKKISVSLVYRQQTLQA 587 (587) T ss_pred HHHHHHHHHHHHHHhCCcccCCCccce-EEEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 999999999999999999999987655 4556789999999999999999999999999998544 No 4 >protein:vir:99306 Length: 587 # NCBI annotation: putative major tail sheath protein # Family: family:all:2449 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024479;genbank:gi:48696438;genbank:GeneID:2948034 Probab=100.00 E-value=5.7e-67 Score=383.77 Aligned_cols=523 Identities=16% Similarity=0.125 Sum_probs=287.9 Q ss_pred Ceecc---ccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEec--------c Q lcl|NC_011270. 1 MAIDF---SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVG--------E 69 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~--------~ 69 (581) |||+. +++.-|++|.+...+--.+.+....+.+....... .|+..++++|+-.. + T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~--------------~G~~~~~~~~~~~~~~~~~~~~g 66 (587) T protein:vir:99 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAE--------------GGEPNTVYELRNYSQAKRLFRSG 66 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEec--------------CCccceeEEeccHHHHHHHhcCc Confidence 99987 78888999887542212122222222222222221 12222333332111 0 Q ss_pred ccceeEEEEe------Cceecccccc-CCCHHHHHHHHHhcCCCCcceEEEE--cCCCceEEEEec-CCccc---ccc-- Q lcl|NC_011270. 70 PTGGSFKLSL------AGEPTGNIPF-NATQGQVQSALRALPNVEDDEVTVL--GDPGGPWTVTFT-KAVAA---LTK-- 134 (581) Q Consensus 70 ~~~GtF~l~~------~g~~T~~i~~-~asa~~v~~aLe~l~~i~~~~V~~~--~~~g~~w~Vtf~-g~~~~---l~~-- 134 (581) .+-.--.+.| ++..--.+.. +++++. -.++.+.++-. |.+|+..+|.+. +..+. +.+ T Consensus 67 ~l~~~~~~a~~~~~~~g~~~~~~~rv~~~~~a~--------~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~ 138 (587) T protein:vir:99 67 ELLDAIELAWGSNPNYTAGRILAMRIEDAKPAS--------AEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIF 138 (587) T ss_pred chHHHHHHHhccccCCCceEEEEEEcCCCceeE--------EEecCeEEEEeeccccccceEEEEccCCCCcceeEEEEE Confidence 0000000011 0100000011 111110 01122222211 223333344332 11110 000 Q ss_pred -c-cceec---cCCCceEEEEEcccccceee-------eccccccccc-----eeeeeccccc---------------cc Q lcl|NC_011270. 135 -D-VTGLT---GGDDPDLNIASEQTGVPAMN-------RALAKKGIKT-----DTIRVVNPNS---------------GQ 182 (581) Q Consensus 135 -~-~~~l~---~g~~~~v~v~~~~~g~~~~~-------~~~~~~~~~~-----~~~~l~~~~~---------------~~ 182 (581) + ...-+ -|.-+.+........ +.+. ..+....++. ....|.+... .+ T Consensus 139 ~~~~~~~~~~~~g~v~~i~y~g~~~~-a~~~v~~~~~t~~a~~~~l~~g~~~v~~yrL~~g~~~~~~~~~~~i~~~~~~t 217 (587) T protein:vir:99 139 QDDRFNEVYDNIGNIFTIKYKGEEAN-ATFSVEHDEETQKASRLVLKVGDQEVKSYDLTGGAYDYTNAIITDINQLPDFE 217 (587) T ss_pred ecccceeeeeeccceeeEEeeccccc-ceeeEeecCcceeeeeeeeecCCceeEEEEecCCchHHHHHHHhhhcccccee Confidence 0 00000 000000100000000 0000 0000000000 0000000000 00 Q ss_pred eeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhc Q lcl|NC_011270. 183 VYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAG 262 (581) Q Consensus 183 ~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g 262 (581) +...+........... ++ -....+.+....++... .+. ..+........+++............+. .+ T Consensus 218 Aky~~~~~~~i~~~~~-~~--~~~~~v~~~~~~v~a~~---~D~----~~~~~~~~~~~~~~~~g~~~~~~~~~~~--~~ 285 (587) T protein:vir:99 218 AKLSPFGDKNLESSKL-DK--IENANIKDKAVYVKAVF---GDL----EKQTAYNGIVSFEQLNAEGEVPSNVEVE--AG 285 (587) T ss_pred EEeeccCCceeEeecc-cc--cccceeeeeeeeeehhc---cce----eeecccceeeeeeecccccchhhhhhhh--hc Confidence 0000000000000000 00 00000111111111000 000 0111112222222322221111111111 11 Q ss_pred ccccccee-eeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCC Q lcl|NC_011270. 263 NVQSEITL-CAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKY 341 (581) Q Consensus 263 ~~~~~i~~-~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~ 341 (581) ...+.... .....+.+.....++++.++ ....+|+++|++|+.++++ +++|+++++++|+++++||++|+++++ T Consensus 286 ~~~~~~~~~~~~~~~a~~~~t~LtGG~dG----~~~~sy~~al~ale~~~~~-~i~~~t~d~~i~a~l~a~vk~~r~~g~ 360 (587) T protein:vir:99 286 EESATVTATSPIKTIEPFELTKLKGGTNG----EPPATWADKLDKFAHEGGY-YIVPLSSKQSVHAEVASFVKERSDAGE 360 (587) T ss_pred cccceeeeeccccceecccceeeecCCCC----CccccHHHHHHHHhhCCcE-EEEecCCCHHHHHHHHHHHHHHHhCCC Confidence 11111111 11112223333334444333 2346899999999997765 567788899999999999999999999 Q ss_pred cEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccc Q lcl|NC_011270. 342 ERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVI 421 (581) Q Consensus 342 ~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l 421 (581) +|++++|...+ ...++.+++++.+|++|+++++++++.... ......+|++++|||+||++|++++++||||+++ T Consensus 361 ~~~aVlg~~~~---~~~~~~~~~a~~~n~e~vi~v~~~~~~~~~--dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i 435 (587) T protein:vir:99 361 PMRAIVGGGFN---ESKEQLFGRQASLSNPRVSLVANSGTFVMD--DGRKNHVPAYMVAVALGGLASGLEIGESITFKPL 435 (587) T ss_pred cEEEEecCCCC---CCHHHHHHHhhhcCCCcEEEEeccceEecC--CCceeeechHHHHHHHHHHHhcCchhcCccceee Confidence 99999987654 356677899999999999999998765422 1234679999999999999999999999999999 Q ss_pred cCcccccccCCHHHHHHHHhCCcEEEEEeCCC---eEEEEEeeeccCC--CcccceEEeehhhHHHHHHHHHHHhhhcCC Q lcl|NC_011270. 422 RGFSGPAEVQRDGEKSRESSEGLMVIEKTPRN---LVHVRHGVTTDPT--SLHTREWNIIGQQDVMVYRIRDYLDADGLI 496 (581) Q Consensus 422 ~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~---~v~i~~~itT~~t--d~~~~~i~v~R~~d~i~~~ir~~~~~~~fi 496 (581) + +.++.++|+++|+++|+++|+++|++.+++ .++|+++|||++. ++.|++|++||++|++.++||+.++ +.|| T Consensus 436 ~-~~~v~~~~t~~e~e~li~~Gvl~l~~~~~~~~~~vriv~~ItT~t~~~~~~~~~i~viRv~D~i~~di~~~~~-~~yi 513 (587) T protein:vir:99 436 R-VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLE-DQFI 513 (587) T ss_pred e-cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeeceeeccCCCCchhhhhhhhhhHHHHHHHHHHHHH-hhCC Confidence 8 578999999999999999999999987654 4789999999864 6779999999999999999999985 6799 Q ss_pred CccCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceE Q lcl|NC_011270. 497 GMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDI 571 (581) Q Consensus 497 G~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~ 571 (581) |||||+++|..||++|.+||++||++|+|++|++++++ .+.++|+++|++.++|++||||||+|++|+|+|=+- T Consensus 514 Gk~Nn~~~r~~i~~~i~~~L~~l~~~gaI~~~~~~dv~-v~~~~d~~~v~~~v~Pv~~mekIy~tv~~~~~~~~~ 587 (587) T protein:vir:99 514 GTRTINTSASIIKDFIQSYLGRKKRDNEIQDFPAEDVQ-VIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred ccccchHHHHHHHHHHHHHHHHHHhCCcccCCCccceE-EEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 99999999999999999999999999999999876654 456778999999999999999999999999998554 No 5 >protein:vir:80779 Length: 569 # NCBI annotation: putative tail sheath protein # Family: family:all:2449 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504132;genbank:gi:158079319;genbank:GeneID:5666428 Probab=100.00 E-value=6.8e-68 Score=388.84 Aligned_cols=508 Identities=18% Similarity=0.162 Sum_probs=281.8 Q ss_pred Ceecc---ccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEE--------ecc Q lcl|NC_011270. 1 MAIDF---SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILAL--------VGE 69 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~--------~~~ 69 (581) |||++ ++++-|++|.+...+-.-+......+.+.....+.+ |+.+++++|+- .++ T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~~~~~~~~~~~~~~~~~ig~a~~--------------G~~~~~~~~~~~~~~~~~f~~g 66 (569) T protein:vir:80 1 MAVEQFPRKKVSRPHTEITVDTSGIGGSSSSSDKTLMLVGSAKG--------------GKPDTVYRFRNYQQAKQVLRSG 66 (569) T ss_pred CeeeeecCCccccCceEEEEecCCCcCCCCCCceeEEEEEEeCC--------------CCCceeEEecCHHHHHHHhcCC Confidence 99987 567778888764322111222222222222333222 22223333311 111 Q ss_pred ccce------------------eEEEEeC----------ceeccccccCCCHHHHHHHHHhcCCCCcceEEEE--cC--- Q lcl|NC_011270. 70 PTGG------------------SFKLSLA----------GEPTGNIPFNATQGQVQSALRALPNVEDDEVTVL--GD--- 116 (581) Q Consensus 70 ~~~G------------------tF~l~~~----------g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~--~~--- 116 (581) .+-- -|.+.-+ |.+-..-.+.+.+.+++.+|+.-..-+.-..++. .. T Consensus 67 ~l~~a~~~a~~~~~~~~~~~~~~~~~rv~~a~~a~~~~~~~~~~a~~~g~~~n~i~v~l~~~~~~~~~~~~v~~~~~~~~ 146 (569) T protein:vir:80 67 DLLDAIELAWNASDVNTASAGDILAVRVEDAKNATLTKGGLTFASTIYGVDANEIQVALEDNNLTHTKRLTVAFSKDGYK 146 (569) T ss_pred chhHHHHhhccCccccccCceEEEEEEcCCCeeeeeeccceeeeeeeccCCCceEEEEEecCcCCcceeeEEeeecCCCc Confidence 1000 0111111 1111111112222233333322111111111111 00 Q ss_pred -----CCceEEEEecCCccccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccce Q lcl|NC_011270. 117 -----PGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYV 191 (581) Q Consensus 117 -----~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~ 191 (581) -|....|++.|.-. .+.+++.......++..-. ....+.....+.+...+ T Consensus 147 ~~~~~ig~v~si~ytg~~~-------------~a~~~~~~~~~~~~a~~l~----------~~~g~~~~~~~~v~~~~-- 201 (569) T protein:vir:80 147 KVFDNLGKIFSIQYKGSEA-------------QANFTIAQDSISKKATTLT----------LNVGSEPESTTEVMKYE-- 201 (569) T ss_pred cccccccceeeEEEeeccc-------------cceEEeecCcCcceeEEEE----------EEecCCcceeEEEEeec-- Confidence 01112222222111 1111111000000000000 00000000000000000 Q ss_pred eEEeecccccccC------cceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhh-ccc Q lcl|NC_011270. 192 VTRVNAGEDGEAN------TRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEA-GNV 264 (581) Q Consensus 192 v~~v~~~~dg~~~------~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~-g~~ 264 (581) ...+...+.. ....-++.+.....+.... .. ..+.....+...+.+... ...++..... .+. T Consensus 202 ---~~~~~~~~~~~lv~~~~~~~~f~a~~~~~~~~~~~-----~~-~~d~~~~~~~~t~~~~~~--~~~~di~~~~~~~~ 270 (569) T protein:vir:80 202 ---LGQGVYSETNVLVSAINSLPDWEAKFFPIGDKNLP-----TD-ALEAVTKVDVKTEAVFVG--ALAGDIAKQLEYND 270 (569) T ss_pred ---cCCccchhhhhhhhhcCCccCceEEEEecCCCcce-----eh-hccchhheeccccceeee--hhHHHHHHhhcCCc Confidence 0000000000 0000000000000000000 00 000000000000000000 0001000000 000 Q ss_pred cccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEE Q lcl|NC_011270. 265 QSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERR 344 (581) Q Consensus 265 ~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~ 344 (581) .-.+.......+.+.+...++++.++ ..+.+|+++|++|+.++++ +++|+++++++|+++++||++|++++++|+ T Consensus 271 ~v~~~~~~~~~l~~~~~~~LtGG~dG----~~~~~~~~~l~~le~~~~~-~i~~~t~d~av~~~l~a~vkr~r~~g~~~~ 345 (569) T protein:vir:80 271 YVTVAVDATKPVEDFELTNLTGGSDG----TAPESWANKFPLLANEGGY-YLVPLTDKQAVHSEALAFVKDRTDNGDPMR 345 (569) T ss_pred eEEEEecCCcceeeecceeecCCCCC----CccchHHHHHHHHhhCCcE-EEEecCCChHHHHHHHHHHHHHHhCCCcEE Confidence 00111111122223333333333322 3456899999999987765 567788899999999999999999999999 Q ss_pred EEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCc Q lcl|NC_011270. 345 AILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGF 424 (581) Q Consensus 345 avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~ 424 (581) +++|..... ..++.+++++.+|++|+++++|++..++. ......+|++++|||+||++|++++++||||++++ + T Consensus 346 aVvg~~~~~---~~~~~~~~a~~~n~e~vv~v~~~~~~~~~--~g~~~~~~~~~~aa~vAG~~A~~~~~~S~T~k~i~-~ 419 (569) T protein:vir:80 346 IIVGGGTNE---TVEESITRATNLRDPRASLVGFSGTRKMD--DGRLLKLPGYMMASQIAGIASGLEVGEAITFKHFN-V 419 (569) T ss_pred EEecCCCCC---CHHHHHHHHhhcCCCeEEEEecCceeecC--CCcceeechhhHHHHHHHHHhcCccccCccceeec-c Confidence 999876654 45667889999999999999998766543 22346799999999999999999999999999998 5 Q ss_pred ccccccCCHHHHHHHHhCCcEEEEEeCCCeEE---EEEeeeccCC--CcccceEEeehhhHHHHHHHHHHHhhhcCCCcc Q lcl|NC_011270. 425 SGPAEVQRDGEKSRESSEGLMVIEKTPRNLVH---VRHGVTTDPT--SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMP 499 (581) Q Consensus 425 ~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~---i~~~itT~~t--d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~ 499 (581) .++.++|+++|+++|+++|+++|++.+++.++ +++++||++. ++.|++|++||++|+|.++||+.++ +.||||| T Consensus 420 ~~i~~~lt~~e~~~li~~G~~~l~~~~~~~~~v~~~vn~itT~t~~~~~~~~~i~viRv~D~i~~dir~~~~-~~yiGk~ 498 (569) T protein:vir:80 420 TSVDRVFESSQLDMLNESGVISIEFVRNRTLTAFRVVQDVTTYNDKSDPVKNEMSVGEANDFLVSELKIELD-NNFIGTK 498 (569) T ss_pred ccccccCCHHHHHHHHhCCeEEEEEecCceEEEEEEeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHH-hhcCccc Confidence 78999999999999999999999988776554 4567777764 6779999999999999999999875 6799999 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceE Q lcl|NC_011270. 500 IYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDI 571 (581) Q Consensus 500 n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~ 571 (581) ||+.+|.+||++|++||++||++|+|++|+++++ +...++|+++|+++++|++||||||||+++.|+|=+- T Consensus 499 nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv-~v~~~~d~~~v~~~v~Pv~~~ekI~~ti~~~~~~~~~ 569 (569) T protein:vir:80 499 VIDTSASLIKNFIQSFLDNKKRAREIQDYTPEEV-QVVLEGDVASISMTVMPIRSLNKITVQLVYKQQILTA 569 (569) T ss_pred CChhHHHHHHHHHHHHHHHHHhCCcccCCCccce-EEEecCCEEEEEEEEEEcccccEEEEEEEEeeeeecC Confidence 9999999999999999999999999999988766 4557789999999999999999999999999997443 No 6 >protein:vir:80488 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468473;genbank:gi:157325048;genbank:GeneID:5601446 Probab=100.00 E-value=6.2e-67 Score=383.57 Aligned_cols=515 Identities=17% Similarity=0.139 Sum_probs=296.8 Q ss_pred Ceecc---ccccCCCcccccCcccccccccccCceeeEEEecCCCCceeee-----------------------eEEcCc Q lcl|NC_011270. 1 MAIDF---SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRES-----------------------IRINPD 54 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-----------------------~~~~~~ 54 (581) |||+. ++|+-|++|.+-.-+-.-+..+...+.+.....+.+-.. +.- ...+|. T Consensus 1 ~~~~~~~~~~~~~pgv~~~~~~s~~~~~~~~~~~~~~~ig~a~~G~~-~~~~~~~~~~~~~~~f~~g~l~~~i~~a~~~~ 79 (562) T protein:vir:80 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATGGKP-NAVYKVRNYSQAKSVFRSGELLDAIERAWNPG 79 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCcccCCCCCCceEEEEEEeCCCCc-ceeEEEccHHHHHHHhcCCChHHHHHHhcccc Confidence 99998 889999999874322222222222222222222221100 000 000110 Q ss_pred C-CceeeEEEEEEeccccceeEEEEeCceeccccccCCCHHHHHHHHHhcCCCCcceEEEEcCC----------CceEEE Q lcl|NC_011270. 55 T-GETITTQILALVGEPTGGSFKLSLAGEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDP----------GGPWTV 123 (581) Q Consensus 55 ~-~~~~evq~v~~~~~~~~GtF~l~~~g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~----------g~~w~V 123 (581) . ..--++.-|.+ ..+..++.++ +|-+-....+-+.+..++.+||.=+..+.-..++.... |....| T Consensus 80 ~~~g~~~~~~~rv-~~a~~a~~~~--~~~~~~~~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ev~~~~g~v~~i 156 (562) T protein:vir:80 80 EGTGAGDILAMRV-EEAKEATFEA--EGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVNQVYDNLGSIFSI 156 (562) T ss_pred cccCceEEEEEEc-CCCCcceEEe--cceEEEEeecccCCCceEEEEecCCCCCCcceEEEecCCcceEEeeccCceeee Confidence 0 00113333333 2334444332 23332333344455566666653222222223322222 222333 Q ss_pred EecCCcc--ccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccceeEEeeccccc Q lcl|NC_011270. 124 TFTKAVA--ALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDG 201 (581) Q Consensus 124 tf~g~~~--~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg 201 (581) .+.|... .+++.+..-+ +....+. ...|... .....|.+........+-. . T Consensus 157 ~y~g~~~~a~~~i~~~~~~-~~a~~l~---~~~g~~~-----------v~~~~l~~g~~~~~~~l~~-----~------- 209 (562) T protein:vir:80 157 KYKGTEASATFTVAVDPVT-FKATKLT---LKAGDKT-----------VKEYDLGSGAYAETNVLIS-----D------- 209 (562) T ss_pred eeccccccceeEEEecCcc-ceEEEEE---EecCCcc-----------eeEEEeCCCccchhhhhhh-----h------- Confidence 3322211 1111000000 0000000 0001000 0000010000000000000 0 Q ss_pred ccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccce---eeeeeeecC Q lcl|NC_011270. 202 EANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEIT---LCAQLAITN 278 (581) Q Consensus 202 ~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~---~~~~~~~~~ 278 (581) .+....+ +-. +.+...............+.+..+..... ....++... ....+++. ......+.+ T Consensus 210 -i~~~~~~-tAk---y~g~~~n~i~~~~~d~~~~~~~kt~~~~v-----~~~~~d~~~--~~~~n~~v~~~~~~~~~la~ 277 (562) T protein:vir:80 210 -INNLPDF-EAK---FFPIGDKNLTTDNFDAQIDVDIKTKEAYV-----KAVGGDIEK--QTAYNGYVEFEFDRSKEIAN 277 (562) T ss_pred -hccccce-EEE---ecccCCceeeecccccchhhhcccceeee-----eehhhhhhh--cccccceEEEEeccCccccc Confidence 0000000 000 00000000000000000000000000000 000000000 00111111 111123334 Q ss_pred CcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhH Q lcl|NC_011270. 279 GASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPS 358 (581) Q Consensus 279 g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~ 358 (581) .+...++++.++ ..+.+|+++|++|++++++ +++++++++++|+++++||++|+++++++++++|...+. .. T Consensus 278 ~~~~~LtGG~dG----~~~~~~~dal~~Le~~~~~-~i~~~t~d~ai~~~~~a~vkr~r~~g~~~~aVvg~~~~~---~~ 349 (562) T protein:vir:80 278 FPLTKLTGGDNG----TIPESWADKFSYFANEGGY-YLVPLTSKQAVHAEALQFVRDCSYNGNPMRVFVGGGIGE---SM 349 (562) T ss_pred cceeeeeCCCCC----CccccHHHHHHHHHhCCcE-EEEecCCChHHHHHHHHHHHHHHhCCCeEEEEecCCCCC---CH Confidence 444445544443 2345799999999987765 567788999999999999999999999999999876543 46 Q ss_pred HHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCcccccccCCHHHHHH Q lcl|NC_011270. 359 ATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSR 438 (581) Q Consensus 359 ~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~ 438 (581) ++.+++++.+|++|+++++|++..... ......+|++++|||+||++|++++++||||+++++ .++..+|+++|+++ T Consensus 350 ~~~~~~a~~~n~e~vv~v~~~~~~~~~--~~~~~~~~~~~~aa~vAGl~Ag~~~~~S~T~~~i~~-~~v~~~lt~~e~~~ 426 (562) T protein:vir:80 350 EQLFTRAIGLQNERAGLIGFSGTVKMD--DGRSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIAI-ETLDTIYEGSQLDQ 426 (562) T ss_pred HHHHHHhhhcCCCeEEEEecCeeEECC--CCceeeechhHHHHHHHHHHhcCccccCccceeecc-ccccccCCHHHHHH Confidence 677889999999999999998765433 334567899999999999999999999999999996 57999999999999 Q ss_pred HHhCCcEEEEEeCCCeE---EEEEeeeccCC--CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHH Q lcl|NC_011270. 439 ESSEGLMVIEKTPRNLV---HVRHGVTTDPT--SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAE 513 (581) Q Consensus 439 l~~~Gv~~l~~~~~~~v---~i~~~itT~~t--d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~ 513 (581) |+++|+++|++.+++.+ ++++++||+.. ++.|++|++||++|+|.++||+.++ +.|||||||+++|..||++|. T Consensus 427 li~~G~l~l~~~~~~~v~~~riv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~-~~yIGk~Nn~~~r~~v~~~i~ 505 (562) T protein:vir:80 427 LNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLD-NEYIGTKIIDTSASLVKNFVQ 505 (562) T ss_pred HHhCCeEEEEEecCCcEEEEEeeccceeccCCCCchhhhhhhhHHHHHHHHHHHHHHH-hcCCccccChHHHHHHHHHHH Confidence 99999999998777655 45678888864 7889999999999999999999885 679999999999999999999 Q ss_pred HHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceE Q lcl|NC_011270. 514 AALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDI 571 (581) Q Consensus 514 ~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~ 571 (581) +||++|+++|+|++|++++++ ...++|+++|++.++|++||||||+|++|+|+|=+- T Consensus 506 ~~L~~l~~~gaI~~~~~~dv~-v~~~~d~~~v~~~v~Pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:80 506 SFLDRKKLAKEIQDYSPEEVQ-VVIEGDIARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred HHHHHHHhCCcccCCCccceE-EEecCCEEEEEEEEEEcccceEEEEEEEEEeeeecC Confidence 999999999999999987764 556889999999999999999999999999997443 No 7 >protein:vir:95741 Length: 587 # NCBI annotation: ORF011 # Family: family:all:2449 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240910;genbank:gi:66394960;genbank:GeneID:5132682 Probab=100.00 E-value=2.1e-66 Score=380.68 Aligned_cols=525 Identities=17% Similarity=0.146 Sum_probs=285.2 Q ss_pred Ceecc---ccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEec--------c Q lcl|NC_011270. 1 MAIDF---SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVG--------E 69 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~--------~ 69 (581) |||+. +++.-|++|.+..-+--.+......+.+.......+ |+..++++++-.. + T Consensus 1 ~a~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~~g~a~~--------------G~~~~~~~~~~~~~~~~~~~~g 66 (587) T protein:vir:95 1 MAVEPFPRRPITRPHASIEVDTSGIGGSAGSSEKVFCLIGQAEG--------------GEPNTVYELRNYSQAKRLFRSG 66 (587) T ss_pred CcccccCCcccccCceEEEEecCCccccCCCCCceEEEEEEeCC--------------CCCceeEEeccHHHHHHHhcCc Confidence 99987 788889998875422121222222222222222221 2223333332111 0 Q ss_pred ccceeEEEEe-----Cc-eecccccc-CCCHHHHHHHHHhcCCCCcceEEE--EcCCCceEEEEec-CCccc---ccc-- Q lcl|NC_011270. 70 PTGGSFKLSL-----AG-EPTGNIPF-NATQGQVQSALRALPNVEDDEVTV--LGDPGGPWTVTFT-KAVAA---LTK-- 134 (581) Q Consensus 70 ~~~GtF~l~~-----~g-~~T~~i~~-~asa~~v~~aLe~l~~i~~~~V~~--~~~~g~~w~Vtf~-g~~~~---l~~-- 134 (581) .+-..-.+.| +| ..--.+.. +++++. -+++.+.++- -|.+|+..+|.+. +..|. +++ T Consensus 67 ~l~~~~~~a~~~~~~~g~~~~~~~rv~~~~~a~--------~~~~~l~~~a~~~G~~gN~i~v~~~~~~~~~~~~~~~~~ 138 (587) T protein:vir:95 67 ELLDAIELAWGSNPNYTAGRILAMRIEDAKPAS--------AEIGGLKITSKIYGNVANNIQVGLEKNTLSDSLRLRVIF 138 (587) T ss_pred chHHHHHHHhccccCCCceEEEEEEcCCCceeE--------EEecCeEEEEecccccccceEEEEecCCCCCceeEEEEE Confidence 0000000111 11 10000111 111110 0112222221 1233444444332 11110 000 Q ss_pred -c-cceec---cCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccceeEEeeccccccc------ Q lcl|NC_011270. 135 -D-VTGLT---GGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEA------ 203 (581) Q Consensus 135 -~-~~~l~---~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~------ 203 (581) + ....+ -|.-..+........ +.+..-......+.....+- .. ......|++..+..... T Consensus 139 ~~~~~~~~~~~~g~v~si~y~g~~~~-~~~~v~~~~~t~~a~~~~l~-~g-------~~~v~~yrL~~g~~~~~~~~~~~ 209 (587) T protein:vir:95 139 QDDRFNEVYDNIGNIFTIKYKGEEAN-ATFSVEHDEETQKASRLVLK-VG-------DQEVKSYDLTGGAYDYTNAIITD 209 (587) T ss_pred ecccceeeeeeccceeeeeeeccccc-cceeeeecccceeeeeeeee-cC-------CceEEEEEecCCchHHHHHHHHh Confidence 0 00000 000000000000000 00000000000000000000 00 00001111111100000 Q ss_pred -Ccce------------eeee-eeeeeecccccc-cceeE---E-EEeecCCcccceeEeccCcchhhhhhhhhhhhccc Q lcl|NC_011270. 204 -NTRD------------DLYT-IQRVVDGGHIDP-GDIVQ---L-SYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNV 264 (581) Q Consensus 204 -~~~~------------~~~t-i~~~vd~~~~d~-~~~~~---~-s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~ 264 (581) +... ++.+ ..+.+.+..... ...+. . ...+........+.+..... ..........+.. T Consensus 210 in~~~~~tAky~g~~~~~i~~~~~~~~~~~~v~~~~~~v~a~~~d~~~~~~~~~~v~~~~~~g~~--~~~~~~~~~~~~~ 287 (587) T protein:vir:95 210 INQLPDFEAKLSPFGDKNLESSKLDKIENANIKDKAVYVKAVFGDLEKQTAYNGIVSFEQLNAEG--EVPSNVEVEAGEE 287 (587) T ss_pred hccccceEEEEecccCceeEEeecCcccccceehhhhhhhhhhcceeeeeeceeeeeeecccccc--eeccchhhhhccc Confidence 0000 0000 000110000000 00000 0 00011111111111111110 0000000000000 Q ss_pred cccce-eeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcE Q lcl|NC_011270. 265 QSEIT-LCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYER 343 (581) Q Consensus 265 ~~~i~-~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~ 343 (581) .+... ........+...+.++++.++ ....+|+++|++|+.++++ +++|+++++++|+++++||++|++++++| T Consensus 288 ~a~~~~~~~~~~~a~~~~t~LtGG~dG----~~~~~y~~~l~ale~~~~~-~i~~~t~d~~v~a~l~a~vk~~~~~g~~~ 362 (587) T protein:vir:95 288 SATVTATSPIKTIEPFELTKLKGGTNG----EPPATWADKLDKFAHEGGY-YIVPLSSKQSVHAEVASFVKERSDAGEPM 362 (587) T ss_pred chheeccccccceeccceeeeecCCCC----CCcccHHHHHHHHHhCCcE-EEEecCCCHHHHHHHHHHHHHHHhCCCcE Confidence 00000 011112222333334444333 2346899999999997765 55778889999999999999999999999 Q ss_pred EEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccC Q lcl|NC_011270. 344 RAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRG 423 (581) Q Consensus 344 ~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g 423 (581) ++++|.... .+.++.+++++.+|++|++++++++..... ......+|++++|||+||++|++++++||||++++ T Consensus 363 ~aVvg~~~~---~~~~~~~~~a~~~n~ervi~v~~~~~~~~~--dg~~~~~~~~~~aa~vAGl~Ag~~~~~SlT~~~i~- 436 (587) T protein:vir:95 363 RAIVGGGFN---ESKEQLFGRQESLSNPRVSLVANSGTFVMD--DGRKNHVPAYMVAVALGGLASGLEIGESITFKPLR- 436 (587) T ss_pred EEEEcCCCC---CCHHHHHHHHhhcCCCcEEEecccceEecC--CCceeeechHHHHHHHHHHHhcCchhcCccceeee- Confidence 999987654 356677899999999999999998764422 12335689999999999999999999999999998 Q ss_pred cccccccCCHHHHHHHHhCCcEEEEEeCCC---eEEEEEeeeccCC--CcccceEEeehhhHHHHHHHHHHHhhhcCCCc Q lcl|NC_011270. 424 FSGPAEVQRDGEKSRESSEGLMVIEKTPRN---LVHVRHGVTTDPT--SLHTREWNIIGQQDVMVYRIRDYLDADGLIGM 498 (581) Q Consensus 424 ~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~---~v~i~~~itT~~t--d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~ 498 (581) +.++.++|+++|+++|+++|+++|++.+++ .++|+++|||++. ++.|++|++||++|+|.++||+.++ +.|||| T Consensus 437 ~~~v~~~~t~~e~e~ai~~Gvl~l~~~~~~~~~~vriv~~itT~t~~~d~~~~~i~viRv~D~i~~dir~~~~-~~~iGk 515 (587) T protein:vir:95 437 VSSLDQIYESIDLDELNENGIISIEFVRNRTNTFFRIVDDVTTFNDKSDPVKAEMAVGEANDFLVSELKVQLE-DQFIGT 515 (587) T ss_pred cccccccCCHHHHHHHHhCCeEEEEEecCCcceEEEEeecceeccCCCCcchhhhhhhhhHHHHHHHHHHHHH-hhCCcc Confidence 568999999999999999999999987655 3788999999864 6789999999999999999999985 679999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceE Q lcl|NC_011270. 499 PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDI 571 (581) Q Consensus 499 ~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~ 571 (581) |||+.+|..||++|.+||++||++|+|++|+++++ +.+.++|+++|++.++|++||||||||++|.++|=+- T Consensus 516 ~nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv-~v~~~~d~~~v~~~v~Pv~~mekI~vt~~~~~~~~~~ 587 (587) T protein:vir:95 516 RTINTSASIIKDFIQSYLGRKKRDNEIQDFPAEDV-QVIVEGNEARISMTVYPIRSFKKISVSLVYKQQTLQA 587 (587) T ss_pred ccchHHHHHHHHHHHHHHHHHHhCCcccCCCccce-EEEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 99999999999999999999999999999988665 4456778999999999999999999999999998544 No 8 >protein:vir:63742 Length: 562 # NCBI annotation: Tsh # Family: family:all:2449 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547629;genbank:GeneID:3783475 Probab=100.00 E-value=2.7e-66 Score=380.03 Aligned_cols=500 Identities=17% Similarity=0.152 Sum_probs=287.2 Q ss_pred Ceecc---ccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEe---------- Q lcl|NC_011270. 1 MAIDF---SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALV---------- 67 (581) Q Consensus 1 ~~~~~---~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~---------- 67 (581) |||+. ++|.-|++|.+-.-+-.-+..+...+.+.......+ |+.+++++|+-. T Consensus 1 ~~~~~~~~~~~~~Pgv~~~~~~s~~~~~~~~~~~~~~~iG~a~~--------------G~~~~~~~~~~~~~~~~~fg~g 66 (562) T protein:vir:63 1 MAIEIYPRKPVSRPHTEISVDTSGIGGSSSGSEKILCLVGSATG--------------GKPNAVYKVRNYSQAKSVFRSG 66 (562) T ss_pred CeeeeeCCCcccCCceEEEEecCCCcccCCCCCceEEEEEEeCC--------------CCCceeEEEccHHHHHHHhcCC Confidence 99985 678889999875422222222222222222222221 122233333211 Q ss_pred --------------------------ccccceeEEEEeCceeccccccCCCHHHHHHHHHhcCCCCcceEEEEcCC---- Q lcl|NC_011270. 68 --------------------------GEPTGGSFKLSLAGEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDP---- 117 (581) Q Consensus 68 --------------------------~~~~~GtF~l~~~g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~---- 117 (581) ..+..++.++ +|-+-..-.+-+-+..++.+||.=+..+.-..++.... T Consensus 67 ~l~~~i~~a~~~~~~~g~~~~~~~rv~~a~~a~~~~--~~~~~~a~~~g~~~n~i~v~~~~~~~~~~~~~~v~~~~~~~~ 144 (562) T protein:vir:63 67 ELLDAIERAWNPGEGTGAGDILAMRVEEAKEATFEA--EGVKVSSTIYGADANDIQVALEDNTITGTKRLSIVFAKERVN 144 (562) T ss_pred chHHHHHHhccccccCCceEEEEEEcCCCccceeEe--cceeEEEeecccCCCeEEEEEecCCCCCCcceEEEecCCCcc Confidence 1122222211 11111111222222233333332111111111111110 Q ss_pred ------CceEEEEecCCccccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccce Q lcl|NC_011270. 118 ------GGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYV 191 (581) Q Consensus 118 ------g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~ 191 (581) |....|++.|... .+.+++...... .. .....+.. + ..... T Consensus 145 ev~~~~g~V~~i~y~g~~~-------------~~~~~v~~~~~~-----~~-------a~~l~~~~---g-----~~~v~ 191 (562) T protein:vir:63 145 QVYDNLGSIFSIKYKGTEA-------------SATFTVAVDPVT-----FK-------ATKLTLKA---G-----DKTVK 191 (562) T ss_pred hhhhhccceeeeeeecccc-------------cceEEEEecCcc-----ee-------EEEEEeec---C-----Cccee Confidence 1112222221110 000111000000 00 00000000 0 00001 Q ss_pred eEEeecccccccCcc-eee---eeeeeeeecccccccceeEEEEee-cCCcccceeEeccCcchhhhhhhhhhhhccccc Q lcl|NC_011270. 192 VTRVNAGEDGEANTR-DDL---YTIQRVVDGGHIDPGDIVQLSYRY-TDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQS 266 (581) Q Consensus 192 v~~v~~~~dg~~~~~-~~~---~ti~~~vd~~~~d~~~~~~~s~~~-~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~ 266 (581) .+++..+........ ... ..+.... .+..... +.... +......+..+.... ....++.. ..+..+ T Consensus 192 ~~~L~~g~~~~~~~l~~~in~~~~~~aky-~~~~gn~----i~~~~~d~~~~~~vkt~~~~v--~t~~~d~~--~~~~~~ 262 (562) T protein:vir:63 192 EYDLGSGAYAETNVLISDINNLPDFEAKF-FPIGDKN----LTTDNFDAQIDVDIKTKEAYV--KAVGGDIE--KQTAYN 262 (562) T ss_pred EEEecCCccchhHHHHHhhccccceEEEe-eccCCce----eeeeccccccccchhhhhhhh--hhhhhhhh--hccccc Confidence 111111110000000 000 0000000 0000000 00000 000001111110000 00000000 011112 Q ss_pred cce---eeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcE Q lcl|NC_011270. 267 EIT---LCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYER 343 (581) Q Consensus 267 ~i~---~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~ 343 (581) ++. ......+.+.....++++.++ ..+.+|+++|++|+.++++ +++|+++++++|+++++||++|++++++| T Consensus 263 ~~v~~~~~~~~~la~~~~~~LtGG~dG----t~~~~~~~al~ale~~~~~-~i~~~t~d~av~~~l~a~vkr~~~~g~~~ 337 (562) T protein:vir:63 263 GYVDFEFDRSKEIANFPLTKLTGGDNG----TIPESWADKFSYFANEGGY-YLVPLTSKQAVHAEALQFVRDCSYNGNPM 337 (562) T ss_pred ceeeeeeccccceecccceeeecCCCC----CchhhHHHHHHHHHhCCcE-EEEecCCCHHHHHHHHHHHHHHHhCCCcE Confidence 111 111223344444455544443 2345799999999987765 66788999999999999999999999999 Q ss_pred EEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccC Q lcl|NC_011270. 344 RAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRG 423 (581) Q Consensus 344 ~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g 423 (581) ++++|..... +.++.+++++.+|++|++++++++..... ......+|++++|||+||++|++++++||||++++ T Consensus 338 ~aVlg~~~~~---~~~~~~~~a~~~n~ervv~v~~~~~~~~~--~~~~~~~~~~~~aa~vAGl~A~~~~~~SlT~~~i~- 411 (562) T protein:vir:63 338 RVFVGGGIGE---SMEQLFTRAIGLQNERAGLIGFSGTVKMD--DGRSLKMPGYMFAAQVAGLTCGLEIGEAITFKNIA- 411 (562) T ss_pred EEEecCCCCC---CHHHHHHHhhhcCCCcEEEEecCeeEECC--CCceeeechhHHHHHHHHHhhcCchhcCccceeec- Confidence 9999876543 45667889999999999999998754433 33456799999999999999999999999999998 Q ss_pred cccccccCCHHHHHHHHhCCcEEEEEeCCCeEE---EEEeeeccCC--CcccceEEeehhhHHHHHHHHHHHhhhcCCCc Q lcl|NC_011270. 424 FSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVH---VRHGVTTDPT--SLHTREWNIIGQQDVMVYRIRDYLDADGLIGM 498 (581) Q Consensus 424 ~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~---i~~~itT~~t--d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~ 498 (581) +.++..+|+++|+++|+++|+++|++.+++.++ +++++||+.. ++.|++|++||++|+|.++||+.++ +.|||| T Consensus 412 ~~~v~~~~t~~e~~~li~~Gv~~l~~~~~~~v~~~~iv~~itT~t~~~~~~~~ki~viRv~D~i~~dir~~~~-~~yiGk 490 (562) T protein:vir:63 412 IETLDTIYEGSQLDQLNESGIITAEFVRNRAVTNFRIVDDVTTFNDKTDPVKSEIGVGEANDFLVSELKISLD-NEYIGT 490 (562) T ss_pred cccccccCCHHHHHHHHhCCeEEEEEecCCcEEEEEeeccceecCCCCCchhhhhhhhHHHHHHHHHHHHHHH-hcCCcc Confidence 678999999999999999999999987776554 5677888754 6789999999999999999999885 679999 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceE Q lcl|NC_011270. 499 PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDI 571 (581) Q Consensus 499 ~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~ 571 (581) |||+++|.+||++|.+||++|+++|+|++|++++++ ...++|+++|++.++|++||||||+|++|+|+|=+- T Consensus 491 ~Nn~~~r~~v~~~i~~~L~~l~~~gaI~~~~~~dv~-v~~~~d~~~v~~~v~pv~~mekIy~ti~~~~~~~~~ 562 (562) T protein:vir:63 491 KIIDTSASLVKNFVQSFLDRKKLAKEIQDYSPEEVQ-VVIEGDVARISLTVFPIRSMKKIEVSLVYRQQILTA 562 (562) T ss_pred ccChHHHHHHHHHHHHHHHHHHhCCcccCCCccceE-EEecCCEEEEEEEEEEcccceEEEEEEEEeeeeecC Confidence 999999999999999999999999999999987764 456789999999999999999999999999997443 No 9 >protein:vir:100829 Length: 607 # NCBI annotation: hypothetical protein # Family: family:all:2449 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164738;genbank:gi:56693151;genbank:GeneID:3197462 Probab=100.00 E-value=1.6e-64 Score=370.35 Aligned_cols=522 Identities=18% Similarity=0.172 Sum_probs=287.0 Q ss_pred Ceecc------ccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccc---c Q lcl|NC_011270. 1 MAIDF------SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEP---T 71 (581) Q Consensus 1 ~~~~~------~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~---~ 71 (581) -+-+| ++|+-|++|.....+-.........+.+.....+. .|+.+++.+++-...+ . T Consensus 7 ~~~~~~~~~~~~~~~~pgv~~~~~~~~~~~~~~~~~~~~~~iG~a~--------------~G~~~~~~~~~~~~~a~~~f 72 (607) T protein:vir:10 7 SAESYKRIYPLFYDSRPHVETNFDDSRLSNTASDSAKNIFMLGSAT--------------NGDPTKVYEIRTSQQATKIF 72 (607) T ss_pred chhhHHHHhCCCCccCCceEEEEecCcCcCCCCCCcceEEEEEEeC--------------CCCCceEEEEcchhHHHHhh Confidence 22222 45666777665331111111222222222222222 2233344333321110 0 Q ss_pred -ce----eEEEEeC----------------------------ceeccccccCCCHHHHHHHHH-hcCCCCcceEE----- Q lcl|NC_011270. 72 -GG----SFKLSLA----------------------------GEPTGNIPFNATQGQVQSALR-ALPNVEDDEVT----- 112 (581) Q Consensus 72 -~G----tF~l~~~----------------------------g~~T~~i~~~asa~~v~~aLe-~l~~i~~~~V~----- 112 (581) +| -+.|.|. |.....=.+.+.+.+++-+|+ .+++-....+. T Consensus 73 ~~g~l~~a~~~a~~~~~~~~~g~~~~~~~rv~~~~~a~~~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~~~~~~~d~ 152 (607) T protein:vir:10 73 GSGDLVDGIKLAFDPTGNSVTNGGTVYALRVDNAKQASLVKDGLTFTSSIFGTNANQVSVALDNDVFGVPRITVNYSPDN 152 (607) T ss_pred cCcchHHHHHHhhccccCCccCCceEEEEeCCCccccceecccccccccccccCCCceEEEEEecCCCccceeEEeeccc Confidence 11 1222221 111011111122222222221 11111111110 Q ss_pred ---EEcCCCceEEEEecCCccccccccceeccCCCceEEEEEcccccceeeeccccccccceee----eecc-------- Q lcl|NC_011270. 113 ---VLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTI----RVVN-------- 177 (581) Q Consensus 113 ---~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~----~l~~-------- 177 (581) +-.+-|+.+.|.|+|.. ..+.++|.-..+|.+-.-......+....+. .|.. T Consensus 153 ~~~~~~n~g~~~~i~y~g~~-------------~~a~~~v~~~~~g~~~~lt~~~~~~~~~~~~V~~~~l~~~~~~t~~~ 219 (607) T protein:vir:10 153 YERTYTNIGQMFSITYSGKS-------------ASAGYTVSHDTDGKAILLTLGSGDSIDKLTNVATFDLTMSKYDTIAK 219 (607) T ss_pred ceeeeeeccceeecccCccc-------------ccccceeeecCCCceeEEEecCCCccceeeeeecccccccccchHHH Confidence 00111233333333321 2222333322222211000000000000000 0000 Q ss_pred ----ccc---cceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcch Q lcl|NC_011270. 178 ----PNS---GQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDI 250 (581) Q Consensus 178 ----~~~---~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~ 250 (581) ... -.+...+......+ .-+.....+.+..-...++....|. ..+.+...+..+.+.+.... T Consensus 220 l~~din~~~~~~A~~~g~~~i~tk----y~d~~~~~i~V~~~~~iv~a~~~D~-------~~~~~~~~~~~~t~~~~~~~ 288 (607) T protein:vir:10 220 LMQAISATPNFSASVVGSPSVNTS----YLDEVTSPVDVKTAPAVVTAKIGDA-------ISKLGYDPYVVVTQTSNNKP 288 (607) T ss_pred HHHHhhcCCceEEEEecccceeee----ccccccceeEEEEeeeeechhhhhh-------hhcccccceEEeeecccchh Confidence 000 00111111110000 0000000111111011111111110 11122222222222222221 Q ss_pred hhhhhhhhhhh-ccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHH Q lcl|NC_011270. 251 QDFYGPAFDEA-GNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALV 329 (581) Q Consensus 251 ~~~~~~a~~~~-g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l 329 (581) .. .+.... ..............+.+.+...++++.++ ..+.+|+++|++|+.++++ +++++++++++|+++ T Consensus 289 ~~---~~~~~~~~~~~~~~~~~~~~~~a~~a~~~LtGGtdG----~~~~ty~dal~aLe~~e~~-~i~~~t~d~ai~~~l 360 (607) T protein:vir:10 289 IV---NGVSAGTGSATASVTTAPESFPANFDTAFLTGGSTG----DVPVSWADKFNGAIGNNVY-YIIPLTSEENIHAEL 360 (607) T ss_pred hh---hhhhccccceeeeeeccccccccccceeeeeCCCCC----CchhhHHHHHHHHhhcCce-EEEecCCCHHHHHHH Confidence 11 111110 00000011111122223333334433222 3456899999999987755 567788899999999 Q ss_pred HHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhc Q lcl|NC_011270. 330 QQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVS 409 (581) Q Consensus 330 ~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~ 409 (581) ++||++|+++++++++++|..... ..++.+++++.+|++|+++++|++...+. .....+|++++|||+||++|+ T Consensus 361 ~a~vkr~~~~g~~~~aVlg~~~~~---t~~~~~t~a~~~N~ervv~V~~~~~~~~~---G~~~~~~~~~~Aa~vAGl~Ag 434 (607) T protein:vir:10 361 QAFIDEQHVLGYNYHAFVGGGFAE---PLEQILSRQVNINDSRFGLVGQSGHVQEG---GESVHVPAYLMAAYVGGLSSS 434 (607) T ss_pred HHHHHHHHhCCCcEEEEecCCCCC---CHHHHHHHHHhhCCCcEEEEecCeeEeeC---CcceeccHHHHHHHHHHHHhc Confidence 999999999999999999876543 56677889999999999999998765432 344679999999999999999 Q ss_pred cchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCCC----eEEEEEeeeccCC--CcccceEEeehhhHHHH Q lcl|NC_011270. 410 AIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRN----LVHVRHGVTTDPT--SLHTREWNIIGQQDVMV 483 (581) Q Consensus 410 ~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~----~v~i~~~itT~~t--d~~~~~i~v~R~~d~i~ 483 (581) +++++||||++++ +.++..+|+++|+++|+++|+++|++.+++ .+||+++|||++. +..|++|+++|++|+|. T Consensus 435 ~~~~~SlT~k~i~-~~~v~~~lt~~e~e~ai~~Gv~~l~~~~~~~~~~~vrIv~~ItT~t~~~~~~~~~i~viRv~D~i~ 513 (607) T protein:vir:10 435 LGVAVPITNKKLA-LVDLDQNFSGDDLNTLNQNGVIGIEHLVNRNATGGYYIVQDVSTNTVSSSHVDGSLYLGELTDFLF 513 (607) T ss_pred CccccCcccceec-cccccccCCHHHHHHHHhCCeEEEEEccCccccceEEEeeeeeeccCCCCcchheeehhhhHHHHH Confidence 9999999999997 568999999999999999999999876543 6899999999875 67899999999999999 Q ss_pred HHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHH--HHhCCceeCCccceeEEeecCCCEEEEEEEEEecCceeEEEEE Q lcl|NC_011270. 484 YRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVW--LVDNNIIRGYRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVR 561 (581) Q Consensus 484 ~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~--l~~~gaI~~~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~ 561 (581) ++||+.++ +.||||+||+..|.++|..+.++|+. |++.|+|++|++++++ ...++|+++|++.++|+++|||||+| T Consensus 514 ~dir~~~~-~~yIGk~nnd~~~~~vk~~i~~~L~~~~l~~~gaI~df~~edv~-v~~~~D~v~v~~~v~Pv~~iekIyvt 591 (607) T protein:vir:10 514 DNLRFVLR-DTYIGSNIRSTSADDIKSTVASYLYSEMNNDDGLIVDFSESDIV-VTISGTVVYIQFAVAPTQEIKNIVVS 591 (607) T ss_pred HHHHHHHh-hcCCcccCCcchHHHHHHHHHHHHHHHHHHhcCceeCCCccccE-EeeCCCEEEEEEEEEEcccceEEEEE Confidence 99999885 67999999999999999999999965 5557999999876654 44577999999999999999999999 Q ss_pred EEEEeccceEEEEEeeccc Q lcl|NC_011270. 562 YSIAPETGDITSTIEGTTS 580 (581) Q Consensus 562 ~~~~~~tg~~~~~~~~~~~ 580 (581) +++.|+|= |+. ..||. T Consensus 592 v~v~~~~~--~~~-~~~~~ 607 (607) T protein:vir:10 592 GTYSNYSA--TSE-DNTTK 607 (607) T ss_pred EEEEEEEE--eec-cCCCC Confidence 99999963 222 22333 No 10 >protein:vir:105470 Length: 451 # NCBI annotation: putative phage sheath tail protein # Family: family:all:632 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529880;genbank:gi:90592620;genbank:GeneID:3974534 Probab=100.00 E-value=2e-59 Score=342.40 Aligned_cols=428 Identities=13% Similarity=0.051 Sum_probs=258.5 Q ss_pred CCCceEEEEecCCccccccc----cce--eccCCCceEEEEEccccccee-eecccccc----ccceeeeecccccccee Q lcl|NC_011270. 116 DPGGPWTVTFTKAVAALTKD----VTG--LTGGDDPDLNIASEQTGVPAM-NRALAKKG----IKTDTIRVVNPNSGQVY 184 (581) Q Consensus 116 ~~g~~w~Vtf~g~~~~l~~~----~~~--l~~g~~~~v~v~~~~~g~~~~-~~~~~~~~----~~~~~~~l~~~~~~~~~ 184 (581) =+|..|.-+ .--+|...++ +.. ...+..-.+.+-....+.|.- -......+ ++.............+- T Consensus 1 magg~~~~~-~K~~PGvYi~~~~~~~~~~~~~~~~~~~~i~~~~~~g~~~~v~i~~~~d~~~~fG~~~~~~~~~~~~~~~ 79 (451) T protein:vir:10 1 MAGGTWKAQ-DKRRPGTYINVVGNGQREAASSLGRVLLIRDKGLGWGKNGVIEVEANSDFTKKLGTTLDDPSLTALKETL 79 (451) T ss_pred CCceeeccc-eeecCceEEEEeccCcceeeccCCcEEEEEeeecCCCCcccEEeecHHHHHHHcCCcccchhHHHHHHHh Confidence 233344211 0112222111 000 001111112221111111100 00000000 00000000000000000 Q ss_pred eEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccc Q lcl|NC_011270. 185 VLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNV 264 (581) Q Consensus 185 vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~ 264 (581) ..+...+.+++..+.........+..++. ..+.|.......+.+.-+..++..+++..+.+...+..+.+......... T Consensus 80 ~g~~~v~~yrl~~g~~a~~t~~~~~~~~~-Aky~G~~Gn~i~v~v~~~~~d~~~~~v~t~~g~~~vd~qtv~~~~~~el~ 158 (451) T protein:vir:10 80 KGASKVLVLNPNEGTAATLTKEGLPWTVT-ANYPGEKGNQITVSVEVSPADQNAATVSTIFGTKLVDEQSIKFNELDKFK 158 (451) T ss_pred cCCcEEEEEEcCCCceEEEEeecCceEEE-EeeCCcCCceEEEEEecccCCcCceEEEEEECCeEEEEEEeeccchhhcc Confidence 01222334444322111110000001111 22333333223333333344444444444332222111100000000001 Q ss_pred cccce---eeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCC-CcHHHHHHHHHHHHHHhc-C Q lcl|NC_011270. 265 QSEIT---LCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGT-GAQPIQALVQQHVSAQSN-N 339 (581) Q Consensus 265 ~~~i~---~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t-~~~~i~~~l~~~v~~~~~-~ 339 (581) .++.. ...+..........++.+..+...+.++.+|.++|++||..+++.++||.. +++++|+++.+||+|||+ + T Consensus 159 ~nd~V~a~~~~~g~~~~~~~~~l~~~~~gg~~~~~~~~~~~~l~~~e~~~~n~l~~~~~~~~~~i~~~~~a~ik~~r~~~ 238 (451) T protein:vir:10 159 GNDYITAKVVEEGSSKPVAFTNVSGTLTGGTTTESNKVESLLNDALENEEYAVVTTAGFEPSSNMNKLVVEAVKRLRENE 238 (451) T ss_pred CCceEEEEecccccccceeeeecccccccccccCCccchHHHHHHhccceeeEEEEccCCCchHHHHHHHHHHHHHHHhc Confidence 11111 000100100111111222223334457789999999999999988888765 467899999999999987 4 Q ss_pred CCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccc Q lcl|NC_011270. 340 KYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRK 419 (581) Q Consensus 340 ~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~ 419 (581) +++.+++++..+. ..+|+++++++.++.. ..++..++++++++|+||++|++++++|+||+ T Consensus 239 g~~~~aVl~~~~~-------------~~~d~egiinv~n~~~------~~dg~~~~~~~~~~~vAG~~Ag~~~~~S~T~~ 299 (451) T protein:vir:10 239 GRKVRGVIPTDAD-------------TTYNYEGISTVVNGYT------LSDGTNVDVKDATGYFAGISASADVATSLTYF 299 (451) T ss_pred CCeEEEEecCccC-------------CCCCCcceEEeecceE------ecCceeechhhhHHHHHHHHcccccccCccce Confidence 7777888874322 2368999999988764 33567899999999999999999999999999 Q ss_pred cccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC-----CcccceEEeehhhHHHHHHHHHHHhhhc Q lcl|NC_011270. 420 VIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-----SLHTREWNIIGQQDVMVYRIRDYLDADG 494 (581) Q Consensus 420 ~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t-----d~~~~~i~v~R~~d~i~~~ir~~~~~~~ 494 (581) +++|+.++..+|+++|+++|+++|+++|++.++++|||++||||+++ +..|++|+++|++|++.++|++.++ +. T Consensus 300 ~~~~~~~v~~~~t~~e~~~~i~~G~lvl~~~~g~~v~i~~~INTltt~~~~k~~~~~ki~vir~~D~i~~di~~~~~-~~ 378 (451) T protein:vir:10 300 EVEDAVSAYPKFDNEKTIKALDAGQIVFTTRPGQRVVIEQDINSLHKFTAEKPQAFSKNRVIRTLDEIATNTENTFE-RT 378 (451) T ss_pred ecCCceeeeeeCCHHHHHHHHhCCeEEEEEEcCCeEEEEEccccceecCCCCCcchhhhhHHHHHHHHHHHHHHHhh-hc Confidence 99999999999999999999999999998888889999999999865 5679999999999999999999875 67 Q ss_pred CCCc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEee-cCCCEEEEEEEEEecCceeEEEEEEEEE Q lcl|NC_011270. 495 LIGM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIE-RQPDVIEVRYEWRPAYPLNYIVVRYSIA 565 (581) Q Consensus 495 fiG~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~-~~~~~~~v~i~v~pv~~~e~I~~~~~~~ 565 (581) |||| +|+.++|.+|+++|++||++|+++|+|++|++.+++... ...+.+++++.++|+++|||||+++.+. T Consensus 379 yiGk~~N~~~gr~~~~~~i~~yl~~l~~~g~i~~~~~~d~~v~~~~~~~~v~v~~~v~pvdame~iy~t~~v~ 451 (451) T protein:vir:10 379 YLGNVGNNAAGRDLFKADRIAYLTSLQNRNMIQSFANTDITVEAGNDMDSIVVNLAVTPVDAMEKLYMTMVVR 451 (451) T ss_pred cceecCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCccceEEeecCCCCEEEEEEEEEEEeeeeeEEEEEEEc Confidence 9999 799999999999999999999999999999877765554 6789999999999999999999999988 No 11 >protein:vir:102957 Length: 437 # NCBI annotation: sheath tail protein # Family: family:all:632 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945292;genbank:gi:39653727;uniprot:Q708M0;genbank:GeneID:2672871 Probab=100.00 E-value=2.1e-58 Score=336.80 Aligned_cols=418 Identities=13% Similarity=0.105 Sum_probs=252.9 Q ss_pred CCCceEEEEecCCccccccccc-----eeccC-CCceEEEEEcccccc--eeeeccc---cccccceeeeecccccccee Q lcl|NC_011270. 116 DPGGPWTVTFTKAVAALTKDVT-----GLTGG-DDPDLNIASEQTGVP--AMNRALA---KKGIKTDTIRVVNPNSGQVY 184 (581) Q Consensus 116 ~~g~~w~Vtf~g~~~~l~~~~~-----~l~~g-~~~~v~v~~~~~g~~--~~~~~~~---~~~~~~~~~~l~~~~~~~~~ 184 (581) =.|..|.-+= --+|...++.. ...++ .+...-+.....|.. ++.-... ..-++..............- T Consensus 1 m~gg~~~~~~-k~~PGvYi~~~~~~~~~i~~~~~~~~a~~~~~~~Gp~~~~~~i~s~~d~~~~fG~~~~~~~~~~~~~~~ 79 (437) T protein:vir:10 1 MAGGIWKRQN-KVRPGAYINVKSKDIAMTRLGGDGVVTVPLALSFGQSKKLMKIRRGEDLFKKLGYEQESPQLLLLNEAF 79 (437) T ss_pred CCcceecccc-eecCceeEEEecCCcceeeccCCcEEEEEEEecCCCCceeEEEecHHHHHHHcCCccchhHHHHHHHHh Confidence 3333443110 11222221100 00011 111111222222221 0000000 00000000000000000000 Q ss_pred eEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccc Q lcl|NC_011270. 185 VLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNV 264 (581) Q Consensus 185 vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~ 264 (581) ..+...+++++..+...... .-+..++. ..+.|.......+.+.-...++..++...+..........+...+. .. T Consensus 80 ~g~~~~~~~R~~~g~~a~~t-l~~~~~~~-A~~~G~~gn~i~v~v~~~~~d~~~~~v~~~~~~~~~d~~~v~~~~~--~~ 155 (437) T protein:vir:10 80 KRVSEVLLYRLNTGEKANVS-LSDNVTAQ-AKYSGVRGNDITVTVKTNVDDPSSFDVVTFLDTVVMDLQTVKVLAD--LK 155 (437) T ss_pred cCCCEEEEEECCCCceeeEe-eccceEEE-eccCCcccceeEEEEeeccCCccceEEEEecCcceeeeeehhhhhh--hh Confidence 11222334444322111000 00111111 1223332222223333333344444444332211110100111110 00 Q ss_pred cccce-eeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcE Q lcl|NC_011270. 265 QSEIT-LCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYER 343 (581) Q Consensus 265 ~~~i~-~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~ 343 (581) .++.. ......+.+.....++++. +++++++||.++|++||..+++.++|| +.++++|+++++||++|+++++.+ T Consensus 156 ~n~~v~~~~~~~l~~~a~~~LtGG~---dg~~t~~dy~~al~~le~~~~n~l~~~-~~d~~~~t~~~~~ik~~r~~~g~~ 231 (437) T protein:vir:10 156 NNALVEFSGTGELQPVAGAKLTGGT---DGAISTQDYLEYFKALETVEFNYMALP-VEDASIKKAAINFIKRMREDEGLG 231 (437) T ss_pred hhcccccccccccccccceeeeccc---cCCCChhHHHHHHHHhccCcceEEEec-CCChhHHHHHHHHHHHHHhccCce Confidence 11111 1111222233333344333 345678899999999999887755555 567889999999999999876666 Q ss_pred EEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccC Q lcl|NC_011270. 344 RAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRG 423 (581) Q Consensus 344 ~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g 423 (581) ++.+.... ..|+++++++.++... .++..++++++++|+||++|++++++|+||++++| T Consensus 232 ~~~V~~~~---------------~~d~e~Iin~~n~~~~------~~~~~~~~~~~~a~vAG~~Ag~~~~~S~t~~~~~~ 290 (437) T protein:vir:10 232 AQLVVADS---------------DADSEAVINVKNGVIL------SDKTVIDKTKATVWVAAASANAGVEKSLTYEKYED 290 (437) T ss_pred EEEEeCCC---------------CCCCceEEEeecceee------cCcceechhhHHHHHHHHhccCccccCccccccCC Confidence 65554321 2378889998887653 34567899999999999999999999999999999 Q ss_pred cccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC-----CcccceEEeehhhHHHHHHHHHHHhhhcCCCc Q lcl|NC_011270. 424 FSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-----SLHTREWNIIGQQDVMVYRIRDYLDADGLIGM 498 (581) Q Consensus 424 ~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t-----d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~ 498 (581) +.++..+|+++|+++|+++|+++|+ +++++|+|+|||||+++ +..|++|+++|++|++.++||+.++ +.|||| T Consensus 291 ~~~v~~~~t~~e~~~~i~~G~~vl~-~~~~~v~i~~gInTltt~~~~~~~~~~ki~vir~~D~i~~di~~~~~-~~yiGk 368 (437) T protein:vir:10 291 SVDVVGRLSHTETEDALLKGQFVFT-ARRGRAVVEQDINSHVSFTIEKNQDFRKNRILRTLDDIVNDTRYAFS-EYFLGK 368 (437) T ss_pred cccccccCCHHHHHHHHhCCcEEEE-EeCCeEEEEEccccccccCCCCCchhhhhhHHHHHHHHHHHHHHHHH-hccccc Confidence 9999999999999999999999996 56789999999999875 6679999999999999999999885 679999 Q ss_pred -cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEe-ecCCCEEEEEEEEEecCceeEEEEEEEEE Q lcl|NC_011270. 499 -PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQI-ERQPDVIEVRYEWRPAYPLNYIVVRYSIA 565 (581) Q Consensus 499 -~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~-~~~~~~~~v~i~v~pv~~~e~I~~~~~~~ 565 (581) +|++++|.+|+++|++||++|+++|+|++|+..+++.. .+.++.+++++.++|+++|||||+++.+. T Consensus 369 ~~N~~~~r~~~~~~i~~yl~~l~~~g~I~~~~~~d~~v~~~~~~~~v~v~~~v~~~dame~iy~ti~v~ 437 (437) T protein:vir:10 369 VSNNEDGRQAFKANRIRYFKDLEARGAIEDFKVEDIEVLRGELKESVVVNVKVKPVDSMEKLYMTVTVE 437 (437) T ss_pred cCCCHHHHHHHHHHHHHHHHHHHhCCCccCCCceeEEeecCCCCCEEEEEEEEEEeeeeeeEEEEEEec Confidence 79999999999999999999999999999998887654 36789999999999999999999999888 No 12 >protein:vir:102819 Length: 648 # NCBI annotation: tail sheath protein # Family: family:all:2449 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874082;genbank:gi:118197689;genbank:GeneID:4496011 Probab=100.00 E-value=1.4e-55 Score=321.29 Aligned_cols=535 Identities=16% Similarity=0.134 Sum_probs=259.4 Q ss_pred Ceecc----ccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccc---cce Q lcl|NC_011270. 1 MAIDF----SQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEP---TGG 73 (581) Q Consensus 1 ~~~~~----~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~---~~G 73 (581) ||++. .+++.|++|.+-.-+---........-+....... .++.++..+|+-.++. .+| T Consensus 1 ma~~~yf~~~~~~~PGVyvee~~sg~~~i~gv~tsva~fvG~a~--------------~Gp~~~p~~v~s~~~~~~~fgg 66 (648) T protein:vir:10 1 MAISVYFDGKLIKQLGAYVKTDLSAVKQINGVGTGIVALLGLAE--------------GGETYKPYRLTSFAEAVSIFKG 66 (648) T ss_pred CeeeeeeCCCCccCCceEEEEeccccccccCCCCceEEEEEeeC--------------CCCCceeEEecCHHHHHHHhcC Confidence 99876 57888999987431111011111111111111111 1222232222211100 000 Q ss_pred e-----------------EEEEeCceeccccccCCCHHHHH-HHHH--h-cCCC--CcceEEEE---cCCCceEEEEe-- Q lcl|NC_011270. 74 S-----------------FKLSLAGEPTGNIPFNATQGQVQ-SALR--A-LPNV--EDDEVTVL---GDPGGPWTVTF-- 125 (581) Q Consensus 74 t-----------------F~l~~~g~~T~~i~~~asa~~v~-~aLe--~-l~~i--~~~~V~~~---~~~g~~w~Vtf-- 125 (581) . |.+..++. +++.+. ..|+ + .++- ..+.+++. +.....|.++. T Consensus 67 g~l~~av~~~F~nGg~~~~~vRv~~~---------~~a~~~~~~~~~~a~~~g~~gn~i~~~v~~~~~~~~~~~~l~v~~ 137 (648) T protein:vir:10 67 GPLLEHIKAAFIGGAGEVVAVRIGNP---------TTASVSIPVAQNTSDTSPANLNFVSYEASTRSNQIYVSFDLDENF 137 (648) T ss_pred ccHHHHHHHHHhCCCcEEEEEEcCCC---------cccceecceeEEeecccCCCCCceEEEEEEcCCCcCceeEEEEEe Confidence 0 11111111 111000 0000 0 0000 01121121 11222333322 Q ss_pred cCCc--cc-c--cc--ccceec-cCCCceEEEEEcccccceeeeccccccccceeeee----c-cc-cccceeeE----e Q lcl|NC_011270. 126 TKAV--AA-L--TK--DVTGLT-GGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRV----V-NP-NSGQVYVL----G 187 (581) Q Consensus 126 ~g~~--~~-l--~~--~~~~l~-~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l----~-~~-~~~~~~vt----g 187 (581) ..+. .. + .+ ....-+ .+..+++++..... .+........++.....+ . .+ ....+..+ - T Consensus 138 ~~~~~~~d~~v~~i~~~~~~y~gt~~~~t~~v~~~~~---~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~v~~~~~~~ 214 (648) T protein:vir:10 138 TSANEADDTIIFTIYQKHPDFSVTRETFTFPRKFTTP---TVLVKRGSTLFFVDRSIVNAALAAGPAFQTALINLLKEQL 214 (648) T ss_pred cCCCcccceeEEEeccCCCcccccceecccccccccc---ccccccccceeecCccchhhhhccCccchhhhhhchhhhh Confidence 1111 00 0 00 000000 01111111111100 000000000000000000 0 00 00000000 0 Q ss_pred ccceeEEeecccccccCcceeeee---eeeeeecccccccceeEEEEee---------cCCcccceeEeccCcch-hhhh Q lcl|NC_011270. 188 TDYVVTRVNAGEDGEANTRDDLYT---IQRVVDGGHIDPGDIVQLSYRY---------TDPNYHEVIRFTDPDDI-QDFY 254 (581) Q Consensus 188 td~~v~~v~~~~dg~~~~~~~~~t---i~~~vd~~~~d~~~~~~~s~~~---------~~~~~~e~~~~~d~~~~-~~~~ 254 (581) .+....+.....+ .. ..+... ......+..........+..+. .+....+.-.+.+..+. .... T Consensus 215 ~~~~~~~~~~~s~--~~-~~d~~~~~~~~~a~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~tp~~~~~~~~~~~~ 291 (648) T protein:vir:10 215 QPTDVVQIFDASD--TN-PVDIPLGLFVYEVLYGGLFGFTKSRLVKTSFGTVDDLLSNPLLFNLSATPFFDGSDYQDYTS 291 (648) T ss_pred hhhhhheeccccc--cc-ccccccccccccccchhhhcCCcchhhhhhhccccccccccceecccccccccccceeeeec Confidence 0000000000000 00 000000 0000000000000000000000 00000000000000000 0000 Q ss_pred ----hhhhhhhcccccccee--------eeee-eecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEe--- Q lcl|NC_011270. 255 ----GPAFDEAGNVQSEITL--------CAQL-AITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVA--- 318 (581) Q Consensus 255 ----~~a~~~~g~~~~~i~~--------~~~~-~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~--- 318 (581) .+.+.......+.+.. ..++ .+.+|........--......++.||+++|++|++++.+.++.+ T Consensus 292 ~~~~~~~~~v~~~~~~~l~~~~~~p~~~~~~~t~L~GGtdG~~p~s~~~~~~~~~~~d~~d~l~~~~~~~~~~ivp~~~~ 371 (648) T protein:vir:10 292 LSDPANWFAKDAYTINHLVDTTINPHILATRIFSLSGGTNGDDGTGYYQTAVSNYINIWSQGLATLEEEEVNFVIPAYKF 371 (648) T ss_pred cccccceeeeeccchhhcccccccCcccccccceecccccCCCcccccccccccchhhHHHHhhhccCCCceEEEeeccc Confidence 0000000000111100 0001 12222211111000011122378899999999999887644431 Q ss_pred ---------CCCcHHHHHHHHHHHHHHhcC-----CCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEc------ Q lcl|NC_011270. 319 ---------GTGAQPIQALVQQHVSAQSNN-----KYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISP------ 378 (581) Q Consensus 319 ---------~t~~~~i~~~l~~~v~~~~~~-----~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~------ 378 (581) .++.++||+++.+|++.|+.+ +...++++|+.++++.... +.+-+...+|++|....+. T Consensus 372 ~~~~~~~~~lt~~q~i~a~a~shv~~~s~~~~~~~r~~~~~~vg~~~~es~~~s-e~~~~~~~~~~~~a~~~~~d~~~~~ 450 (648) T protein:vir:10 372 TNVTQLNDRLTIFKGIASTFLSHVQTMSQVNRRKARVGVFGLPAPSPNESVTAS-EYLYNRNILNTISAMFGGTDRAQAV 450 (648) T ss_pred ccccccccccCCccchHHHHHHHHHHhhhccccccccCeEEEeCCCCchhHHHH-HHHhhhhcccccceeeeecCCceEE Confidence 688899999999999999733 2334666666655543333 3344555677766433222 Q ss_pred ----CeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCcc-cccccCCHHHHHHHHhCCcEEEEEeCCC Q lcl|NC_011270. 379 ----SSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFS-GPAEVQRDGEKSRESSEGLMVIEKTPRN 453 (581) Q Consensus 379 ----~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~-~~~~~~t~~e~~~l~~~Gv~~l~~~~~~ 453 (581) ++..+++ . .....+|+|++||++||+++++++++|||||+|+++. ++..+|+++|+++|+++|++||++.+++ T Consensus 451 ~~~~~~~~~~~-~-G~~~~~p~~~~Aa~VAGl~a~l~~~~s~T~k~i~~~~id~~~~~t~~qld~L~~~Gv~~ie~~~~~ 528 (648) T protein:vir:10 451 VFPFYSNVFND-E-GKVELLGGEFFASYVAGMHANREPQDSITFLPISGIGAEPLYNWTYTQKDDLISNRVLFVEKVKTS 528 (648) T ss_pred eecccceeECC-C-CcEEecchhhHHHHHHhhhhccccccCcccceeeccccccccCCCHHHHHHHhcCCcEEEEEecCC Confidence 2322221 1 1123489999999999999999999999999999763 3446999999999999999999988764 Q ss_pred ----eEEEEEeeeccCC--CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeC Q lcl|NC_011270. 454 ----LVHVRHGVTTDPT--SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRG 527 (581) Q Consensus 454 ----~v~i~~~itT~~t--d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~ 527 (581) .+||++||||+.. +..|++|+++|+.||+.+.||+.++ +.|||+||++..|.+||+.|.+||.+++++++|++ T Consensus 529 ~~~~~~rvv~gITT~~~~~~~~~~eisv~ri~D~l~~~vr~~l~-~~fIG~~n~~~~~~~ik~~i~~~L~~~~~~~~I~~ 607 (648) T protein:vir:10 529 FGGIVYRIHHNPTTWLGPVTQGFQEFVLRRIDDFLQSYVYKNLQ-EQFIGRKSYGRKTENDIKVYTEALLSNLVGKQIVA 607 (648) T ss_pred cceeeEEEeccceeecCCCCcceeeeeeeehhhHHHHHHHHHHh-hhcCcccccHHHHHHHHHHHHHHHhhHhhcCcccC Confidence 5889999999985 6778999999999999999999886 78999999999999999999999999999999999 Q ss_pred CccceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccc Q lcl|NC_011270. 528 YRNLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETG 569 (581) Q Consensus 528 ~~~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg 569 (581) |++.++++..+ .|+++|+|+++|++|||||++++++++.-- T Consensus 608 y~~~~v~~~~~-~~vv~V~~~v~Pv~~i~~I~vti~it~~~~ 648 (648) T protein:vir:10 608 YKDVKVTSNED-KTVYYVEFFYQPVTEIKFILVTMKVTFDLE 648 (648) T ss_pred cccceEEEEec-CCEEEEEEEEEecceeeEEEEEEEEEeccC Confidence 99999987654 599999999999999999999999988655 No 13 >protein:vir:78986 Length: 436 # NCBI annotation: putative sheath tail protein # Family: family:all:632 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110731;genbank:gi:134287348;genbank:GeneID:4955160 Probab=100.00 E-value=8.5e-56 Score=322.49 Aligned_cols=415 Identities=12% Similarity=0.049 Sum_probs=257.3 Q ss_pred HhcCCCCcceEEEEcCCCceEEEEecCCccc-cccccce-----eccCCCceEEEEEccccc------ceeeeccccccc Q lcl|NC_011270. 101 RALPNVEDDEVTVLGDPGGPWTVTFTKAVAA-LTKDVTG-----LTGGDDPDLNIASEQTGV------PAMNRALAKKGI 168 (581) Q Consensus 101 e~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~-l~~~~~~-----l~~g~~~~v~v~~~~~g~------~~~~~~~~~~~~ 168 (581) .+|.+-.-..-..-.++ .-+.|...... ......+ +...=+|.-.+.....+. ..+........+ T Consensus 1 ~~magg~~~~~~K~~PG---~Y~n~~~~~~~~~~~~~rGi~a~p~~~~wGp~~~v~~i~~~~~~~~~~~~~G~~~~~~~~ 77 (436) T protein:vir:78 1 MALGGGTFVTQNKVLPG---SYINFVSATRATSSLSDRGIVAMPLELDWGIDEEVFQVTSDDFEKYSTKYFGYDYTHEKL 77 (436) T ss_pred CcccceeeccceeecCc---eEEEEEecCcceeeccCCeEEEEEEEecCCCCceeEEeecccchHHHHHHhcCccchHHH Confidence 22222100000000111 11222211000 0000000 000111111111111110 000000000000 Q ss_pred cceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCc Q lcl|NC_011270. 169 KTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPD 248 (581) Q Consensus 169 ~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~ 248 (581) + .+.. +-......+.|+...+... ... +. -.-+.+.+.....+.+.-...++..+++..+.... T Consensus 78 ~----~l~~-----~~~~~~tv~~yrl~~G~~a---~~~-v~---~Aky~g~~gn~i~v~v~~~~~d~~~~dv~~~~g~~ 141 (436) T protein:vir:78 78 K----GLRD-----LFKNIRLGYFYKLNKGVKA---SCS-IA---TARCSGIRGNDLKVIVTTNIDDNAKFDVVTLLDNK 141 (436) T ss_pred H----HHHH-----HhcCCCEEEEEECCCccee---eee-ee---eeecCCCCCcEEEEEecccccccCceEEEEEecch Confidence 0 0000 0001122334554322221 111 11 12233333322333444445566666666665443 Q ss_pred chhhhhhhhhhhhccccccc-eeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHH Q lcl|NC_011270. 249 DIQDFYGPAFDEAGNVQSEI-TLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQA 327 (581) Q Consensus 249 ~~~~~~~~a~~~~g~~~~~i-~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~ 327 (581) .+....+..++.. ..|+. .......+.+.+...++++.++ .+++++||.++|++||..+++.++||. .++++|+ T Consensus 142 ~~d~~~~~~~~~l--~~n~~V~~~~~g~la~~a~~~LtGG~dG--~~~T~~dy~~al~~le~~~fn~l~~~~-~d~~~~~ 216 (436) T protein:vir:78 142 KVDTQIAKVITEL--QDNDYVTWKKEATLEATAGLTFTNGTNG--EAVTGTEYQAFLDKIESYSFNALGCLA-TTAEIKS 216 (436) T ss_pred hhhhhhHHHHhhc--cCCceEEEEecccccccceeeeeccccc--cccchHHHHHHHHHHcccceeEEEecC-CChHHHH Confidence 3222222222111 11221 1111223344444445544332 346889999999999999877555555 6889999 Q ss_pred HHHHHHHHHhcC-CCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHH Q lcl|NC_011270. 328 LVQQHVSAQSNN-KYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGK 406 (581) Q Consensus 328 ~l~~~v~~~~~~-~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl 406 (581) ++.+||+|||++ +++..+++. ++...|++.++++.++. ++..++++++++|+||+ T Consensus 217 ~~~a~ikr~re~~g~~~~aV~~---------------~~~~~d~EgIInv~n~v---------~g~~~~~~~~~a~vAG~ 272 (436) T protein:vir:78 217 LFVEFTKRMRDKVGAKFQTVLY---------------KKNDADYEGVVSVENKI---------KDTGLLESSLIYWTTGA 272 (436) T ss_pred HHHHHHHHHHhhcCCeEEEEec---------------CCCCCCCceEEEeeccc---------CCceechhHHHHHHHHH Confidence 999999999975 555556653 23456888999987642 35568899999999999 Q ss_pred hhccchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC-----CcccceEEeehhhHH Q lcl|NC_011270. 407 SVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-----SLHTREWNIIGQQDV 481 (581) Q Consensus 407 ~a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t-----d~~~~~i~v~R~~d~ 481 (581) +|++++++|+||++++++.++..+|+++|+++++++|+++|++ ++++|+|++||||+++ +..|++|+++|++|+ T Consensus 273 ~Ag~~~~~S~T~~~~~~~~~v~~~~t~~e~~~ai~~G~lvl~~-d~~~v~I~~~VNTltt~~~~k~~~~~kI~vir~~D~ 351 (436) T protein:vir:78 273 IAGCDINKSNTNKRYDGEFDVDVNYTQIHLEEALKTGKFIFHK-VGDEVHVLEDINTFVSFTDEKNDDFSSNQSVRVLDQ 351 (436) T ss_pred HhcCccccCccceecCccccccccCCHHHHHHHHhCCeEEEEE-eCCeEEEEEccccceecCCCCCcchhhhhHHHHHHH Confidence 9999999999999999999999999999999999999999975 5678999999999864 557999999999999 Q ss_pred HHHHHHHHHhhhcCCCc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEee-cCCCEEEEEEEEEecCceeEEE Q lcl|NC_011270. 482 MVYRIRDYLDADGLIGM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQIE-RQPDVIEVRYEWRPAYPLNYIV 559 (581) Q Consensus 482 i~~~ir~~~~~~~fiG~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~~-~~~~~~~v~i~v~pv~~~e~I~ 559 (581) +.++||+.++ +.|||| +|+.++|.+|+++|++||++|+++|+|++|++.+++... ...+.+++++.++|+++||+|| T Consensus 352 i~~di~~~~~-~~yiGKv~N~~dgr~~l~~~i~~yl~~L~~~g~I~~f~~~Dv~v~~~~~~~~v~v~~~v~pvdamekiy 430 (436) T protein:vir:78 352 IANDIATLFN-TKYLGEVPNDKSGRISFWNDVVKHHEQLQNMRAIEDFKADDVSVEPGSDKKTVVVSDAVKVISAMSKLY 430 (436) T ss_pred HHHHHHHHhh-hccccccCCCHHHHHHHHHHHHHHHHHHHhCCcccCCCCcceEEeecCCCCEEEEEEEEEEEEeeeeEE Confidence 9999999875 679999 799999999999999999999999999999987776553 6688999999999999999999 Q ss_pred EEEEEE Q lcl|NC_011270. 560 VRYSIA 565 (581) Q Consensus 560 ~~~~~~ 565 (581) +++.++ T Consensus 431 ~ti~v~ 436 (436) T protein:vir:78 431 MTVSVS 436 (436) T ss_pred EEEEEC Confidence 999999 No 14 >protein:vir:102359 Length: 356 # NCBI annotation: XkdK protein # Family: family:all:632 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529565;genbank:gi:90592650;genbank:GeneID:3974491 Probab=100.00 E-value=3.7e-50 Score=291.61 Aligned_cols=322 Identities=8% Similarity=0.052 Sum_probs=220.4 Q ss_pred eeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchh Q lcl|NC_011270. 172 TIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQ 251 (581) Q Consensus 172 ~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~ 251 (581) .-+|++ .. ..+-+-.+....+...-++.+ ...+++.. ...++..++.. T Consensus 1 ~~glp~---------------------------i~--i~f~~~a~ta~~~g~rGiv~~-il~d~~~~--~~~~~~~~~v~ 48 (356) T protein:vir:10 1 MAGLVN---------------------------IN--IEFKELATSFIQRSKAGIVAI-ILKDTTKM--YKELTSEDDIP 48 (356) T ss_pred CCCCCc---------------------------ee--EEEeecceeeccCCccceEEE-EEecCCcc--eeEEeccccch Confidence 111111 01 111111111111111111211 11222221 22233333322 Q ss_pred hhhhhhhhhhccccccceeeeeeeecCCcce-----eEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHH Q lcl|NC_011270. 252 DFYGPAFDEAGNVQSEITLCAQLAITNGAST-----ILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQ 326 (581) Q Consensus 252 ~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~-----~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~ 326 (581) +.+.... ..-.+..+.++..+ ...+.... ..+++||.++|++||..+++.+.||. .++++| T Consensus 49 ~~~~~~n----------~~~i~~~~~g~~~~~~~~~p~~~~~~~---~~t~~~y~~aL~~le~~~fn~l~~~~-~d~~~~ 114 (356) T protein:vir:10 49 ISLSADN----------KKYIKYGFVGATDNEKVLRPSKVIIST---FTEDGKVEDILEELESVEFNYLCMPE-AIEAEK 114 (356) T ss_pred hHHHHHH----------HHHHHHHhhccccccccccceeeeeec---ccCchhHHHHHHHhcCccceEEEecC-CChHHH Confidence 2111100 00011111111111 11111111 23567999999999998888777775 567999 Q ss_pred HHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHH Q lcl|NC_011270. 327 ALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGK 406 (581) Q Consensus 327 ~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl 406 (581) +++.+|++|||++++.+++.|... ...|+|.++++.++.. . ++..+.++++++|+||+ T Consensus 115 ~~~~a~ikr~r~~~~~~~~~V~~~---------------~~aD~EgIInv~n~~~-~------~g~~~t~~~~~~~vAG~ 172 (356) T protein:vir:10 115 TKIVTWIKKIREEESTEAKAVLAN---------------IKADNEAIINFTENVV-V------DGEEITAEKYTTRVASL 172 (356) T ss_pred HHHHHHHHHHHhcCCcEEEEEecC---------------CCCCCceeEEeecCeE-e------cceeechhHHHHHHHHH Confidence 999999999999988888777532 1358899999987643 2 45578899999999999 Q ss_pred hhccchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC-----CcccceEEeehhhHH Q lcl|NC_011270. 407 SVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-----SLHTREWNIIGQQDV 481 (581) Q Consensus 407 ~a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t-----d~~~~~i~v~R~~d~ 481 (581) +|+++.++|+||++++++..+ .+|+++|+++++++|.++|.+ +++.|+|++||||+++ +..|++|+++|++|. T Consensus 173 ~Ag~~~n~S~T~~~~~~~~~~-~~~t~~e~~~ai~~G~lvl~~-d~~~V~I~~~VNSltt~t~~k~~~f~Kirvvr~~D~ 250 (356) T protein:vir:10 173 IASTPNTQSITYAPLDEVESI-VKIDKASADAKVQAGELILRR-LSGKIRIARGINSLTTLTAEKGEIFQKIKLVDTKDL 250 (356) T ss_pred HhccchhccccceecCCcccc-ccCCHHHHHHHHhCCeEEEEE-EcCeEEEEecCccceecCCCCCcchhhhHHHHHHHH Confidence 999999999999999987655 589999999999999999965 5678999999999864 456999999999999 Q ss_pred HHHHHHHHHhhhcCCCc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEE-----------------------ee Q lcl|NC_011270. 482 MVYRIRDYLDADGLIGM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQ-----------------------IE 537 (581) Q Consensus 482 i~~~ir~~~~~~~fiG~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~-----------------------~~ 537 (581) +.++|++.++ +.|||| +|+.++|.+|+++|++||++|+++|+|+++...++.. .. T Consensus 251 i~~Di~~~f~-~~yiGKv~N~~dgr~~l~~ai~~y~~~L~~~~~I~~~~~~eid~e~q~~~~~~~g~d~~~~~d~~v~~~ 329 (356) T protein:vir:10 251 ISKDIKNIYV-EKYLRKCPNTYDNKCLFIVAVQSYLTELAKQELIDSNFTVEIDLEKQKEYLEGKKIAVSKMKENEIKEA 329 (356) T ss_pred HHHHHHHHHh-hccccccCCCHHHHHHHHHHHHHHHHHHHhCCccccCceeEecccchHHHhhhccccccccccceeecc Confidence 9999999875 689999 7999999999999999999999999998643221100 11 Q ss_pred cCCCEEEEEEEEEecCceeEEEEEEEE Q lcl|NC_011270. 538 RQPDVIEVRYEWRPAYPLNYIVVRYSI 564 (581) Q Consensus 538 ~~~~~~~v~i~v~pv~~~e~I~~~~~~ 564 (581) ...+.+++.+.++|+++||+||+++.+ T Consensus 330 ~~~~~v~~~~~v~~vdamE~iy~ti~v 356 (356) T protein:vir:10 330 NTGSNGFYLINLKLVDAMEDINIRVQM 356 (356) T ss_pred cCCcEEEEEEEEEEEeeeeeEEeEEeC Confidence 345789999999999999999999988 No 15 >protein:vir:98824 Length: 774 # NCBI annotation: putative phage tail sheath protein # Family: family:all:661 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851105;genbank:gi:117530262;genbank:GeneID:4484488 Probab=100.00 E-value=4.3e-46 Score=269.28 Aligned_cols=546 Identities=18% Similarity=0.181 Sum_probs=280.0 Q ss_pred CeeccccccC---CCcccc-cCcccccccccccCceeeEEEe--cCCCCceeee-------eEEcCcCCceeeEEEEEEe Q lcl|NC_011270. 1 MAIDFSQYQT---PGVYTE-AVGAPQLGIRSSVPTAVAIFGT--AVGYQTYRES-------IRINPDTGETITTQILALV 67 (581) Q Consensus 1 ~~~~~~~~~~---~~~~~~-~~g~~~~~~~~~~~~~~~~~~~--~~g~~~~~~~-------~~~~~~~~~~~evq~v~~~ 67 (581) ++--.+.-+. .+.|+. +-|+-++-......+-++.-|+ +.|-...+|. +.+..-.++...| .+-++ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 216 (774) T protein:vir:98 138 VAPVVSSKETVAATGPYTGSTSGRYFLRVDDVAAGVATIKWQFVPLGQNPLNWAAVTTDIDVNIAAGAGSSSNV-LIPLT 216 (774) T ss_pred ecceeccccceeeccccccccCceEEEEEcccccceeeeEEEEEecCCCcccceeeeeeeeeeeeccCCCccce-EEEee Confidence 2211111111 123442 3344444443334444445554 2232222221 1111111111111 11111 Q ss_pred --------ccccceeEEE--------EeCce-eccccccCCCHHHHHHHH-HhcCCCCc-ceEEE--EcCC--------- Q lcl|NC_011270. 68 --------GEPTGGSFKL--------SLAGE-PTGNIPFNATQGQVQSAL-RALPNVED-DEVTV--LGDP--------- 117 (581) Q Consensus 68 --------~~~~~GtF~l--------~~~g~-~T~~i~~~asa~~v~~aL-e~l~~i~~-~~V~~--~~~~--------- 117 (581) |..++.+.+| ..+.. .+.+|-...-+..+-.+| ++..+|.. ..++| ..+. T Consensus 217 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~GVYVEEVpSG 296 (774) T protein:vir:98 217 SNNISVRFGTVVGAALNLVEGNSWSIRVNKYVVSVPIFSGDLPNQIVTSLISACAGVEPFGEITRNVEDNGVVIQLEPAL 296 (774) T ss_pred cCceeEEEeeeeeeeEeeecCCccceeecceeeeeccccCcchhhhhhhhhhhhcccccccceEEEEecCceEEEEeCCC Confidence 2222222333 22221 123443333444554444 57777755 33333 3211 Q ss_pred -----C-ceEE-EEecCC-------cccccc---c----cceeccC---CCceEEEEEcccccceeeeccccccc-cc-e Q lcl|NC_011270. 118 -----G-GPWT-VTFTKA-------VAALTK---D----VTGLTGG---DDPDLNIASEQTGVPAMNRALAKKGI-KT-D 171 (581) Q Consensus 118 -----g-~~w~-Vtf~g~-------~~~l~~---~----~~~l~~g---~~~~v~v~~~~~g~~~~~~~~~~~~~-~~-~ 171 (581) | -... --|.|- .|.|.- | .....|| ....+...-...|.+.+...+...+. +. - T Consensus 297 vrtIeGGV~TSVAAFVG~A~rGPvn~PvlITS~aD~~~~Fg~~~GGl~GassA~r~~~~~sG~~~L~i~A~~pGawGN~I 376 (774) T protein:vir:98 297 TGSISNRFSFYVTANDNTANRGFTTSPALVTTIPDPAIHFTSFQGGLDGPRSAFRDFYTFNGTPLLRLQAVSEGNWGNQV 376 (774) T ss_pred CccccccccceeeeecccccCCCCCcCEEEeehhHhhhhhccccCCccccceeeeeeeeecccceEEEEEeecCcCCCce Confidence 1 1111 123221 111100 0 0000110 00011111111111111000000000 00 0 Q ss_pred eeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccc-cceeEE---EEe--ecCCcccceeEec Q lcl|NC_011270. 172 TIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDP-GDIVQL---SYR--YTDPNYHEVIRFT 245 (581) Q Consensus 172 ~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~-~~~~~~---s~~--~~~~~~~e~~~~~ 245 (581) ...+.....+... .......+........++..++ ..+..+. ...... .+. ...+... . T Consensus 377 tV~I~~~t~~~~~------l~v~~~~~s~f~~~~a~e~~tv----~~~~~~~~~~v~e~~dn~~i~~~~~~~~~-----~ 441 (774) T protein:vir:98 377 TVSIYPVNNSEFR------LNVQDLNGSAFNPPLADEVYTV----KLGDTNESGELNALLDSKFIRGFFLPKSI-----D 441 (774) T ss_pred EEEEEecCCceeE------EEEEecCCccccccccceeEEE----ecccccccceeeeeeceeeEeeccccccc-----c Confidence 0000000000000 0000000000000000000000 0000000 000000 000 0000000 0 Q ss_pred cCcchhhhhh-hhhhhhc-c-ccccceeeeee-eecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCC Q lcl|NC_011270. 246 DPDDIQDFYG-PAFDEAG-N-VQSEITLCAQL-AITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTG 321 (581) Q Consensus 246 d~~~~~~~~~-~a~~~~g-~-~~~~i~~~~~~-~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~ 321 (581) ..+....... ..+...+ . ....+...... ..............+.++.+.+..+|..++++++..+ ..++++... T Consensus 442 ~in~vs~lv~~~~~~~a~~d~~~~~~~~~~~~~~~~~~~~v~v~lagG~Dg~~tt~~~igg~~~~~~~tg-i~aLl~a~~ 520 (774) T protein:vir:98 442 SINYDAALVRQSPLRLAPPDESETDVENPAHVDFYGPNVLVDVTLENGYDGPPVTNDDYVSIIRTLENQP-VHILLVGTT 520 (774) T ss_pred cccccccccccchhcccccccccccccccccccccCCcceEEEeecCCCCcccccchheecccccccccc-eeEEEcCcc Confidence 0000000000 0000000 0 00000000000 0001111112222233344556778999998888754 456666777 Q ss_pred cHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHH Q lcl|NC_011270. 322 AQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAA 401 (581) Q Consensus 322 ~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa 401 (581) +..++..+.+||++++..++.|+++++.+++.+. ++.+.....|+|+|+++++|+....+...+ ..+.+|+ ++ T Consensus 521 ~~~V~~aii~~~e~~~~~~~~r~avid~p~g~t~---~~Ai~~r~~f~S~~aal~~Pwvkv~D~~~g-~~~~vPp---Sg 593 (774) T protein:vir:98 521 NVGVQQALITEAERASDSDGLRIAVLAAPPRTTP---TLAASVTRGFNSTRAVMVAGWFTYAGQPNS-SRYGVPG---AA 593 (774) T ss_pred chhhHHHHHHHHHHhhhcccceEEEEECCCCCCH---HHHHHHHhccCCceEEEEeCcEEEeccCCC-ceeecCh---hH Confidence 8889999999999999888999999998766543 455677889999999999998877665433 3345564 58 Q ss_pred HHHHHhhccchhcccccccccCcccc------cccCCHHHHHHHHhCCcEEEE-EeCCCeEEEEEeeeccCCCcccceEE Q lcl|NC_011270. 402 AVAGKSVSAIAAMPLTRKVIRGFSGP------AEVQRDGEKSRESSEGLMVIE-KTPRNLVHVRHGVTTDPTSLHTREWN 474 (581) Q Consensus 402 ~vAgl~a~~~~~~slt~~~l~g~~~~------~~~~t~~e~~~l~~~Gv~~l~-~~~~~~v~i~~~itT~~td~~~~~i~ 474 (581) ++||+.|+.|+|+||.|++|.|+.++ ...+++.|++.|+.+|++++. ...++++++ ||-+|+.+|++|++|+ T Consensus 594 ~vAGl~ArtDv~kSPANk~I~Givg~ai~~~l~~~~t~ae~d~Ln~~gIN~i~itt~g~G~rv-WG~RTlssDp~wr~In 672 (774) T protein:vir:98 594 VYAGKLAAIDFFVSPAARSLVGPLFNIIESDTDNYTSRSNQDIYSAARLEVLSLDTVDRTYRF-ASGVTLSTDPAWERIY 672 (774) T ss_pred HHHHHHHhcCcccccCCceeecceeccccccccccccchhhhhhcccccceeEEEEcCCcEEE-EcccccCCCcccceEe Confidence 88999999999999999999998754 344578899999999999986 356777765 5667888999999999 Q ss_pred eehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCccc-----eeEEeecCCCEEEEEEEE Q lcl|NC_011270. 475 IIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL-----KARQIERQPDVIEVRYEW 549 (581) Q Consensus 475 v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~-----~~~~~~~~~~~~~v~i~v 549 (581) +||++|+|+++|++.++| |++|||++.+|..|+..++.||.+||++|+|.++++. ..++.+++.|+++++|.+ T Consensus 673 VRRlfd~Ie~SI~~~~~~--~VfEPNd~~l~~~I~~sI~~fL~~L~~~GaL~G~~~V~~D~etNt~~dI~~G~l~i~I~v 750 (774) T protein:vir:98 673 LRRVHDVVRQGAHAILRN--YVAMPNSRLVRNQIAAALNAFMGELKRNGNIVSFRPAIIDGSNNSTAAYFSRELYVSLQF 750 (774) T ss_pred ehhhHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceecceEEEEcCCCCCHHHhhCCEEEEEEEE Confidence 999999999999999876 6779999999999999999999999999999998753 245567889999999999 Q ss_pred EecCceeEEEEEEEEEeccceEEE Q lcl|NC_011270. 550 RPAYPLNYIVVRYSIAPETGDITS 573 (581) Q Consensus 550 ~pv~~~e~I~~~~~~~~~tg~~~~ 573 (581) +|++|+|||+++|++++++.++.= T Consensus 751 aP~~PAEfIilri~q~t~~~~l~E 774 (774) T protein:vir:98 751 QPLYSADYIYVTISRDTETSPLGE 774 (774) T ss_pred EecCCcceEEEEEEEeecceeccC Confidence 999999999999999999887654 No 16 >protein:vir:108052 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595294;genbank:gi:161622600;genbank:GeneID:5783771 Probab=100.00 E-value=4.6e-42 Score=247.20 Aligned_cols=549 Identities=15% Similarity=0.087 Sum_probs=253.6 Q ss_pred CeeccccccCCCc---------ccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCcee-----eEEEEEE Q lcl|NC_011270. 1 MAIDFSQYQTPGV---------YTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETI-----TTQILAL 66 (581) Q Consensus 1 ~~~~~~~~~~~~~---------~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~-----evq~v~~ 66 (581) -=|.+-.|-|... |...+|.|..... ..-+.. .....|.+.-+-+|........+ ....++. T Consensus 28 ~~vg~~~~gp~~~p~~v~s~~~~~~~fg~~~~~~~--~~~~~~--~~f~~~g~~~~vvrv~~~~~~~~~~~~~~~~~~~~ 103 (660) T protein:vir:10 28 ALVGKFQWGPAFQVTQITNEVELVDLFGGPNNEVA--DYFMSG--MNFLQYGNDLRTVRVVSREFAKNASPIAGNIETTI 103 (660) T ss_pred eEEeecCCCCCccCeEcCCHHHHHHHcCCcCCCch--hHHHHH--HHHHhCCceEEEEEecccccccccccccccceeEE Confidence 1133334444422 2235566542110 000000 01122333222233321110000 0001111 Q ss_pred e--cc--ccceeEEEEeCceecc----ccccCCCHHHHHHHHHhcCCCC-cceEEEEcCCCceEEEEecCCcc----ccc Q lcl|NC_011270. 67 V--GE--PTGGSFKLSLAGEPTG----NIPFNATQGQVQSALRALPNVE-DDEVTVLGDPGGPWTVTFTKAVA----ALT 133 (581) Q Consensus 67 ~--~~--~~~GtF~l~~~g~~T~----~i~~~asa~~v~~aLe~l~~i~-~~~V~~~~~~g~~w~Vtf~g~~~----~l~ 133 (581) . +. ..+..-++.+.+..-. ...++++.......+..-...+ ...+.........|...|.+... .+. T Consensus 104 ~~~~~~~~~g~~~~v~~~~~~~~~~~~v~~~~a~g~~~~~~~~ta~~~~~a~~v~~~~~~~~~~~~~~~~~~~~~~~a~s 183 (660) T protein:vir:10 104 TTAGSNYAVGDKINIKYNQTVVESEGRVTSVDTDGKILSVFIPSAKIIAYARSLNQYPTLGPAWTAEVTSASSGVSGTIT 183 (660) T ss_pred eeccccccccceeeEeeccccccccccceeeccccceeeeccccccccccccccccccccccceeEEEecccCcccccee Confidence 1 10 1122223333322110 1111221111111110000000 00111111122345555542211 111 Q ss_pred cccceeccCCCceEEEEEccccc-ceeeeccccccccceeeeeccccccceeeEeccceeEEeecccccccCcc------ Q lcl|NC_011270. 134 KDVTGLTGGDDPDLNIASEQTGV-PAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTR------ 206 (581) Q Consensus 134 ~~~~~l~~g~~~~v~v~~~~~g~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~------ 206 (581) +.... . .++..+.+....... ........... .....+.....+. .+....+. .....+...... T Consensus 184 v~~~v-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~~~g~---~G~~i~v~-i~~~~~~~~~~~~~~~~~ 255 (660) T protein:vir:10 184 VGKIV-T-DSGILLTEAENSEEAITSLEFQAALKK--FAMPGVVALYPGE---IGSTLEVE-IVSKAAYEAGSSKMLDVY 255 (660) T ss_pred eeeee-c-cCcceEEeeeccccccccccceeeccc--cccceeeeecccc---cCcceeEE-EeeccccCCcceeEEeee Confidence 11100 0 011111111100000 00000000000 0000000000000 00000000 000000000000 Q ss_pred ----eeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhh---hhhhhhccccccceeeeeeeecCC Q lcl|NC_011270. 207 ----DDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYG---PAFDEAGNVQSEITLCAQLAITNG 279 (581) Q Consensus 207 ----~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~---~a~~~~g~~~~~i~~~~~~~~~~g 279 (581) ....+.......+. ...+...+.....+. ..+.+.........+... .......+..+............+ T Consensus 256 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~ 333 (660) T protein:vir:10 256 PGGGTRASIAKAVFNYGP-QTDDQYAIIVRRDGA-IVESVVLSTKEGEKDVYGNNIYLDDYFAKGTSNYIYATSLNWPKG 333 (660) T ss_pred eccceeeEEeeeeccccc-ccccccccccccCCc-ccceeeeeccccccccccceeeeehhhcCCCccEEEEEeccCCCC Confidence 00000000000000 000000000001110 001110000000000000 000111111222222111111111 Q ss_pred ccee--EEeeeccCCcccchhhHHHHHHHHhcCC---ceEEEEeCC------CcHHHHHHHHHHHHHHhcCCCcEEEEEe Q lcl|NC_011270. 280 ASTI--LACAVDPEGDTVTMGDYQNALNKFRDED---EIAIIVAGT------GAQPIQALVQQHVSAQSNNKYERRAILG 348 (581) Q Consensus 280 ~~~~--~~~~~~~~~~~~t~~dy~~al~~l~~~~---~~~iv~~~t------~~~~i~~~l~~~v~~~~~~~~~~~avvg 348 (581) .... +.++.++ .++.+..|+..++++|+..+ ...+++|+. +..+++++|.+||++++ .|+++++ T Consensus 334 ~~~~~~l~gg~~~-~~~~t~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~al~~~~~~~~----~~~aiid 408 (660) T protein:vir:10 334 FSGIINLSGGISA-NDKVTAGDLMQGWDLFADREALHINLLIAGAVAGEGDEVASTVQKHVVSIADERQ----DCLAFIS 408 (660) T ss_pred cccceeeeccccC-ccccccchhhhhhhhhhhhhhcccceEEEcCcCCCchhhhHHHHHHHHHHHHhhC----CEEEEEe Confidence 1111 2222222 23456778999998887654 233445532 23468888889998874 4899998 Q ss_pred cCCC--------CCchhHHHHHHH-------hhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchh Q lcl|NC_011270. 349 MDGS--------VTPVPSATRIAN-------AQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAA 413 (581) Q Consensus 349 ~~~~--------~~~~~~~~~~~~-------a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~ 413 (581) .+.. .+.+...++... ...+++.|+++++|+..+.+...+......|..++||.+|.+.....+| T Consensus 409 ~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~ 488 (660) T protein:vir:10 409 PPKGLLVNVPLTRAVDNLIDWRTGAGTFDANNMNISTTYAAIDGNYKYQYDKYNDVNRWVPLAADLAGLCARTDDVSQPW 488 (660) T ss_pred cCcccccccccccCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCCceeEechhHHHHHHHHHhhccCCcE Confidence 6532 122333332221 2357899999999988777665444333444557777777777788899 Q ss_pred ccccccccc---CcccccccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccCCCc-ccceEEeehhhHHHHHHHHH Q lcl|NC_011270. 414 MPLTRKVIR---GFSGPAEVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTSL-HTREWNIIGQQDVMVYRIRD 488 (581) Q Consensus 414 ~slt~~~l~---g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~td~-~~~~i~v~R~~d~i~~~ir~ 488 (581) +||.|+.+. |+.+++..+++.|++.|+++||++++..++ +++++ ||.+|..+++ .|++|++||++|+|++.|++ T Consensus 489 ~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~~~~G~~~-wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~ 567 (660) T protein:vir:10 489 MSPAGYNRGQILNVLKLAIEPRQAQRDRMYQEAINPVVGFAGGDGFVL-FGDKTATKVPSPMDHINVRRLFNMLKKNIGD 567 (660) T ss_pred EccCCeeeceeeccceeeecCChhhHHhHhhCCceEEEEeeCCCcEEE-EcccccCCCCcccceEehhhHHHHHHHHHHH Confidence 999999754 566788889999999999999999987765 57765 7778877664 79999999999999999999 Q ss_pred HHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEE Q lcl|NC_011270. 489 YLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSI 564 (581) Q Consensus 489 ~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~ 564 (581) .+++ |++|||++.+|..||..|+.||++||++|+|.+|. ...+++.+++.++++|+|.++|++|||||++||+. T Consensus 568 ~~~~--~v~epn~~~l~~~i~~~i~~fL~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~P~~pae~I~~~~~~ 645 (660) T protein:vir:10 568 ASKY--KLFELNDNFTRSSFRMEVSQYLDGIKALGGIYEGRVVCDTTVNTPAVIDRNEFIANIYVKPARSINYITLNFVA 645 (660) T ss_pred HHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCccEEEEEEEE Confidence 9876 67799999999999999999999999999999985 33567778899999999999999999999999965 Q ss_pred EeccceEEEEEeecccC Q lcl|NC_011270. 565 APETGDITSTIEGTTSF 581 (581) Q Consensus 565 ~~~tg~~~~~~~~~~~~ 581 (581) .. +| .+| T Consensus 646 ~~-~~---------~~~ 652 (660) T protein:vir:10 646 TS-TG---------ADF 652 (660) T ss_pred ee-cC---------ccH Confidence 43 22 344 No 17 >protein:vir:79798 Length: 717 # NCBI annotation: tail sheath subunit # Family: family:all:12069 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429630;genbank:gi:156564120;genbank:GeneID:5525563 Probab=100.00 E-value=8e-42 Score=245.90 Aligned_cols=499 Identities=14% Similarity=0.096 Sum_probs=234.5 Q ss_pred Ceecccc-----------------ccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEE Q lcl|NC_011270. 1 MAIDFSQ-----------------YQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQI 63 (581) Q Consensus 1 ~~~~~~~-----------------~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~ 63 (581) +|.-++. -.||.-|+..+|..-......+|+.+++.-. ++ T Consensus 132 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~----------------- 188 (717) T protein:vir:79 132 VAATFTLPNGGIVEATFLLKARGVIIPPNNYTLDVGTEEDMKAGTQPTFAQVLLN------EN----------------- 188 (717) T ss_pred eEEEEEcCCCceeeeeeeeeecceEeCCCcceEeccChhhhhcCCCchhhhhhhc------cc----------------- Confidence 3332221 1223333333333332222222221111100 00 Q ss_pred EEEeccccceeEEEEeC---cee--ccccccC---------CCHHHHHHHHHhc---------CCCCcceEEEEcCCCce Q lcl|NC_011270. 64 LALVGEPTGGSFKLSLA---GEP--TGNIPFN---------ATQGQVQSALRAL---------PNVEDDEVTVLGDPGGP 120 (581) Q Consensus 64 v~~~~~~~~GtF~l~~~---g~~--T~~i~~~---------asa~~v~~aLe~l---------~~i~~~~V~~~~~~g~~ 120 (581) |.-.-+.+.=+|..+|. |++ +.-|.-+ +...++.-.||-| .++..++-.+-.-.+.. T Consensus 189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (717) T protein:vir:79 189 VADMESEITVSYEFTYKDAQGETKTSEVLDNNTDKDGKPMIAKGADVTIKLEHVALAGLKLYADGIEVVDAKAFTVAGDQ 268 (717) T ss_pred hhhccceeEEEEEEEeecccCcchhhhhhcCCCCCCCceeEEecccceeehhhhhhhhhHHhhcchhhhhhhheeeecce Confidence 00000011123444443 222 1111111 1111111111111 11111110000000111 Q ss_pred EEEEec-C-Ccc-ccccc-cceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccce-eEEe Q lcl|NC_011270. 121 WTVTFT-K-AVA-ALTKD-VTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYV-VTRV 195 (581) Q Consensus 121 w~Vtf~-g-~~~-~l~~~-~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~-v~~v 195 (581) -++.-- + .++ .|... ..+|.--..+-+.+...-.|...-. -. .++...|.. .+++ T Consensus 269 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~n~----------~~----------~~v~~~D~~~~~~~ 328 (717) T protein:vir:79 269 LTIHSNSKMKLGASLEAQYAYNLVEVIQPVIELESIFGGGVYND----------IM----------RKVESKDGAVTVTI 328 (717) T ss_pred EEEEecCCcccchhhHHHHHhhHHHhhccceEEeecccCceeee----------ee----------eEEecCCceEEEEE Confidence 111100 0 000 00000 0000000001111111111110000 00 001111110 0111 Q ss_pred ecccc--------cccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhcccccc Q lcl|NC_011270. 196 NAGED--------GEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSE 267 (581) Q Consensus 196 ~~~~d--------g~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~ 267 (581) +.... .-..+..++......+|+++-++...++..- +-...+.........+.+ ..++..... T Consensus 329 t~~~~~~g~~~~~pl~~ts~dy~~~~~~vdgI~~~~~~~V~~~g---~~s~a~a~~~~g~~s~d~------a~f~Gg~dg 399 (717) T protein:vir:79 329 TKPESKRGMISEDPLVFKSGDYTNFKMLVDAINNHPFNNVVRAR---TKPEFEATFTSTLQAAAD------AKFSGGKDE 399 (717) T ss_pred ecccccCcceeccccccccCceeeeeeeecccccCchhheeeee---cccccceeeeecccCchh------hccCCCccc Confidence 10000 0001111222222334444333322222210 000111110000000000 111111222 Q ss_pred ceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCC--------cHHHHHHHHHHHHHHhcC Q lcl|NC_011270. 268 ITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTG--------AQPIQALVQQHVSAQSNN 339 (581) Q Consensus 268 i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~--------~~~i~~~l~~~v~~~~~~ 339 (581) +..-.+....+- ++-....+.. .+ ..++..|+..+...+++|+.. ...++..+.+||..++.. T Consensus 400 l~~~~ee~Y~~l------Ggk~~d~g~l--t~-~aays~LE~~dVDlVil~ga~adtt~ga~~d~va~alad~caalSal 470 (717) T protein:vir:79 400 LSLDKEEMYKRL------GGEKNEEGFV--TK-QGAYQYLENYEVDYVIPLGVHADTKLIGKYDDFAYQLALACAVMSHY 470 (717) T ss_pred cccchhhhhccc------cccccccccc--cc-hhhhhhcCcceeEEEEecCccccccccchhhhHHHHHHHHHHHhhhc Confidence 221111111000 0000011111 11 256777877654444444322 135677899999998766 Q ss_pred CCcEEEEEecCCCCCc--hhHHHHHHH----hh-----------ccC--------CccEEEEEcCeeEecccccCCceec Q lcl|NC_011270. 340 KYERRAILGMDGSVTP--VPSATRIAN----AQ-----------SIK--------DQRVALISPSSFVYYAPELNREVVL 394 (581) Q Consensus 340 ~~~~~avvg~~~~~~~--~~~~~~~~~----a~-----------~~n--------s~r~~~v~~~~~~~~~~~~~~~~~~ 394 (581) .+.++.+++....... ...+..+.. +. .++ +.+..++.+.+..... ........ T Consensus 471 ~r~ai~VI~l~sp~D~~~AtVe~~~~kLs~~Aaa~~~~d~~~a~a~~~~~~~idis~y~~vv~~~~~iv~~-~~~~~~~~ 549 (717) T protein:vir:79 471 NSVTIGIIPTTTPSDISLAGVEEHVKKLENYANEFYMRDRFGNIIFDADRNKIDLGQFIEVVAGPDFIVRN-TRLGQMAS 549 (717) T ss_pred cccceeeeccccccccchhhHHHHHHHHHhhhhhhhhhcchhccccccccccccccceeeeeecceeEEEc-CCCceeec Confidence 6677777775432211 111111110 00 011 1122223322222222 11222223 Q ss_pred CHHHHHHHHHHHhhccchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCcccceEE Q lcl|NC_011270. 395 GGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWN 474 (581) Q Consensus 395 p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~~~~~i~ 474 (581) | .||++||+.+++++|+||+|++|+|+.++..++++.|++.|+++|+++|+..++++++++.++|+...+..|++|+ T Consensus 550 p---~AG~vAGldA~rGVwkSPANk~I~GVvgLa~~lT~sE~d~Ln~aGIntIr~~~GrGirVWGaRTtasd~sdWryIn 626 (717) T protein:vir:79 550 T---PDASYIGMVSQLKTQSAPTNKPLPSVTALRYTYSANQLNRLTKARFATFKYKQDGSIGVVDAPTSAHAGSDYTRLS 626 (717) T ss_pred C---HHHHHHHHHhcCCcccccccceecccccCcccCCHHHHHHHhhCCeEEEEEeCCceEEEEeeeecCCCCcccceee Confidence 3 3799999999999999999999999999999999999999999999999988888999999988877667899999 Q ss_pred eehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc-ceeEEeecCCCEEEEEEEEEecC Q lcl|NC_011270. 475 IIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN-LKARQIERQPDVIEVRYEWRPAY 553 (581) Q Consensus 475 v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~-~~~~~~~~~~~~~~v~i~v~pv~ 553 (581) +||++|+|+++|++.++| ||||||++.+|..|++.|.+||++||++|+|.+|+. ..+++.+++.++++|+|.++|++ T Consensus 627 VRRl~D~Ie~sIr~al~~--yVgEPNd~~tr~~Ik~sI~afL~~L~r~GAI~GykvdvtnT~~di~~G~l~V~I~vaPv~ 704 (717) T protein:vir:79 627 TARIVKEAVNAVREVADP--FIGEPNDTGNRNALTAAVDKRLSKMIENKALLGFDFRLVVTPQQELLGEGSIELSLEAPN 704 (717) T ss_pred hhhhHHHHHHHHHHHHHH--hccccCCHHHHHHHHHHHHHHHHHHHhcCceecceeeEecChhHhhCCEEEEEEEEEecC Confidence 999999999999999864 899999999999999999999999999999999984 34566778889999999999999 Q ss_pred ceeEEEEEEEEEe Q lcl|NC_011270. 554 PLNYIVVRYSIAP 566 (581) Q Consensus 554 ~~e~I~~~~~~~~ 566 (581) |||||++++++.- T Consensus 705 PaEfI~ititITA 717 (717) T protein:vir:79 705 ELRRLTTIVSLSA 717 (717) T ss_pred cccEEEEEEEEeC Confidence 9999999997665 No 18 >protein:vir:106984 Length: 743 # NCBI annotation: contractile sheath protein gp18 # Family: family:all:661 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195136;genbank:gi:58532913;uniprot:Q5GQN6;genbank:GeneID:3260481 Probab=100.00 E-value=2.2e-41 Score=243.48 Aligned_cols=540 Identities=12% Similarity=0.066 Sum_probs=236.0 Q ss_pred CeeccccccCCCcccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccccceeEEEEeC Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKLSLA 80 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~~~GtF~l~~~ 80 (581) +.|..+....+..... +... ... .........+-........... ......++..+... ..+. -. T Consensus 132 V~v~~~~~d~~~~~~~----~~~~---~~~-~~~~~~~~~~~~~~~~v~~~~~----~~~~~~~~~~~~~~-~~~~--~~ 196 (743) T protein:vir:10 132 GVLVDRGADYIVTFAA----TPTD---TAV-GTQLLFSYSGTLVTGEILSYDS----ATNTATITASGTLT-SQYL--LD 196 (743) T ss_pred EEEecCCCcceeeeec----cccc---ccc-ceeeeecccccccccceeeeee----cCcceeeeeccccc-eeee--cc Confidence 2221111111100000 0000 000 0000000000000000000000 00011111111000 0000 00 Q ss_pred ceeccccccCCCHHHHHHHHHhcCCCCcceEEEEcC--CCceEEEEecCCcccc---------------------ccccc Q lcl|NC_011270. 81 GEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGD--PGGPWTVTFTKAVAAL---------------------TKDVT 137 (581) Q Consensus 81 g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~--~g~~w~Vtf~g~~~~l---------------------~~~~~ 137 (581) ......++........+... .......+.+.+. .+..++++.....+.+ .+... T Consensus 197 ~~~~~~~a~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~tv~v~~~~~~vg~~v~~~~~~~~~~~~~~~~~~v~~~ 273 (743) T protein:vir:10 197 TPEQGLIGSFTDNSTTEVGR---TPGTYSNVPASGGTGTGATFNVVVADAGGGVGGSVVVTLANPGTGYNQGETLTIASA 273 (743) T ss_pred cccccccccccccccccccc---cccceeeEEecccccccccccccccccccccccccccccccccceeeeccccccccc Confidence 11111111111111111111 1111111111110 0111222221100000 00000 Q ss_pred eeccCCCceEEEEEcccccceeeecccc-ccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeee Q lcl|NC_011270. 138 GLTGGDDPDLNIASEQTGVPAMNRALAK-KGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVV 216 (581) Q Consensus 138 ~l~~g~~~~v~v~~~~~g~~~~~~~~~~-~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~v 216 (581) ....+......+.....+..+....... ....................+.. + ....+.......+. .+ T Consensus 274 ~~~~~t~~~~~~~~~~~g~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~t~~----~---~~~~~~~~d~~~v~----v~ 342 (743) T protein:vir:10 274 ATGDGTDILVTVATLSDGTIAITELKDWYLNTEIGSTGIKLGDIGPRPGTSQ----F---ATDNGITDDQVHFA----VI 342 (743) T ss_pred cccccccchhheecccccceeeeecccccccchhhccccccccccccceeee----c---cccccccccceEEE----Ee Confidence 0000000000111111111111100000 00000000000000000000000 0 00000000000000 00 Q ss_pred ecccc----cccceeEEEEeecCCccc----ceeEeccCcchhhhh----------hhhhhh-hccccccceeeeeeeec Q lcl|NC_011270. 217 DGGHI----DPGDIVQLSYRYTDPNYH----EVIRFTDPDDIQDFY----------GPAFDE-AGNVQSEITLCAQLAIT 277 (581) Q Consensus 217 d~~~~----d~~~~~~~s~~~~~~~~~----e~~~~~d~~~~~~~~----------~~a~~~-~g~~~~~i~~~~~~~~~ 277 (581) +..+. ..............++.. ........-.....+ ..+... .+.....+......... T Consensus 343 ~~~~~~~~~~~~v~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 422 (743) T protein:vir:10 343 DTTGELTGTANTIVERLTYLSKLSDARSEENANIYYKNVINEQSAYLYHGNDAAVQIAASGEAWGQSSDQVLADAGTAFS 422 (743) T ss_pred cCcceeeeccCceeEEEeeeecccccccccCcceeecceeccccceeeccCcccceeeeccccCccccceeeeecccccc Confidence 00000 000000000000000000 000000000000000 000000 00000000000000111 Q ss_pred CCcceeEEeeeccCCcccchhhHHHHHHHHhcCC---ceEEEEeC-----CCcHHHHHHHHHHHHHHhcCCCcEEEEEec Q lcl|NC_011270. 278 NGASTILACAVDPEGDTVTMGDYQNALNKFRDED---EIAIIVAG-----TGAQPIQALVQQHVSAQSNNKYERRAILGM 349 (581) Q Consensus 278 ~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~---~~~iv~~~-----t~~~~i~~~l~~~v~~~~~~~~~~~avvg~ 349 (581) ...........+.++..++..+|..+++.|+..+ ...+++|+ .+..++++++.+||++++ .++++++. T Consensus 423 ~~~~~~~~~~gG~d~~~~~~~~~~~~~~~~~~~~~~~~~ll~~p~~~~~~~~~~~v~~a~~~~~~~~~----~~~a~~d~ 498 (743) T protein:vir:10 423 RTTGYWVNLAGGNDDFAYDAGEFGAAMDLFLDTEETEIDFVLMGGSMADEADTKSKATKVIAIAASRK----DALAFVSP 498 (743) T ss_pred cccceEEEeecCccccccchhHHHHHHHHhhhccccCcceEEecCcccCccchHHHHHHHHHHHHhhC----CeEEEEec Confidence 1111111111122233456678999888887653 23455564 234678888899998774 48999987 Q ss_pred CCCCCc------------hhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccc Q lcl|NC_011270. 350 DGSVTP------------VPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLT 417 (581) Q Consensus 350 ~~~~~~------------~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt 417 (581) +..... ........+...++++|+++++|+..+.+...+......|..++|+.+|.......+|+||. T Consensus 499 p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~a~~D~~~g~~~spa 578 (743) T protein:vir:10 499 HKGNQIASTGNVALSSAQQKENTIAFFSDLTSTSYAVFDSGYKYVYDRFTDKYRYIPCNGDVAGLCVQTSNQLDDWYSPA 578 (743) T ss_pred CCCccccccccccccccccchHHHHHHHhccCCeeEEEEccceeeeccccCceeEechhHHHHHHHHHhhccCCcEEccC Confidence 643211 11122233455678999999999877766554443344445577777777777888999999 Q ss_pred cccccCc---ccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeecc-CCCcccceEEeehhhHHHHHHHHHHHhhh Q lcl|NC_011270. 418 RKVIRGF---SGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTD-PTSLHTREWNIIGQQDVMVYRIRDYLDAD 493 (581) Q Consensus 418 ~~~l~g~---~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~-~td~~~~~i~v~R~~d~i~~~ir~~~~~~ 493 (581) |+++.|+ .+++..+++.|++.|+++||+++..+++++++++ |-+|+ ..|+.|++|++||++|+|+++|++.++| T Consensus 579 n~~~~gi~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w-G~rT~~s~d~~~~~i~vrR~~~~i~~si~~~~~~- 656 (743) T protein:vir:10 579 GLNRGGILNAVKLAYNPNKADRDELYQNRINPVVSLRGQGITLF-GDKTALAAPSAFDRINVRRLFLNLEKRARRLAEG- 656 (743) T ss_pred CeeeeeeeccccceecCChhHHHhHhhCCceEEEEecCCeEEEE-cccccCCCCcccceEeehhhHHHHHHHHHHHHHH- Confidence 9997655 4456678999999999999999988888888764 65676 5689999999999999999999999986 Q ss_pred cCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccc Q lcl|NC_011270. 494 GLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETG 569 (581) Q Consensus 494 ~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg 569 (581) |++|||++.+|..|+..|.+||+.||++|+|++|. ...+++.+++.|+++++|.++|++|||||.|||+ ...+| T Consensus 657 -~v~e~n~~~~~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~-~~~~~ 734 (743) T protein:vir:10 657 -VLFEQNDATTRAGFSSALNSYLSEVQARRGVTDYLVICDESNNTPDIIDRNEFVAEVYVKPTRSINFITITFT-ATKTG 734 (743) T ss_pred -hccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEE-EeecC Confidence 56699999999999999999999999999999986 3456777889999999999999999999999995 33333 Q ss_pred -eEEEEEeec Q lcl|NC_011270. 570 -DITSTIEGT 578 (581) Q Consensus 570 -~~~~~~~~~ 578 (581) ++.--+ +- T Consensus 735 ~~~~e~~-~~ 743 (743) T protein:vir:10 735 VTFSEVV-GR 743 (743) T ss_pred cchHhhh-cC Confidence 211111 11 No 19 >protein:vir:104477 Length: 749 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214663;genbank:gi:61806304;genbank:GeneID:3294532 Probab=100.00 E-value=1e-40 Score=239.81 Aligned_cols=544 Identities=11% Similarity=-0.012 Sum_probs=225.6 Q ss_pred Ceeccc---cccCCCccc--ccCcccccccc------ccc--Cceee--EEE------ecCCCCc------eeeeeEEcC Q lcl|NC_011270. 1 MAIDFS---QYQTPGVYT--EAVGAPQLGIR------SSV--PTAVA--IFG------TAVGYQT------YRESIRINP 53 (581) Q Consensus 1 ~~~~~~---~~~~~~~~~--~~~g~~~~~~~------~~~--~~~~~--~~~------~~~g~~~------~~~~~~~~~ 53 (581) +.|... ..+....+. ......+.... ... ..... ... ..++-.. ........+ T Consensus 133 l~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~a~~~~~ 212 (749) T protein:vir:10 133 IGIFVTDAGADQVVVVPAPGSGNEHEFVADAAVSAASGAAGKVFKYSIILTIDDVVGTFAPGSATTITIGGSAESVNVLA 212 (749) T ss_pred eEEEEEcCCCceeeeeecCCccceeeEEeeecccccccccccccccceeeeeccccceeecccceeeeccCccccccccc Confidence 222111 100000000 00000000000 000 00000 000 0000000 000000000 Q ss_pred cCCceeeEEEEEEeccccceeEEEEeCceeccccccCC--CHHHHHHHHHhcCCCCcceEEE----EcCCCceEEEEecC Q lcl|NC_011270. 54 DTGETITTQILALVGEPTGGSFKLSLAGEPTGNIPFNA--TQGQVQSALRALPNVEDDEVTV----LGDPGGPWTVTFTK 127 (581) Q Consensus 54 ~~~~~~evq~v~~~~~~~~GtF~l~~~g~~T~~i~~~a--sa~~v~~aLe~l~~i~~~~V~~----~~~~g~~w~Vtf~g 127 (581) .......+ .+.......+|...... +...+.+. .+..++..+....+-....+.+ ....+..-.++... T Consensus 213 ~~~~~~~~-~~~~~s~~~~~~~a~~~----~v~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~~~~~~~~g~~~~it~v~ 287 (749) T protein:vir:10 213 YDATNKKL-EIGLPSGGVTGILADNQ----VITQGTNTAKINVTIERKLLVALNKSSIEFAASDVVQDTNSTNITITSVR 287 (749) T ss_pred ccCCcceE-EEeeecccccceeeeee----cccccccccccccccccchhhhhccccceeeccccccCCccceeEEEeee Confidence 00000000 01110111111111100 00000000 0001111110000000001111 11111111111110 Q ss_pred CccccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEecccee---EE-eeccccccc Q lcl|NC_011270. 128 AVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVV---TR-VNAGEDGEA 203 (581) Q Consensus 128 ~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v---~~-v~~~~dg~~ 203 (581) +...-. ..+++- ...-.....|...+.... .........+. .... ...++....+ +. .+...+. T Consensus 288 ~~~~~~---~~~t~~---~~~~~a~~~gt~~~~~~~--~g~~D~~~v~v-~~~~-g~~~~~~g~v~e~~~~~~~~~~~-- 355 (749) T protein:vir:10 288 DEYTER---EYLPGV---KWINVAPRPGTSLYANGV--GGHRDEMHVIL-VDID-GGVTGTVGALLERYIDVSKASDA-- 355 (749) T ss_pred cccccc---ccccce---eeccccccccceeeeecc--cCCCCceEEEE-ecCC-Ceeeecccceeeeeeeccccccc-- Confidence 000000 000000 000000000000000000 00000000000 0000 0000000000 00 0000000 Q ss_pred CcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhcc--cccccee----------- Q lcl|NC_011270. 204 NTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGN--VQSEITL----------- 270 (581) Q Consensus 204 ~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~--~~~~i~~----------- 270 (581) .....+.....+....... .+....... +.........+..-........+..... ....... T Consensus 356 --~~~~~~~~~~~~~~~~~s~-~v~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 431 (749) T protein:vir:10 356 --KTSVGETNYYAEVIKQKSE-FIYWAEHES-TLYAATSSASDGLFGQTAANRQFNLFRSAAGSVDYPAGVTTLGSKNNA 431 (749) T ss_pred --cccccccchhhhhhccCCC-EEEEEeccc-ccccccccccccccccccccceeeccccccccceeccccccccccCCc Confidence 0000000000000000000 000000000 0000000000000000000000000000 0000000 Q ss_pred eeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCC---ceEEEEe--CC---CcHHHHHHHHHHHHHHhcCCCc Q lcl|NC_011270. 271 CAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDED---EIAIIVA--GT---GAQPIQALVQQHVSAQSNNKYE 342 (581) Q Consensus 271 ~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~---~~~iv~~--~t---~~~~i~~~l~~~v~~~~~~~~~ 342 (581) ..-..+.++..... . ......+..++..++++|...+ ++.++++ +. +..+++.++.+||++++ . T Consensus 432 ~~~~~~~gg~d~~~--~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~li~~~~~~~~~~~~~v~~al~~~~~~~~----~ 503 (749) T protein:vir:10 432 TYYYRLSGGVNYTV--S--AGQYTITNTDIGSAYELIGDPESQIVDFIISGPSGTSDANALAKITSLVNIAEERR----D 503 (749) T ss_pred EEEEEccCCccccc--c--cccccccchhHHHHHHHhhhhhhcccceEEEecCCCCcchhHHHHHHHHHHHhhcC----C Confidence 00011122211111 1 1122345678888888886543 2333332 22 23457778888888775 4 Q ss_pred EEEEEecCCCCCc------hhHHHHH-HHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcc Q lcl|NC_011270. 343 RRAILGMDGSVTP------VPSATRI-ANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMP 415 (581) Q Consensus 343 ~~avvg~~~~~~~------~~~~~~~-~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~s 415 (581) ++++++.+..... ....... .+...++++|+++++|+.++.+...+......|..++|+.+|......++|+| T Consensus 504 ~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~S 583 (749) T protein:vir:10 504 CMVFVSPRRGNVIGISNTTTITTNIVDFFKKLPSSSYMVFDSGYKYIYDKYNDVYRYIPCNGDTAGLCLQTNEISEPWFS 583 (749) T ss_pred EEEEEcCCCCcccccccchhhhhHHHHHHhhccCceeEEEEccceeeeccccCceEEechHHHHHHHHHHhhccCCcEEC Confidence 7888875433211 1111112 22344678889999998877665543333344555778888888888889999 Q ss_pred ccccc---ccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeecc-CCCcccceEEeehhhHHHHHHHHHHHh Q lcl|NC_011270. 416 LTRKV---IRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTD-PTSLHTREWNIIGQQDVMVYRIRDYLD 491 (581) Q Consensus 416 lt~~~---l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~-~td~~~~~i~v~R~~d~i~~~ir~~~~ 491 (581) |.|++ |.|+.+++..+++.|++.|+++||++++.+++++++++ |-+|+ ..|+.|++|++||++|+|++.|++.++ T Consensus 584 Pan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gIn~i~~~~g~G~~~w-G~rT~~s~d~~~~~i~vRRl~~~ie~si~~~~~ 662 (749) T protein:vir:10 584 PAGFQRGVLRNAIKLAYTPNKAQRDQLYANRVNPIVSFPGQGVVLY-GDKTALGFASAFDRINIRRLFLTVERVISTAAK 662 (749) T ss_pred cCCceeeeeeccccceeecChhHHHhhhhCCceEEEEecCCeEEEE-cceecCCCCcccceeehhhhHHHHHHHHHHHHH Confidence 99986 66778888999999999999999999998888888765 55665 668899999999999999999999987 Q ss_pred hhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEec Q lcl|NC_011270. 492 ADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPE 567 (581) Q Consensus 492 ~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~ 567 (581) | |++|||++.+|..|+..|..||+.||++|+|++|.. +.+++.+++.|+++++|.++|++|+|||++||+. . T Consensus 663 ~--~v~epn~~~l~~~i~~~i~~fL~~l~~~G~i~~f~V~~d~~~Nt~~~i~~G~~~~~i~~~P~~pae~I~~~~~~--~ 738 (749) T protein:vir:10 663 A--QLFEQNDEAQRSLFINIVEPYLRDVQGRRGVVDFLVKCDSTNNTPEAVDRGEFYAEVFLKPTRTINYVQLTFVA--T 738 (749) T ss_pred H--hhcCCCCHHHHHHHHHHHHHHHHHHHhcCCeeeeEEEEcCCCCCHHHhhCCEEEEEEEEEecCCccEEEEEEEE--e Confidence 6 677999999999999999999999999999999853 3556777899999999999999999999999853 2 Q ss_pred cceEEEEEeeccc Q lcl|NC_011270. 568 TGDITSTIEGTTS 580 (581) Q Consensus 568 tg~~~~~~~~~~~ 580 (581) +++++ .|--.| T Consensus 739 ~~~~~--~~e~~s 749 (749) T protein:vir:10 739 RTGVS--FAEVAS 749 (749) T ss_pred ecCcc--hHHHhC Confidence 22221 111111 No 20 >protein:vir:106427 Length: 679 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944106;genbank:gi:38640150;genbank:GeneID:2658188 Probab=100.00 E-value=4.5e-41 Score=241.75 Aligned_cols=551 Identities=10% Similarity=0.040 Sum_probs=234.6 Q ss_pred CeeccccccCCCc---------ccccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCce------------- Q lcl|NC_011270. 1 MAIDFSQYQTPGV---------YTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGET------------- 58 (581) Q Consensus 1 ~~~~~~~~~~~~~---------~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~------------- 58 (581) -=|.+-.|-|-.. |...||.|..... ..-+. .....-|.+.-+-+|+....... T Consensus 28 ~~vg~~~~gp~~~p~~i~s~~~~~~~fg~~~~~~~--~~~~~--~~~f~~gg~~~~vvrv~~~~~~~~~~~~~~~~~~~~ 103 (679) T protein:vir:10 28 ALVGKFNWGPAYQISQVVSEVDLVDKFGRPDDQTA--DSFFS--GVNFLNYGNDLRLVRVLNETKSRNSSALYQSLSYTI 103 (679) T ss_pred eeeecccCCCCccCEEecCHHHHHHHcCCcccccc--hHHHH--HHHHHhCCCeEEEEEccCcccccccccccccccccc Confidence 2222333333321 2334555432100 00000 00011111111111211000000 Q ss_pred -------eeEEEEEEe--ccc-cceeEE-EEeCcee-ccccccCCCHHHHHHHHHhcCCCCc---ceEEE-EcCCCceEE Q lcl|NC_011270. 59 -------ITTQILALV--GEP-TGGSFK-LSLAGEP-TGNIPFNATQGQVQSALRALPNVED---DEVTV-LGDPGGPWT 122 (581) Q Consensus 59 -------~evq~v~~~--~~~-~~GtF~-l~~~g~~-T~~i~~~asa~~v~~aLe~l~~i~~---~~V~~-~~~~g~~w~ 122 (581) ..-.++++. +.. ..+.++ +.-.+.. +..++.. .......+....+.... ..+.. +...+..-. T Consensus 104 ~~~~~~~~~~~~v~~~~~~s~~~~~~~~~~~~~~~~~~~~v~~~-~~~~~a~~~~~~~~l~~a~~~~~~~~t~~~g~~~~ 182 (679) T protein:vir:10 104 TSPGVDYKVGDVVNVLQGGNVIATGKVTVVNASGGIVAFYVPTA-AIIDKAKSLNDYPALDNAWQIQFAAGGPGAGQAAT 182 (679) T ss_pred cccccccccccceeeeeCCCcccceeEEEeeccCceeeeeeccc-ccccccccccccceecccceeeeeeccccccceee Confidence 000001110 000 001110 0001100 0000000 00000000001111100 00000 000000000 Q ss_pred E-----------EecCCcc-ccccccceeccCCCceEEEEEcccccceeeeccccccccce--eeeeccccccceeeEec Q lcl|NC_011270. 123 V-----------TFTKAVA-ALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTD--TIRVVNPNSGQVYVLGT 188 (581) Q Consensus 123 V-----------tf~g~~~-~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~--~~~l~~~~~~~~~vtgt 188 (581) + ++...-. ...+. .+. +....+.-.....+.+..... ........ ...+..+..... ... T Consensus 183 ~~v~~v~~~~~~~~~~~~~a~~~i~--~~~-~~~~t~~~~~~~~~~~~~~A~-~~g~~gn~i~v~~va~~~~~~~--~~~ 256 (679) T protein:vir:10 183 ATVVGINLDSTIFVPNDEYAMSAIS--ERS-ETKRTFIDICEEMKVPAIVAR-YAGTYGDNIKVLMIAYKDYYKF--NEA 256 (679) T ss_pred eeeeeeccCCceeeccccccccccc--ccc-ccchhhhhhhhccccceeeee-cccccCCcceEEEEeecccccc--ccc Confidence 0 0000000 00000 000 000000000000000000000 00000000 000000000000 000 Q ss_pred cceeEEeecccccc-cCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhh--h-hhhccc Q lcl|NC_011270. 189 DYVVTRVNAGEDGE-ANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPA--F-DEAGNV 264 (581) Q Consensus 189 d~~v~~v~~~~dg~-~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a--~-~~~g~~ 264 (581) ...+.......... .............. .............. ..+....+..............+.. + ...++. T Consensus 257 ~a~v~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~vvv-~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 334 (679) T protein:vir:10 257 GKIVSVNTINPKVFPTGLDYGNVTPSSYL-EFGPQNESQFAFIV-FNNGVAVESKILSTKPGDRDIYGTSIYINEYFGNG 334 (679) T ss_pred ccccccccccccccccccccccceeeeec-ccccccccceeeEE-ecccccccceeeecccccccccchhhhhhhhhcCc Confidence 00000000000000 00000000000000 00000000000000 0011111111110000000000000 0 001111 Q ss_pred cccceeeeeeeecCCcceeEEeeeccC-CcccchhhHHHHHHHHhcC--CceE-EEEeCCC------cHHHHHHHHHHHH Q lcl|NC_011270. 265 QSEITLCAQLAITNGASTILACAVDPE-GDTVTMGDYQNALNKFRDE--DEIA-IIVAGTG------AQPIQALVQQHVS 334 (581) Q Consensus 265 ~~~i~~~~~~~~~~g~~~~~~~~~~~~-~~~~t~~dy~~al~~l~~~--~~~~-iv~~~t~------~~~i~~~l~~~v~ 334 (581) .............-.....+....+.. ....+..++.++++.+... ..+. +++|+.. ..+++..+.+||+ T Consensus 335 ~~~~v~~~~~~~~~~~~~~~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~~~ 414 (679) T protein:vir:10 335 YSSFVQGVAESWPVGYTGVLAFGGGQSSNTDISAAEFMKGWDMFADREHTDVNLFIAGAVAGEGAQIASTVQKAVVAIAD 414 (679) T ss_pred ccceeeeccccccccccceeeccCCccCCCccchhhhhhhhhhhhcccccccceEEecCCCCCchhhhHHHHHHHHHHHH Confidence 111110000000000011111111111 1224556788888766543 3333 4455432 3568888999999 Q ss_pred HHhcCCCcEEEEEecCCCCC-----chhHHHHHHH-------------hhccCCccEEEEEcCeeEecccccCCceecCH Q lcl|NC_011270. 335 AQSNNKYERRAILGMDGSVT-----PVPSATRIAN-------------AQSIKDQRVALISPSSFVYYAPELNREVVLGG 396 (581) Q Consensus 335 ~~~~~~~~~~avvg~~~~~~-----~~~~~~~~~~-------------a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~ 396 (581) ++++ ++++++.+.... .....+.+.. ...++|.|.++++|+.++.+...+......|. T Consensus 415 ~~~~----~~ai~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s 490 (679) T protein:vir:10 415 ERRD----CLVLISPPREYMINQPAASVVRKLVDWRRGVNQAGISLDDNMNIGTTYASVDGNYKYQYDKYNDVNRWIPLA 490 (679) T ss_pred hhCC----eEEEEeccccccccccccchHHHHHHHHhhcccccchhhhhhccCcceEEEEccceeeecccCCceEEechH Confidence 8854 889987653321 1122222222 23477899999999888777654443344455 Q ss_pred HHHHHHHHHHhhccchhccccccccc---CcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCC-cccce Q lcl|NC_011270. 397 QFMAAAVAGKSVSAIAAMPLTRKVIR---GFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTS-LHTRE 472 (581) Q Consensus 397 ~~~Aa~vAgl~a~~~~~~slt~~~l~---g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td-~~~~~ 472 (581) .++|+++|.+....++|+||.|+++. |+.++...+++.|++.|+++||++|+..+++++++ ||-+|+..+ .+|++ T Consensus 491 g~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~g~G~~~-wG~rT~~~~~s~~~~ 569 (679) T protein:vir:10 491 ADIAGLCARTDTVGQPWQSPAGFNRGQIVNVIKLAVDTRQAHRDEMYTNGINPIVGFAGQGYIL-YGDKTASQAPTPFDR 569 (679) T ss_pred HHHHHHHHHhhccCCcEECcCCeeeccccccccceeecChhhHHhhhhCCceEEEEecCCeEEE-EcccccCCCCcccce Confidence 57888888888888899999998765 55678888999999999999999999888888876 566777655 57999 Q ss_pred EEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCEEEEEEE Q lcl|NC_011270. 473 WNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYE 548 (581) Q Consensus 473 i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~~~v~i~ 548 (581) |++||++++|+++|++.++| |++|||++.+|..||..|..||..||++|+|.+|.. ..+++.+++.|+++++|. T Consensus 570 i~vrR~~~~i~~si~~~~~~--~v~epn~~~~~~~i~~~i~~fL~~l~~~gal~gf~v~~d~~~nt~~~i~~G~~~~~i~ 647 (679) T protein:vir:10 570 INVRRLFNLLKKSISESAKY--KLFELNDAFTRSSFRSEVGSYLDTIRSLGGIYDFRVVCDESNNTPAVIDRNEFVATIL 647 (679) T ss_pred EehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEE Confidence 99999999999999999876 677999999999999999999999999999999863 356777899999999999 Q ss_pred EEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 549 WRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 549 v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) ++|++|+|||++||+.... . .+| T Consensus 648 ~~p~~pae~i~~~~~~~~~--~--------~~~ 670 (679) T protein:vir:10 648 IKPARSINYITLSFVATST--G--------ADF 670 (679) T ss_pred EEecCCccEEEEEEEEeec--C--------ccH Confidence 9999999999999853322 1 345 No 21 >protein:vir:98263 Length: 664 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239196;genbank:gi:66391671;genbank:GeneID:3416365 Probab=100.00 E-value=1.2e-40 Score=239.48 Aligned_cols=541 Identities=13% Similarity=0.063 Sum_probs=240.9 Q ss_pred CeeccccccCCCcccccC-ccccc-ccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEe-------cccc Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTEAV-GAPQL-GIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQILALV-------GEPT 71 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~-g~~~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~-------~~~~ 71 (581) |+. .-|++|.+-. +...+ +...+. .++-|+- ..|+.++...|+-. |.+. T Consensus 1 ma~-----~~PgVyv~E~~~~~~i~~~~ts~----------~~~vG~~-------~~Gp~~~p~~i~s~~d~~~~fG~~~ 58 (664) T protein:vir:98 1 MAL-----QSPGIETKETSVQSTVVRNSTGR----------AAIVGKF-------SWGPAYQIRQISNEVELVNYFGAPD 58 (664) T ss_pred Cce-----ecCceEEEecCCCcccccccccc----------eEEEeec-------cCCCCCccEEecCHHHHHHhcCCcc Confidence 884 3588888632 11111 111100 0111110 01222222222110 1111 Q ss_pred cee--------EEEEeCcee-cccccc-CC--CHHHHHHHHH---------------------hcCCCCcceEEEEcCCC Q lcl|NC_011270. 72 GGS--------FKLSLAGEP-TGNIPF-NA--TQGQVQSALR---------------------ALPNVEDDEVTVLGDPG 118 (581) Q Consensus 72 ~Gt--------F~l~~~g~~-T~~i~~-~a--sa~~v~~aLe---------------------~l~~i~~~~V~~~~~~g 118 (581) ..+ |-|.+++.- ...+.. ++ .+..+..+|. +-+......+...+.+| T Consensus 59 ~~~~~~~~v~~~f~ngg~~~~vvRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g 138 (664) T protein:vir:98 59 NLTADYFMSAVNFLQYGNDLRLVRVVDKDAAKNASAIFNQIKTTIASQGSNYNVGDVIKVKYSNTVVEENGKVTQVDSNG 138 (664) T ss_pred ccchhHHHHHHHHHhcCCeEEEEEecCccccccccccccccceeeccCCcccccccceeEeecCcccccceeeccCCCCC Confidence 100 111111100 000000 00 0000000000 00000000111112222 Q ss_pred ceEEEEecCCccccc-cc---------------cceeccCCCceEEEEEcccc-cceeeeccccccccceeeeecccccc Q lcl|NC_011270. 119 GPWTVTFTKAVAALT-KD---------------VTGLTGGDDPDLNIASEQTG-VPAMNRALAKKGIKTDTIRVVNPNSG 181 (581) Q Consensus 119 ~~w~Vtf~g~~~~l~-~~---------------~~~l~~g~~~~v~v~~~~~g-~~~~~~~~~~~~~~~~~~~l~~~~~~ 181 (581) ..+.+.-.-....+. .. ....+.|......+...... ...+.. .......+...... T Consensus 139 n~~~v~i~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~s~g~a~a~~v~~v~~d~~~~~~~------~~~a~~~i~~~~~~ 212 (664) T protein:vir:98 139 KILAVTIPKRKKSLLVLNRSVLTQIFLLVGTTEIVSQSSGVSASITIDGIESDSGITLLN------LDIAKETIQGTSFQ 212 (664) T ss_pred ceeeEeeccCccceeecccccccccceecccceeeeeecccceeeecccccccceeeccc------cceeeeccccccce Confidence 222221100000000 00 00000000000000000000 000000 00000000000000 Q ss_pred ceeeEeccceeE-----------EeecccccccCcceeeeeeeeeeecccc------------cccceeEEEEeecCCcc Q lcl|NC_011270. 182 QVYVLGTDYVVT-----------RVNAGEDGEANTRDDLYTIQRVVDGGHI------------DPGDIVQLSYRYTDPNY 238 (581) Q Consensus 182 ~~~vtgtd~~v~-----------~v~~~~dg~~~~~~~~~ti~~~vd~~~~------------d~~~~~~~s~~~~~~~~ 238 (581) ............ .+........... ..+......+.. ...+...+.....+.. T Consensus 213 ~~~~~~~~~~~~a~~~G~~Gn~isv~i~s~~~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 288 (664) T protein:vir:98 213 TLTQKYQIPSVVALYPGELGSTVQVEIISKAAYDTG---AMISGYPSGISVKNSGRSVMTYGPQTDNQYAFVVRRGGIV- 288 (664) T ss_pred eeeeccccceeeeeecccccceeeeeecccccccCc---ceEeeccCceecccceeeeeeccccCccceeEEEecCCce- Confidence 000000000000 0000000000000 000000000000 0000001111111110 Q ss_pred cceeEeccCcchhhhhhhh---hhhhccccccceeeeeeeecCCcceeEEeeecc-CCcccchhhHHHHHHHHhcCCc-- Q lcl|NC_011270. 239 HEVIRFTDPDDIQDFYGPA---FDEAGNVQSEITLCAQLAITNGASTILACAVDP-EGDTVTMGDYQNALNKFRDEDE-- 312 (581) Q Consensus 239 ~e~~~~~d~~~~~~~~~~a---~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~-~~~~~t~~dy~~al~~l~~~~~-- 312 (581) .+.+......+..+..+.. .....+..+.................+....+. ..+..+..++.++|++|++.+. T Consensus 289 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~~~~~~g~~~~~tgl~~l~~~~~~~ 368 (664) T protein:vir:98 289 QESFIVSTDKTDKDIYGVNIYMDDFFANGGSQYVFGTSMNWPKGFSGILEFGGGLSSNDTVGADELMTGWDMFADREALH 368 (664) T ss_pred eeeEEeecccCcccceeeeeechhheecccceeeeeecccCCcccceeEeccCccccccccCchhHHHHHHhhhcccccc Confidence 0111000000000000000 000001111111111111111111111111111 1223456788899999887543 Q ss_pred -eEEEEeCCCc------HHHHHHHHHHHHHHhcCCCcEEEEEecCC--------CCCchhHHHHH-----------HHhh Q lcl|NC_011270. 313 -IAIIVAGTGA------QPIQALVQQHVSAQSNNKYERRAILGMDG--------SVTPVPSATRI-----------ANAQ 366 (581) Q Consensus 313 -~~iv~~~t~~------~~i~~~l~~~v~~~~~~~~~~~avvg~~~--------~~~~~~~~~~~-----------~~a~ 366 (581) ..+++|+... .+++.++.+||+++++ ++++++... ........+.. .... T Consensus 369 ~~ll~~p~~~~~~~~~~~~v~~al~~~a~~~~~----~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (664) T protein:vir:98 369 VPLLIAGGCAGESVEIASTVQKHVISIGDERQD----CTVFVSPPRSLLVNIPLATAVDNIVEWRTGYKISGGTPVDNNL 444 (664) T ss_pred cceEEecCCCCCcHHHHHHHHHHHHHHHHhcCC----eEEEEccccceeccCCccccHHHHHHHhhhccccccchhhhhc Confidence 3345665322 2577778888887753 777766432 22222222211 1234 Q ss_pred ccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhccccccccc---CcccccccCCHHHHHHHHhCC Q lcl|NC_011270. 367 SIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIR---GFSGPAEVQRDGEKSRESSEG 443 (581) Q Consensus 367 ~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~---g~~~~~~~~t~~e~~~l~~~G 443 (581) .++++|.++++|+.++.+...+......|..++|+.+|.......+|+||.|+.+. |+.++...+++.|++.|+++| T Consensus 445 ~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~A~~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~g 524 (664) T protein:vir:98 445 NVSSSYGFLDGNYKYQYDKYNDVNRWVPLAGDIAGLCVYTDSVANPWMSPAGYNRGQIRNCIKLAIEPRTAHRDAMYQVQ 524 (664) T ss_pred CCccceEEEEcCeEEEecccCCceEEechHHHHHHHHHHhhhcCCcEECcCCceeeeeeccccceeecChhhHHHHHhCC Confidence 58899999999988777655433333344557888888888888899999999654 566778889999999999999 Q ss_pred cEEEEEeCC-CeEEEEEeeeccCCC-cccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011270. 444 LMVIEKTPR-NLVHVRHGVTTDPTS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVD 521 (581) Q Consensus 444 v~~l~~~~~-~~v~i~~~itT~~td-~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~ 521 (581) +++++..++ +++++ ||-+|..++ ++|++|++||++++|+++|++.+++ |++|||++.+|..|+..|+.||++||+ T Consensus 525 In~i~~~~~~~G~~~-wG~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~--~v~epn~~~l~~~i~~~i~~~L~~l~~ 601 (664) T protein:vir:98 525 INPVTGFAGGSGFVL-YGDKTLTSVPSPFDRINVRRLFNMIKKDIGDNAKY--KLFENNDDFTRASFRMDTGQYMTNIRA 601 (664) T ss_pred CeEEEEeeCCCcEEE-EcccccCCCCcccceEeehhHHHHHHHHHHHHHHH--hhcCCCCHHHHHHHHHHHHHHHHHHHh Confidence 999988776 56765 566777655 5899999999999999999999876 566999999999999999999999999 Q ss_pred CCceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 522 NNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 522 ~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) +|+|.+|. ...+++.++++|+++++|.++|++|+|||.+||+ ...+|-=-..++|-+-. T Consensus 602 ~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~-q~~~~~~~~e~~~~~~~ 664 (664) T protein:vir:98 602 LGGCYDYRVICDTTNNTPDVIDRNEFVATVYVKPPRSINYITLNFV-ATSTGADFDELVGPQAV 664 (664) T ss_pred cCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEE-EeecCcchhHhcccccC Confidence 99999985 3456777889999999999999999999999995 44444211222332222 No 22 >protein:vir:7206 Length: 659 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049780;genbank:gi:9632592;genbank:GeneID:1258597 Probab=100.00 E-value=2.2e-40 Score=238.03 Aligned_cols=488 Identities=15% Similarity=0.085 Sum_probs=230.5 Q ss_pred Cee-ccccccCCCccc---------ccCcccccccccccCceeeEEEe--cCCCCceeeeeEEcCcCC------------ Q lcl|NC_011270. 1 MAI-DFSQYQTPGVYT---------EAVGAPQLGIRSSVPTAVAIFGT--AVGYQTYRESIRINPDTG------------ 56 (581) Q Consensus 1 ~~~-~~~~~~~~~~~~---------~~~g~~~~~~~~~~~~~~~~~~~--~~g~~~~~~~~~~~~~~~------------ 56 (581) +-| +.. ..+.+-. ...+..|... ......|. .+|..+..+. .+.+.. T Consensus 81 vRv~~~~--~~~~a~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~g~~g~v~~--~~~~~~~~~~~v~t~~~~ 151 (659) T protein:vir:72 81 VRAVDRD--TAKNSSPIAGNIDYTISTPGSNYAVG-----DKITVKYVSDDIETEGKITE--VDADGKIKKINIPTGKNY 151 (659) T ss_pred EEccCCc--ccccccccccccceeecccccccccc-----eeeeeeeccccccccceEEE--eeccccceeeeecccccc Confidence 111 111 0100000 0001111000 00011111 1111111110 000000 Q ss_pred ------------ceeeEEEEEEeccccceeEEEEeCceeccccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEE Q lcl|NC_011270. 57 ------------ETITTQILALVGEPTGGSFKLSLAGEPTGNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVT 124 (581) Q Consensus 57 ------------~~~evq~v~~~~~~~~GtF~l~~~g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vt 124 (581) ...+++++.-......+.|++.- ....+..-... T Consensus 152 a~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~v~----------------------------------v~~~~~~~~~~ 197 (659) T protein:vir:72 152 AKAKEVGEYPTLGSNWTAEISSSSSGLAAVITLGK----------------------------------IITDSGILLAE 197 (659) T ss_pred ccccccccccccccceeeEEeeccccccceEEEEE----------------------------------eecCcceeeee Confidence 00011111111111111111100 00000000000 Q ss_pred ecCC---cccccccccee-------------ccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEec Q lcl|NC_011270. 125 FTKA---VAALTKDVTGL-------------TGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGT 188 (581) Q Consensus 125 f~g~---~~~l~~~~~~l-------------~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgt 188 (581) ..+. ...+..+.... +.+...++++.....+..... ...... ...+.. T Consensus 198 v~~~~~a~~~~~~~~~v~~~~~~~~~a~~~gt~g~~~tv~i~~~~~~~~~~~----------~~v~~~-~~~~~~----- 261 (659) T protein:vir:72 198 IENAEAAMTAVDFQANLKKYGIPGVVALYPGELGDKIEIEIVSKADYAKGAS----------ALLPIY-PGGGTR----- 261 (659) T ss_pred ccccchhhhcccccccccccccceeeeccccccccceeEEEcccccccccee----------eeeecc-cccccc----- Confidence 0000 00000000000 001111111111000000000 000000 000000 Q ss_pred cceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEec-cCc--ch--hhhhhhhhhhhcc Q lcl|NC_011270. 189 DYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFT-DPD--DI--QDFYGPAFDEAGN 263 (581) Q Consensus 189 d~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~-d~~--~~--~~~~~~a~~~~g~ 263 (581) ......+... .+..+. .........++. .+.+... ... +. ...+... ...+ T Consensus 262 --------------a~~~~~~~~~-----~~~~~~--~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~~ 317 (659) T protein:vir:72 262 --------------ASTAKAVFGY-----GPQTDS--QYAIIVRRNDAI-VQSVVLSTKRGEKDIYDSNIYIDD--FFAK 317 (659) T ss_pred --------------cccceeeeee-----eccccc--ccceeeecccce-eeeeeeeeccccccccchhhhhhh--hhhc Confidence 0000000000 000000 000000000000 0000000 000 00 0000000 0111 Q ss_pred ccccceeeeeeeecCCccee--EEeeeccCCcccchhhHHHHHHHHhcCC---ceEEEEeCC------CcHHHHHHHHHH Q lcl|NC_011270. 264 VQSEITLCAQLAITNGASTI--LACAVDPEGDTVTMGDYQNALNKFRDED---EIAIIVAGT------GAQPIQALVQQH 332 (581) Q Consensus 264 ~~~~i~~~~~~~~~~g~~~~--~~~~~~~~~~~~t~~dy~~al~~l~~~~---~~~iv~~~t------~~~~i~~~l~~~ 332 (581) .+............-..... +.++.+. ....+..|+..++++|+..+ ...+++|+. +..++++.+.+| T Consensus 318 ~~~~~v~~~~~~~~~~~~~~~~l~gg~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~~v~~~l~~~ 396 (659) T protein:vir:72 318 GGSEYIFATAQNWPEGFSGILTLSGGLSS-NAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGESLETASTVQKHVVSI 396 (659) T ss_pred CCceEEEEEecccCCcccccccccccccc-cccccchhHHHHHHHhhhccccceeEEEecCCCCcchhhhHHHHHHHHHH Confidence 11111111111110001111 1111111 12346678899998886543 344555543 234688889999 Q ss_pred HHHHhcCCCcEEEEEecCC--------CCCchhHHHHHH-------HhhccCCccEEEEEcCeeEecccccCCceecCHH Q lcl|NC_011270. 333 VSAQSNNKYERRAILGMDG--------SVTPVPSATRIA-------NAQSIKDQRVALISPSSFVYYAPELNREVVLGGQ 397 (581) Q Consensus 333 v~~~~~~~~~~~avvg~~~--------~~~~~~~~~~~~-------~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~ 397 (581) |+++++ ++++++... ..+.+.+..+.. ....+++.|+++++|+..+.+...+......|.. T Consensus 397 ~~~~~~----~~~~~d~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg 472 (659) T protein:vir:72 397 GDARQD----CLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNHKYQYDKYNDVNRWVPLAA 472 (659) T ss_pred HhhhCC----EEEEEcCccccccCCCcccCHHHHHHHHhhccccccccccccceeEEEEcCceeeccccCCceEEechHH Confidence 988854 777776542 222222222222 1224789999999998877666543333334445 Q ss_pred HHHHHHHHHhhccchhccccccccc---CcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCc-ccceE Q lcl|NC_011270. 398 FMAAAVAGKSVSAIAAMPLTRKVIR---GFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSL-HTREW 473 (581) Q Consensus 398 ~~Aa~vAgl~a~~~~~~slt~~~l~---g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~-~~~~i 473 (581) ++|+.+|.+.....+|+||.|+++. |+.++...+++.|++.|+++||++++..+++++++ ||-+|+.+++ .|++| T Consensus 473 ~vAGl~Ar~D~~~G~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~-wG~rT~~~~~s~~~~i 551 (659) T protein:vir:72 473 DIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVL-YGDKTATSVPSPFDRI 551 (659) T ss_pred HHHHHHHHhhccCCcEEccCCeeeceeeccccccccCChhHHHHHhhCCceEEEEecCCeEEE-EcccccCCCCcccceE Confidence 7788888888888899999999754 55667788999999999999999999888888865 5667776664 89999 Q ss_pred EeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEE Q lcl|NC_011270. 474 NIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEW 549 (581) Q Consensus 474 ~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v 549 (581) ++||++|+|+++|++.++| |++|||++.+|..|+..|+.||++||++|+|++|. ...+++.+++.|+++++|.+ T Consensus 552 ~vrR~~~~i~~si~~~~~~--~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~~~i~~G~~~~~i~~ 629 (659) T protein:vir:72 552 NVRRLFNMLKTNIGRSSKY--RLFELNNAFTRSSFRTETAQYLQGNKALGGIYEYRVVCDTTNNTPSVIDRNEFVATFYI 629 (659) T ss_pred eehhHHHHHHHHHHHHHHH--hhcCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCHHHhhCCeEEEEEEE Confidence 9999999999999999876 67799999999999999999999999999999985 34567778999999999999 Q ss_pred EecCceeEEEEEEEEEeccceEEEEEeeccc Q lcl|NC_011270. 550 RPAYPLNYIVVRYSIAPETGDITSTIEGTTS 580 (581) Q Consensus 550 ~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~ 580 (581) +|++|+|||++||+ ...+|-=...+-|... T Consensus 630 ~p~~pae~I~~~~~-~~~~~~~~~e~~~~~~ 659 (659) T protein:vir:72 630 QPARSINYITLNFV-ATATGADFDELTGLAG 659 (659) T ss_pred EecCCccEEEEEEE-EeecCcchHHhcccCC Confidence 99999999999996 3444432233334433 No 23 >protein:vir:100539 Length: 663 # NCBI annotation: gp18 tail sheath monomer # Family: family:all:661 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656380;genbank:gi:109290131;genbank:GeneID:4156517 Probab=100.00 E-value=3.1e-40 Score=237.18 Aligned_cols=516 Identities=12% Similarity=0.058 Sum_probs=225.8 Q ss_pred Cee-cccccc-CCCcc-----cccCcccccccccccCceeeEEEecCCCCceee--eeEEcCcCCceeeEEEEEEecccc Q lcl|NC_011270. 1 MAI-DFSQYQ-TPGVY-----TEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRE--SIRINPDTGETITTQILALVGEPT 71 (581) Q Consensus 1 ~~~-~~~~~~-~~~~~-----~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~~~~~~~~~~~evq~v~~~~~~~ 71 (581) |-| +...-. ..+.. ....+..-.. ..+..... ......... ...... .+... .+... . T Consensus 81 vRv~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~g~~~~v~--~~~~~~~~~~~~~~~~~-~g~~~---~~~~~---~ 147 (663) T protein:vir:10 81 VRVIDMEQAKNASPLFNQIEVTITTEGQGYT----VGDTVSIK--HNTTTVTEEGKVTKVDA-DGKIK---ALFVP---S 147 (663) T ss_pred EecCCcccccccccccccceeeEeecccCcc----ccceeeec--ccccccccCcceeeecc-CCcee---EEEec---c Confidence 111 110000 00000 0000000000 00000000 000000000 000000 00000 01000 0 Q ss_pred ceeEEEEeCceeccccccCCCHHHHHHHHHhcCCCCcceEEEE--cCCCc--eEEEEec-C--------------Ccccc Q lcl|NC_011270. 72 GGSFKLSLAGEPTGNIPFNATQGQVQSALRALPNVEDDEVTVL--GDPGG--PWTVTFT-K--------------AVAAL 132 (581) Q Consensus 72 ~GtF~l~~~g~~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~--~~~g~--~w~Vtf~-g--------------~~~~l 132 (581) +.-+... .+....+..-++...+++ ...+.......+. ...+. .+.+... . +.+.+ T Consensus 148 a~~~~~a-~~~~~~~~~~~a~~~~v~----~~~~~~~~a~av~~i~~dg~vt~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 222 (663) T protein:vir:10 148 SAVIAKA-KQLGTYPVLGDNWRAEVS----GASGGSAATLTLGGIVVDSGVTFGNSEEAPDVMTSTKVLANFAKYGMPLI 222 (663) T ss_pred ccccccc-cccccccccccceeeEEe----eccccccccceeEeeecCCceeEEeeeccccccccceeeeecccccccee Confidence 0000000 000000000000000000 0000000000000 00000 0000000 0 00000 Q ss_pred ccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeee Q lcl|NC_011270. 133 TKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTI 212 (581) Q Consensus 133 ~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti 212 (581) ..... ...|....+.+........ .....+ .+..+...... ... ...+ . ...+..++ T Consensus 223 ~a~~~-g~~G~~i~v~~~~~~~~~~------------~~~~~v-~~~~g~~~~~~-~~~---~~~g---~--~~~~~~~~ 279 (663) T protein:vir:10 223 SAVYP-GEIGSTVEVEVISKTAFQS------------GAAQPI-YPFGGTRASNA-RSV---IQYG---P--MTDDQFAI 279 (663) T ss_pred eeecc-cccCcceeEeecccccccc------------cceeee-cccCccccccc-ccc---cccc---c--ccchhhcc Confidence 00000 0000111111000000000 000000 00000000000 000 0000 0 00000000 Q ss_pred eeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccC- Q lcl|NC_011270. 213 QRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPE- 291 (581) Q Consensus 213 ~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~- 291 (581) .+...+. ......++....... .......+...+ .+..+.........+..+....+....+.+ T Consensus 280 --~~~~~g~-~~e~~~ls~~~~~~~---------~~~~~~~~~~~~---~~~~s~~v~~~~~~~~~~~~~~~~l~gg~~~ 344 (663) T protein:vir:10 280 --IVRRDGI-VVESTVLSTRRGDRD---------VYGNNIFMDDYF---RNGSSNFIYASSVNWPAGFTGIIQLGGGASA 344 (663) T ss_pred --cccCCCc-ccceeeeeccccccc---------cchhhhhhhhhh---cCcccceeEeeccccCcccceeEEecccccC Confidence 0000000 000000000000000 000000111111 122222222222222112222222222211 Q ss_pred CcccchhhHHHHHHHHhcC---CceEEEEeCC--C----cHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCc-----hh Q lcl|NC_011270. 292 GDTVTMGDYQNALNKFRDE---DEIAIIVAGT--G----AQPIQALVQQHVSAQSNNKYERRAILGMDGSVTP-----VP 357 (581) Q Consensus 292 ~~~~t~~dy~~al~~l~~~---~~~~iv~~~t--~----~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~-----~~ 357 (581) ....+..||..+++.|.+. +.+.++++.. + ..++++.+.+||+++ +.++++++.+..... .. T Consensus 345 ~~~~~~~d~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~~l~~~~~~~----~~~~ai~d~p~~~~~~~~~~~~ 420 (663) T protein:vir:10 345 NNAVGSDELIAGWDLFADREALHVNLMIAGACKSDGVAVASTVQKHVVALADDR----QDCVAFVNPPSELLVGVPTTQA 420 (663) T ss_pred cccchhhhhhhHHhhhccccccCceEEEeecCCCCchhhHHHHHHHHHHHHHhh----CCEEEEEecCcccccccchhhh Confidence 2335777898888877654 3444444322 1 245677777777765 458999987654321 11 Q ss_pred HHHH-------------HHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhccccccccc-- Q lcl|NC_011270. 358 SATR-------------IANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIR-- 422 (581) Q Consensus 358 ~~~~-------------~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~-- 422 (581) .+.. ......++++|.++++|+.++.+...+......|..++||.+|.......+|+||.|+.+. T Consensus 421 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D~~~g~~~span~~~~~i 500 (663) T protein:vir:10 421 VKNIVEWRNGVTTGGEVVDNNMNISSTYAFISGNYKYQYDKYNDINRWVPLSADIAGLCAYTDQVGHPWMSPAGYRRGQL 500 (663) T ss_pred HHHHHHHhhhccccchhhhhhcccCcceEEEEecceeEecccCCceEEechHHHHHHHHHHhhccCCcEEccCCeeecce Confidence 1111 2233568899999999988877665444333344457777777777788899999999754 Q ss_pred -CcccccccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccCCC-cccceEEeehhhHHHHHHHHHHHhhhcCCCcc Q lcl|NC_011270. 423 -GFSGPAEVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMP 499 (581) Q Consensus 423 -g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~td-~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~ 499 (581) |+.++...+++.|++.|+++|+++++..++ +++++ ||.+|...+ +.|++|++||++|+|+++|++.++| |++|| T Consensus 501 ~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~-wG~rT~s~~~s~~~~i~vrR~~~~i~~si~~~~~~--~v~ep 577 (663) T protein:vir:10 501 RNTIKLAIEPKQSLRDTMYQVSINPVTGFAGGDGFVL-FGDKMATQVPSPFDRINVRRLFNMLKKNIGDTSKY--ELFEN 577 (663) T ss_pred eccccceeecCchhHHHHHhCCCcEEEEeeCCCcEEE-EcccccCCCCcccceEehhhHHHHHHHHHHHHHHH--hccCC Confidence 556677889999999999999999987776 56654 676777655 5899999999999999999999876 67799 Q ss_pred CCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccc-eEEEE Q lcl|NC_011270. 500 IYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETG-DITST 574 (581) Q Consensus 500 n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg-~~~~~ 574 (581) |++.+|..|+..|+.||.+||++|+|.+|. ...+++.+++.++++++|.++|++|+|||+++|+.. .+| +|.-- T Consensus 578 n~~~l~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~-~~~~~f~e~ 656 (663) T protein:vir:10 578 NDAFTRQSFRMEVSQYLDNIRSLGGVYDFRVVCDTTNNTPQVIDSNEFVATIYIKAPRSINYITLNFVAT-STGANFDEL 656 (663) T ss_pred CCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEE-ecCccHHHH Confidence 999999999999999999999999999985 335667788999999999999999999999999754 333 22110 Q ss_pred EeecccC Q lcl|NC_011270. 575 IEGTTSF 581 (581) Q Consensus 575 ~~~~~~~ 581 (581) + |-..- T Consensus 657 ~-~~~~~ 662 (663) T protein:vir:10 657 I-GPAQL 662 (663) T ss_pred H-HHHhc Confidence 0 00000 No 24 >protein:vir:101187 Length: 663 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932509;genbank:gi:37651635;genbank:GeneID:2610680 Probab=100.00 E-value=5.2e-40 Score=235.94 Aligned_cols=544 Identities=11% Similarity=0.074 Sum_probs=244.4 Q ss_pred CeeccccccCCCc---------ccccCcccccccccccCceeeEEEecCCCCceeee-eEEcCcCCceee-----EEEEE Q lcl|NC_011270. 1 MAIDFSQYQTPGV---------YTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRES-IRINPDTGETIT-----TQILA 65 (581) Q Consensus 1 ~~~~~~~~~~~~~---------~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~e-----vq~v~ 65 (581) -=|.+-.|-|... |...||.++..... .-+... ...-|.+ .|. +|+.......+. -..++ T Consensus 28 ~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~~~--~~~v~~--~f~ngg~-~~~vvRv~~~~~~~~a~~~~~~~~~~ 102 (663) T protein:vir:10 28 AIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAP--YFMSAM--NFLQYGN-DLRLVRVIDMEKAKNASPLVNQVSVT 102 (663) T ss_pred eEEeeeccCCCCccEEecCHHHHHHHhCCcCccchh--HHHHHH--HHHhCCC-eEEEEEccCCcccccccccCCcceee Confidence 2223333344322 33456766532110 000000 1111222 222 233211000000 00111 Q ss_pred Eecc----ccceeEEEEeCceeccc----cccCCCHHHHHHHHHhc-CCCCcceEEEEcCCCceEEEEecCCcccc--cc Q lcl|NC_011270. 66 LVGE----PTGGSFKLSLAGEPTGN----IPFNATQGQVQSALRAL-PNVEDDEVTVLGDPGGPWTVTFTKAVAAL--TK 134 (581) Q Consensus 66 ~~~~----~~~GtF~l~~~g~~T~~----i~~~asa~~v~~aLe~l-~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l--~~ 134 (581) .... ..+-..++.+.+..... ...++....+...+..- .......+......+..|..+|....+.- .. T Consensus 103 ~~~~~~~~~~g~~~~v~~~~~~~~~~~~~~~~~~n~~~~~v~~~~a~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~ 182 (663) T protein:vir:10 103 ITTEGQGYTVGDAITVKYNNATITEAGKVTAVDSDGKIKSLFVPTAEIIAKTRQLGTYPTLGDNWRIDVSGASGGSAAAL 182 (663) T ss_pred eeccccCccccccccccccccccccccceeeeccCCceEEEEeccccccccccccceeeeccccceeEeeeccccccccc Confidence 1100 01111111111100000 00000000000000000 00000000000001122334442111100 00 Q ss_pred ccceeccCCCceEEEEEccccc-ceeeeccccccccceeeeeccccccceeeEeccceeEEeecccc------------- Q lcl|NC_011270. 135 DVTGLTGGDDPDLNIASEQTGV-PAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGED------------- 200 (581) Q Consensus 135 ~~~~l~~g~~~~v~v~~~~~g~-~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~d------------- 200 (581) ....+....+..+.......+. .......... ......+.....+. .+....+ ....... T Consensus 183 ~v~~vv~~~~~~~~~~~~a~~~~~~~~~~~~~~--~~~~~~~~a~~~G~---~Gn~i~v-~i~~~~~~~~~~~~~v~~~~ 256 (663) T protein:vir:10 183 ALGNIVVDSGVTFGNSEDAPAVMTSPAVMEKYA--KFGMPLVSAVYPGE---IGSTVEV-EIVSKTAFNSGAQQTIYPFG 256 (663) T ss_pred cccceecccceeeEeeccccccccccchhhhcc--cccceeeeeecccc---cccceeE-Eecccccccccccccccccc Confidence 0000000000000000000000 0000000000 00000000000000 0000000 0000000 Q ss_pred cc-cCcceeeeeeeeeeecccccccceeEEEEeecCCc-ccceeEeccCcchhhhhhhh--h-hhhccccccceeeeeee Q lcl|NC_011270. 201 GE-ANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPN-YHEVIRFTDPDDIQDFYGPA--F-DEAGNVQSEITLCAQLA 275 (581) Q Consensus 201 g~-~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~-~~e~~~~~d~~~~~~~~~~a--~-~~~g~~~~~i~~~~~~~ 275 (581) +. ......+..+.+..+ +....-....+.. +........... ...+.. + ....+..+......... T Consensus 257 ~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~s~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (663) T protein:vir:10 257 GTRTSNARGVIQYGPMTD-------DQFAIIVRRDGIVVESTVLSTRKGDR--DVYGSNIFMDDYFRNGGSNFIFASSEG 327 (663) T ss_pred cccccccceeeeeccccc-------cceeEEEecCCcceeeeeeeeccccc--ccchhhhhhhhhhccCcceEEEEeecc Confidence 00 000000000000000 0000000111100 001111110000 000000 0 01112222222111111 Q ss_pred ecCCcceeEEeeeccC-CcccchhhHHHHHHHHhcCCce---EEEEeC--C----CcHHHHHHHHHHHHHHhcCCCcEEE Q lcl|NC_011270. 276 ITNGASTILACAVDPE-GDTVTMGDYQNALNKFRDEDEI---AIIVAG--T----GAQPIQALVQQHVSAQSNNKYERRA 345 (581) Q Consensus 276 ~~~g~~~~~~~~~~~~-~~~~t~~dy~~al~~l~~~~~~---~iv~~~--t----~~~~i~~~l~~~v~~~~~~~~~~~a 345 (581) ........+....+.+ ..+.+..||..++++|++.+.. .++++. . ...++++.+.+||++++ .+++ T Consensus 328 ~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~al~~~a~~~~----~~~a 403 (663) T protein:vir:10 328 WPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQKYVVSLADDRQ----DCVA 403 (663) T ss_pred cCccccceeEcccccCCCccccchhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHHHHHHHHHhhC----CEEE Confidence 1111111122111111 2335678999999998875432 233332 1 22567788888888774 4899 Q ss_pred EEecCCCCC-----chhHHHHHH-------------HhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHh Q lcl|NC_011270. 346 ILGMDGSVT-----PVPSATRIA-------------NAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKS 407 (581) Q Consensus 346 vvg~~~~~~-----~~~~~~~~~-------------~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~ 407 (581) +++.+.+.. .......++ ....++++|.++++|+..+.+...+......|..++|+++|.+. T Consensus 404 i~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~P~~~~~d~~~~~~~~~p~s~~vAGl~Ar~D 483 (663) T protein:vir:10 404 IVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDINRWVPLAADIAGLCAYTD 483 (663) T ss_pred EEecCcccccccccccchHHHHHHHHhccccccchhhccccCcceEEEEccceEEecccCCceEEechhHHHHHHHHHhh Confidence 998764321 111122222 23457899999999988877665444444445557788888888 Q ss_pred hccchhccccccc---ccCcccccccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccCCC-cccceEEeehhhHHH Q lcl|NC_011270. 408 VSAIAAMPLTRKV---IRGFSGPAEVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTS-LHTREWNIIGQQDVM 482 (581) Q Consensus 408 a~~~~~~slt~~~---l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~td-~~~~~i~v~R~~d~i 482 (581) ....+|+||.|+. |.|+.+++..+++.|++.|+++||++++..++ +++++ ||.+|+..+ +.|++|++||++++| T Consensus 484 ~~~g~~~sPan~~~~~i~g~~~~~~~~~~~~~~~Ln~~gin~i~~~~~~~G~~~-wG~rT~s~~~s~~~~i~vrR~~~~i 562 (663) T protein:vir:10 484 QVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVL-FGDKMATQVPSPFDRINVRRLFNML 562 (663) T ss_pred ccCCceEccCCceeccccccccceeccChhHHHHHhhCCceEEEEEeCCCcEEE-EcccccCCCCcccceEehhhHHHHH Confidence 8888999999986 55677788999999999999999999987776 57765 677887765 489999999999999 Q ss_pred HHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEE Q lcl|NC_011270. 483 VYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYI 558 (581) Q Consensus 483 ~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I 558 (581) +++|++.++| |++|||++.+|..|+..|+.||.+||++|+|.+|. ++.+++.++++|+++++|.++|++|+||| T Consensus 563 ~~si~~~~~~--~v~e~n~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~i 640 (663) T protein:vir:10 563 KKNIGDTSKY--ELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDRNEFVGTIYVKPPRSINYI 640 (663) T ss_pred HHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceE Confidence 9999999876 67799999999999999999999999999999985 34567778999999999999999999999 Q ss_pred EEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 559 VVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 559 ~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) ++||+... +.+ +| T Consensus 641 ~~~~~~~~--~~~--------~~ 653 (663) T protein:vir:10 641 TLNMVATS--TGA--------NF 653 (663) T ss_pred EEEEEEee--cCc--------cH Confidence 99986442 222 23 No 25 >protein:vir:103456 Length: 659 # NCBI annotation: tail sheath monomer # Family: family:all:661 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803108;genbank:gi:116326388;genbank:GeneID:4405485 Probab=100.00 E-value=5.4e-40 Score=235.85 Aligned_cols=549 Identities=14% Similarity=0.098 Sum_probs=240.7 Q ss_pred ccccCCCcccc--cCccc----------ccccccccCc-eeeEEEecCC----CCceeeeeEEcC--------cCCceee Q lcl|NC_011270. 6 SQYQTPGVYTE--AVGAP----------QLGIRSSVPT-AVAIFGTAVG----YQTYRESIRINP--------DTGETIT 60 (581) Q Consensus 6 ~~~~~~~~~~~--~~g~~----------~~~~~~~~~~-~~~~~~~~~g----~~~~~~~~~~~~--------~~~~~~e 60 (581) =.+..|++|.+ ..+.. +.+...+.|. .......... |++.+..- ..+ ..+...- T Consensus 1 ~~~~~PgVyv~e~~~~~~~~~~~ts~~~fvG~~~~Gp~~~p~~i~s~~d~~~~fG~~~~~~-~~~~~v~~~f~ngg~~~~ 79 (659) T protein:vir:10 1 MTLLSPGIELKETTVQSTVVNNSTGTAALAGKFQWGPAFQIKQVTNEVDLVNTFGQPTAET-ADYFMSAMNFLQYGNDLR 79 (659) T ss_pred CceecCceEEEEecCCceecccCccceEEEecccCCCCCccEEecCHHHHHHHcCCcCCCc-chhHHHHHHHhhCCCeEE Confidence 12233444432 01111 2222222221 1111100000 11110000 000 0011111 Q ss_pred EEEEEEecc-------c--cceeEEEEeCceecc---ccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEe--- Q lcl|NC_011270. 61 TQILALVGE-------P--TGGSFKLSLAGEPTG---NIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTF--- 125 (581) Q Consensus 61 vq~v~~~~~-------~--~~GtF~l~~~g~~T~---~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf--- 125 (581) | |.+.+. + .+..|+..+.|.... .++.-..+. -+ .....+.+....+..-.+.. T Consensus 80 v--vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~-g~~~~v~~vd~~~~~~~~~i~~~ 148 (659) T protein:vir:10 80 V--VRAVDRDTAKNSSPIAGNIEYTISTPGSNYAVGDKITVKYVSD--------AI-ETEGKITEVDTDGKIKKINIPTA 148 (659) T ss_pred E--EEccCcccccccccccccceeeEeecccccccccceeeeecCC--------Cc-cccceeeEEecccccceeeeccc Confidence 1 111100 0 011233333321100 000000000 00 00011111100000000000 Q ss_pred --------cCCcccccccc----ceeccCCCceEEEEEcccccceeee-------ccccccccceeeeeccc--ccccee Q lcl|NC_011270. 126 --------TKAVAALTKDV----TGLTGGDDPDLNIASEQTGVPAMNR-------ALAKKGIKTDTIRVVNP--NSGQVY 184 (581) Q Consensus 126 --------~g~~~~l~~~~----~~l~~g~~~~v~v~~~~~g~~~~~~-------~~~~~~~~~~~~~l~~~--~~~~~~ 184 (581) .|+.+.+..+. ....++....+.+............ ................+ ...... T Consensus 149 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~a~t~~~~~~~~~~~~~~~v~a~~~G 228 (659) T protein:vir:10 149 KIIAKAKEVGEYPTLGSNWTAEISSSSSGLAAVITLGKIITDSGILLAEIENAEAAMTAVDFQANLKKYGIPGVVALYPG 228 (659) T ss_pred ccccccccccccceeeeeeeeeeeeeccccceeeEEeeeecCCceeEEeeccccccccccccccceeecccccccccccc Confidence 01111100000 0000000111111111000000000 00000000000000000 000000 Q ss_pred eEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEE------------eecCCcccceeEeccCcchhh Q lcl|NC_011270. 185 VLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSY------------RYTDPNYHEVIRFTDPDDIQD 252 (581) Q Consensus 185 vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~------------~~~~~~~~e~~~~~d~~~~~~ 252 (581) ..+ +..........+........ +......++..........+ ...++ ..+...+.......+ T Consensus 229 ~~g-~~~tv~~~~~a~~~~~~~v~---v~~~~~~~~~a~~~t~~~~~~~~~~~~~~~~v~~~~~-~~~~~~~~~~~~~~~ 303 (659) T protein:vir:10 229 ELG-DKIEIEIVSKADYAKGASAL---LPIYPGGGTRASTAKAVFGYGPQTDSQYAIIVRRNDA-IVQSVVLSTKRGEKD 303 (659) T ss_pred eec-ccceEEEechhhccccceee---eeeeeecccccccceeeeeeccccccchhhccccccc-eeeeeeeeccccccc Confidence 000 00000000000000000000 00000000000000000000 00000 000110100000000 Q ss_pred hhhh---hhhhhccccccceeeeeeeecCCccee--EEeeeccCCcccchhhHHHHHHHHhcCC---ceEEEEeCC---- Q lcl|NC_011270. 253 FYGP---AFDEAGNVQSEITLCAQLAITNGASTI--LACAVDPEGDTVTMGDYQNALNKFRDED---EIAIIVAGT---- 320 (581) Q Consensus 253 ~~~~---a~~~~g~~~~~i~~~~~~~~~~g~~~~--~~~~~~~~~~~~t~~dy~~al~~l~~~~---~~~iv~~~t---- 320 (581) .... ......+..+................. +.++.+. ....+..|+..++++|+..+ ...+++|+. T Consensus 304 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~l~gg~~~-~~~~~~~~~~~~~~~~~~~~~~~~~il~~p~~~~~~ 382 (659) T protein:vir:10 304 IYDSNIYIDDFFAKGGSEYIFATAQNWPEGFSGILTLSGGLSS-NAEVTAGDLMEAWDFFADRESVDVQLFIAGSCAGES 382 (659) T ss_pred cccchhhhhhhhccCcccEEEEeecccCCCccceeeecccccc-cccccchhHHHHHHHhhhccccceeEEEecCCCCcc Confidence 0000 001111122222211111111111111 1122221 23356678999888886543 344555543 Q ss_pred --CcHHHHHHHHHHHHHHhcCCCcEEEEEecCC--------CCCchhHHHHHHH-------hhccCCccEEEEEcCeeEe Q lcl|NC_011270. 321 --GAQPIQALVQQHVSAQSNNKYERRAILGMDG--------SVTPVPSATRIAN-------AQSIKDQRVALISPSSFVY 383 (581) Q Consensus 321 --~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~--------~~~~~~~~~~~~~-------a~~~ns~r~~~v~~~~~~~ 383 (581) +..+++..+.+||+.+++ ++++++.+. ..+...+..+... ...++|.|+++++|+.++. T Consensus 383 ~~~~~~v~~al~~~~~~~~~----~~~~~d~p~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~s~~~~l~~p~~~~~ 458 (659) T protein:vir:10 383 LETASTVQKHVVSIGDARQD----CLVLCSPPRETVVGIPVTRAVDNLVNWRTAAGSYTDNNFNISSTYAAIDGNYKYQY 458 (659) T ss_pred hhhhHHHHHHHHHHHHhhCC----eEEEEcCccccccCCCcccCHHHHHHHHHhcccccccccccCcceEEEEeCcEEEe Confidence 235688889999988854 777776542 1222222222221 1247899999999988877 Q ss_pred cccccCCceecCHHHHHHHHHHHhhccchhcccccccc---cCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEe Q lcl|NC_011270. 384 YAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVI---RGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHG 460 (581) Q Consensus 384 ~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l---~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~ 460 (581) +...+......|..++|+.+|.......+|+||.|+++ .|+.+++..+++.|++.|+++||++++..++++++++ | T Consensus 459 d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w-G 537 (659) T protein:vir:10 459 DKYNDVNRWVPLAADIAGLCARTDNVSQTWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVLY-G 537 (659) T ss_pred cccCCceEEechHHHHHHHHHHHhccCCceEccCCceeeeeeccccceecCCHhHHHHHhhCCeeEEEEeCCCeEEEE-c Confidence 66544433344445788888888888899999999974 4566777889999999999999999998888888765 5 Q ss_pred eeccCCC-cccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEE Q lcl|NC_011270. 461 VTTDPTS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQ 535 (581) Q Consensus 461 itT~~td-~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~ 535 (581) -+|+.++ +.|++|++||++++|+++|++.++| |++|||++.+|..|+..|+.||+.||++|+|++|. ...+++ T Consensus 538 ~rT~~~~~s~~~~i~vrR~~~~i~~si~~~~~~--~v~e~n~~~l~~~i~~~i~~fL~~l~~~gal~~~~V~~d~~~nt~ 615 (659) T protein:vir:10 538 DKTATSVPSPFDRINVRRLFNMLKTNIGRSSKY--RLFELNNAFTRSSFRTETAQYLQGIKALGGIYEYRVVCDTTNNTP 615 (659) T ss_pred ccccCCCCcccceEehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeEEEEEcCCCCCH Confidence 5666655 5899999999999999999999876 66799999999999999999999999999999985 345677 Q ss_pred eecCCCEEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeeccc Q lcl|NC_011270. 536 IERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTS 580 (581) Q Consensus 536 ~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~ 580 (581) .+++.++++++|.++|++|+|||.++|+.....-++. .|.|-.. T Consensus 616 ~~i~~G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~-e~~~~~~ 659 (659) T protein:vir:10 616 SVIDRNEFVATFYIQPARSINYITLNFVATATGADFD-ELTGLAG 659 (659) T ss_pred HHhhCCeEEEEEEEEecCCcceEEEEEEEEecCcchH-HhhccCC Confidence 7889999999999999999999999997552221221 1111111 No 26 >protein:vir:104858 Length: 729 # NCBI annotation: T4-like tail sheath protein # Family: family:all:661 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214361;genbank:gi:61806001;genbank:GeneID:3294378 Probab=100.00 E-value=8e-40 Score=234.93 Aligned_cols=554 Identities=11% Similarity=0.057 Sum_probs=231.9 Q ss_pred Cee-------------ccccccCCC---cc--cccCcccccccccccCceeeEEEecCCCCceeeeeEEcCcCCceeeEE Q lcl|NC_011270. 1 MAI-------------DFSQYQTPG---VY--TEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRESIRINPDTGETITTQ 62 (581) Q Consensus 1 ~~~-------------~~~~~~~~~---~~--~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq 62 (581) +-| .+....... .. ...-.....................+|..+...++.+...... + T Consensus 86 vRv~~~~~~~a~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~G~~gn~~~v~v~~~~~~----~ 161 (729) T protein:vir:10 86 VRADDYNTQTGVGLKNAFVTGGVGVGATMLRITSNTHYNQLGYDENTISGVSVAAKNPGTWANGIKVAIIDGKAD----Q 161 (729) T ss_pred EecCcccccccccccccccccccccccccccccccccccccccccCCCcceEEEEeccCccccceeeEEecccCc----c Confidence 000 000000000 00 0000000111111111112222223333333333322211110 0 Q ss_pred EEEEec---cccceeEEEEeCceec------------cccccCCCHHHHHHHHHhcCCCCc--ceEEEEcCCCceEEEEe Q lcl|NC_011270. 63 ILALVG---EPTGGSFKLSLAGEPT------------GNIPFNATQGQVQSALRALPNVED--DEVTVLGDPGGPWTVTF 125 (581) Q Consensus 63 ~v~~~~---~~~~GtF~l~~~g~~T------------~~i~~~asa~~v~~aLe~l~~i~~--~~V~~~~~~g~~w~Vtf 125 (581) .+++.. ...+-.|.+++.+... ..+..+.+..... ...+..... ..............+.. T Consensus 162 ~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--v~~~s~~~~~~~~~~~~~~~~~~~~~~~ 239 (729) T protein:vir:10 162 ILTVASGNTTAVGSAVTQSISKTIGTATGTTTIDGVLKGIVTGSTDTTLE--VKVISHISAAGVETAVEYQQNGTYTFDN 239 (729) T ss_pred eeeeeccccccceeeeeeeccccccccccceeeeeeeccccccccccccc--ceecccccccccceeccccccceeeecc Confidence 111100 0000011111111000 0000000000000 000000000 00000000000011110 Q ss_pred cCCccccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccc-----eeeEeccceeEEeecccc Q lcl|NC_011270. 126 TKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQ-----VYVLGTDYVVTRVNAGED 200 (581) Q Consensus 126 ~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~-----~~vtgtd~~v~~v~~~~d 200 (581) .+..+.+....... +.....+.................. ....+ .+.+.. ......+...... ...+ T Consensus 240 ~~s~~~~a~~~~~~--~~~~~~t~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~t~~~~~~~~~~~d~~~~~~-~d~~ 311 (729) T protein:vir:10 240 SGSVNVIAAGSSGS--GSAKSYTAQTDWFESQNIVLSNSTL----EWDSI-ADAPGTSTYVSTRGGKNDEIHVLV-IDDK 311 (729) T ss_pred cCccceeeeccccc--cccccceeeeccccccccccccccc----ccccc-ccccccccccccccccccccceee-eccc Confidence 11111000000000 0000000000000000000000000 00000 000000 0000000000000 0000 Q ss_pred ccc----Ccceeee-eeeeeeeccccccc-----cee--EEEEeecCCcccceeEeccCcchhhhhhhhhh---hhcccc Q lcl|NC_011270. 201 GEA----NTRDDLY-TIQRVVDGGHIDPG-----DIV--QLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFD---EAGNVQ 265 (581) Q Consensus 201 g~~----~~~~~~~-ti~~~vd~~~~d~~-----~~~--~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~---~~g~~~ 265 (581) +.. ....... .+....+....... ++. ...+............................ .+.... T Consensus 312 ~~~~~~~g~vve~~~~~s~~~~~~~~~~~~~~~~~vi~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 391 (729) T protein:vir:10 312 GTITGNSGTILEKHLSLSKAKDAEYSVGSSSYWRDFLATNSKYIFGGGATSGITTTGYSVSSTNTLDTDSGWDQNAEGVN 391 (729) T ss_pred cccccCcccceeeeeeeeeccccccccccccccceeeccccceeeecccccccccccccccccceecccccccccccccc Confidence 000 0000000 00000000000000 000 00000000000000000000000000000000 000000 Q ss_pred ccceeeeeeeecCCccee-EEeeeccCCcccchhhHHHHHHHHhcCCceEE---EEe-----CCCcHHHHHHHHHHHHHH Q lcl|NC_011270. 266 SEITLCAQLAITNGASTI-LACAVDPEGDTVTMGDYQNALNKFRDEDEIAI---IVA-----GTGAQPIQALVQQHVSAQ 336 (581) Q Consensus 266 ~~i~~~~~~~~~~g~~~~-~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~i---v~~-----~t~~~~i~~~l~~~v~~~ 336 (581) ..........+.+|.... ........+......++..++++|++.+...+ +++ ......++..+.+||+++ T Consensus 392 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~g~~~l~~~~~~~~~~~~~~~~~~~~~~~~~v~~a~~~~~~~~ 471 (729) T protein:vir:10 392 FGASGVATLTLAGGTNYGDKTDLTTSGALSSGVDDIISGYTLFENTEEIEVDFILMGAAHHPKEQSQAVAEKVTAVAEAR 471 (729) T ss_pred ccccceeEEEeecccccccccccccccccccchhHHHHHHHHhhcccccccceeeecCCCCCccchHHHHHHHHHHHHhc Confidence 000000111111111100 00001111223445678899999987654322 222 345667888899999887 Q ss_pred hcCCCcEEEEEecCCCC--------------CchhHHHHHHHhhcc-CCccEEEEEcCeeEecccccCCceecCHHHHHH Q lcl|NC_011270. 337 SNNKYERRAILGMDGSV--------------TPVPSATRIANAQSI-KDQRVALISPSSFVYYAPELNREVVLGGQFMAA 401 (581) Q Consensus 337 ~~~~~~~~avvg~~~~~--------------~~~~~~~~~~~a~~~-ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa 401 (581) ++ ++++++..... ......+.......+ +++++++++|+..+.+...+......|..++|+ T Consensus 472 ~~----~~a~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~~~~~~p~s~~~aG 547 (729) T protein:vir:10 472 KD----AVAFISPYRQAFLNDTVSGTVTVSNIDQTTENVVGFYAPLSSSTYSVFDSGYKYMFDRFNNTFRYVPLNGDIAG 547 (729) T ss_pred CC----eEEEecccccccccccccccccccccchhhHHHHHHHhhccCCceEEEEcCeeEEecccCCceEEechhHHHHH Confidence 53 77777643210 011112223333444 456788888887776654443333444557888 Q ss_pred HHHHHhhccchhcccccccccC---cccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeecc-CCCcccceEEeeh Q lcl|NC_011270. 402 AVAGKSVSAIAAMPLTRKVIRG---FSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTD-PTSLHTREWNIIG 477 (581) Q Consensus 402 ~vAgl~a~~~~~~slt~~~l~g---~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~-~td~~~~~i~v~R 477 (581) .+|.+..+..+|+||.|+++.+ +.++...+++.|++.|+++||++++.+++++++++ |-+|+ ..|+.|++|++|| T Consensus 548 l~a~~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w-G~rT~~~~d~~~~~i~vrR 626 (729) T protein:vir:10 548 TCARTDIEQFPWFSPAGTARGPILNSVKLVYNPGKKQRDILYSNRINPVILSPGAGIILF-GDKTGFGKSSAFDRINVRR 626 (729) T ss_pred HHHHhhccCCcEEccCCccccceecccceeeecChhhHhhhhhCCceEEEEecCCeEEEE-cceecCCCCcccceeehhh Confidence 8888888888999999998655 45566788999999999999999998888888765 55665 6789999999999 Q ss_pred hhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEecC Q lcl|NC_011270. 478 QQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAY 553 (581) Q Consensus 478 ~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~ 553 (581) ++|+|++.|++.++| |++|||++.+|..|+..|.+||..||++|+|.+|. ...+++.+++.|+++++|.++|++ T Consensus 627 ~~~~i~~si~~~~~~--~v~epn~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~~G~~~~~v~~~p~~ 704 (729) T protein:vir:10 627 LFIYLEDAISAAAKD--QLFEFNDELTRTNFVNIVEPFLRDVQAKRGIFDFVVICDETNNTAAVIDSNEFVADIFIKPAR 704 (729) T ss_pred hHHHHHHHHHHHHHH--hhcCCCCHHHHHHHHHHHHHHHHHHHhccceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecC Confidence 999999999999876 67799999999999999999999999999999985 445677789999999999999999 Q ss_pred ceeEEEEEEEEEeccce---EEEEE Q lcl|NC_011270. 554 PLNYIVVRYSIAPETGD---ITSTI 575 (581) Q Consensus 554 ~~e~I~~~~~~~~~tg~---~~~~~ 575 (581) |+|||.+||+....-=+ +-+.| T Consensus 705 p~e~i~~~~~~~~~~~~~~e~~~~~ 729 (729) T protein:vir:10 705 SINFIGLTFVATRTGVAFEEVIGSV 729 (729) T ss_pred CccEEEEEEEEeecCccHHHHHhcC Confidence 99999999866543211 22223 No 27 >protein:vir:5833 Length: 742 # NCBI annotation: similar to tail sheath protein # Family: family:all:661 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835619;genbank:gi:30044022 Probab=100.00 E-value=1.3e-39 Score=233.73 Aligned_cols=529 Identities=16% Similarity=0.084 Sum_probs=246.1 Q ss_pred CeeccccccCCCcccc--------cCccc--------ccccccccCceeeEE-----EecCCCCceeeeeEEcCcCCcee Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTE--------AVGAP--------QLGIRSSVPTAVAIF-----GTAVGYQTYRESIRINPDTGETI 59 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~--------~~g~~--------~~~~~~~~~~~~~~~-----~~~~g~~~~~~~~~~~~~~~~~~ 59 (581) |=|+|.+ ++ + |.. .++.| +.++.+. ...++. -+.++.-...--+ ..|.++... T Consensus 102 ~~~~~~~-~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 175 (742) T protein:vir:58 102 MFLGYDP-QK-G-YTDVSYVDVQLAGTPTDTILFSYSLDGSSTT--HSLTINLNAPSVTLPSNIVPLFFY-YEPYTGSIT 175 (742) T ss_pred eeeeccc-CC-C-cccceeEEEEEccCCCeeEEEeeecCCCcce--eEEEEEeeceeEeeccccceeeeE-eccccceEE Confidence 6677655 21 1 221 11111 1111111 111110 0111111000000 122222110 Q ss_pred eEEEEEEeccccceeEEEEeCceecccccc-----------------------CCCHHHHHHHHHhcCCCCcceEEEEcC Q lcl|NC_011270. 60 TTQILALVGEPTGGSFKLSLAGEPTGNIPF-----------------------NATQGQVQSALRALPNVEDDEVTVLGD 116 (581) Q Consensus 60 evq~v~~~~~~~~GtF~l~~~g~~T~~i~~-----------------------~asa~~v~~aLe~l~~i~~~~V~~~~~ 116 (581) =..+|. -.++.|.++++- .+|.+|+| |..+.+.--++|+. ++... T Consensus 176 ~~~~~~--~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~ 243 (742) T protein:vir:58 176 LQSSVN--YSGLTLNYTVSK--ATTPWVYFAEYGTPTSSLTLYKGFYLEGIDLNSFNKQFVVSIENI--------TVNRE 243 (742) T ss_pred Eeeecc--cCCCcccceeee--eecCcccccccCCCccceeeeecccccccccCcccceeeEEEeee--------eeccc Confidence 000000 012223333321 33455553 22222222222111 01111 Q ss_pred CCc------eEEEEecC--------------------Ccc--------c--cccccce-eccCC----CceEEEEEcccc Q lcl|NC_011270. 117 PGG------PWTVTFTK--------------------AVA--------A--LTKDVTG-LTGGD----DPDLNIASEQTG 155 (581) Q Consensus 117 ~g~------~w~Vtf~g--------------------~~~--------~--l~~~~~~-l~~g~----~~~v~v~~~~~g 155 (581) .|. .-.|.|.. .-| . ...++.. ++||. .+.+.+....+. T Consensus 244 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~n~~~~~~~~~~~~~ 323 (742) T protein:vir:58 244 KGQVLYPSFDVVVHFRDIRGVSANTEYIRFRQVNLNPESPNYIERVIGNMTFEFDGERIVTGGEYPNQVPFLRVVVSQDI 323 (742) T ss_pred CCceeccceeEEEEEeeccCCCCCccceeeeeeecCCCCcceeeecccceeeeeccceeeecccccccccceeeEecccc Confidence 110 11233311 100 0 1112221 23331 133333333322 Q ss_pred cceeeeccccccccceeeeeccccccceeeEecc--ceeEEeeccc-----------ccccCcceeeeeeeeeeeccccc Q lcl|NC_011270. 156 VPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTD--YVVTRVNAGE-----------DGEANTRDDLYTIQRVVDGGHID 222 (581) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd--~~v~~v~~~~-----------dg~~~~~~~~~ti~~~vd~~~~d 222 (581) ........ .-+....+++.. .++..+.... .+...++.+. ....++.--+..+.....+.... T Consensus 324 ~~~~~~~s--~~~~~~~~~~~~--v~d~~~~~~~~~~v~~~~t~~~~~pp~~~~~~e~v~~ngG~~f~v~s~~~~g~~i~ 399 (742) T protein:vir:58 324 KQNVAGVE--KWVPVGFEGIYS--VGDFTVIVNELTNVSIPVTDSAIIPPMRFTRIEQITLSGGASFSVISNQPYGFNIQ 399 (742) T ss_pred CcCcccee--EEEecccccccc--ccceeeeccccccceeeccccccCCcccccccceeecccCcceEEEEecccCccee Confidence 11100000 000000000000 0000000000 0000000000 00000000001111010000000 Q ss_pred ccceeEEEEeecCCcccceeEeccCc-chhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccC-------Ccc Q lcl|NC_011270. 223 PGDIVQLSYRYTDPNYHEVIRFTDPD-DIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPE-------GDT 294 (581) Q Consensus 223 ~~~~~~~s~~~~~~~~~e~~~~~d~~-~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~-------~~~ 294 (581) ... .++-.........+...+.. +..+.......... ..+-........+.+|............ ... T Consensus 400 ~~~---as~~~s~ln~~~~V~Gt~aa~~~~d~~t~~~v~s~-~~alp~~a~sv~laGG~dg~v~v~~~~~D~iG~~~~~d 475 (742) T protein:vir:58 400 DSR---HSYWLSPFKDDELIIGTELVLPALDVSTEFGVSSW-EEALPEFSFLMPFQGGSDGYIRVDENEPDTIGRVKITP 475 (742) T ss_pred ccC---cceEEeccCCceEEEeehhhccccccchheecccc-ccccceeeEEEeecCCccccccccCCCccccccccccc Confidence 000 00000000000011110000 00000000000000 0000111223344444432221111100 001 Q ss_pred cchhhHHHHHHHHhcCCce-EEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccE Q lcl|NC_011270. 295 VTMGDYQNALNKFRDEDEI-AIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRV 373 (581) Q Consensus 295 ~t~~dy~~al~~l~~~~~~-~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~ 373 (581) ...++| .+|.+|...+.+ .+++|+..+..+++.+.+||+.+.+ |+.++...+.. .......+.....+++.|. T Consensus 476 ~~~adr-TGL~ALlev~eVtILiAPG~t~~~v~aav~A~la~a~~----Rl~vL~D~P~~-~tt~~~A~a~r~~~nSsra 549 (742) T protein:vir:58 476 ALLANY-ERLLPLLTEDQFDLVLTPYLTFADHAGTVNAFINRAEN----RFLYLFDIAGD-DDTENLAISLAGYINSSFA 549 (742) T ss_pred ccccch-hHHHHhhhcCCCcEEEEcCCCchHHHHHHHHHHHhhcC----CeEEEEecCCC-CchHHHHHHHHhccCCceE Confidence 112344 346666555444 4566777778888999999988754 34444333222 2233455667788999999 Q ss_pred EEEEcCeeEecccccCCceecC-HHHHHHHHHHHhhccchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCC Q lcl|NC_011270. 374 ALISPSSFVYYAPELNREVVLG-GQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPR 452 (581) Q Consensus 374 ~~v~~~~~~~~~~~~~~~~~~p-~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~ 452 (581) ++++|+....++ .....+| ..++||.+|....+..+++||.|+.+.+.. ..++.|++.|+++|+++++.. + T Consensus 550 aly~PwVkv~d~---~~~r~vPpSgaIAGL~ARtD~erGvw~SPANrgii~~~----~~s~se~d~LN~~GINtIrsf-G 621 (742) T protein:vir:58 550 TTFFPWVRRLTN---KGMRTVPASLAAYRSIRTTDPETGLAPVGARRGVVTGE----PVRQVDWEDLYNNRINPIVRV-G 621 (742) T ss_pred EEEeceeeeccC---CcceeechHHHHHHHHHHhccCCceEecCCcceeeecc----ccchhhHHHHhhCCceEEEEC-C Confidence 999987655433 2233344 446677777777777789999998654332 457889999999999999764 5 Q ss_pred CeEEEEEeeecc-CCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc-- Q lcl|NC_011270. 453 NLVHVRHGVTTD-PTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR-- 529 (581) Q Consensus 453 ~~v~i~~~itT~-~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~-- 529 (581) +++++ ||-+|+ ..|++|++|++||++|+|+++|++.++| |++|||++.+|..|+..|++||+.||++|+|.||+ T Consensus 622 ~G~rl-WGnRTlassDs~wryInVRRlfd~Ie~SI~~a~q~--~VfEPNd~~L~~sIk~sInafL~~L~aqGALlGfrV~ 698 (742) T protein:vir:58 622 NDVLL-FGQKTMLNVNSALNRINVRRLLIVMRNRISQILSS--YLFENNTSENRLRAEALVRQYLESLRLRGAVTDYEVA 698 (742) T ss_pred CcEEE-EcceecCCCCcccceEeehhhHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEE Confidence 57765 566776 6689999999999999999999999875 57799999999999999999999999999999985 Q ss_pred -cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceEE Q lcl|NC_011270. 530 -NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDIT 572 (581) Q Consensus 530 -~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~ 572 (581) +++++..+++.|+++++|.++|++|+|||.++|.+...+-+++ T Consensus 699 lDetNTpeDI~~Gklvv~I~vAP~~PAEfI~lrf~it~tga~Fs 742 (742) T protein:vir:58 699 IDSVTTPTDIDNNTLRARVTVQPARSIEYIDITFVITPTGVEIT 742 (742) T ss_pred EcCCCCHHHhhCCEEEEEEEEEccCCcceEEEEEEEEecccccC Confidence 3445666789999999999999999999999999988888888 No 28 >protein:vir:6894 Length: 660 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861870;genbank:gi:32453661;genbank:GeneID:1494296 Probab=100.00 E-value=1.1e-39 Score=234.15 Aligned_cols=557 Identities=13% Similarity=0.072 Sum_probs=235.3 Q ss_pred Cee----------ccccccCCCcccccCcccccccccccCceeeEE-E------------------------ecCCCCce Q lcl|NC_011270. 1 MAI----------DFSQYQTPGVYTEAVGAPQLGIRSSVPTAVAIF-G------------------------TAVGYQTY 45 (581) Q Consensus 1 ~~~----------~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-~------------------------~~~g~~~~ 45 (581) |+. ++.+ ..+++.++ ..-+++...+.|-..++. . ...-|.+ T Consensus 1 ~~~~~PgVyv~e~~~~~-~i~~v~ts--~~~fvG~~~~Gp~~~p~~i~s~~~~~~~fG~~~~~~~~~~~~~~~f~~~g~- 76 (660) T protein:vir:68 1 MALLSPGVELKETTVQS-TVVNNSTG--TAALAGKFQWGPAFQIKQITDEVALVDMFGTPNTDTADYFMSAMNFLQYGN- 76 (660) T ss_pred CccccCceEEEEecCCc-ccccCCCc--ceeEEecccCCCCccCEEecCHHHHHHhcCCccCccchhHHHHHHHHhCCC- Confidence 111 1111 01111111 000122222222111100 0 0011122 Q ss_pred eee-eEEcCcCCc---eeeEEEEEE--ecc----ccceeEEEEeCceeccc----cccCCCHHHHHHHHHhcCCCC-cce Q lcl|NC_011270. 46 RES-IRINPDTGE---TITTQILAL--VGE----PTGGSFKLSLAGEPTGN----IPFNATQGQVQSALRALPNVE-DDE 110 (581) Q Consensus 46 ~~~-~~~~~~~~~---~~evq~v~~--~~~----~~~GtF~l~~~g~~T~~----i~~~asa~~v~~aLe~l~~i~-~~~ 110 (581) .|. +|....... ..-.+.+.. ... ..+-..++++.+..... .+.+........-+..-...+ ... T Consensus 77 ~~~vvRv~~~~~~~~~~~~~~~~~~t~~~~g~~~~~g~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~ta~~~~~a~~ 156 (660) T protein:vir:68 77 DLRVVRAVDRDTAKNSSPVAGNINFTISSAGTNYRVGDKVVVKYSTDIIEPDGEVTSVDSDGKILNIFIPSGKIIAKAKE 156 (660) T ss_pred eEEEEEecccccccccccccccceeeeeccCcceeeeeeeeeecccccccccccceeeeecCceeeeeecccccccccee Confidence 222 222100000 000000110 000 01111222222111000 000000000000000000000 000 Q ss_pred EEEEcCCCceEEEEecCCccccccccc--eeccCCCceEEEEEccccc---ceee-eccc----------cccccc-eee Q lcl|NC_011270. 111 VTVLGDPGGPWTVTFTKAVAALTKDVT--GLTGGDDPDLNIASEQTGV---PAMN-RALA----------KKGIKT-DTI 173 (581) Q Consensus 111 V~~~~~~g~~w~Vtf~g~~~~l~~~~~--~l~~g~~~~v~v~~~~~g~---~~~~-~~~~----------~~~~~~-~~~ 173 (581) +......+..|..+|.+..+....... .........+.+.....+. .+.. .... ....+. ... T Consensus 157 ~~~~~~~~~~~~~~v~~~~~~~~~~~~v~~~~~d~~~~~~~~~ta~~~~~~~~~~~~~~~~~~~~~~A~~~g~~G~~i~v 236 (660) T protein:vir:68 157 IGEYPELGSNWTAEMSGSSSGLSAVITIDSVVMDSGILLTEVETSEEAITSLTFQESIKKYGVPGVVALYPGELGDQLEI 236 (660) T ss_pred eccccccccceeEEeecccccceeeeeeccccccccceeeeeccccccccccceeeeecccCccccccccccccccceEE Confidence 000011123344444322111100000 0000000000000000000 0000 0000 000000 000 Q ss_pred eeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhh Q lcl|NC_011270. 174 RVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDF 253 (581) Q Consensus 174 ~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~ 253 (581) .+..+... ................. ......+.+ ....+.+...+.....+.. .+.+.+.......+. T Consensus 237 ~~~~~a~~--~~~~~~~~~~~~~~~~~--~~~~~~~~~-------~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~~~ 304 (660) T protein:vir:68 237 EIVSKADY--DKGASAQLKIYPDGGTR--YSTAKAIFG-------YGPQTDDQYAIIVRRNDSV-VQSVVLSTKRGERDI 304 (660) T ss_pred EEeccccc--cccccccceeeeccccc--ccceeeEee-------cccccccceeeeeecCCcc-eeeeeeecccccccc Confidence 00000000 00000000000000000 000000000 0000000001111111110 001100000000000 Q ss_pred hhhhh---hhhccccccceee-eeeeecCCcc-eeEEeeeccCCcccchhhHHHHHHHHhcC---CceEEEEeCCC---- Q lcl|NC_011270. 254 YGPAF---DEAGNVQSEITLC-AQLAITNGAS-TILACAVDPEGDTVTMGDYQNALNKFRDE---DEIAIIVAGTG---- 321 (581) Q Consensus 254 ~~~a~---~~~g~~~~~i~~~-~~~~~~~g~~-~~~~~~~~~~~~~~t~~dy~~al~~l~~~---~~~~iv~~~t~---- 321 (581) .+... ....+..+..... .......-.. ..+.++.++ ....+..++..+++.|... +...++++... T Consensus 305 ~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~gg~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 383 (660) T protein:vir:68 305 YGSNIFIDDFFAKGASNYIFATAQGWPKGFSGVIKLNGGLSS-NETVEAGDLMEAWDLFADRESVNAQLFIAGSCAGESL 383 (660) T ss_pred cccceeeehhhccCcccEEEEeecCCCccccceeeecccccc-ccccccchhhhHHHHhhhhhccccceeeccccCCCch Confidence 00000 0001111111110 0000000000 001111111 1223445666666655443 33333443221 Q ss_pred --cHHHHHHHHHHHHHHhcC----CCcEEEEEecCCCCCchhHHHHHHH-------hhccCCccEEEEEcCeeEeccccc Q lcl|NC_011270. 322 --AQPIQALVQQHVSAQSNN----KYERRAILGMDGSVTPVPSATRIAN-------AQSIKDQRVALISPSSFVYYAPEL 388 (581) Q Consensus 322 --~~~i~~~l~~~v~~~~~~----~~~~~avvg~~~~~~~~~~~~~~~~-------a~~~ns~r~~~v~~~~~~~~~~~~ 388 (581) ..+++..+.+||+++++. ..++.++++.....+.+.+.++... ...+++.|+++++|+..+.+...+ T Consensus 384 ~~~~~v~~~l~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~ 463 (660) T protein:vir:68 384 EVASTVQKHVVAIGDSRQDCLVLCSPPRAAVVGIPVNRAVDNLVDWRTASGTYTDNNFNISSTYAAIDGNYKYQYDKYND 463 (660) T ss_pred HHHHHHHHHHHHHHHhhCCeEEEEcccceeEecCCCCCCHHHHHHHHhhcccccccccccCcceEEEEcCceEEecccCC Confidence 236788899999888541 1123333344444444444333321 224789999999999888776554 Q ss_pred CCceecCHHHHHHHHHHHhhccchhcccccccccC---cccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccC Q lcl|NC_011270. 389 NREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRG---FSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDP 465 (581) Q Consensus 389 ~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g---~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~ 465 (581) ......|..++||.+|....+.++|+||.|+.+.+ +.+++..+++.|++.|+++||++++..+++++++ ||-+|+. T Consensus 464 ~~~~~p~sg~~AGl~Ar~d~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~~ 542 (660) T protein:vir:68 464 VNRWVPLAADIAGLCARTDNISQPWMSPAGYNRGQILNVIKLAIETRQAQRDRLYQEAINPVTGTGGDGYVL-YGDKTAT 542 (660) T ss_pred ceEEechhHHHHHHHHHHhccCCcEEccCCeeeceeeccceeeecCChhHHHHHhhCCceEEEEecCCeEEE-EcceecC Confidence 44444555688888888888888999999997654 4567778999999999999999999888888876 5556766 Q ss_pred CC-cccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCC Q lcl|NC_011270. 466 TS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQP 540 (581) Q Consensus 466 td-~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~ 540 (581) ++ +.|++|++||++++|+++|++.++| |++|||++.+|..|+..|+.||..||++|+|.+|. ...+++.++++ T Consensus 543 ~~~s~~~~i~vrR~~~~i~~si~~~~~~--~v~epn~~~~~~~i~~~i~~~L~~l~~~gal~gf~V~~d~~~nt~~~i~~ 620 (660) T protein:vir:68 543 SVPSPFDRINVRRLFNMVKTNIGSASKY--RLFELNNAFTRSSFRTETSQYLQGIKALGGVYNFKVVCDTTNNTPAVIDR 620 (660) T ss_pred CCCcccceEehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEecCCCCHHHhhC Confidence 65 5799999999999999999999876 56699999999999999999999999999999985 34567778899 Q ss_pred CEEEEEEEEEecCceeEEEEEEEEEeccce---EEEEEee Q lcl|NC_011270. 541 DVIEVRYEWRPAYPLNYIVVRYSIAPETGD---ITSTIEG 577 (581) Q Consensus 541 ~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~---~~~~~~~ 577 (581) ++++++|.++|++|+|||.++|+.....-+ +-..+-| T Consensus 621 G~~~~~i~~~p~~pae~i~l~~~~~~~~~~~~e~~~~v~~ 660 (660) T protein:vir:68 621 NEFVATFYLQPARSINYITLNFVATATGADFDELIGAVGG 660 (660) T ss_pred CeEEEEEEEEecCCcceEEEEEEEeecCccHHHHHHhhcC Confidence 999999999999999999999954332211 1122222 No 29 >protein:vir:101804 Length: 663 # NCBI annotation: gp18 # Family: family:all:661 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238881;genbank:gi:66391956;genbank:GeneID:3416631 Probab=100.00 E-value=1.2e-38 Score=228.47 Aligned_cols=557 Identities=12% Similarity=0.068 Sum_probs=239.8 Q ss_pred ccccCCCcccccCccc------------ccccccccCceeeEEEe-cCC----CCceeeeeEEcC-------cCCceeeE Q lcl|NC_011270. 6 SQYQTPGVYTEAVGAP------------QLGIRSSVPTAVAIFGT-AVG----YQTYRESIRINP-------DTGETITT 61 (581) Q Consensus 6 ~~~~~~~~~~~~~g~~------------~~~~~~~~~~~~~~~~~-~~g----~~~~~~~~~~~~-------~~~~~~ev 61 (581) =.+.-|++|.+-...+ +++...+.|...++.-+ ... |.+.+..-...- ..+...-| T Consensus 1 ~~~~~Pgvyv~e~~~~~~i~~~~t~~~~~vG~~~~Gp~~~p~~v~~~~~~~~~fg~~~~~~~~~~~~~~~f~ngg~~~~v 80 (663) T protein:vir:10 1 MALLSPGIEMKETSINSTVVRSATGRAAIVGKFAWGPAYEVRQVTNEVELVDMFGSPDNVTAPYFMSAMNFLQYGNDLRL 80 (663) T ss_pred CceecCceEEEEecCCccccccCcccceeEeecccCCCCccEEecCHHHHHHhcCCcCCcchhHHHHHHHHHhCCCeEEE Confidence 2234455555422100 12222222211111000 000 011000000000 00111111 Q ss_pred EEEEEe-----cccc--ceeEEEEeCce---eccccccCCCHHHHHHHHHhcCCCCcc--eEEEEcC------------- Q lcl|NC_011270. 62 QILALV-----GEPT--GGSFKLSLAGE---PTGNIPFNATQGQVQSALRALPNVEDD--EVTVLGD------------- 116 (581) Q Consensus 62 q~v~~~-----~~~~--~GtF~l~~~g~---~T~~i~~~asa~~v~~aLe~l~~i~~~--~V~~~~~------------- 116 (581) -+|.-. ..+. +...++...+. ....|........+... -....+... .+.+... T Consensus 81 vRv~~~~~~~~a~~~~~~~~~~~~~~~~~~~~g~~i~v~~~~~~~~~~-~~~~~~~~~~~~~~v~~~ta~~~~~~~~v~~ 159 (663) T protein:vir:10 81 VRVIDMEKAKNASPLVNQVSVTITTEGQGYTVGDAITVKYNNATITEA-GKVTAVDSDGKIKSLFVPTAEIIAKTRQLGT 159 (663) T ss_pred EEccCCccccccccccccceeEEeeccccccccccccccccccccccc-ccceeeecccceEEEeecccccccccccccc Confidence 111000 0000 11112111111 00011111000000000 000000000 0000000 Q ss_pred ---CCceEEEEecCCcccc--ccccceeccCCCceEEEEEcccccceeeeccccc-cccceeeeeccccccceeeEeccc Q lcl|NC_011270. 117 ---PGGPWTVTFTKAVAAL--TKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKK-GIKTDTIRVVNPNSGQVYVLGTDY 190 (581) Q Consensus 117 ---~g~~w~Vtf~g~~~~l--~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~-~~~~~~~~l~~~~~~~~~vtgtd~ 190 (581) ....|.++|....+.. .........+. .+.+................. ..+.....+.....+.. +.. T Consensus 160 ~~~~~~~~~~~~s~~s~~~~~a~~v~~v~~d~--~~~v~~~~~a~~~~t~~~~~~~~~~~~~~~i~A~~~G~~---Gn~- 233 (663) T protein:vir:10 160 YPTLGDNWRIDVSGASGGSAAALALGNIVVDS--GVTFGNSEDAPAVMTSPAVMEKYAKFGMPLISAVYPGEI---GST- 233 (663) T ss_pred ceeeccceeeEeeeccCccccccccceecccc--ceEEeeccccccccccccccccccccccceEEeccCCcc---cce- Confidence 0011222221100000 00000000000 011111100000000000000 00000000000000000 000 Q ss_pred eeEEeecccccccCcceeee-eeeeeeecc--------cccccceeEEEEeecCCc-ccceeEeccCcchhhhhhhh--- Q lcl|NC_011270. 191 VVTRVNAGEDGEANTRDDLY-TIQRVVDGG--------HIDPGDIVQLSYRYTDPN-YHEVIRFTDPDDIQDFYGPA--- 257 (581) Q Consensus 191 ~v~~v~~~~dg~~~~~~~~~-ti~~~vd~~--------~~d~~~~~~~s~~~~~~~-~~e~~~~~d~~~~~~~~~~a--- 257 (581) ..+......+........ ......... .....+...+.....+.. +.......... ....+.. T Consensus 234 --i~V~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~--~~~~~~~~~~ 309 (663) T protein:vir:10 234 --VEVEIVSKTAFNSGAQQTIYPFGGTRTSNARGVIQYGPMTDDQFAIIVRRDGIVVESTVLSTRKGD--RDVYGSNIFM 309 (663) T ss_pred --eeeeeccccccccccccceecccccccccccceeecccccccceeeEeecCCcceeeecccccccc--cccccchhhh Confidence 000000000000000000 000000000 000000000000011000 00000000000 0000000 Q ss_pred hhhhccccccceeeeeeeecCCcceeEEeeeccC-CcccchhhHHHHHHHHhcCCc---eEEEEeCC------CcHHHHH Q lcl|NC_011270. 258 FDEAGNVQSEITLCAQLAITNGASTILACAVDPE-GDTVTMGDYQNALNKFRDEDE---IAIIVAGT------GAQPIQA 327 (581) Q Consensus 258 ~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~-~~~~t~~dy~~al~~l~~~~~---~~iv~~~t------~~~~i~~ 327 (581) .....+..+.+..........+....+....+.+ ....+..||..+++.|++.+. ..++++.. ....++. T Consensus 310 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gg~d~~~~~~~~~~~~~~~~l~~~~~~~~~~~i~~~~~~~~~~~~~~v~~ 389 (663) T protein:vir:10 310 DDYFRNGGSNFIFASSEGWPAGFTGIIQLGGGTSANADVGADELIKGWDLFSDREALHVNLMIAGACGSDGAEIASTVQK 389 (663) T ss_pred hhhhcCCcceEEEEeecccCccccceeEeccccCCccccchhhhHHHHHhhhcccccceeEEEeccCCCCchhhHHHHHH Confidence 0011111222211111111111222222221111 123467889999999887642 22333321 1245778 Q ss_pred HHHHHHHHHhcCCCcEEEEEecCCCCC-----chhHHHHH-------------HHhhccCCccEEEEEcCeeEecccccC Q lcl|NC_011270. 328 LVQQHVSAQSNNKYERRAILGMDGSVT-----PVPSATRI-------------ANAQSIKDQRVALISPSSFVYYAPELN 389 (581) Q Consensus 328 ~l~~~v~~~~~~~~~~~avvg~~~~~~-----~~~~~~~~-------------~~a~~~ns~r~~~v~~~~~~~~~~~~~ 389 (581) .+.+||++++ .++++++.+.... ....+... .....++++|.++++|+.++.+...+. T Consensus 390 ~l~~~a~~~~----~~~ai~d~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 465 (663) T protein:vir:10 390 YVVSLADDRQ----DCVAIVNPPAELMVGIPTSTAVKNIVEWRNGMTGSGEVVDNNMNISSTYAFIIGNYKYQYDKYNDI 465 (663) T ss_pred HHHHHHHhhC----CEEEEEecCcccccccccccchHHHHHHHHhccccccchhhhcccCccceEEEcCceEEecccCCc Confidence 8888888774 4899998764321 11122222 223457899999999988877665444 Q ss_pred CceecCHHHHHHHHHHHhhccchhccccccc---ccCcccccccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccC Q lcl|NC_011270. 390 REVVLGGQFMAAAVAGKSVSAIAAMPLTRKV---IRGFSGPAEVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDP 465 (581) Q Consensus 390 ~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~---l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~ 465 (581) .....|..++|+.+|.+.....+|+||.|+. |.|+.+++..+++.|++.|+++|+++++..++ +++++ ||.+|+. T Consensus 466 ~~~~p~sg~vAGl~Ar~D~~~g~~~sPan~~~~~i~g~~~~~~~~~~~e~~~Ln~~gin~i~~~~~~~G~~~-wG~rT~~ 544 (663) T protein:vir:10 466 NRWVPLAADIAGLCAYTDQVSHPWMSPAGYRRGQIRNCIKLAIEPKQSMRDTMYQVAINPVTGFAGGDGFVL-FGDKMAT 544 (663) T ss_pred eEEechhHHHHHHHHHhhccCCceEccCCceeccccccccceeecChhHHHHHhhCCceEEEEEeCCCcEEE-EcccccC Confidence 3334455577888888888888999999986 45667788899999999999999999987776 57765 6778876 Q ss_pred CC-cccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCC Q lcl|NC_011270. 466 TS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQP 540 (581) Q Consensus 466 td-~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~ 540 (581) ++ +.|++|++||++++|+++|++.++| |++|||++.+|..|+..|..||.+||++|+|.+|. ...+++.+++. T Consensus 545 ~~~s~~~~i~vrR~~~~i~~si~~~~~~--~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~ 622 (663) T protein:vir:10 545 QVPSPFDRINVRRLFNMLKKNIGDTSKY--ELFENNDAFTRQSFRMETSQYLDGIRSLGGCYDFRVVCDTTNNTPNVIDR 622 (663) T ss_pred CCCcccceEehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhC Confidence 65 5899999999999999999999876 67799999999999999999999999999999985 34567778999 Q ss_pred CEEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 541 DVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 541 ~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) |+++++|.++|++|+|||+++|+....--+++ .|-|-.-- T Consensus 623 G~~~~~i~~~p~~pae~i~~~~~~~~~~~~~~-e~~~~~~~ 662 (663) T protein:vir:10 623 NEFVGTIYVKPPRSINYITLNMVATSTGANFD-ELIGPMQL 662 (663) T ss_pred CeEEEEEEEEecCCcceEEEEEEEeecCccHH-HHHHHHhc Confidence 99999999999999999999985432221110 00000000 No 30 >protein:vir:5711 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839870;genbank:gi:30065725;genbank:GeneID:1260618 Probab=100.00 E-value=3.3e-41 Score=242.49 Aligned_cols=372 Identities=13% Similarity=0.063 Sum_probs=216.5 Q ss_pred eccccccceeeEeccceeEEeecccccccCcceeeeeee-eeeecccccccceeEEEEeecCCcccceeEeccCcchhhh Q lcl|NC_011270. 175 VVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQ-RVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDF 253 (581) Q Consensus 175 l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~-~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~ 253 (581) ..... .++++...+.-+-.+ ....+........ +..+ +...+.. .+....+...+.. .+.+...+... T Consensus 1 m~~~~-~GV~v~e~~~g~~~i-----~~v~tav~~~vg~a~~~d-~~~~~~~---~pv~i~s~~~~~~-~~g~~~tl~~a 69 (396) T protein:vir:57 1 MSDYH-HGVQVLEINDGTRVI-----STVSTAIVGMVCTASDAD-AETFPLN---KPVLITNVQSAIA-KAGKKGTLAAS 69 (396) T ss_pred CCCCC-CceEEEEcCCCcccc-----cccCCceEEEEEeccCCC-cccccCc---cCeEeecchhhhh-hcccccchHHH Confidence 01110 112211111100000 0000000000000 0000 0000000 0001111111100 01111111111 Q ss_pred hhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCCceEEE------EeCCCcHHHHH Q lcl|NC_011270. 254 YGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDEDEIAII------VAGTGAQPIQA 327 (581) Q Consensus 254 ~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv------~~~t~~~~i~~ 327 (581) +-..++..+. .-.................+.+.....+........+++.+|.+.+....+ +|......+++ T Consensus 70 l~~~~~~~~~--~~~vv~~~~~~~~~~~~~~a~t~~~iiG~~~~~~~~tgl~al~~~~~~~~~~p~i~~ap~~~~~~v~~ 147 (396) T protein:vir:57 70 LQAIADQSKP--VTVVVRVEDGTGDDEETKLAQTVSNIIGTTDENGQYTGLKALMGAESVTGVKPRILGVPGLDTKEVAV 147 (396) T ss_pred HHHhhhcCCc--eeEeeeccccccccccccccccceeeeeeccccccchhhhhhhhcccceeEEeccccCcccchhHHHH Confidence 1101111000 000000000000000000000000000111122344555565555443333 34556677888 Q ss_pred HHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHh Q lcl|NC_011270. 328 LVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKS 407 (581) Q Consensus 328 ~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~ 407 (581) ++.+||+++. +++++.++...+ .+..++....+++.|+++++|+....+...+......|+.++||.+|... T Consensus 148 al~~~~~~~~-----~~~~~d~p~~~~---~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d 219 (396) T protein:vir:57 148 ALASVCQELN-----AFGYISAWGCKT---ISEVKAYRQNFSQRELMVIWPDFLAWDTVTSTTATAYATARALGLRAKID 219 (396) T ss_pred HHHHHhhhCc-----eEEEEcCCCCCC---HHHHHHHHhccCCceEEEEcceeeeecccCCceeEEehhHHHHHHHHHhh Confidence 8999998763 688887765543 34456677789999999999988776654443333444557888888888 Q ss_pred hccchhcccccccccCcccccccC------CHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCcccceEEeehhhHH Q lcl|NC_011270. 408 VSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDV 481 (581) Q Consensus 408 a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~ 481 (581) ...++++||.|++|.|+.++...+ ++.|.+.|+++|++++. +++++++ ||-+|+.+|++|++|++||++|+ T Consensus 220 ~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~t~~--~~~G~~~-wG~rT~~~d~~~~~i~vrR~~~~ 296 (396) T protein:vir:57 220 QEQGWHKTLSNVGVNGVTGISASVFWDLQKPGTDADLLNEAGVTTLV--RRDGFRF-WGNRTCSDDPLFLFESYTRTAQV 296 (396) T ss_pred hccCcEeccCCceeccccccceecccccCCcchhhhhhhhcCcEEEE--cCCCEEE-EcccccCCCcccceeehhhHHHH Confidence 888899999999999998876654 46789999999999994 3456765 67778888999999999999999 Q ss_pred HHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCEEEEEEEEEecCceeE Q lcl|NC_011270. 482 MVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNY 557 (581) Q Consensus 482 i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~ 557 (581) |++.|++.+++ |++|||++.+|..|+..|+.||+.||++|+|.+|+. ..+++.+++.|+++++|.++|++|+|| T Consensus 297 i~~~i~~~~~~--~v~e~n~~~~~~~i~~~i~~~l~~l~~~gal~g~~v~~d~~~n~~~~i~~G~~~~~v~~~p~~p~e~ 374 (396) T protein:vir:57 297 LADTMAEAHMW--AIDKPITATLIRDIIDGINAKFRELKNNGYIVDGTCWFSEESNDAETLKAGKLYIDYDYTPVPPLEN 374 (396) T ss_pred HHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcce Confidence 99999999864 788999999999999999999999999999999863 345666788999999999999999999 Q ss_pred EEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 558 IVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 558 I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) |.+++++.++- -..+ T Consensus 375 I~~~~~~~~~~---------~~~~ 389 (396) T protein:vir:57 375 LTLRQRITSRY---------LASL 389 (396) T ss_pred EEEEEEEchHH---------HHHH Confidence 99999876543 3333 No 31 >protein:vir:80984 Length: 666 # NCBI annotation: gp18 tail sheath protein # Family: family:all:661 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469499;genbank:gi:157311456;genbank:GeneID:5602117 Probab=100.00 E-value=1.2e-39 Score=234.01 Aligned_cols=524 Identities=14% Similarity=0.097 Sum_probs=224.8 Q ss_pred CeeccccccCCCcccccCccccccccc-ccCceeeEEEecCCCC---ceeeeeEEcCcCCceeeEEEEEEeccccceeEE Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTEAVGAPQLGIRS-SVPTAVAIFGTAVGYQ---TYRESIRINPDTGETITTQILALVGEPTGGSFK 76 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~g~~---~~~~~~~~~~~~~~~~evq~v~~~~~~~~GtF~ 76 (581) ++..|-.+.-. .|...+..+-....+ ...+........+|-. +.....+..+. ..+. .+.... T Consensus 67 ~~~~f~~~g~~-~~v~R~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---------~g~~~~ 133 (666) T protein:vir:80 67 SGANFLQYGND-LRVVRVLNKEKAKNATALAGNIEFEITNEGSNYEVGDTIKIKHNRQ---DIET---------AGKVTK 133 (666) T ss_pred HHHHHhcCCCe-EEEEEecCccccccccccccceeEEEeeccccccccccccccccCc---cccc---------CcceEE Confidence 11111111000 111111000000000 0011111111222211 11111110000 0000 001111 Q ss_pred EEeCceecc-ccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCCccc----cccccc----ee-------c Q lcl|NC_011270. 77 LSLAGEPTG-NIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAA----LTKDVT----GL-------T 140 (581) Q Consensus 77 l~~~g~~T~-~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~----l~~~~~----~l-------~ 140 (581) +...+..+. .++ ++.-+.. ....+..... ...|..+|....+. +.+... .+ . T Consensus 134 i~~~~~~~~~~~~---ta~~~~~----a~~~~~~~~v-----~~~~~~~~~~~~~~~~~a~~V~~~~~~~~~~~~~~~~a 201 (666) T protein:vir:80 134 VDGDGKVKGVFIP---TGKIIAH----AKAIGVYPEL-----DGDWTAEFTSSSGNGSAALSVTKIVTDSGLLLTDLETS 201 (666) T ss_pred Eeecceeeeeecc---hhhhccc----ccccccccee-----eccceeeeccccccceeeeeeeeeecCCccceeeeccc Confidence 111111000 000 0000000 0000000000 00112222100000 000000 00 0 Q ss_pred cCCCceEEEEEcc--cccceeeeccccccccc-eeeeeccccccceeeEeccceeEEeecccc-cccCcceeeeeeeeee Q lcl|NC_011270. 141 GGDDPDLNIASEQ--TGVPAMNRALAKKGIKT-DTIRVVNPNSGQVYVLGTDYVVTRVNAGED-GEANTRDDLYTIQRVV 216 (581) Q Consensus 141 ~g~~~~v~v~~~~--~g~~~~~~~~~~~~~~~-~~~~l~~~~~~~~~vtgtd~~v~~v~~~~d-g~~~~~~~~~ti~~~v 216 (581) ++......+.... .+.+...... ...... ....+.+.. ..... . .. ...... .......+..+ T Consensus 202 ~~~~t~~~~~~~~~~~~~~a~~a~~-~g~~g~~l~v~i~~~~--~~~~~-~-~~---~~~~~~~~~~~~~~~~~~----- 268 (666) T protein:vir:80 202 RANITNQTFLTKLQKYDMPAVSAIY-AGEIGNSLEVEILARS--AFKNT-A-PD---LTMYPYGGERTAARNLIP----- 268 (666) T ss_pred cccccccccccccccccchhhhhhc-ccccccceeeeecccc--ccccc-c-cc---ceeeeccccccccceeee----- Confidence 0000000000000 0000000000 000000 000000000 00000 0 00 000000 00000000000 Q ss_pred ecccccccceeEEEEeecCCcccceeEeccCcchhhh-----hhhhhhhhccccccce-eeee---------eeecCCcc Q lcl|NC_011270. 217 DGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDF-----YGPAFDEAGNVQSEIT-LCAQ---------LAITNGAS 281 (581) Q Consensus 217 d~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~-----~~~a~~~~g~~~~~i~-~~~~---------~~~~~g~~ 281 (581) .... ......+.....+ ...|.+......+.... +...+ .......+. ...+ ..+.+|.. T Consensus 269 -~~~~-~~~~~~~~v~~~g-~~~e~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~ 343 (666) T protein:vir:80 269 -YAPQ-NDNQYAFIVRRDG-VVVESYVLSTLKGDKDVYGNSIYMDDF--FGRGSSQYIYATAQGWVDGFSGIISLAGGVS 343 (666) T ss_pred -eccc-cccceeeEeccCC-ccceeeecccccccccccchhhhhhhh--hccccceeeeecccccccccceEEEecCCCC Confidence 0000 0000000000000 11111111111111000 01000 000011100 0000 11111211 Q ss_pred eeEEeeeccCCcccchhhHHHHH--HHHhcCCceEEE-EeC-----CCcHHHHHHHHHHHHHHhcC----CCcEEEEEec Q lcl|NC_011270. 282 TILACAVDPEGDTVTMGDYQNAL--NKFRDEDEIAII-VAG-----TGAQPIQALVQQHVSAQSNN----KYERRAILGM 349 (581) Q Consensus 282 ~~~~~~~~~~~~~~t~~dy~~al--~~l~~~~~~~iv-~~~-----t~~~~i~~~l~~~v~~~~~~----~~~~~avvg~ 349 (581) ......... +......++..+. .++++.+.+.++ +|+ .+.+.++..+.+||+++++. ..+++++++. T Consensus 344 ~~~~~~~~~-~~~~~~g~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~vd~ 422 (666) T protein:vir:80 344 ANEATTGGV-GADPFIGAMMQGWGLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDERQDCLVMVSPPRSTVVNI 422 (666) T ss_pred ccccccccc-ccccccccchhhhhhhhhhcccccceEeecCcCCcccchHHHHHHHHHHHHhhcceEEEeecceeEEeec Confidence 111000000 0011112233322 223333333444 443 23567888899999988641 2234455555 Q ss_pred CCCCCchhHHHHHHH-------hhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccc- Q lcl|NC_011270. 350 DGSVTPVPSATRIAN-------AQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVI- 421 (581) Q Consensus 350 ~~~~~~~~~~~~~~~-------a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l- 421 (581) .+..+.....++... ...++|.|.++++|+.++.+...+......|..++||.+|......++|+||.|+.+ T Consensus 423 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~l~~p~~~~~d~~~~~~~~~p~sg~~AGl~Ar~D~~~g~~~sPan~~~~ 502 (666) T protein:vir:80 423 PVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIAGLCARTDAVSQPWMSPAGYNRG 502 (666) T ss_pred CCCCCHHHHHHHHHhcccchhhhcccCcceEEEEcCceEEecccCCceeEechHHHHHHHHHHHhhcCCceEccCCeecc Confidence 554444444333221 235889999999998888776544433444556888888888888899999999964 Q ss_pred --cCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC-CcccceEEeehhhHHHHHHHHHHHhhhcCCCc Q lcl|NC_011270. 422 --RGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-SLHTREWNIIGQQDVMVYRIRDYLDADGLIGM 498 (581) Q Consensus 422 --~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t-d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~ 498 (581) .|+.+++..+++.|++.|+++||++++.+++++++++ |-+|... ++.|++|++||++++|++.|++.++| |++| T Consensus 503 ~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~g~G~~~w-G~rT~~~~~s~~~~i~vRRl~~~i~~si~~~~~~--~v~e 579 (666) T protein:vir:80 503 QIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILM-GDKTATTVPSPFDRINVRRLFNMLKKNIGDSSKY--KLFE 579 (666) T ss_pred eeeccccceeecChhHHHhhhhCCeeEEEEeCCCeEEEE-ccccCCCCCcccceeehhhHHHHHHHHHHHHHHH--hccC Confidence 4556788899999999999999999988888888775 4466554 45899999999999999999999886 5669 Q ss_pred cCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceEEEE Q lcl|NC_011270. 499 PIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITST 574 (581) Q Consensus 499 ~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~ 574 (581) ||++.+|..|+..|..||++||++|+|.+|. .+.+++.+++.|+++++|.++|++|||||++||+.. .+| T Consensus 580 pn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~di~~G~~~~~i~~~P~~Pae~I~~~~~~~-~~~----- 653 (666) T protein:vir:80 580 NNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPAKSINYIMLNFTAV-ATG----- 653 (666) T ss_pred CCCHHHHHHHHHHHHHHHHHHHhcCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEe-ecC----- Confidence 9999999999999999999999999999986 345677889999999999999999999999999633 233 Q ss_pred EeecccC Q lcl|NC_011270. 575 IEGTTSF 581 (581) Q Consensus 575 ~~~~~~~ 581 (581) .+| T Consensus 654 ----~~~ 656 (666) T protein:vir:80 654 ----SDF 656 (666) T ss_pred ----ccH Confidence 233 No 32 >protein:vir:1845 Length: 392 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052269;genbank:gi:9634076;genbank:GeneID:1262448 Probab=100.00 E-value=1.6e-41 Score=244.25 Aligned_cols=361 Identities=11% Similarity=0.082 Sum_probs=219.1 Q ss_pred eccccccceeeEeccceeEEeecccccccCcceeeeeeeeeee-cccccccceeEEEEeecCCcccceeEeccCcchhhh Q lcl|NC_011270. 175 VVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVD-GGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDF 253 (581) Q Consensus 175 l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd-~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~ 253 (581) ..+ .-+++++...+.-...+.. .++..... +....+ ++...+ ..+....+...+.... T Consensus 1 m~~-~~~Gv~v~e~~~g~~~i~~-----~~tav~g~-vgta~~~~~~~~~--------------~~~p~~its~~~~~~~ 59 (392) T protein:vir:18 1 MSD-FHHGTKVIEINDGTRVIST-----VATAIVGM-VWTASDADAETFP--------------LNEPVLITNVQSAIAK 59 (392) T ss_pred CCC-CCCCeEEEEcCCCceeeec-----cCcceeEE-EEeccCCCCcccc--------------cccceEeechHHHHhh Confidence 222 2244444333221111100 11111111 000000 000000 0111111111111111 Q ss_pred hhhhhhhhccccccceeeeeeeecCCcceeEEeeeccC----Ccccch---------hhHHHHHHHHhcCC------ceE Q lcl|NC_011270. 254 YGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPE----GDTVTM---------GDYQNALNKFRDED------EIA 314 (581) Q Consensus 254 ~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~----~~~~t~---------~dy~~al~~l~~~~------~~~ 314 (581) + |. ...+..+++..+.++...++...+... ...++. ..-..++.+|+..+ ... T Consensus 60 ~-------g~-~gtl~~al~~~~~ngg~~~~vv~v~~~~~~~~~~~t~~dliG~~~~~~~~tg~~al~~~~~~~~~~p~i 131 (392) T protein:vir:18 60 A-------GK-KGTLSASLQAIADQSKPVTVVVRVAEGTGDDAEAQTTSNIIGGTDENGKYTGIKALLTAEAVTGVKPRI 131 (392) T ss_pred c-------CC-CcchHHHHHHhhcccCceEEEecccccccccccccchhhheecccccchhhhHHHHHhhhhhhceeehh Confidence 1 10 001111222223333222222111000 000010 11112222232222 123 Q ss_pred EEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceec Q lcl|NC_011270. 315 IIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVL 394 (581) Q Consensus 315 iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~ 394 (581) +++|+.++..+++.+.+||++++ ++++++.....+ ..+.++....++|.|.++++|+....+...+...... T Consensus 132 l~ap~~~~~~v~~~l~~~~~~~~-----~~~~~d~~~~~~---~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p 203 (392) T protein:vir:18 132 LGVPGLDTQEVATALASVCISLR-----AFGYVSAWGCKT---ISEAMAYRENFSQRELMVIWPDFLAWDTTANATATAY 203 (392) T ss_pred cccCccchHHHHHHHHHHHhhcC-----cEEEEecCCCCC---HHHHHHHHhhccCceEEEEeCceeeecccCCceEEec Confidence 55677778889999999998763 577887655543 3444566678999999999998877665443333344 Q ss_pred CHHHHHHHHHHHhhccchhcccccccccCcccccccC------CHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCc Q lcl|NC_011270. 395 GGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSL 468 (581) Q Consensus 395 p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~ 468 (581) |..++||.+|.......+++||.|++|.|+.++...+ ++.|.+.|+++||+++. +++++++ ||-+|+.+|+ T Consensus 204 ~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G~~~-wG~rT~~~d~ 280 (392) T protein:vir:18 204 ATARALGLRAYIDQTIGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVTTLV--RKDGFRF-WGNRTCSDDP 280 (392) T ss_pred hHHHHHHHHHhhhccCCceEccCCceeeceeecceecccccCCCcchhhhhhhcCceEEE--cCCCEEE-EcccccCCCc Confidence 4457778888888888899999999999998876654 46788999999999994 3556765 6777888899 Q ss_pred ccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEE Q lcl|NC_011270. 469 HTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIE 544 (581) Q Consensus 469 ~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~ 544 (581) +|++|++||++|+|+++|++.+++ |++|||++.+|..|+..|+.||++||++|+|.+|+ ...+++.+++.|+++ T Consensus 281 ~~~~i~~rR~~~~i~~~i~~~~~~--~v~e~n~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~ 358 (392) T protein:vir:18 281 LFLFENYTRTAQVLADTMAEAHMW--AVDKPITASLIRDIVDGINAKFRELKSNGYIVDGECWFDEESNDKETLKAGKLY 358 (392) T ss_pred ccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhcCcccceEEEEecCCCCHHHhhCCeEE Confidence 999999999999999999999864 78899999999999999999999999999999975 334566678899999 Q ss_pred EEEEEEecCceeEEEEEEEEEeccce-EEEEEee Q lcl|NC_011270. 545 VRYEWRPAYPLNYIVVRYSIAPETGD-ITSTIEG 577 (581) Q Consensus 545 v~i~v~pv~~~e~I~~~~~~~~~tg~-~~~~~~~ 577 (581) ++|.++|++|+|||.+++++.++--+ +-..|-- T Consensus 359 ~~v~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 392 (392) T protein:vir:18 359 IDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 392 (392) T ss_pred EEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999999988776422 1111100 No 33 >protein:vir:98553 Length: 395 # NCBI annotation: gp21 # Family: family:all:115 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958076;genbank:gi:41057373;genbank:GeneID:2744224 Probab=100.00 E-value=1.3e-40 Score=239.23 Aligned_cols=373 Identities=11% Similarity=0.077 Sum_probs=221.0 Q ss_pred eccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeec Q lcl|NC_011270. 139 LTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDG 218 (581) Q Consensus 139 l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~ 218 (581) +. ---+-|.|.+...|..++ ..+..+..+.. +. ..+... T Consensus 1 m~-~~~~GV~v~e~~~g~~~v------~~v~tav~~~v------------------gt-a~~~~~--------------- 39 (395) T protein:vir:98 1 MS-DFHHGTQVIEINDGTRVI------STVATAVVGMV------------------CT-ASDADA--------------- 39 (395) T ss_pred CC-CCCCCeEEEEcCCCcccc------cccCcceEEEE------------------ee-ccCCCc--------------- Confidence 11 001223333333332211 00111111110 00 001000 Q ss_pred ccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhcccccccee-eeeeeecCCcceeEEeeeccCCcccch Q lcl|NC_011270. 219 GHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITL-CAQLAITNGASTILACAVDPEGDTVTM 297 (581) Q Consensus 219 ~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~-~~~~~~~~g~~~~~~~~~~~~~~~~t~ 297 (581) ...+.. .+....+...... .+.+...+.+.+-..++..+ ..... ..............+...+...+.... T Consensus 40 -~~~p~~---~pv~v~s~~~~~~-~~g~~~tl~~al~~~~~~~~---~~~~vv~~~~~~~~~~~~~~a~~~~~i~g~~~~ 111 (395) T protein:vir:98 40 -TLFPLN---EPVLITNVQSAIA-KAGKKGTLAASLQAIADQSK---PVTVVVRVEDGTGDDEEAALAQTVSNIIGGTDE 111 (395) T ss_pred -cccccc---cceEeechHHhHh-hcccccchhhHHHHHhhccC---ceEEEeecccccccccccccccccccccccccc Confidence 000000 0001111111110 11111122221111111111 00000 000000000000111111111111111 Q ss_pred hhHHHHHHHHhcCCc------eEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCc Q lcl|NC_011270. 298 GDYQNALNKFRDEDE------IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQ 371 (581) Q Consensus 298 ~dy~~al~~l~~~~~------~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~ 371 (581) ..-.+++++|++.+. ..+++|+..+..+++++.+||++++ +++++.++...+ ..+.++....++|+ T Consensus 112 ~~~~Tgl~al~~~~~~~~~~p~il~ap~~~~~~v~~al~~~~~~~~-----~~~~~d~p~~~t---~~~a~~~~~~~~s~ 183 (395) T protein:vir:98 112 NGKYTGIKALLTAQAVTGVKPRILGVPGLDTKEVAVALASAAIKLR-----AFAYVSAWGCKT---ISEAMEYRKNFSQR 183 (395) T ss_pred ccchhHHHHHhhhhhhhccchhhcccccccccHHHHHHHHHhhhcC-----cEEEEEcCCCCC---HHHHHHHHhccCCc Confidence 222345555554332 2345677777888899999998763 577887765443 34445667789999 Q ss_pred cEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCcccccccC------CHHHHHHHHhCCcE Q lcl|NC_011270. 372 RVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEGLM 445 (581) Q Consensus 372 r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~Gv~ 445 (581) |.++++|+....+...+......|..++||.+|......++++||.|+++.|+.++...+ ++.|++.|+++||+ T Consensus 184 ~~~~~~p~~~~~d~~~~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~i~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~ 263 (395) T protein:vir:98 184 ELMVIWPDFLAWDTVKNTTATAYATARALGLRAYIDQTVGWHKTLSNVGVQGVTGISASVFWDLQASGTDADLLNEAGVT 263 (395) T ss_pred eEEEEecceeEecccCCceeeechHHHHHHHHHHhhcccCcEeccCCceeecccccceecccccCCCcchHHhhhhcCcE Confidence 999999988776554433333344557778888888888899999999999998876654 47899999999999 Q ss_pred EEEEeCCCeEEEEEeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCce Q lcl|NC_011270. 446 VIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNII 525 (581) Q Consensus 446 ~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI 525 (581) ++. +++++++ ||-+|+.+|++|++|++||++|+|++.|++.+++ |++|||++.+|..|+..++.||++||++|+| T Consensus 264 ~~~--~~~G~~~-wG~rT~s~d~~~~~i~~rR~~~~i~~~i~~~~~~--~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l 338 (395) T protein:vir:98 264 TLV--RKDGFRF-WGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMW--AVDKPITATLIRDIVDGINAKFRELKSNGYI 338 (395) T ss_pred EEE--cCCCEEE-EcccccCCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCce Confidence 994 3456764 6778888899999999999999999999999864 7789999999999999999999999999999 Q ss_pred eCCcc----ceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccce-----EEE Q lcl|NC_011270. 526 RGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGD-----ITS 573 (581) Q Consensus 526 ~~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~-----~~~ 573 (581) .+|+. .+++..++..|+++++|.++|++|+|||.+++++.++--+ |-+ T Consensus 339 ~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:98 339 VEGKCWFDEESNDKETLKAGKLYIDYDYTPVPPLESLTLRQRITDKYLVNLAESVNS 395 (395) T ss_pred eceEEEEecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99863 3456667889999999999999999999999988765422 111 No 34 >protein:vir:6594 Length: 666 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891725;genbank:gi:33620670;genbank:GeneID:1725344 Probab=100.00 E-value=2.2e-39 Score=232.56 Aligned_cols=541 Identities=13% Similarity=0.074 Sum_probs=240.5 Q ss_pred CeeccccccCCCc---------ccccCcccccccccccCceeeEEEecCCCCceeee-eEEcCcCCceeeE-----EEEE Q lcl|NC_011270. 1 MAIDFSQYQTPGV---------YTEAVGAPQLGIRSSVPTAVAIFGTAVGYQTYRES-IRINPDTGETITT-----QILA 65 (581) Q Consensus 1 ~~~~~~~~~~~~~---------~~~~~g~~~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~~~~~~~~ev-----q~v~ 65 (581) -=|.+-.|-|... |...||.|..... ..-+.. ....-| |-.|. +|........+.. ..++ T Consensus 28 ~fvG~~~~Gp~~~p~~v~s~~~~~~~fG~~~~~~~--~~~~~~--~~f~ng-g~~~~vvrv~~~~~~~~~~~~~~~~~~~ 102 (666) T protein:vir:65 28 ALVGKFQWGPAFQIIQVTNEVELVNKFGQPDNNTA--DYFMSG--ANFLQY-GNDLRVVRVLNKEKAKNATALAGNVEFE 102 (666) T ss_pred eEEecccCCCCccCEEecCHHHHHHHcCCccccch--hHHHHH--HHHHhc-CceEEEEEccCcccccccccccCceeee Confidence 2234444455432 2335565542110 000000 001111 22232 2332111111000 0000 Q ss_pred --Eecc--ccceeEEEEeCceecc--c--cccCCCHHHHHHHHHhcCCCCcceEEEEcCCC------ceEEEEecCCcc- Q lcl|NC_011270. 66 --LVGE--PTGGSFKLSLAGEPTG--N--IPFNATQGQVQSALRALPNVEDDEVTVLGDPG------GPWTVTFTKAVA- 130 (581) Q Consensus 66 --~~~~--~~~GtF~l~~~g~~T~--~--i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g------~~w~Vtf~g~~~- 130 (581) ..+. ..+-+.++.+.+..-. . ...+..+..... ..++...+. .....+ ..|...|....+ T Consensus 103 ~~~~g~~~~~g~~~~V~~~~~~~~~~~~~~~~~~~~~~~g~---~~~t~~~~~--~~~~~g~~~~l~~~~~~~~~~~~~~ 177 (666) T protein:vir:65 103 ITNEGSNYEVGDTIKIKHNRQDIETAGKVTKVDGDGKVKGV---FIPTGKIIA--HAKAIGVYPELDGGWTAEFTSSSGN 177 (666) T ss_pred EeeccccccccceEEEEeccccccccccccccccccccccc---ccccceeec--cccccCcceeEeeccceeecccCcc Confidence 0111 1122233333221100 0 000000000000 000000000 000000 012223321110 Q ss_pred ---ccccccceeccCCCceEEEEEcccccceeeec---ccccccc-ceeeeeccccccc---eee-Eecccee--EEeec Q lcl|NC_011270. 131 ---ALTKDVTGLTGGDDPDLNIASEQTGVPAMNRA---LAKKGIK-TDTIRVVNPNSGQ---VYV-LGTDYVV--TRVNA 197 (581) Q Consensus 131 ---~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~---~~~~~~~-~~~~~l~~~~~~~---~~v-tgtd~~v--~~v~~ 197 (581) .+.+.... ..|. +............... ......+ ...........+. +.. .+.+... ..... T Consensus 178 ~~~a~sv~~~~-~~g~---~~~~~~~~a~~~~~~~~~~~~~~~~~~~a~~A~~~g~~g~~i~v~i~~~~~~~~~~~~l~~ 253 (666) T protein:vir:65 178 GSAALSVTKIV-TDSG---LLLTDLETSRANITNQTFLTKLKKYDMPAVSAIYAGEIGNSLEVEILARSAFKNTAPDLTM 253 (666) T ss_pred cccceeeeecc-cccc---eeeeeecccccccccccccccccccccceeeeeeccccccceeEEeecccccccccccccc Confidence 00000000 0000 0000000000000000 0000000 0000000000000 000 0000000 00000 Q ss_pred ccc-cccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhh---hhhccccccceeee- Q lcl|NC_011270. 198 GED-GEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAF---DEAGNVQSEITLCA- 272 (581) Q Consensus 198 ~~d-g~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~---~~~g~~~~~i~~~~- 272 (581) ... ....... ..+....... ....+.....+ ...|.+......+......... +...+..+.+.... T Consensus 254 ~~~~~~~~~~~------~~~~~~~~~~-~~~~~~v~~~g-~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 325 (666) T protein:vir:65 254 YPYGGERTAAR------NLIPYAPQND-NQYAFIVRRDG-VVVESYVLSTLKGDKDVYGNSIYMDDFFARGSSQYIYATA 325 (666) T ss_pred cccccccccce------eeeccccccc-ccceeeeecCC-cccceeecccCcccccccchhhhhhhhhcccccceeeeec Confidence 000 0000000 0000000000 00000000111 1112221111111111000000 00001111111000 Q ss_pred ---------eeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCC---ceEEEEeC-----CCcHHHHHHHHHHHHH Q lcl|NC_011270. 273 ---------QLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDED---EIAIIVAG-----TGAQPIQALVQQHVSA 335 (581) Q Consensus 273 ---------~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~---~~~iv~~~-----t~~~~i~~~l~~~v~~ 335 (581) ...+.+|........ +..+......++..++++|++.+ ...+++|+ .++.++++++.+||++ T Consensus 326 ~~~~~~~~~~~~~~~g~~~~~~~~-~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~p~~~~~~~~~~~v~~~l~~~~~~ 404 (666) T protein:vir:65 326 QGWVDGFSGIISLAGGVSANEATT-GGVGADPFIGAMMQGWDLFAERESIHVNLLIAGACAGEGDAFSTVQKHAVSIGDE 404 (666) T ss_pred ccccccccceEEccCCCCcCcccc-cccccccccccHHHHHHHHhhhhhccCCceeecCcCCccchhHHHHHHHHHHHhh Confidence 011111111100000 00011123346778888877654 23334443 3356888889999988 Q ss_pred HhcCCCcEEEEEec--------CCCCCchhHHHHHHH-------hhccCCccEEEEEcCeeEecccccCCceecCHHHHH Q lcl|NC_011270. 336 QSNNKYERRAILGM--------DGSVTPVPSATRIAN-------AQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMA 400 (581) Q Consensus 336 ~~~~~~~~~avvg~--------~~~~~~~~~~~~~~~-------a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~A 400 (581) +++ ++++++. ....+.....++... ...++|+|.++++|+.++.+...+......|..++| T Consensus 405 ~~~----~~a~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~sg~vA 480 (666) T protein:vir:65 405 RQD----CLVMVSPPRSTVVNIPVTTAIDNLIAWREGSGNYNENNMNINTTYAVIDGNYKYQYDKYNDVNRWVPLAADIA 480 (666) T ss_pred ccc----eEEEeccccceeeecCCCCCHHHHHHHHHhcccccccccccCcceEEEEcCceEEecccCCceeEechHHHHH Confidence 854 5666553 333333333322221 124789999999998887766544433444445777 Q ss_pred HHHHHHhhccchhccccccccc---CcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC-CcccceEEee Q lcl|NC_011270. 401 AAVAGKSVSAIAAMPLTRKVIR---GFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT-SLHTREWNII 476 (581) Q Consensus 401 a~vAgl~a~~~~~~slt~~~l~---g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t-d~~~~~i~v~ 476 (581) +.+|.+.....+|+||.|+.+. |+.+++..+++.|++.|+++||++++..++++++++ |-+|..+ ++.|++|++| T Consensus 481 Gl~Ar~D~~~g~~~span~~~~~i~g~~~~~~~~~~~~~~~Ln~~gIn~i~~~~~~G~~~w-G~rT~~~~~s~~~~i~vr 559 (666) T protein:vir:65 481 GLCARTDAVSQPWMSPAGYNRGQIMNVVKLAIEPRKAHRDRLYQAAINPVIGAGGEGFILM-GDKTATTVPSPFDRINVR 559 (666) T ss_pred HHHHHHhccCCcEEccCCeecceeeccccceeecChhHHHhhhhCCceEEEEeCCCeEEEE-ecccCCCCCcccceEehh Confidence 8888888788899999999754 556778899999999999999999988888888765 5566554 4689999999 Q ss_pred hhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEec Q lcl|NC_011270. 477 GQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPA 552 (581) Q Consensus 477 R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv 552 (581) |++++|+++|++.++| |++|||++.+|..|+..|..||++||++|+|.+|. ...+++.+++.|+++++|.++|+ T Consensus 560 R~~~~i~~si~~~~~~--~v~epn~~~l~~~i~~~i~~~L~~l~~~gal~g~~V~~d~~~nt~~~i~~G~~~~~i~~~p~ 637 (666) T protein:vir:65 560 RLFNMLKKNIGDSSKY--KLFENNDNFTRASFRMEVSQYLSTIRSLGGIYDFRVQCDTTNNTPDVIDRNEFVASMFIKPA 637 (666) T ss_pred hHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEec Confidence 9999999999999876 67799999999999999999999999999999986 34567778899999999999999 Q ss_pred CceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 553 YPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 553 ~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) +|||||+++|+... +| .+| T Consensus 638 ~pae~i~~~~~~~~-~~---------~~~ 656 (666) T protein:vir:65 638 KSINYIMLNFTAVA-TG---------SDF 656 (666) T ss_pred CCcceEEEEEEEee-cC---------ccH Confidence 99999999995432 22 235 No 35 >protein:vir:79141 Length: 391 # NCBI annotation: major tail seath protein gpFI # Family: family:all:115 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165276;genbank:gi:145708101;genbank:GeneID:5247152 Probab=100.00 E-value=6.4e-41 Score=240.95 Aligned_cols=360 Identities=12% Similarity=0.065 Sum_probs=222.6 Q ss_pred ccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccC Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDP 247 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~ 247 (581) |... ...++++...+..+-.+ ....+...........-+....+. .+.+..... T Consensus 1 M~~~-------~~pGv~v~e~~~~~~~i-----~~~~tav~~~vg~a~~a~~~~~p~--------------n~pv~iss~ 54 (391) T protein:vir:79 1 MPTD-------YHHGVRVVELNDGTRPI-----RTIETAVAGIVCTADDADAATFPL--------------DTPVLLTNP 54 (391) T ss_pred CCCC-------CCCCeEEEECCCCcccc-----cccCCceEEEEeeccccccccccc--------------ccCEEeccH Confidence 1100 01122222211100000 001111111100000000000010 011222222 Q ss_pred cchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCC----------cccchhhHHHHHHHHhcCCceE--- Q lcl|NC_011270. 248 DDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEG----------DTVTMGDYQNALNKFRDEDEIA--- 314 (581) Q Consensus 248 ~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~----------~~~t~~dy~~al~~l~~~~~~~--- 314 (581) .+....+ |+. ..+..+++..+.++...++........ +....+.-..+++.|+..+... T Consensus 55 ~~~~~~~-------g~~-gtl~~al~~~~~~gg~~~~vv~~~~~~~~~~~~~~~~g~~~~~~~~tGl~~l~~~~~~~~~~ 126 (391) T protein:vir:79 55 QAYIGKA-------GDK-GTLAHTLDAITDQTNPLTVVVRVAGGASEAETTSNLIGTTNAAGRYTGMKALLTARNRFGVA 126 (391) T ss_pred HHHHHhc-------CCc-cccchhhhhhhcccccceeeeccccccccccccccccccccchhhhHHHhhhhhhhhhhccc Confidence 2111111 110 112223333344444333332221110 0111223445566555544332 Q ss_pred ---EEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCc Q lcl|NC_011270. 315 ---IIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNRE 391 (581) Q Consensus 315 ---iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~ 391 (581) +++|+.+...+++.+.++|++++ ++++++.+...+. ...+.....+++.|+++++|+....+...+... T Consensus 127 p~~l~~p~~~~~~v~~al~~~~~~~~-----~~ai~d~p~~~t~---~~a~~~~~~~~s~~~a~~~P~~~~~d~~~~~~~ 198 (391) T protein:vir:79 127 PRILAVPGLDSLPVGTELVTIAQKLR-----AFAYLSAYGCQTK---EEAVAYRSNFGQREAMVMWPDFVGWDTAANAET 198 (391) T ss_pred chhhcCCccchhHHHHHHHHHHhhcC-----cEEEEECCCCCCH---HHHHHHHhccCCceeEEecceeeeecCcCCcee Confidence 33456667778888888887653 5778877654433 344566778999999999998877665544444 Q ss_pred eecCHHHHHHHHHHHhhccchhcccccccccCcccccccCCHH------HHHHHHhCCcEEEEEeCCCeEEEEEeeeccC Q lcl|NC_011270. 392 VVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDG------EKSRESSEGLMVIEKTPRNLVHVRHGVTTDP 465 (581) Q Consensus 392 ~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~------e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~ 465 (581) ...|..++||.+|.......+++||.|++|.|+.++...++.. |.+.|+++||+++. +++++++ ||-+|+. T Consensus 199 ~~p~s~~~AG~~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~~I~t~~--~~~G~~~-wG~rT~~ 275 (391) T protein:vir:79 199 TLWATARAVGLRAKIDNDTGWHKTLSNVAVGGVTGLSRDVFWDLQDPATDAGYLNANEVTTLV--HRDGYRF-WGSRTCS 275 (391) T ss_pred eechHHHHHHHHHHhhhcccceeccCCceehhhhccccccccccccccchhhhhhhcCceEEE--CCCcEEE-EcccccC Confidence 4445557777777777777799999999999999888766543 45789999999984 3566765 5767888 Q ss_pred CCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCC Q lcl|NC_011270. 466 TSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPD 541 (581) Q Consensus 466 td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~ 541 (581) +|++|++|++||++|+|+++|++.+++ |++|||++.+|..|+..|+.||+.||++|+|.+|+. +.++..+++.| T Consensus 276 ~d~~~~~i~~rR~~~~i~~~i~~~~~~--~v~epn~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G 353 (391) T protein:vir:79 276 ADPLFAFENYTRTAQVLADTMAEAHMW--ANDLPMTPTLVRDLLEGINAKLRMLTRNGYLLGGAAWFDADANSKDTLKAG 353 (391) T ss_pred CCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeceEEEEecCCCCHHHhhCC Confidence 899999999999999999999999864 778999999999999999999999999999999863 34566678999 Q ss_pred EEEEEEEEEecCceeEEEEEEEEEeccce-----EEEE Q lcl|NC_011270. 542 VIEVRYEWRPAYPLNYIVVRYSIAPETGD-----ITST 574 (581) Q Consensus 542 ~~~v~i~v~pv~~~e~I~~~~~~~~~tg~-----~~~~ 574 (581) +++++|.++|++|+|||.+++++.++--+ |.++ T Consensus 354 ~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~a 391 (391) T protein:vir:79 354 QLAIDYDYTPVPPLENLTFRQRITDRYLMQFAEAVKAA 391 (391) T ss_pred EEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 99999999999999999999988776533 4444 No 36 >protein:vir:79181 Length: 390 # NCBI annotation: gp30, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111061;genbank:gi:134288772;genbank:GeneID:4960700 Probab=100.00 E-value=2.6e-41 Score=243.11 Aligned_cols=363 Identities=13% Similarity=0.094 Sum_probs=222.4 Q ss_pred ccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccC Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDP 247 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~ 247 (581) |... ...++++.......-.+ ....+........ . .+..... + ...+.+..++. T Consensus 1 M~~~-------~~~Gv~v~e~~~~~~~i-----~~~~tav~~~vg~-a-~dad~~~-------~-----p~n~pv~its~ 54 (390) T protein:vir:79 1 MPQD-------YHHGVRVIEINEGGRPI-----RSVSTAVLGVVCT-A-ADADASA-------F-----PLNTPVLLTNV 54 (390) T ss_pred Cccc-------cCCCeEEEEcCCCcccc-----cccCCceeEEEEe-c-CCCCccc-------c-----ccccceEeecH Confidence 1100 11122221111100000 0011111100000 0 0000000 0 00001111111 Q ss_pred cchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcc-c---------chhhHHHHHHHHhcCCc----- Q lcl|NC_011270. 248 DDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDT-V---------TMGDYQNALNKFRDEDE----- 312 (581) Q Consensus 248 ~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~-~---------t~~dy~~al~~l~~~~~----- 312 (581) .+ ....+|.. ..+..+++..+.++...++.+.+....+. . .......+|.+|+..+. T Consensus 55 ---~~----~~~~~g~~-~tL~~al~~~~~~~~~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tgl~al~~~~~~~~~~ 126 (390) T protein:vir:79 55 ---VA----ALGKAGKK-GTLRRTLDAIGKQTKPLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALGVK 126 (390) T ss_pred ---HH----HHHhcCCC-ccchhhhhhhcccccceEEEEeeccccccccccceeeecccccccchhhhhhhhhhhhhccc Confidence 11 11111111 11222334444455444443333211110 0 11122344444443332 Q ss_pred -eEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCc Q lcl|NC_011270. 313 -IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNRE 391 (581) Q Consensus 313 -~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~ 391 (581) ..+++|..+...+++.+..||+++. ++++++.+...+. .+.++....+++.|.++++|+....+...+... T Consensus 127 p~il~ap~~~~~~v~~~l~~~a~~~~-----~~ai~D~p~~~t~---~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~ 198 (390) T protein:vir:79 127 PRILAAPGLDTQPVAAALAATAQSLR-----AMAYVSASGCKTK---EEAAAYRRQFGQREIMVIWPDWLGWDDTTNSTA 198 (390) T ss_pred cccccCCcccchHHHHHHHHhhhhcc-----eEEEEEccCCCCH---HHHHHHhcCCCCceEEEEcCceeecccccCcee Confidence 2344566677788888998887663 6888887655443 334566778999999999998877655444444 Q ss_pred eecCHHHHHHHHHHHhhccchhcccccccccCcccccccCCHH------HHHHHHhCCcEEEEEeCCCeEEEEEeeeccC Q lcl|NC_011270. 392 VVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDG------EKSRESSEGLMVIEKTPRNLVHVRHGVTTDP 465 (581) Q Consensus 392 ~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~------e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~ 465 (581) ...|+.++||.+|.......+++||.|++|.|+.++...++.. |.+.|+++|++++. +++++++ ||-+|+. T Consensus 199 ~~p~s~~~Ag~~a~~D~~~g~~~spsN~~i~gi~~~~~~~~~~~~~~~~~a~~Ln~~gi~t~~--~~~G~~~-wG~rT~~ 275 (390) T protein:vir:79 199 VIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLV--NRNGFRF-WGERTCS 275 (390) T ss_pred EeehHHHHHHHHHhhhccCCcEEccCCceeeccceeeeeccccccccchhhhhhhhcCcEEEE--cCCCEEE-EeccccC Confidence 4455667888888888888899999999999998887766544 45689999999984 3566765 6777888 Q ss_pred CCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCCC Q lcl|NC_011270. 466 TSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQPD 541 (581) Q Consensus 466 td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~ 541 (581) +|++|++|++||++|+++++|++.+++ |++|||++.+|..|+..++.||..||++|+|.+|+ ...++..+++.| T Consensus 276 ~d~~~~~i~vrR~~~~i~~~i~~~~~~--~v~e~~~~~~~~~i~~~i~~~L~~l~~~gal~g~~v~~d~~~nt~~~i~~G 353 (390) T protein:vir:79 276 DDPKFAFENYTRTAQVAADSIAEAQMP--VVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILASG 353 (390) T ss_pred CCcccceeeehhhHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCC Confidence 899999999999999999999999864 67799999999999999999999999999999985 334566678899 Q ss_pred EEEEEEEEEecCceeEEEEEEEEEeccce-EEEEEee Q lcl|NC_011270. 542 VIEVRYEWRPAYPLNYIVVRYSIAPETGD-ITSTIEG 577 (581) Q Consensus 542 ~~~v~i~v~pv~~~e~I~~~~~~~~~tg~-~~~~~~~ 577 (581) +++++|.++|++|+|||.+++++.++-=+ +...|-+ T Consensus 354 ~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~v~~ 390 (390) T protein:vir:79 354 KAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred EEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999999999988776422 2222222 No 37 >protein:vir:78206 Length: 390 # NCBI annotation: gp33, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111183;genbank:gi:134288694;genbank:GeneID:4960666 Probab=100.00 E-value=8.8e-41 Score=240.17 Aligned_cols=357 Identities=14% Similarity=0.108 Sum_probs=221.2 Q ss_pred ccceeeeeccccccceeeEeccceeEEeeccccc--ccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEec Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDG--EANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFT 245 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg--~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~ 245 (581) |... ...++++.. +..+... ...+..... + ...-..++.. . ...+.+.+. T Consensus 1 M~~~-------~~~Gv~v~e-------~~~g~~~i~~~~tav~g~-v---g~a~~ad~~~---------~-pln~pv~i~ 52 (390) T protein:vir:78 1 MPQD-------YHHGVRVIE-------INEGGRPIRSVSTAVLGV-V---CTAADADASA---------F-PLNTPVLLT 52 (390) T ss_pred Cccc-------ccCCeEEEE-------cCCCcccccccCcceeEE-E---EcccCcCccc---------c-ccccceEec Confidence 1100 011122111 1111110 000000000 0 0000000000 0 000111111 Q ss_pred cCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcc-cchhhH---------HHHHHHHhcCCc--- Q lcl|NC_011270. 246 DPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDT-VTMGDY---------QNALNKFRDEDE--- 312 (581) Q Consensus 246 d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~-~t~~dy---------~~al~~l~~~~~--- 312 (581) ...+ ....+|. ...+..+++..+.++...++.+.+....+. .+..++ ..++.+|+..+. T Consensus 53 s~~~-------~~~~~g~-~gtL~~al~~~~~~gg~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~ 124 (390) T protein:vir:78 53 NVVA-------ALGKAGK-KGTLRRTLDAIGKQTKPLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALG 124 (390) T ss_pred cHHH-------HHhhcCC-CceehhhhhhhccccCceEEEEEecccccccccccccccccccccccchhhhhhhhhhhhc Confidence 1111 1111111 112333445555555555555444221111 111111 123333332221 Q ss_pred ---eEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccC Q lcl|NC_011270. 313 ---IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELN 389 (581) Q Consensus 313 ---~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~ 389 (581) ..+++|+.+...+++.+..||++++ ++++++.+...+ ..+.+.....++++|.++++|+....+...+. T Consensus 125 ~~p~il~ap~~~~~~v~~~l~~~a~~~~-----~~aivD~p~~~t---~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 196 (390) T protein:vir:78 125 VKPRILAAPGLDTQPVAAALAATAQSLR-----AMAYVSASGCKT---KEEAAAYRKQFGQREIMVIWPDWLGWDDTTNS 196 (390) T ss_pred ceehhhcccccchHHHHHHHHHhhcccc-----eEEEEecCCCCC---HHHHHHHhhccCCceEEEEcCceEeecccCCc Confidence 1234566677788888999987663 678888765443 33445667789999999999988765554333 Q ss_pred CceecCHHHHHHHHHHHhhccchhcccccccccCcccccccCCHHH------HHHHHhCCcEEEEEeCCCeEEEEEeeec Q lcl|NC_011270. 390 REVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGE------KSRESSEGLMVIEKTPRNLVHVRHGVTT 463 (581) Q Consensus 390 ~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~e------~~~l~~~Gv~~l~~~~~~~v~i~~~itT 463 (581) .....|+.++||.+|.......+++||.|++|.|+.++..+++..+ .+.|+.+|++++. +++++++ ||-+| T Consensus 197 ~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~--~~~G~~~-wG~rT 273 (390) T protein:vir:78 197 TAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLV--NRNGFRF-WGERT 273 (390) T ss_pred ccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEE--cCCCEEE-Ecccc Confidence 3333444577777777777778999999999999999887766544 4678999999984 3456765 67788 Q ss_pred cCCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecC Q lcl|NC_011270. 464 DPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQ 539 (581) Q Consensus 464 ~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~ 539 (581) +.+|++|++|++||++|+++++|++.+++ |++|||++.+|..|+..++.||++||++|+|.+|+ ...++..+++ T Consensus 274 ~s~d~~~~~i~~rR~~~~i~~~i~~~~~~--~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~ 351 (390) T protein:vir:78 274 CSDDPKFAFENYTRTAQVAGDSIAEAQMP--VVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILA 351 (390) T ss_pred cCCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhh Confidence 88899999999999999999999999874 67799999999999999999999999999999985 3456677789 Q ss_pred CCEEEEEEEEEecCceeEEEEEEEEEecc-----ceEEE Q lcl|NC_011270. 540 PDVIEVRYEWRPAYPLNYIVVRYSIAPET-----GDITS 573 (581) Q Consensus 540 ~~~~~v~i~v~pv~~~e~I~~~~~~~~~t-----g~~~~ 573 (581) .|++++++.++|++|+|||.+++++.++- ++|.+ T Consensus 352 ~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:78 352 SGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999999999999988762 22333 No 38 >protein:vir:103993 Length: 390 # NCBI annotation: phage tail sheath protein # Family: family:all:115 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293729;genbank:gi:72537699;genbank:GeneID:3608123 Probab=100.00 E-value=8.8e-41 Score=240.17 Aligned_cols=357 Identities=14% Similarity=0.108 Sum_probs=221.2 Q ss_pred ccceeeeeccccccceeeEeccceeEEeeccccc--ccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEec Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDG--EANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFT 245 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg--~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~ 245 (581) |... ...++++.. +..+... ...+..... + ...-..++.. . ...+.+.+. T Consensus 1 M~~~-------~~~Gv~v~e-------~~~g~~~i~~~~tav~g~-v---g~a~~ad~~~---------~-pln~pv~i~ 52 (390) T protein:vir:10 1 MPQD-------YHHGVRVIE-------INEGGRPIRSVSTAVLGV-V---CTAADADASA---------F-PLNTPVLLT 52 (390) T ss_pred Cccc-------ccCCeEEEE-------cCCCcccccccCcceeEE-E---EcccCcCccc---------c-ccccceEec Confidence 1100 011122111 1111110 000000000 0 0000000000 0 000111111 Q ss_pred cCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcc-cchhhH---------HHHHHHHhcCCc--- Q lcl|NC_011270. 246 DPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDT-VTMGDY---------QNALNKFRDEDE--- 312 (581) Q Consensus 246 d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~-~t~~dy---------~~al~~l~~~~~--- 312 (581) ...+ ....+|. ...+..+++..+.++...++.+.+....+. .+..++ ..++.+|+..+. T Consensus 53 s~~~-------~~~~~g~-~gtL~~al~~~~~~gg~~~~vv~v~~~~~~~~~~~~~ig~~~~~~~~tg~~al~~~~~~~~ 124 (390) T protein:vir:10 53 NVVA-------ALGKAGK-KGTLRRTLDAIGKQTKPLTVVVRVAEGKDADETTSNVIGTVTPDGKYTGIKALLAAQGALG 124 (390) T ss_pred cHHH-------HHhhcCC-CceehhhhhhhccccCceEEEEEecccccccccccccccccccccccchhhhhhhhhhhhc Confidence 1111 1111111 112333445555555555555444221111 111111 123333332221 Q ss_pred ---eEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccC Q lcl|NC_011270. 313 ---IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELN 389 (581) Q Consensus 313 ---~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~ 389 (581) ..+++|+.+...+++.+..||++++ ++++++.+...+ ..+.+.....++++|.++++|+....+...+. T Consensus 125 ~~p~il~ap~~~~~~v~~~l~~~a~~~~-----~~aivD~p~~~t---~~~a~~~~~~~~s~~~~~~~p~~~~~d~~~~~ 196 (390) T protein:vir:10 125 VKPRILAAPGLDTQPVAAALAATAQSLR-----AMAYVSASGCKT---KEEAAAYRKQFGQREIMVIWPDWLGWDDTTNS 196 (390) T ss_pred ceehhhcccccchHHHHHHHHHhhcccc-----eEEEEecCCCCC---HHHHHHHhhccCCceEEEEcCceEeecccCCc Confidence 1234566677788888999987663 678888765443 33445667789999999999988765554333 Q ss_pred CceecCHHHHHHHHHHHhhccchhcccccccccCcccccccCCHHH------HHHHHhCCcEEEEEeCCCeEEEEEeeec Q lcl|NC_011270. 390 REVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQRDGE------KSRESSEGLMVIEKTPRNLVHVRHGVTT 463 (581) Q Consensus 390 ~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t~~e------~~~l~~~Gv~~l~~~~~~~v~i~~~itT 463 (581) .....|+.++||.+|.......+++||.|++|.|+.++..+++..+ .+.|+.+|++++. +++++++ ||-+| T Consensus 197 ~~~~p~s~~~Agl~a~~D~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~ln~~gi~t~~--~~~G~~~-wG~rT 273 (390) T protein:vir:10 197 TAVIPAPAIAAGLRAKIDNDIGWHKTISNVVVNGVSGISADVSWDLQDPATDAGYLNEHEVTTLV--NRNGFRF-WGERT 273 (390) T ss_pred ccccchHHHHHHHHHHhhcCCCcEECcCCceeeceeecceecccccccccchhhhhhhcCcEEEE--cCCCEEE-Ecccc Confidence 3333444577777777777778999999999999999887766544 4678999999984 3456765 67788 Q ss_pred cCCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecC Q lcl|NC_011270. 464 DPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQ 539 (581) Q Consensus 464 ~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~ 539 (581) +.+|++|++|++||++|+++++|++.+++ |++|||++.+|..|+..++.||++||++|+|.+|+ ...++..+++ T Consensus 274 ~s~d~~~~~i~~rR~~~~i~~~i~~~~~~--~v~e~n~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~i~ 351 (390) T protein:vir:10 274 CSDDPKFAFENYTRTAQVAGDSIAEAQMP--VVDGPLNPSLARDIVESINGWFRQQVANGYLIGGSAWIDPEPNTADILA 351 (390) T ss_pred cCCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEccCCCCHHHhh Confidence 88899999999999999999999999874 67799999999999999999999999999999985 3456677789 Q ss_pred CCEEEEEEEEEecCceeEEEEEEEEEecc-----ceEEE Q lcl|NC_011270. 540 PDVIEVRYEWRPAYPLNYIVVRYSIAPET-----GDITS 573 (581) Q Consensus 540 ~~~~~v~i~v~pv~~~e~I~~~~~~~~~t-----g~~~~ 573 (581) .|++++++.++|++|+|||.+++++.++- ++|.+ T Consensus 352 ~G~~~~~v~~~p~~pae~I~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:10 352 SGKAYIDYDYTPVPPLENLVLRQRITDRFLADFPARVAG 390 (390) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 99999999999999999999999988762 22333 No 39 >protein:vir:6079 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878220;genbank:gi:33438919;genbank:GeneID:1457754 Probab=100.00 E-value=1e-40 Score=239.80 Aligned_cols=370 Identities=13% Similarity=0.098 Sum_probs=219.1 Q ss_pred eccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeec Q lcl|NC_011270. 139 LTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDG 218 (581) Q Consensus 139 l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~ 218 (581) ++. -.+.|.|.+...|..++ ..+..+...... .. .+.. T Consensus 1 m~~-~~~Gv~v~e~~~~~~~v------~~~~tav~~fvG------------------ta-~~~~---------------- 38 (396) T protein:vir:60 1 MSD-YHHGVQVLEINEGTRVI------STVSTAIVGMVC------------------TA-SDAD---------------- 38 (396) T ss_pred CCC-CCCCeEEEEcCCCcccc------cccCceeEEEEe------------------cc-cccc---------------- Confidence 211 11334444443333211 001111111110 00 0000 Q ss_pred ccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeee--eecCCcceeEEeeeccCCcccc Q lcl|NC_011270. 219 GHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQL--AITNGASTILACAVDPEGDTVT 296 (581) Q Consensus 219 ~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~--~~~~g~~~~~~~~~~~~~~~~t 296 (581) +...+.. .+....+...+. ..+.+...+.+.+-..++..+ ....-..+ ...+......+.......+... T Consensus 39 ~~~~p~~---~p~~v~s~~~~~-~~~g~~~tl~~a~~~~~~~gg----~~~~vv~~~~~~~~~~~~~~~~~~~~~~~~~d 110 (396) T protein:vir:60 39 AEIFPLN---KPVLITNVQSAI-AKAGKKGTLAASLQAIADQSK----PVTVVVRVEDGTGEDEETKLAQTVSNIIGTTD 110 (396) T ss_pred cccccCc---cCeEeechHHHH-HhhcCcchhHHHHHHHhhccC----ceEEEEeccccccccccccccccccccccccc Confidence 0000000 000111111110 011111112221111111111 11111111 0011111111111111011111 Q ss_pred hhhHHHHHHHHhcCCc------eEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCC Q lcl|NC_011270. 297 MGDYQNALNKFRDEDE------IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKD 370 (581) Q Consensus 297 ~~dy~~al~~l~~~~~------~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns 370 (581) ......++++|++.+. ..+++|+.....+++++.+||++++ ++++++.+...+ .++.++....+++ T Consensus 111 ~~~~~tg~~al~~~~~~~~~~~~il~ap~~~~~~v~~al~~~~~~~~-----~~~i~d~p~~~~---~~~a~~~~~~~~s 182 (396) T protein:vir:60 111 ENGQYTGLKALLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR-----AFGYISAWGCKT---ISEVKAYRQNFSQ 182 (396) T ss_pred ccccccchhhhhhcccceeeeeeeccccccccHHHHHHHHHHhccCC-----eEEEEeCCCCCC---HHHHHHHHhhcCC Confidence 1122334444443332 2244567777888999999987663 678887765433 3445566678999 Q ss_pred ccEEEEEcCeeEecccccCCceecC-HHHHHHHHHHHhhccchhcccccccccCcccccccC------CHHHHHHHHhCC Q lcl|NC_011270. 371 QRVALISPSSFVYYAPELNREVVLG-GQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEG 443 (581) Q Consensus 371 ~r~~~v~~~~~~~~~~~~~~~~~~p-~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~G 443 (581) .|+++++|+....+... ...+.+| ...+||.+|......++++||.|++|.|+.++...+ +..|++.|+++| T Consensus 183 ~~~~~~~p~~~~~d~~~-~~~~~~p~s~~~AG~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~g 261 (396) T protein:vir:60 183 RELMVIWPDFLAWDTVA-STTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESG 261 (396) T ss_pred ceEEEEeCceeeecccC-CceeEEchhHHHHHHHHHhhhccCcEeCcCCceecceeeceeecccccCCCcchhhhhhhcC Confidence 99999999887765543 3344445 457777777777788899999999999998776544 568899999999 Q ss_pred cEEEEEeCCCeEEEEEeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCC Q lcl|NC_011270. 444 LMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNN 523 (581) Q Consensus 444 v~~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g 523 (581) |+++. +++++++ ||-+|+.+|++|++|++||++|++++.|++.+++ |++|||++.+|..|+..|+.||+.||++| T Consensus 262 I~~~~--~~~G~~~-wG~rT~~~d~~~~~i~~rR~~~~i~~~i~~~~~~--~v~e~n~~~~~~~i~~~i~~~l~~l~~~g 336 (396) T protein:vir:60 262 VTTLI--RRDGFRF-WGNRTCSDDPLFLFENYTRTAQVLADTMAEAHMW--AVDKPITATLIRDIVDGINAKFRELKTNG 336 (396) T ss_pred cEEEE--cCCCEEE-EcccccCCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCC Confidence 99994 3566765 6778888899999999999999999999999864 77899999999999999999999999999 Q ss_pred ceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 524 IIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 524 aI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) +|.+|+ ...++..++..|+++++|.++|++|+|||.+++++.++ +-..+ T Consensus 337 al~g~~~~~d~~~nt~~~i~~G~~~~~i~~~p~~pae~I~~~~~~~~~---------~~~~~ 389 (396) T protein:vir:60 337 YIVDATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDK---------YLANL 389 (396) T ss_pred ceeceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchH---------HHHHH Confidence 999975 33456667889999999999999999999999988776 22222 No 40 >protein:vir:79092 Length: 477 # NCBI annotation: gp13, phage tail sheath protein # Family: family:all:115 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111213;genbank:gi:134288804;genbank:GeneID:4960766 Probab=100.00 E-value=3.4e-39 Score=231.50 Aligned_cols=418 Identities=11% Similarity=0.094 Sum_probs=215.8 Q ss_pred cccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceee-------------------------- Q lcl|NC_011270. 132 LTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYV-------------------------- 185 (581) Q Consensus 132 l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~v-------------------------- 185 (581) |. +.+. |-|.|.+...|..++...... + ....+...-.+....+ T Consensus 1 M~---~~~~----pGVyv~E~~~g~~~I~~v~Ts--v-~~~VG~a~~~p~n~pv~its~~d~~~~g~~~~~~tL~~Av~~ 70 (477) T protein:vir:79 1 MA---ANYL----HGVETIEKETGSRPVKVVKSA--V-IGLIGTAPIGPVNTPVQSLSDVDAAQFGPQLAGFTIPQALDA 70 (477) T ss_pred Cc---CCCC----CCeEEEEecCCcccccccCCc--e-EEEEeecccCCCcccEEEccHHHHHHhcCCCCCCcHHHHHHH Confidence 11 0111 123444444433222111000 0 0000000000000000 Q ss_pred ----EeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcc-hhhh---hhhh Q lcl|NC_011270. 186 ----LGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDD-IQDF---YGPA 257 (581) Q Consensus 186 ----tgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~-~~~~---~~~a 257 (581) .+...++ ++.............................................. ....+ ..+. .... T Consensus 71 ~f~ngg~~~~v--vrV~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~--~~~~~~~~~~~~~~~~~ 146 (477) T protein:vir:79 71 VYDYGSGTVIV--INVLDPAVHKSNAASESVTFDAATGRAKLAHPAAANLVLKNDSGGTTY--TEGTDYAVDLINGVITR 146 (477) T ss_pred HhhcCCceEEE--EeccCCccccccccccccccccccccccccccccceeEEeeccccccc--ccCccccccccchhhhh Confidence 0001111 111110000000000000000000000000000000000000000000 00000 0000 0000 Q ss_pred hhhhccccccceeeeeeeecCCcceeEEe-eeccCCcccchhhHHHHHHHHhcCCc------eEEEEeC-CCcHHHHHHH Q lcl|NC_011270. 258 FDEAGNVQSEITLCAQLAITNGASTILAC-AVDPEGDTVTMGDYQNALNKFRDEDE------IAIIVAG-TGAQPIQALV 329 (581) Q Consensus 258 ~~~~g~~~~~i~~~~~~~~~~g~~~~~~~-~~~~~~~~~t~~dy~~al~~l~~~~~------~~iv~~~-t~~~~i~~~l 329 (581) .. .+... .....+...+..+....... ...+.. .......++++|+..+. ..+..|+ +.+..+++.+ T Consensus 147 ~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~---~a~~~~tg~~al~~~~~~~~~~~~iv~apg~~~~~~v~~~l 221 (477) T protein:vir:79 147 IK-TGTIP-AAATAAKATYDYADPTKVTAADIIGAV---NAAGMRTGMKALKDTYNLYGYFSKILIAPAYCTQNSVSVEL 221 (477) T ss_pred hh-ccccc-cccceeeceeccCCcccceeeeecccc---cccccchhhhhhhhhhhhcccccceeeccccccchhHHHHH Confidence 00 00000 00001111111111100000 000000 11122223333332221 1233444 3566788999 Q ss_pred HHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHH----HhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHH Q lcl|NC_011270. 330 QQHVSAQSNNKYERRAILGMDGSVTPVPSATRIA----NAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAG 405 (581) Q Consensus 330 ~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~----~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAg 405 (581) .+||++++ ++++++.+............. ....++|.|+.+++|+....+...+......|...+||.+|. T Consensus 222 ~~~~~~~~-----~~a~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~~~~~~~~~~~p~s~~~ag~~a~ 296 (477) T protein:vir:79 222 EAMAVQLG-----AIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHVKVYDIATNAERLEPLSSRAAGLRAR 296 (477) T ss_pred HHHHhhcC-----eEEEEecCCCCChHHHhhhhhhccccccccccceEEEEcCeeEEecccCCceeeechHHHHHHHHHH Confidence 99998663 788888765544433332222 123578999999999887766544443333444577777777 Q ss_pred HhhccchhcccccccccCcccccccC------CHHHHHHHHhCCcEEEEEeCCCeEEEEEeeecc---CCCcccceEEee Q lcl|NC_011270. 406 KSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTD---PTSLHTREWNII 476 (581) Q Consensus 406 l~a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~---~td~~~~~i~v~ 476 (581) ......+++||.|+++.|+.++...+ ++.|++.|+++||++++..++++++++.+ +|+ ..++.|+++++| T Consensus 297 ~d~~~g~~~span~~~~gv~~~~~~~~~~~~~~~~~~~~L~~~~i~~i~~~~~~G~~~wG~-rT~~~~~~~~~~~~i~vr 375 (477) T protein:vir:79 297 VDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSGLRLWGN-RTAAWPTVTHMRNFENVR 375 (477) T ss_pred hhccCCceEccCCceeecceecccccccccCCChhhHHHHhhCCceEEEEecCCcEEEEcc-cccCCCCCCccceeeehh Confidence 77777899999999999998876654 45789999999999998888888876544 555 346789999999 Q ss_pred hhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCEEEEEEEEEec Q lcl|NC_011270. 477 GQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPA 552 (581) Q Consensus 477 R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~~~v~i~v~pv 552 (581) |++|+|++.|++.+++ |++|||++.+|.+|+..|++||+.||++|+|.+|+. ..+++.+++.|+++++|.++|+ T Consensus 376 R~~~~i~~~~~~~~~~--~v~e~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~~~~~~nt~~~i~~G~~~~~i~~~p~ 453 (477) T protein:vir:79 376 RTGDVINESLRYFSQQ--FVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAWFDPARNPKEELAAGHLLINYKYTVP 453 (477) T ss_pred hHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCeEEEEEEEEec Confidence 9999999999999875 777999999999999999999999999999999863 3456677899999999999999 Q ss_pred CceeEEEEEEEEEeccceEEEEEeec Q lcl|NC_011270. 553 YPLNYIVVRYSIAPETGDITSTIEGT 578 (581) Q Consensus 553 ~~~e~I~~~~~~~~~tg~~~~~~~~~ 578 (581) +|+|||.+++++.++- +..--.|. T Consensus 454 ~p~e~i~~~~~~~~~~--~~~~~~~~ 477 (477) T protein:vir:79 454 PPLERLTYETEITSEY--LLTLKGGN 477 (477) T ss_pred CCceeEEEEEEEechH--HhhhccCC Confidence 9999999999886543 22222233 No 41 >protein:vir:5663 Length: 671 # NCBI annotation: tail sheath protein # Family: family:all:661 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899602;genbank:gi:34419589;genbank:GeneID:2545776 Probab=100.00 E-value=2.5e-38 Score=226.74 Aligned_cols=535 Identities=12% Similarity=0.023 Sum_probs=223.9 Q ss_pred Cee-ccccccCCCcccccCcccccccccccCc-eeeEEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccccceeEEEE Q lcl|NC_011270. 1 MAI-DFSQYQTPGVYTEAVGAPQLGIRSSVPT-AVAIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKLS 78 (581) Q Consensus 1 ~~~-~~~~~~~~~~~~~~~g~~~~~~~~~~~~-~~~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~~~GtF~l~ 78 (581) +-| +.. ..+.........++......... ....... .+..+. +. ..+...++... ......+.|+.. T Consensus 81 vrv~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----~~--~~~~~~~~~~~--a~~~~~~~~~~~ 149 (671) T protein:vir:56 81 VRICDAT--TAQNATPLYNAVEYTIGASNGCVVGDDITIT-YSGVGA----LT--AKGKVLEVDAG--NNNAASKIFLPS 149 (671) T ss_pred EEecCcc--ccccchhhccccccccccCcceeeceeeeee-cCcccc----cc--cCcceeEEeee--ccceeeeeeccc Confidence 111 110 00000000000000000000000 0000000 000000 00 00001111000 001111122211 Q ss_pred eCce------eccccccCCCHHHHHHHHHhcCCCCcceEEEEcC--CCceEEEEecCCccccccc-cceeccCCCceEEE Q lcl|NC_011270. 79 LAGE------PTGNIPFNATQGQVQSALRALPNVEDDEVTVLGD--PGGPWTVTFTKAVAALTKD-VTGLTGGDDPDLNI 149 (581) Q Consensus 79 ~~g~------~T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~--~g~~w~Vtf~g~~~~l~~~-~~~l~~g~~~~v~v 149 (581) +..- .+.+...+.+.. +.-+...+.+... .+..+ +.+........ ....+ +....... T Consensus 150 ~~~v~~~~~~~~~~~~~~~t~~---------~~~~~~~v~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~-~~~~~~~~ 216 (671) T protein:vir:56 150 AEIVAAAKSDGNYPSVGTITLQ---------PTQGDIALTNIEIIDTGSVY---FPNIELAFDALTAIETE-GGALKYAD 216 (671) T ss_pred eeEEEeeecccccccccccccc---------ccccceeeeeecccccceEE---Eeccccccccccccccc-cccccchh Confidence 1100 001111111110 0011111111110 11111 00000000000 00000 00000000 Q ss_pred EEcccccceeeeccccccccceeeeecccccccee------eEeccceeEEeecccccccCcceeeeeeeeeeecccccc Q lcl|NC_011270. 150 ASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVY------VLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDP 223 (581) Q Consensus 150 ~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~------vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~ 223 (581) .....+.+.+........-................ .+..+.. ..................+ ..... T Consensus 217 ~~~~~~~~~~~a~~~g~~g~~~~v~v~~~~~~~~~~a~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-------~~~~~ 288 (671) T protein:vir:56 217 LIEKQGFPRLSARYVGDFGDAISVEIINYADYQTAFAFAAGHTLGDIE-LPIYPDGGTRSINLSSYFT-------FGPSN 288 (671) T ss_pred hhhcccccccccccccccCcceEEEEecccccccccccccceeeeecc-ccccccccccccccceeec-------ccccc Confidence 00000000000000000000000000000000000 0000000 0000000000000000000 00000 Q ss_pred cceeEEEEeecCCcccceeEecc-Cc----chhhhhhhhhhhhccccccceeeeeeee-cCCcceeEEeeeccCCcccch Q lcl|NC_011270. 224 GDIVQLSYRYTDPNYHEVIRFTD-PD----DIQDFYGPAFDEAGNVQSEITLCAQLAI-TNGASTILACAVDPEGDTVTM 297 (581) Q Consensus 224 ~~~~~~s~~~~~~~~~e~~~~~d-~~----~~~~~~~~a~~~~g~~~~~i~~~~~~~~-~~g~~~~~~~~~~~~~~~~t~ 297 (581) .+.........+. ..+.+.... .. .....+...+...+ ...+........ .......+. ++..+.... T Consensus 289 ~~~~~~~v~~~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~---gg~d~~~~~ 362 (671) T protein:vir:56 289 SNQYAVIVRVSGE-VEEAFIVSTNPGDKDVNGQSIFIDEYFENS--GSAYITAIAEGWKTESGAYNFG---GGSDANAGA 362 (671) T ss_pred cccceeEEeecCc-cceeEEEeecccccccchhhhhhhhhhccc--CceEEEecCcccCCcccccccc---Cccccccch Confidence 0000000001110 001111100 00 00011111111111 111111111000 001111111 112223456 Q ss_pred hhHHHHHHHHhcCCc--eEEEE-eC-CCc--HHHHHHHHHHHHHHhcCCCcEEEEEecCCC--------CCchhHHHHH- Q lcl|NC_011270. 298 GDYQNALNKFRDEDE--IAIIV-AG-TGA--QPIQALVQQHVSAQSNNKYERRAILGMDGS--------VTPVPSATRI- 362 (581) Q Consensus 298 ~dy~~al~~l~~~~~--~~iv~-~~-t~~--~~i~~~l~~~v~~~~~~~~~~~avvg~~~~--------~~~~~~~~~~- 362 (581) .++.++++++++... ..+++ |. ... ...+...++.+..+....+++++++..... .+...+.++. T Consensus 363 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 442 (671) T protein:vir:56 363 DDWMFGLDMLSDPEVLYTNLVIAGNAAAEEVSIASTVQKYAIDSVGNVRQDCVVFVSPPQSLIVNKQAGTAVANIQGWRT 442 (671) T ss_pred hHHHHHHHhhhhccccceeEEEcCCCCCccchhHHHHHHHHHHHHHhhcCCEEEEEeccccccccccccccHHHHHHHhh Confidence 688999999986533 23333 22 111 111222233333344444567888764321 1112111111 Q ss_pred ----------HHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccc---cCcccccc Q lcl|NC_011270. 363 ----------ANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVI---RGFSGPAE 429 (581) Q Consensus 363 ----------~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l---~g~~~~~~ 429 (581) .....++|.|.++++|+.++.+...+......|..++||++|.+.....+|+||.|+.+ .|+.+++. T Consensus 443 ~~~~~~~~~~~~~~~~~s~~~~~~~p~~~~~d~~~~~~~~~p~s~~~AGl~Ar~D~~~g~~~span~~~~~i~g~~~~~~ 522 (671) T protein:vir:56 443 GIDPTNGQAVVDNLNVSTTYAVIDGNYKYQYDKYNDRNRWVPLAGDIAGLCAYTDQVSQPWMSPAGFNRGQIKGVNRLAV 522 (671) T ss_pred hccccchhhhhhhccCCcceEEEecCceEEecccCCceeEechHHHHHHHHHHhhccCCcEECcCCceecccccccccee Confidence 12345789999999998887766543333334455777788888778889999999764 46667888 Q ss_pred cCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCC-cccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHH Q lcl|NC_011270. 430 VQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTS-LHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQV 508 (581) Q Consensus 430 ~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td-~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~i 508 (581) .+++.|++.|+++||++++..+++++++ ||-+|+..+ ++|++|++||++++|+++|++.++| |++|||++.+|..| T Consensus 523 ~~~~~~~~~Ln~~gIn~i~~~~~~G~~~-wG~rT~~~~~~~~~~i~vrR~~~~i~~si~~~~~~--~v~epn~~~~~~~i 599 (671) T protein:vir:56 523 DLRRAHRDALYQIGINPVVGFAGQGFVL-YGDKTATQQASAFDRINVRRLFNLLKKAISDAAKY--RLFELNDEFTRSSF 599 (671) T ss_pred ecChhHHHHHhhCCceEEEEecCCeEEE-EcceecCCCCcccceEehhhHHHHHHHHHHHHHHH--hcCCCCCHHHHHHH Confidence 9999999999999999999888888776 555776654 6899999999999999999999876 66799999999999 Q ss_pred HHHHHHHHHHHHhCCceeCCc----cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceEEEEEee Q lcl|NC_011270. 509 KASAEAALVWLVDNNIIRGYR----NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEG 577 (581) Q Consensus 509 k~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~ 577 (581) +..|..||..||++|+|.+|. ...+++.+++.|+++++|.++|++|+|||.++|+....--+|. .+-| T Consensus 600 ~~~i~~fL~~l~~~gal~g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~Pae~I~~~~~~~~~~~~f~-e~~~ 671 (671) T protein:vir:56 600 KSEIDAYLTNIQDLGGVYDFRVVCDETNNPGSVIDRNEFVASIYVKPAKSINFITLNFVATSTDADFA-EIIG 671 (671) T ss_pred HHHHHHHHHHHHhCCceeeeEEEEcCCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEeecCcchh-hhcC Confidence 999999999999999999985 3456677889999999999999999999999997553332322 2223 No 42 >protein:vir:1996 Length: 495 # NCBI annotation: major tail subunit # Family: family:all:369 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050643;genbank:gi:9633530;genbank:GeneID:2636269 Probab=100.00 E-value=6.3e-39 Score=230.03 Aligned_cols=430 Identities=12% Similarity=0.063 Sum_probs=261.0 Q ss_pred cCCCCcceEEEEcCCCceEEEEecCC-----ccc----cccccceeccCCCceE---EEEEcccccceeeec-------- Q lcl|NC_011270. 103 LPNVEDDEVTVLGDPGGPWTVTFTKA-----VAA----LTKDVTGLTGGDDPDL---NIASEQTGVPAMNRA-------- 162 (581) Q Consensus 103 l~~i~~~~V~~~~~~g~~w~Vtf~g~-----~~~----l~~~~~~l~~g~~~~v---~v~~~~~g~~~~~~~-------- 162 (581) |++|.--++--+-.-. -+.++|... +|. +-+-+..|.+|..+.. .+....+....+... T Consensus 1 m~~i~F~~IP~~iRvP-~~y~E~dns~A~~g~~~~~q~vLiiGq~la~gs~~~~~pv~v~s~~~a~~~fG~GS~la~M~~ 79 (495) T protein:vir:19 1 MSDISFNAIPSDVRVP-LTYIEFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQASAAFGQGSMLALMAD 79 (495) T ss_pred CCCCchhhCCcccccC-eEEEEEccCCCCcCCcCCCceEEEEEecCcccccccceeEEecCHHHHHHhcCcCcHHHHHHH Confidence 4444322221000000 134455321 111 0111112222322222 222211111111000 Q ss_pred c-ccccccce--eeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecc--------------cccccc Q lcl|NC_011270. 163 L-AKKGIKTD--TIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGG--------------HIDPGD 225 (581) Q Consensus 163 ~-~~~~~~~~--~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~--------------~~d~~~ 225 (581) + -....-.. ...+.+. .++..+|+ ....+.....|...-.+--..++..|..+ +..+.+ T Consensus 80 a~~~~n~~~~l~~i~~~D~--aG~aA~g~--it~tg~at~~G~l~l~I~g~~v~v~V~~gdTaa~vA~al~aaina~~~l 155 (495) T protein:vir:19 80 AFLNANRVAELWCIPQGNG--TGNAAVGE--ISLSGTAGENGSLVTYIAGQRLAVSVAAGATGAALADLLVARIKGQPDL 155 (495) T ss_pred HHHHhCCcceEEEEeeCCh--hhceeEEE--EEEeecCCCCcEEEEEECCEEEEEEecCCCCHHHHHHHHHHHhcCCccC Confidence 0 00000011 1122221 11222222 11111111111111000001112222211 111111 Q ss_pred eeEEEEeec--CCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHH Q lcl|NC_011270. 226 IVQLSYRYT--DPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNA 303 (581) Q Consensus 226 ~~~~s~~~~--~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~a 303 (581) -+......+ +.....+++++ ...||. .|++.+.+...-.......++....++.+...++|+..+ T Consensus 156 PvTA~~~~~~~~~~a~~~VtlT------------Ar~kG~-~n~idi~~~~~~ge~~p~Glt~titamsgGag~PDia~a 222 (495) T protein:vir:19 156 PVTAEVRADSGDDDTHADVVLS------------AKFTGA-LSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISAS 222 (495) T ss_pred ceEEEeeccCCCCcCceeEEEE------------Eeeccc-cccceeEEEeecccccccceeEEEEecCCCCCCcchHHH Confidence 111111110 00111122222 123554 366666655444344555566677777777788899999 Q ss_pred HHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEe Q lcl|NC_011270. 304 LNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVY 383 (581) Q Consensus 304 l~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~ 383 (581) |++|.+++++.|++|++|.+++. ++++|++......+++.++.-. ...+...+..+++...||+|+.+++.. T Consensus 223 laal~~~~~~~I~~P~tD~asL~-al~~~l~~rw~~~~q~~g~~~~---a~~gT~~~l~t~g~~~N~~~it~~~~~---- 294 (495) T protein:vir:19 223 IAGMGDLQYKYIVMPYTDEPNLN-LLRTELQERWGPVNQADGFAVT---VLSGTYGDISTFGVSRNDHLISCMGIA---- 294 (495) T ss_pred HHHhccCCCcEEEEecCcHHHHH-HHHHHHHHhhhHHHhcCeEEEE---eecCCHHHHHHhhhccCCceEEEEecC---- Confidence 99999999999999999999985 5899999877654444333222 223456778899999999999987531 Q ss_pred cccccCCceecCHHHHHHHHHHHh---hccchhcccccccccCcc--cccccCCHHHHHHHHhCCcEEEEEeCCCeEEEE Q lcl|NC_011270. 384 YAPELNREVVLGGQFMAAAVAGKS---VSAIAAMPLTRKVIRGFS--GPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVR 458 (581) Q Consensus 384 ~~~~~~~~~~~p~~~~Aa~vAgl~---a~~~~~~slt~~~l~g~~--~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~ 458 (581) +..-|++.+||.+|+.+ .+.||++|++..+|+|+. +...+|+.+|++.|+.+|+.+++...+|.|+|+ T Consensus 295 -------gsp~~~~~~AAA~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~~Gist~~V~~~G~V~I~ 367 (495) T protein:vir:19 295 -------GAPEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGISTFNVNDGGEMQIE 367 (495) T ss_pred -------CCCCcHHHHHHHHHHHHHHHhhcccccccCceeecceecCCccccCChHHHHHHHhCCcceEEECCCCeEEEE Confidence 22235566655555554 488999999999999997 556799999999999999999998889999999 Q ss_pred EeeeccCC------CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHH-----------HHHHHHHHHHHHHHHHHh Q lcl|NC_011270. 459 HGVTTDPT------SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDT-----------TIVQVKASAEAALVWLVD 521 (581) Q Consensus 459 ~~itT~~t------d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~-----------~r~~ik~~i~~~L~~l~~ 521 (581) |.||||++ |+.|.+|.++|+++|+++.+|..+. .+|.|+|..++ +-..||+++.+.+++|+. T Consensus 368 R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~-~kfpR~KLa~d~~~~~~gq~IvTp~~ir~ell~~~~~le~ 446 (495) T protein:vir:19 368 RMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRIT-QKFPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWEN 446 (495) T ss_pred eeeeeeeecCCCCcchhhhhhHHHHHHHHHHHHHHHHHh-hhcCCcccccCCCCCCCcccccChHHHHHHHHHHHHhhhh Confidence 99999964 7789999999999999999999997 56988877665 668999999999999999 Q ss_pred CCceeCCcc----ceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEe Q lcl|NC_011270. 522 NNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) Q Consensus 522 ~gaI~~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~ 566 (581) +|++++++. ..++++.+|++|+.+.+....+..++.+-.++++-. T Consensus 447 ~given~~~~~~~LiVerd~~dpnRln~~~p~d~vn~L~V~A~~i~f~L 495 (495) T protein:vir:19 447 AGLVEDFDTFKEELYVARNKDDKDRLDVLCGPNLINQFRIFAAQVQFIL 495 (495) T ss_pred hccccChhhhcceeEEEECCCCCcEEEEEecceeeCceeeeeeeeeeeC Confidence 999999864 457888899999999999999999999888888877 No 43 >protein:vir:96740 Length: 388 # NCBI annotation: phage tail sheath protein FI-like # Family: family:all:115 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039839;genbank:gi:126010913;genbank:GeneID:5076250 Probab=100.00 E-value=7.1e-40 Score=235.21 Aligned_cols=361 Identities=10% Similarity=0.065 Sum_probs=209.8 Q ss_pred ccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccC Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDP 247 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~ 247 (581) |. ... ...+++++...+.-+-.+ ...++.... .+-...+.-...+... +.........+.. +.. T Consensus 1 m~----~~~-~~~hGv~v~ev~~g~~~i-----~~~~tavi~-~Vgta~~ad~~~p~~~---~~~i~~~~d~~~~-~~~- 64 (388) T protein:vir:96 1 MP----VID-QFEHNGISIETHEPPPPM-----GPPGDNVVA-WVVTAPDKHADVAFSV---PFRVANTADAQYL-DST- 64 (388) T ss_pred CC----CCC-CCCCceEEEEcCCCcccc-----cccCcceeE-EEEecCCCcccccccc---ceeeecchhhhhh-hcc- Confidence 11 000 111223332211110000 001111110 1110111000001100 0001111111100 000 Q ss_pred cchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeecc-CCccc----------chhhHHHHHHHHhcCCc--eE Q lcl|NC_011270. 248 DDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDP-EGDTV----------TMGDYQNALNKFRDEDE--IA 314 (581) Q Consensus 248 ~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~-~~~~~----------t~~dy~~al~~l~~~~~--~~ 314 (581) +.....+..++...+.++...+....+.. ..... ....-...+++++..+. .. T Consensus 65 --------------~~~~gtl~~al~~~~~~~~~~~~vv~v~~g~~~~at~a~iig~~~~~tg~~~gl~al~~~~~~p~i 130 (388) T protein:vir:96 65 --------------GNELGTGWHAASETLKKTSVPQYFIVVPEGADDAATMANIIGGIDPTTGRRTGIAALTECTERPTL 130 (388) T ss_pred --------------ccccccchhhhHhhhccCCceEEEEEeccccccccccceeeeecccccchhhHHHHhhhcccceeE Confidence 00000011111112222222111111100 00000 00112244555554333 34 Q ss_pred EEEeC-CCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHH-HHHhhccCCccEEEEEcCeeEecccccCCce Q lcl|NC_011270. 315 IIVAG-TGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATR-IANAQSIKDQRVALISPSSFVYYAPELNREV 392 (581) Q Consensus 315 iv~~~-t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~-~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~ 392 (581) +++|+ ++...+++++.+||++++ ++++++++........... ......++|+|+++++|+....+... ...+ T Consensus 131 l~aPg~s~~~~v~~al~~~~~~~~-----~~~i~D~p~~~~~~~~~~~~~~~~~~~~s~~~~~~~P~~~~~d~~~-~~~~ 204 (388) T protein:vir:96 131 IGAPGFSQNKAVIDALASMAKRLK-----CRAVIDGPSGSTQDAIDLSGLLGGEGTGHDRVYMVDPMPAIYSRKA-QGNI 204 (388) T ss_pred EEeeccccchHHHHHHHHHHhhcC-----cEEEEeccCCchhHHHHHHhhhhccCcCcceEEEEeCceeeecccC-Ccee Confidence 55565 566789999999998763 6888887754332222111 12344689999999999887665433 3445 Q ss_pred ecCHHHHHHHHHHHhhccchhcccccccccCccccc------ccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC Q lcl|NC_011270. 393 VLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPA------EVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT 466 (581) Q Consensus 393 ~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~------~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t 466 (581) .+|+ ++.+||+.|..++++||.|+++ ++.++. ..+++.|.+.|+++||+++..++++++++ ||-+|+ T Consensus 205 ~~p~---s~~~AG~~a~~D~~~spaN~~i-~i~g~~~~~~~~~~~~~~~~~~Ln~~gI~~i~~~~~~G~~~-wG~rT~-- 277 (388) T protein:vir:96 205 YVPP---STIAMGAVAAVKPWESPGNQGV-LIQDVARVIDYNILDKSTEGDLLNRNGVSYFARTSMGGFSL-IGNRTV-- 277 (388) T ss_pred eech---HHHHHHHHHhhcCcccccCeeE-EeeeecccccccccCChhhHHhhhhcCceEEEEecCCcEEE-Eccccc-- Confidence 5665 4788899999999999999988 344443 33467899999999999998888888876 555664 Q ss_pred CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCE Q lcl|NC_011270. 467 SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDV 542 (581) Q Consensus 467 d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~ 542 (581) +|++|++||++|+|+++|++.+++ |++|||++.+|..|+..|+.||++||++|+|.+|+. ..++..+++.|+ T Consensus 278 --~~~~i~vrR~~~~i~~si~~~~~~--~v~epn~~~~~~~i~~~i~~fL~~l~~~Gal~g~~~~~d~~~nt~~~i~~G~ 353 (388) T protein:vir:96 278 --TGKFISFVGLEDAIARKLEAASQR--AMSKQLTKSFMEQEIKKINLFMQDLVAAEIIPGGEVYLHPTLNTVERYKNGS 353 (388) T ss_pred --CCcceeehhhHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEecCCCCHHHhhCCE Confidence 499999999999999999999875 677999999999999999999999999999999753 345666788999 Q ss_pred EEEEEEEEecCceeEEEEEEEEEeccce--EEEEE Q lcl|NC_011270. 543 IEVRYEWRPAYPLNYIVVRYSIAPETGD--ITSTI 575 (581) Q Consensus 543 ~~v~i~v~pv~~~e~I~~~~~~~~~tg~--~~~~~ 575 (581) ++++|.++|++|+|||.+++++.++==+ +..-| T Consensus 354 ~~~~i~~~p~~pae~I~~~~~~~~~~~~~~~~~~~ 388 (388) T protein:vir:96 354 WYIVIDYGRYSPNEHMIFHLNAVDRIVEEFIEEVL 388 (388) T ss_pred EEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhC Confidence 9999999999999999999998765443 22222 No 44 >protein:vir:1172 Length: 391 # NCBI annotation: hypothetical protein # Family: family:all:115 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490621;genbank:gi:17313241;genbank:GeneID:927300 Probab=100.00 E-value=3.8e-40 Score=236.73 Aligned_cols=362 Identities=12% Similarity=0.057 Sum_probs=214.6 Q ss_pred ccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeee-eeeeecccccccceeEEEEeecCCcccceeEecc Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTI-QRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTD 246 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti-~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d 246 (581) |-... ...++++.......-.+.. ..+....... .+..+.. ..+.. .+......... ...|.+ T Consensus 1 M~~~~------~~~GV~v~e~~~~~~~i~~-----v~tavig~vg~a~~a~~~-~~~~~---~p~~v~s~~~~-~~~~g~ 64 (391) T protein:vir:11 1 MAADQ------YHHGVRVQEINDGTRPIRT-----IATAIIGLVATAEDADAT-AFPLD---TPVLITNVQAA-IGKAGT 64 (391) T ss_pred CCCCc------CCCcEEEEECCCCcceecc-----cCCceeEEEEecCCCCCc-ccccc---ccEEEecchhh-heecCC Confidence 10000 0112222111110000000 0000000000 0000000 00000 00111111111 111111 Q ss_pred CcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCC-cccchhhHH---------HHHHHHhcC-Cc--- Q lcl|NC_011270. 247 PDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEG-DTVTMGDYQ---------NALNKFRDE-DE--- 312 (581) Q Consensus 247 ~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~-~~~t~~dy~---------~al~~l~~~-~~--- 312 (581) ... +..+++..+.++...++........ ...+..|+. .++.++... .. T Consensus 65 ~~t------------------l~~al~~~~~~~g~~~~vv~~~~~~~~~~t~~d~~g~~~a~~~~~g~~a~~~~~~~~~~ 126 (391) T protein:vir:11 65 SGT------------------LPASLQAIADQANAATVVVRVKPGEDEAATNSAVIGGVSADGKYTGMKALLAAKARLGV 126 (391) T ss_pred Ccc------------------chhhhhhhhccccceeEEeeecccccccccchhhhcccccccchhhhhhhhhhhhhhee Confidence 111 1222333333443333333222111 111111111 112222211 11 Q ss_pred --eEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCC Q lcl|NC_011270. 313 --IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNR 390 (581) Q Consensus 313 --~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~ 390 (581) ..+.+|+.+..++++++.+||+++ .++++++.+...+ ....++....++|+|+++++|+....+...+ . T Consensus 127 ~p~~~~ap~~~~~~v~~al~~~~~~~-----~~~~i~D~p~~~t---~~~a~~~r~~~~s~~~~~~~p~~~~~~~~~~-~ 197 (391) T protein:vir:11 127 VPRILGVPGLDTQPVATALIAIAQQL-----RAFAYVSASGCKT---KEEATAYRENFAAREAMVIWPDFLTWSTVVN-Q 197 (391) T ss_pred ccccccccccccHHHHHHHHHhhccc-----ceEEEEEcCCCCC---HHHHHHHhhhcCCceEEEEcCcceecccccC-c Confidence 123456667778888899888765 3788887665443 3455666778999999999998876654433 3 Q ss_pred ceecC-HHHHHHHHHHHhhccchhcccccccccCcccccccCC------HHHHHHHHhCCcEEEEEeCCCeEEEEEeeec Q lcl|NC_011270. 391 EVVLG-GQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQR------DGEKSRESSEGLMVIEKTPRNLVHVRHGVTT 463 (581) Q Consensus 391 ~~~~p-~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t------~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT 463 (581) ...+| ...+|+.+|.......+++||.|++|.|+.++...++ +.|.+.|+++|++++. +++++++ ||-+| T Consensus 198 ~~~~p~s~~~ag~~a~~d~~~g~~~span~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~~~--~~~G~~~-wG~rT 274 (391) T protein:vir:11 198 TVPAPAVAQALGLRARIDQEVGWHKTLSNVAVNGVTGISADVFWDLQSPSTDANYLNENEVTTLV--QEGGFRF-WGSRT 274 (391) T ss_pred eEEechHHHHHHHHHHhhccCCcEEccCCceeeceeecccccccccCCCcchhhhhhhcCcEEEE--cCCCEEE-Ecccc Confidence 34444 4466666666666777999999999999988866553 6788999999999984 4566765 67778 Q ss_pred cCCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecC Q lcl|NC_011270. 464 DPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQ 539 (581) Q Consensus 464 ~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~ 539 (581) +.+|++|++|++||++|++++.|++.+++ |++|||++.+|..|+..+..||++||++|+|.+|+. ..+++.+++ T Consensus 275 ~~~d~~~~~i~vrR~~~~i~~~~~~~~~~--~v~e~n~~~~~~~i~~~i~~~l~~l~~~g~l~g~~~~~~~~~n~~~~i~ 352 (391) T protein:vir:11 275 CSDDPLFAFENYTRTAQVLADTIAEAHMW--AVDKPMHPSLVRDILEGVNAKFRELKGLGLIIDAQAWYDPNVNDKDTLK 352 (391) T ss_pred cCCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhccceeceEEEEecCCCCHHHhh Confidence 88899999999999999999999999864 677999999999999999999999999999999863 345566788 Q ss_pred CCEEEEEEEEEecCceeEEEEEEEEEeccce-EEEEEee Q lcl|NC_011270. 540 PDVIEVRYEWRPAYPLNYIVVRYSIAPETGD-ITSTIEG 577 (581) Q Consensus 540 ~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~-~~~~~~~ 577 (581) .|+++++|.++|++|+|||.+++++.++-=+ +..+|-- T Consensus 353 ~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~a 391 (391) T protein:vir:11 353 AGKLRITYDYTPVPPLEDLTFFQKITDSYLVDFASRVNA 391 (391) T ss_pred CCeEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhcC Confidence 9999999999999999999999988776422 1111111 No 45 >protein:vir:4517 Length: 498 # NCBI annotation: tail sheath protein # Family: family:all:369 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599043;genbank:gi:19549001;genbank:GeneID:935236 Probab=100.00 E-value=2.1e-40 Score=238.16 Aligned_cols=425 Identities=13% Similarity=0.078 Sum_probs=256.2 Q ss_pred cccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCCcccc-------ccccceeccCCCceEEEEEccc--- Q lcl|NC_011270. 85 GNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAAL-------TKDVTGLTGGDDPDLNIASEQT--- 154 (581) Q Consensus 85 ~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l-------~~~~~~l~~g~~~~v~v~~~~~--- 154 (581) =.|+|| .+|+ ++.+ --+.++|....+.+ -+-+..|.+|..+..+...... T Consensus 1 M~IsF~-----------~IP~----~iRv-----P~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~ 60 (498) T protein:vir:45 1 MTISFN-----------TIPS----NTLV-----PLFYAEMDNQAANTAQDSGASLLIGHANNGAEIVANSLVLMPSADY 60 (498) T ss_pred CCCchh-----------hcCc----cccc-----CeEEEEEeCCCCCCCCCCcceEEEEecCCccccccceeEEecCHHH Confidence 112221 1221 0000 01334452211111 1111222223322222212111 Q ss_pred ccceeeec---------ccccccccee--eeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeeccc--- Q lcl|NC_011270. 155 GVPAMNRA---------LAKKGIKTDT--IRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGH--- 220 (581) Q Consensus 155 g~~~~~~~---------~~~~~~~~~~--~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~--- 220 (581) ....+... .-....-.++ ..+.+. .++..+|+ ....+.....|...-.+--..++..|..+. T Consensus 61 a~~lfG~GSml~~M~~a~~~~n~~~~l~~i~~~d~--aG~aA~g~--it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa 136 (498) T protein:vir:45 61 ARQICGAGSQLARMVEAYRQTDPFGELYVIAVPEA--TGAAATVT--LTVTGEATESGTVNVYVGRTRVQAPVTNGDNVT 136 (498) T ss_pred HHHhcCcCcHHHHHHHHHHHhCCcceEEEEeeCCc--ccceeEEE--EEeecccCCCcEEEEEECCEEEEEEecCCCCHH Confidence 11000000 0000000111 122111 11121221 111111111111100000001111111111 Q ss_pred -----------ccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeec---CCcceeEEe Q lcl|NC_011270. 221 -----------IDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAIT---NGASTILAC 286 (581) Q Consensus 221 -----------~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~---~g~~~~~~~ 286 (581) ..+.+-+.. .....+++++ ...||..+|++.+.+..... ......++. T Consensus 137 ~vA~al~aaina~~~lPVTA------~~~~~~VtlT------------Ar~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~ 198 (498) T protein:vir:45 137 TIASSIQDAINAVPTLPFTA------SSSAGVVTLT------------ARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQI 198 (498) T ss_pred HHHHHHHHHHhCCCCCceEE------EecCceEEEE------------eeccCccccceeEEEeeccccccccccceeeE Confidence 001000000 0001111111 13478888999888765432 234445566 Q ss_pred eeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcC-----CCcEEEEEecCCCCCchhHHHH Q lcl|NC_011270. 287 AVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNN-----KYERRAILGMDGSVTPVPSATR 361 (581) Q Consensus 287 ~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~-----~~~~~avvg~~~~~~~~~~~~~ 361 (581) ...++.+...++|+.++|++|.++++..|++|++|++++.+ +++|++.++.. .+..+++. ...+...+. T Consensus 199 ~itamagGag~PD~a~alaal~~~~~~~I~~p~~D~asL~a-l~~~L~~~sgRw~~~~q~~g~~~~-----a~~gT~~~l 272 (498) T protein:vir:45 199 AVATGTAGTGAPVLTGAVAAMADEPFDYIGLPFNDTASVNT-LVTEMNDTSGRWSYARQLYGHVYT-----AKTGTLSEL 272 (498) T ss_pred EEEccCCCccCchhHHHHHHhccCCccEEEEeeCCHHHHHH-HHHHHhhhhhhhhHHhhcCeEEEE-----eccCCHHHH Confidence 66666667778899999999999999999999999999866 79999875432 22233333 234457788 Q ss_pred HHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhh---ccchhcccccccccCcc--cccccCCHHHH Q lcl|NC_011270. 362 IANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSV---SAIAAMPLTRKVIRGFS--GPAEVQRDGEK 436 (581) Q Consensus 362 ~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a---~~~~~~slt~~~l~g~~--~~~~~~t~~e~ 436 (581) .+++...||+|+.+.+.. ....-|++.+||.+|+.+| +.||++|++..+|+|+. .+..+|+.+|+ T Consensus 273 ~t~g~~~N~~~it~~~~~----------~~~~sp~~~~AAa~aa~~A~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~er 342 (498) T protein:vir:45 273 VNAGDQFNQQHITLAGYE----------KETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPAPKGKRFTMTEQ 342 (498) T ss_pred HHhhhccCCceEEEEecC----------CCCCChHHHHHHHHHHHHHHHhhcccccccCceeecceecCCchhcCChHHH Confidence 899999999999987531 2334577888888888887 89999999999999997 56679999999 Q ss_pred HHHHhCCcEEEEEeCCCeEEEEEeeeccCC------CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHH------- Q lcl|NC_011270. 437 SRESSEGLMVIEKTPRNLVHVRHGVTTDPT------SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDT------- 503 (581) Q Consensus 437 ~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t------d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~------- 503 (581) +.|+.+|+.+++.. +|.|+|+|.||||++ |+.|.+|.++|+++|+++.+|..+. .+|.++|..++ T Consensus 343 n~LL~~Gist~~V~-~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~-~kfpR~KLa~dg~~~~~g 420 (498) T protein:vir:45 343 QTLLSHGVATAYVE-SGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSAYVLRKLKSVIT-SKYGRHKLASDGTRFGPG 420 (498) T ss_pred HHHHhCCcceEEEc-CCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhh-hhcCCeeecccCcccCCC Confidence 99999999999875 668999999999974 7889999999999999999999997 56999887766 Q ss_pred ----HHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCEEEEEEEEEecCceeEE----EEEEEEEeccc Q lcl|NC_011270. 504 ----TIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYI----VVRYSIAPETG 569 (581) Q Consensus 504 ----~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~I----~~~~~~~~~tg 569 (581) +...||+++.+.+++|+.+|++++++. ..++++.+|++|+.+.+....+..++-+ -++++++.++- T Consensus 421 q~IvTp~~ir~ell~~y~~le~~givEn~~~~~~~LiVerd~~dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:45 421 QAIVTPAVIKGELLATYRQLERAGIVENYELFKQYLVVERDASVPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred CcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecccccCchhhhhhhhhhheehhhcCC Confidence 778999999999999999999999864 4578888999999999977777777644 44455555544 No 46 >protein:vir:4463 Length: 498 # NCBI annotation: Tail Sheath protein # Family: family:all:369 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700386;genbank:gi:23505458;genbank:GeneID:955665 Probab=100.00 E-value=1.1e-40 Score=239.67 Aligned_cols=430 Identities=12% Similarity=0.091 Sum_probs=259.0 Q ss_pred cccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCCcccc-------ccccceeccCCCce---EEEEEccc Q lcl|NC_011270. 85 GNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAAL-------TKDVTGLTGGDDPD---LNIASEQT 154 (581) Q Consensus 85 ~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l-------~~~~~~l~~g~~~~---v~v~~~~~ 154 (581) =.|+|| .+|+ ++.+ . -+.++|....+.+ -+-+..+.+|..+. +.|....+ T Consensus 1 M~IsF~-----------~IP~----~iRv---P--~~y~E~dns~A~~~~~~q~vLiiGq~la~gs~~~~~~v~v~s~~~ 60 (498) T protein:vir:44 1 MAISFN-----------SIPS----DTRV---P--LFYAEMDNSAANTARDSGASLLIGHASNDASIAVNSLVLVSSVDY 60 (498) T ss_pred CCCchh-----------hcCc----cccc---C--eEEEEEeCCCCCCCcCCcceEEEEecCcccccccceeEeecCHHH Confidence 112222 1221 0000 0 1334442211111 11112222232221 12211111 Q ss_pred ccceeeec---------ccccccccee--eeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecc---- Q lcl|NC_011270. 155 GVPAMNRA---------LAKKGIKTDT--IRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGG---- 219 (581) Q Consensus 155 g~~~~~~~---------~~~~~~~~~~--~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~---- 219 (581) ....+... .-....-..+ ..+.+ ..++..+|+ ....+.....|...-.+--..++..|..+ T Consensus 61 a~~~fG~GSml~~M~~a~~~~n~~~~l~~i~~~D--~aG~aAtg~--it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa 136 (498) T protein:vir:44 61 ARQICGAGSQLARMVGAYRKTDPFGELYVIAVPE--STGAAATVA--LTVTGEATETGTVNVYTGRTRVQAPVTSGDDAA 136 (498) T ss_pred HHHhcCcccHHHHHHHHHHHhCCCceeEEEecCC--cccceeEEE--EEeecccCCCcEEEEEECCEEEEEEecCCCCHH Confidence 11001000 0000000111 12211 111121221 11111111111110000000111111111 Q ss_pred ----------cccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeecC---CcceeEEe Q lcl|NC_011270. 220 ----------HIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITN---GASTILAC 286 (581) Q Consensus 220 ----------~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~---g~~~~~~~ 286 (581) +..+.+-+... ....++.++ ...||..+|++.+.+...... ..+..++. T Consensus 137 ~vA~al~aaina~~~lPVTA~------~~~~~vtlT------------Ar~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~ 198 (498) T protein:vir:44 137 AVAVSIKDAVNANPDLPFTAT------SEAGVVTLT------------ARHKGLYGNEIPVTLNYYGFGGGEVLPAGVNI 198 (498) T ss_pred HHHHHHHHHHhCCCCCceEEe------eccceEEEE------------EeccCcccCcceEEEeeccCccccccccceeE Confidence 11111111000 001111111 134788899999887765532 23345566 Q ss_pred eeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhh Q lcl|NC_011270. 287 AVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQ 366 (581) Q Consensus 287 ~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~ 366 (581) ...++.+...++|+.++|++|.+++|+.|++|++|++++.+ +++|++.++..+..++...|.......+...+..+++. T Consensus 199 titamsgGag~PDia~alaal~~~~~~~i~~p~~D~asl~a-l~~~L~~~sgRw~~~~q~~g~~~~a~~gT~a~l~t~g~ 277 (498) T protein:vir:44 199 TVASGVKGAGAPALNDAVAAMGDEPFDYIGLPFNDTASVNS-MATEMNDSSGRWSYVRQLYGHVYTAKTGTLSELVAAGD 277 (498) T ss_pred EEEcccCCccCchhHHHHHhhccCCccEEEEeecCHHHHHH-HHHHHhhhhcchHHHhhcCeEEEEeccCCHHHHHHhhh Confidence 66677777788899999999999999999999999999866 89999877654333333333333344556788889999 Q ss_pred ccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhh---ccchhcccccccccCcc--cccccCCHHHHHHHHh Q lcl|NC_011270. 367 SIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSV---SAIAAMPLTRKVIRGFS--GPAEVQRDGEKSRESS 441 (581) Q Consensus 367 ~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a---~~~~~~slt~~~l~g~~--~~~~~~t~~e~~~l~~ 441 (581) ..||+|+.+.+.. ....-|++.+||.+|+.+| +.||++|++..+|+|+. .+..+|+.+|++.|+. T Consensus 278 ~~N~~~it~~~~~----------~~~~sp~~~~AAa~a~~aA~~l~~DPArPL~tl~L~Gi~~p~~~~r~~~~ern~LL~ 347 (498) T protein:vir:44 278 QFNLQHITLAGYE----------KDTQTPADELAASRTARAAVFIRNDPARPTQTGELVDMLPAPKGKRFTTTEQQTLLS 347 (498) T ss_pred ccCCceEEEEecC----------CCCCCHHHHHHHHHHHHHHHHhhcccccccCceeecccccCCchhcCChHHHHHHHh Confidence 9999999987531 2333577888888888887 89999999999999997 4568999999999999 Q ss_pred CCcEEEEEeCCCeEEEEEeeeccCC------CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHH-----------H Q lcl|NC_011270. 442 EGLMVIEKTPRNLVHVRHGVTTDPT------SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDT-----------T 504 (581) Q Consensus 442 ~Gv~~l~~~~~~~v~i~~~itT~~t------d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~-----------~ 504 (581) +|+.+++.. +|.|+|+|.||||++ |+.|.+|.++|+++|+++.+|..+. .+|.|+|..++ + T Consensus 348 ~Gist~~V~-~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~-~kfpR~KLa~d~~~~~~gq~IvT 425 (498) T protein:vir:44 348 HGVATAYVE-SGVLRIQRDITTYRKNAYGVADNSYLDSETLHTSAYVLRRLKSVIT-SKYGRHKLANDGTRFGSGQAIVT 425 (498) T ss_pred cCcceEEEc-CCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhh-hhcCCcccccCCcccCCCccccc Confidence 999999875 668999999999974 7889999999999999999999996 56988875554 6 Q ss_pred HHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCEEEEEEEEEecCceeEEE----EEEEEEeccc Q lcl|NC_011270. 505 IVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYIV----VRYSIAPETG 569 (581) Q Consensus 505 r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~I~----~~~~~~~~tg 569 (581) ...||+++.+.+++|+.+|++++++. ..++++.+|++|+.+.+....+..++-+- ++++++.++- T Consensus 426 p~~ir~eli~~y~~le~~givEn~~~~~~~LiVerd~~dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:44 426 PAVIRGELGSTYRQMEREGIVENFDLFQQHLIVERNANDSNRLDVLFPPDYVNQLRVFAVLNQFRLQYSEEAA 498 (498) T ss_pred HHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecccccCchhhhhhhhhhhhhhhhhcC Confidence 78999999999999999999999864 45788889999999999777777776543 3444444444 No 47 >protein:vir:489 Length: 498 # NCBI annotation: putative sheath protein # Family: family:all:369 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543096;swissprot:trembl:q8w623;genbank:gi:18249908;uniprot:Q8W623;genbank:GeneID:929698 Probab=100.00 E-value=8.6e-40 Score=234.75 Aligned_cols=425 Identities=12% Similarity=0.054 Sum_probs=255.6 Q ss_pred cccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCC-----ccc--cccccceeccCCCce---EEEEEccc Q lcl|NC_011270. 85 GNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKA-----VAA--LTKDVTGLTGGDDPD---LNIASEQT 154 (581) Q Consensus 85 ~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~-----~~~--l~~~~~~l~~g~~~~---v~v~~~~~ 154 (581) =.|+||.=+..++ +- -+.++|... +|. +-+-+..+.+|..+. +.+....+ T Consensus 1 M~IsF~~IP~~iR---------------vP-----~~y~E~dns~A~~~~~~qrvLiiGq~la~gt~~~~~~v~v~s~~~ 60 (498) T protein:vir:48 1 MTISFSAVPSDTL---------------VP-----LFYAEMDNSAANTAVTSAPALLIGHASNDAAIEVNSLVLMPSADY 60 (498) T ss_pred CCccccccCcccc---------------cc-----eEEEEEecCCCccccCCcceEEEeecCccccccccceEEecCHHH Confidence 1222221111111 00 122333110 110 001111122222221 12211111 Q ss_pred ccceeeec--------c-ccccccce--eeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecc---- Q lcl|NC_011270. 155 GVPAMNRA--------L-AKKGIKTD--TIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGG---- 219 (581) Q Consensus 155 g~~~~~~~--------~-~~~~~~~~--~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~---- 219 (581) ....+... + -....-.. ...+.+. .++..+|+ ..........|...-.+--..++..|..+ T Consensus 61 a~~~fG~GS~l~~M~~a~~~~n~~~~l~~i~~~D~--ag~aA~g~--it~tg~at~~G~l~l~Igg~~v~v~V~~gdTaa 136 (498) T protein:vir:48 61 ARQICGAGSQLARMVDVYRQTDPFGELYVIAVPEA--RGAAATVR--VTVTGEAEESGTLSLYVGRSSVQVPVVNGDDAT 136 (498) T ss_pred HHHhcCcccHHHHHHHHHHHhCCCceeEEEeeCCc--ccceeEEE--EEecccccCCceEEEEECCEEEEEeecCCCCHH Confidence 11111000 0 00000011 1112111 11111221 11111111111100000000011111111 Q ss_pred ----------cccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeec---CCcceeEEe Q lcl|NC_011270. 220 ----------HIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAIT---NGASTILAC 286 (581) Q Consensus 220 ----------~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~---~g~~~~~~~ 286 (581) +..+.+-+... ....++.++ ...||..+|++.+.+..... ...+..++. T Consensus 137 ~vA~al~aai~a~~~lPVTA~------~~~~~VtlT------------Ar~kG~~GN~I~l~~~~~~~~~ge~~p~Glt~ 198 (498) T protein:vir:48 137 AVATAIKEAVNGVITLPFAAS------SDAGVVTLT------------ARHKGLYGNELPVCLNYYGSGGGEILPAGLQV 198 (498) T ss_pred HHHHHHHHHHhCCCCcceEEE------ecCcEEEEE------------eeecccccccceeeeeeccCcccccccceeeE Confidence 00010000000 001111111 13378888999988876543 234445566 Q ss_pred eeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHHHHHHHHHHHHhcC-----CCcEEEEEecCCCCCchhHHHH Q lcl|NC_011270. 287 AVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQALVQQHVSAQSNN-----KYERRAILGMDGSVTPVPSATR 361 (581) Q Consensus 287 ~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~~~l~~~v~~~~~~-----~~~~~avvg~~~~~~~~~~~~~ 361 (581) ...++.+...++|+.++|++|.+++++.|++|++|++++.+ +++|++.++.. .+..+++. ...+...+. T Consensus 199 ~itamsgGag~PDia~aLaal~~~~~~~I~~p~~D~asl~a-l~~~L~~~sgRw~~~~q~~g~~~~-----a~~gT~~~l 272 (498) T protein:vir:48 199 VTEAGTAGSGAPDLTAAVAAMGDEAFDFIGLPFNDAASINM-MMTEMNDSSGRWSYARQLYGHVYT-----AKLGTLSEL 272 (498) T ss_pred EEEcccCCccCcchHHHHHhhccCCccEEEEeecCHHHHHH-HHHHHhhhhhhhhHHhhcCeEEEE-----eccCCHHHH Confidence 66667766788899999999999999999999999999866 89999876543 22233333 234457788 Q ss_pred HHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhh---ccchhcccccccccCcc--cccccCCHHHH Q lcl|NC_011270. 362 IANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSV---SAIAAMPLTRKVIRGFS--GPAEVQRDGEK 436 (581) Q Consensus 362 ~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a---~~~~~~slt~~~l~g~~--~~~~~~t~~e~ 436 (581) .+++...||+|+.+.+. .+....|++.+||.+|+..| +.||++|++..+|+|+. .+..+|+.+|+ T Consensus 273 ~t~g~~~N~~~it~~~~----------~~~~~~p~~~~AAa~a~~aA~~l~~DPArPLqtl~L~Gi~~p~~~~r~~~~er 342 (498) T protein:vir:48 273 VNAGDMHNQQHITLAGY----------EKETQSPVDELVASRLAREAVFIRNDPARPTQTGELVGMLPAPKGKRFIMTEQ 342 (498) T ss_pred HHhhhccCCceEEEEec----------CCCCCChHHHHHHHHHHHHHHhhhccccccccceeeeccccCCchhcCChHHH Confidence 89999999999998753 23344688888888888887 89999999999999997 56679999999 Q ss_pred HHHHhCCcEEEEEeCCCeEEEEEeeeccCC------CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHH------- Q lcl|NC_011270. 437 SRESSEGLMVIEKTPRNLVHVRHGVTTDPT------SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDT------- 503 (581) Q Consensus 437 ~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t------d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~------- 503 (581) +.|+.+|+.+++. +++.|+|+|.||||++ |+.|.+|.++|+++|+++.+|..+. .+|.++|..++ T Consensus 343 n~LL~~Gist~~V-~~G~V~I~R~ITTY~~n~~G~~D~syLDi~T~~tl~yvr~~~r~~i~-~kfpR~KLa~dg~~~~~g 420 (498) T protein:vir:48 343 QTLLSHGVATAYV-EGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTSAYVLRKLKSVIT-SKYGRHKLANDGTRFGPG 420 (498) T ss_pred HHHHhcCcceEEE-cCCeEEEEeeeeeeeecCCCCcchhhhhhhhHHHHHHHHHHHHHHhh-hhcCCceecccCcccCCC Confidence 9999999999986 7889999999999974 7889999999999999999999997 56999887776 Q ss_pred ----HHHHHHHHHHHHHHHHHhCCceeCCcc----ceeEEeecCCCEEEEEEEEEecCceeEEE----EEEEEEeccc Q lcl|NC_011270. 504 ----TIVQVKASAEAALVWLVDNNIIRGYRN----LKARQIERQPDVIEVRYEWRPAYPLNYIV----VRYSIAPETG 569 (581) Q Consensus 504 ----~r~~ik~~i~~~L~~l~~~gaI~~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~I~----~~~~~~~~tg 569 (581) +...||+++.+.+++|+.+|++++++. ..++++.+|++|+.+.+....+..++-+- ++++++..+- T Consensus 421 q~IvTp~~ir~eli~~y~~le~~given~~~~~~~LiVerd~~dpnRln~~~p~d~vn~L~V~A~~~~f~lq~~~~~~ 498 (498) T protein:vir:48 421 QAIVTPAVIKGELLATYRQMERAGIVENYDLFKQYLIVERDADNPNRLNTLFPPDYVNQLRVFAVVNQFRLQYSEESA 498 (498) T ss_pred CcccchHHHHHHHHHHHHhhhhhccccChhhhcceeEEEECCCCCcEEEEEecccccCchhhhhhhhhhhhhhhhcCC Confidence 779999999999999999999999864 45788889999999999777777776543 3444444443 No 48 >protein:vir:2035 Length: 396 # NCBI annotation: gpFI # Family: family:all:115 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046778;genbank:gi:9630349;genbank:GeneID:1261516 Probab=100.00 E-value=3.5e-39 Score=231.41 Aligned_cols=379 Identities=12% Similarity=0.087 Sum_probs=218.4 Q ss_pred eccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeec Q lcl|NC_011270. 139 LTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDG 218 (581) Q Consensus 139 l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~ 218 (581) ++. -.+.|.|.+...|..++.... .+..+.... ..+... T Consensus 1 m~~-~~~GV~v~e~~~g~~~i~~v~------tav~~~vg~-------------------a~~a~~--------------- 39 (396) T protein:vir:20 1 MSD-YHHGVQVLEINEGTRVISTVS------TAIVGMVCT-------------------ASDADA--------------- 39 (396) T ss_pred CCC-CCCCeEEEEcCCCcceeeecC------CceeEEEee-------------------eccCCC--------------- Confidence 321 124566666666554332221 111111110 000000 Q ss_pred ccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchh Q lcl|NC_011270. 219 GHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMG 298 (581) Q Consensus 219 ~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~ 298 (581) ...+.. .+..+.....+.. .+.....+.+.+-..++..+.....+... ....... ....+.+.....+..... T Consensus 40 -~~~~l~---~pvlvts~~~~~~-~~g~~~tL~~al~~~~~ngg~~~~v~~~~-~~~~~~~-~~~~a~t~~~~~~~~~~~ 112 (396) T protein:vir:20 40 -ETFPLN---KPVLITNVQSAIS-KAGKKGTLAASLQAIADQSKPVTVVMRVE-DGTGDDE-ETKLAQTVSNIIGTTDEN 112 (396) T ss_pred -ccccCc---cCEEeechHHHHh-hcccccchhhhhhhhhccCceeEEEEecc-ccccccc-cccccccccccccccccc Confidence 000000 0000111111100 01111111111111111111000000000 0000000 000000000000000001 Q ss_pred hHHHHHHHHhcCCce------EEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCcc Q lcl|NC_011270. 299 DYQNALNKFRDEDEI------AIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQR 372 (581) Q Consensus 299 dy~~al~~l~~~~~~------~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r 372 (581) ....++.+|...+.. .++.|+.....+++.+.+||+++. ++++++.+...+ .++.++....++++| T Consensus 113 ~~~tg~~al~~~~~~~~~~p~i~~ap~~~~~~v~~al~~~~~~~~-----~~~~iD~p~~~~---~~~a~~~r~~~~s~~ 184 (396) T protein:vir:20 113 GQYTGLKAMLAAESVTGVKPRILGVPGLDTKEVAVALASVCQKLR-----AFGYISAWGCKT---ISEVKAYRQNFSQRE 184 (396) T ss_pred cccchhhhhhhhccccccchhhhhhhhhccHHHHHHHHHHHhcCC-----cEEEEecCCCCC---HHHHHHHhhCCCCce Confidence 112233333332222 233456667778888899887763 677887765443 344456667899999 Q ss_pred EEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCccccccc------CCHHHHHHHHhCCcEE Q lcl|NC_011270. 373 VALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEV------QRDGEKSRESSEGLMV 446 (581) Q Consensus 373 ~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~------~t~~e~~~l~~~Gv~~ 446 (581) +++++|+....+...+......|..++||.+|......++++||.|++|.|+.++... ++..|.+.|+++||++ T Consensus 185 ~~~~~P~~~~~d~~~~~~~~~p~s~~~Ag~~a~~d~~~g~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gi~~ 264 (396) T protein:vir:20 185 LMVIWPDFLAWDTVTSTTATAYATARALGLRAKIDQEQGWHKTLSNVGVNGVTGISASVFWDLQESGTDADLLNESGVTT 264 (396) T ss_pred EEEEcCccccccCcCCcceeechhHHHHHHHHHhhhhcCcEeccCCceeccceecceecccccCCCcchhhhhhhcCcEE Confidence 9999998876654433333334455777777777778889999999999999887654 3567899999999999 Q ss_pred EEEeCCCeEEEEEeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCcee Q lcl|NC_011270. 447 IEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIR 526 (581) Q Consensus 447 l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~ 526 (581) +. +++++++ ||-+|+.+|++|++|++||++|+|++.|++.+++ |++|||++.+|..|+..++.||++||++|+|. T Consensus 265 ~~--~~~G~~~-wG~rT~s~d~~~~~i~~rR~~~~i~~~~~~~~~~--~v~e~~~~~~~~~i~~~i~~~L~~l~~~G~l~ 339 (396) T protein:vir:20 265 LI--RRDGFRF-WGNRTCSDDPLFLFENYTRTAQVVADTMAEAHMW--AVDKPITATLIRDIVDGINAKFRELKTNGYIV 339 (396) T ss_pred EE--cCCCEEE-EcccccCCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCccee Confidence 94 3556665 5778888899999999999999999999999874 78899999999999999999999999999999 Q ss_pred CCcc----ceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccce-EEEEEeec Q lcl|NC_011270. 527 GYRN----LKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGD-ITSTIEGT 578 (581) Q Consensus 527 ~~~~----~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~-~~~~~~~~ 578 (581) +|+. ..++..++..|+++++|.++|++|+|||.+++++.++-=+ +-..|--- T Consensus 340 g~~v~~d~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~~~~~~~~~~~~ 396 (396) T protein:vir:20 340 DATCWFSEESNDAETLKAGKLYIDYDYTPVPPLENLTLRQRITDKYLANLVTSVNSN 396 (396) T ss_pred ceEEEEecCCCCHHHhhCCEEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHhhcC Confidence 9863 3345567788999999999999999999999988765311 00000000 No 49 >protein:vir:100323 Length: 393 # NCBI annotation: tail sheath protein FI # Family: family:all:115 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655492;genbank:gi:109289960;genbank:GeneID:4157366 Probab=100.00 E-value=4.3e-38 Score=225.43 Aligned_cols=362 Identities=11% Similarity=0.057 Sum_probs=216.7 Q ss_pred ccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccC Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDP 247 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~ 247 (581) |. .....-.++++...+.-...+ ....+..-. ++-..- +. +. .. +. ..+.+..... T Consensus 1 m~-----m~~~~~~GV~v~e~~~g~~~i-----~~~~tav~~-~vgta~-~~--~~---~~--~p-----ln~pv~i~s~ 56 (393) T protein:vir:10 1 MS-----ILDTYLHGVEVVEVNAGGVTI-----STAATSVIG-VVCTGD-QA--DA---ET--FP-----LNTPVLITNP 56 (393) T ss_pred CC-----CCCccCCCeEEEEcCCCccee-----cccCcceeE-EEeecc-Cc--Cc---cc--cc-----CccceEecch Confidence 10 000011222222211110000 000111100 010000 00 00 00 00 0011111111 Q ss_pred cchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcc---------cchhhHHHHHHHHhcCCc------ Q lcl|NC_011270. 248 DDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDT---------VTMGDYQNALNKFRDEDE------ 312 (581) Q Consensus 248 ~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~---------~t~~dy~~al~~l~~~~~------ 312 (581) .+.... .|. ...+..+.+..+.++...++.+.+...... ........++.+|..... T Consensus 57 ~~~~~~-------~g~-~g~L~~al~~~~~~~~~~~~vv~v~~~~~~~~t~~~iig~~~~~~~tgl~al~~~~~~~~~~p 128 (393) T protein:vir:10 57 LNYLEK-------AGS-TGTLRRTLNSIGSIVKTPTVIVRVAESDDSDTLTANIVGTQENGKFTGIKALLTAQSTVFVKP 128 (393) T ss_pred HHHHHh-------hCC-ccchhhhhhhhhcccCceEEEeecccCccccccccccccccccchhhHHHHHHhhhhhcceee Confidence 111111 111 111222333344444444443333211100 001122344555443221 Q ss_pred eEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCce Q lcl|NC_011270. 313 IAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREV 392 (581) Q Consensus 313 ~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~ 392 (581) ..++.|+.+..++.+++.++|++++. +.++..++..+ ..+.+.....+++.|.++++|+....+...+.... T Consensus 129 ~li~apg~~~~~~~~al~~~~~~~~~-----~~~v~d~~~~t---~~~ai~~~~~~~s~~~~~~~P~~~~~d~~~~~~~~ 200 (393) T protein:vir:10 129 KLLCVPQHDNQAVATELLSVAKKLNA-----FAFISDNGATT---KEQAYTYRQNFSQREGMMIFGDWKSYNTDKKAYDT 200 (393) T ss_pred eeeeeccccchHHHHHHHHHhhccCc-----EEEEEcCCCCC---HHHHHHHhhhcCCceEEEEecccccccccCCceeE Confidence 23455777777888889999987742 33444444333 33445566778999999999988765554334344 Q ss_pred ecCHHHHHHHHHHHhhccchhcccccccccCcccccccC------CHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCC Q lcl|NC_011270. 393 VLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPT 466 (581) Q Consensus 393 ~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~t 466 (581) ..|..++||.+|.......+++||.|++|.|+.++.... ++.|.+.|+++||+++. +++++++ ||-+|+.+ T Consensus 201 ~p~s~~~Ag~~a~~d~~~G~~~spaN~~l~gi~~~~~~~~~~~~~~~~~~~~Ln~~gI~t~~--~~~G~~~-wG~rT~s~ 277 (393) T protein:vir:10 201 DYAVARACALQAYIDKTVGWHKNISNVELDGVTGITKAVEFDINESSTEANYLNEKGITICL--NHNGFRY-WGSRTLAT 277 (393) T ss_pred eehhHHHHHHHHHhhcCCCcEEccCCceeeceeecceecccccCCCcchhHhHhhcCceEEE--cCCCEEE-EcccccCC Confidence 455557788888888788899999999999998877653 47889999999999983 3567765 57778888 Q ss_pred CcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCC--ceeCCcc---ceeEEeecCCC Q lcl|NC_011270. 467 SLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNN--IIRGYRN---LKARQIERQPD 541 (581) Q Consensus 467 d~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~g--aI~~~~~---~~~~~~~~~~~ 541 (581) |++|++|++||++|+|++.|++.+++ |++|||++.+|..|+..++.||+.||+.| +|.+++. ++++..+++.| T Consensus 278 d~~~~~i~vrR~~~~i~~~i~~~~~~--~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~~al~g~~v~~~~~nt~~~i~~G 355 (393) T protein:vir:10 278 DTRWAFQQSVRTAQIIKETIGAGLAW--AVDMPLTPLRVKTMLEAINNKLRSWASGDDPRILGARVWVAEEITADIIKSG 355 (393) T ss_pred CcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhccccccccceEEecCCCCHHHhhCC Confidence 99999999999999999999999875 67799999999999999999999999855 8999863 34555668889 Q ss_pred EEEEEEEEEecCceeEEEEEEEEEecc-----ceEEEE Q lcl|NC_011270. 542 VIEVRYEWRPAYPLNYIVVRYSIAPET-----GDITST 574 (581) Q Consensus 542 ~~~v~i~v~pv~~~e~I~~~~~~~~~t-----g~~~~~ 574 (581) +++++|.++|++|+|||.+++++.++- ++|-+- T Consensus 356 ~~~~~i~~~p~~p~e~I~~~~~~~~~~~~~l~~~v~a~ 393 (393) T protein:vir:10 356 KFVIKYDYHWIPSLESLGLEQRVNDEYVVDLVNTLKAL 393 (393) T ss_pred EEEEEEEEEecCCcceEEEEEEEchHHHHHHHHHHhcC Confidence 999999999999999999999886542 111111 No 50 >protein:vir:107865 Length: 477 # NCBI annotation: gp39 # Family: family:all:115 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024712;genbank:gi:48696949;genbank:GeneID:2845953 Probab=100.00 E-value=1.7e-37 Score=222.10 Aligned_cols=422 Identities=11% Similarity=0.072 Sum_probs=215.1 Q ss_pred cCCCCcceEEEEcCCCceEEEEecCCccccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccc Q lcl|NC_011270. 103 LPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQ 182 (581) Q Consensus 103 l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~ 182 (581) |++ .+. |-|.+.+...|..++...... + ....+.....+.+ T Consensus 1 M~~--------------------------------~~~----pGVyv~E~~~~~~~i~~v~T~--v-~~~VG~a~~gp~n 41 (477) T protein:vir:10 1 MAA--------------------------------NYL----HGVETIEKETGSRPVKVVKSA--V-IGLIGTAPIGPVN 41 (477) T ss_pred Ccc--------------------------------cCC----CCeEEEEccCCcccccccCCc--e-eEEEecccCCCCC Confidence 110 011 113333333333222111000 0 0000000000000 Q ss_pred eeeEec---cceeEEeeccccc---------ccCcceeeeeeeeeeecccccccc----------eeEEEEeecCCcccc Q lcl|NC_011270. 183 VYVLGT---DYVVTRVNAGEDG---------EANTRDDLYTIQRVVDGGHIDPGD----------IVQLSYRYTDPNYHE 240 (581) Q Consensus 183 ~~vtgt---d~~v~~v~~~~dg---------~~~~~~~~~ti~~~vd~~~~d~~~----------~~~~s~~~~~~~~~e 240 (581) ..+.-+ ++..+.. ...++ ..++.-.+..++. .+........ ............... T Consensus 42 ~pv~its~~d~~~~g~-~~~~~tL~~Av~~~f~nGg~~~~vVrV-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 119 (477) T protein:vir:10 42 TPVQSLSDVDAAQFGP-QLAGFTIPQALDAVYDYGSGTVIVINV-LDPAVHKSNAANEPVTFDAATGRAKLAHPAAANLV 119 (477) T ss_pred cCEEEccHHHHHHhcc-CCCCCcHHHHHHHHHhccceEEEEEec-Cccccccccccccccccccccceeccccccccccc Confidence 000000 0000000 00000 0011111111110 0000000000 000000000000000 Q ss_pred eeEeccCcchhhhhhhhhhhhccccccce--------eeeeeeecCCc-ceeEEeeeccCCcccchhhHHHHHHHHhcCC Q lcl|NC_011270. 241 VIRFTDPDDIQDFYGPAFDEAGNVQSEIT--------LCAQLAITNGA-STILACAVDPEGDTVTMGDYQNALNKFRDED 311 (581) Q Consensus 241 ~~~~~d~~~~~~~~~~a~~~~g~~~~~i~--------~~~~~~~~~g~-~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~ 311 (581) .................+.........+. .........+. .........+ ....+....++++|+..+ T Consensus 120 v~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g---~~~~~~~~tGl~al~~~~ 196 (477) T protein:vir:10 120 LKNDSGGTTYAEGTDYAVDLINGVITRIKTGTIPPGATAAKATYDYADPTKVTAADIIG---AVNAAGMRTGMKALKDTY 196 (477) T ss_pred ccccccccccccchhhhhhhccccceecccccccccceeeeeccccccccccccccccc---cccccchhhhhhhhhhhh Confidence 00000000000000000000000000000 00000010000 0000000000 011112223444443322 Q ss_pred c------eEEEEeC-CCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHH----HhhccCCccEEEEEcCe Q lcl|NC_011270. 312 E------IAIIVAG-TGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIA----NAQSIKDQRVALISPSS 380 (581) Q Consensus 312 ~------~~iv~~~-t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~----~a~~~ns~r~~~v~~~~ 380 (581) . ..+..|+ +.+..+++.|.+||++++ ++++++.+............. ....++++|+.+++|+. T Consensus 197 ~~~~~~~~~l~apg~~~~~~v~~~l~~~~~~~~-----~~~~~d~p~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~p~~ 271 (477) T protein:vir:10 197 NLYGYFSKILIAPAYCTQNSVSVELEAMAVQLG-----AIAYIDAPIGTTLAQALAGRGPAGTINFNTSSDRVRLCYPHV 271 (477) T ss_pred hhcchhcccccccccccchhhHHHHHHHHhhCC-----EEEEEecCCCCCHHHHHhhhhhccccccccccceEEEEcCeE Confidence 1 2233444 456678999999998763 688888765544333322221 23357899999999987 Q ss_pred eEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCcccccccC------CHHHHHHHHhCCcEEEEEeCCCe Q lcl|NC_011270. 381 FVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEGLMVIEKTPRNL 454 (581) Q Consensus 381 ~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~Gv~~l~~~~~~~ 454 (581) ...+...+......|+.++||.+|....+..+++||.|+++.|+.++...+ ++.|.+.|+++||++++...+++ T Consensus 272 ~~~d~~~~~~~~~p~s~~~ag~~a~~d~~~g~~~span~~~~gi~~~~~~~~~~~~~~~~~~~~L~~~gi~~i~~~~~~G 351 (477) T protein:vir:10 272 KVYDTATNAERLEPLSSRAAGLRARVDLDKGYWWSSSNQQLVGVTGVERPLSAMIDDPQSDVNMLNEQGITTVFSSYGSG 351 (477) T ss_pred EEecccCCceeEEchHHHHHHHHHHhhhcCCceeccCCceeccccccccccccccCCChhhHHHHhhCCceEEEEecCCc Confidence 766554443333344557777777777778899999999999998886654 45789999999999998888888 Q ss_pred EEEEEeeecc---CCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc-- Q lcl|NC_011270. 455 VHVRHGVTTD---PTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR-- 529 (581) Q Consensus 455 v~i~~~itT~---~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~-- 529 (581) ++++ |-+|+ ..++.|+++++||++|+|++.|++.+++ |++|||++.+|..|+..|++||+.||++|+|.+|+ T Consensus 352 ~~~w-G~rT~~~~~~~~~~~~~~vrR~~~~i~~~~~~~~~~--~v~~~~~~~~~~~i~~~i~~~l~~l~~~g~l~g~~v~ 428 (477) T protein:vir:10 352 LRLW-GNRTAAWPTVTHMRNFENVRRTGDVINESLRYFSQQ--FVDAPIDQGLIDSLVESVNGFGRKLIGDGALLGFKAW 428 (477) T ss_pred EEEE-cccccCCCCCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEE Confidence 8765 55565 3467899999999999999999999875 67799999999999999999999999999999985 Q ss_pred --cceeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeec Q lcl|NC_011270. 530 --NLKARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGT 578 (581) Q Consensus 530 --~~~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~ 578 (581) ...++..++..++++++|.++|++|+|||.+++++.++- +..--.|. T Consensus 429 ~~~~~nt~~~i~~G~~~~~i~~~p~~p~e~i~~~~~~~~~~--~~~~~~g~ 477 (477) T protein:vir:10 429 FDPARNPKEELAAGHLLINYKYTVPPPLERLTYETEITSEY--LLTLKGGN 477 (477) T ss_pred EecCCCCHHHhhCCeEEEEEEEEecCCcceEEEEEEEcchH--HhhhhcCC Confidence 234566678899999999999999999999999886643 21111222 No 51 >protein:vir:10336 Length: 386 # NCBI annotation: ORF39 # Family: family:all:115 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758931;genbank:gi:27311206;genbank:GeneID:956116 Probab=100.00 E-value=4.5e-38 Score=225.34 Aligned_cols=358 Identities=11% Similarity=0.022 Sum_probs=206.0 Q ss_pred ccceeeeeccccccceeeEeccceeEEeecccc--cccCcceeeeeeeeeee-cccccccceeEEEEeecCCcccceeEe Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGED--GEANTRDDLYTIQRVVD-GGHIDPGDIVQLSYRYTDPNYHEVIRF 244 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~d--g~~~~~~~~~ti~~~vd-~~~~d~~~~~~~s~~~~~~~~~e~~~~ 244 (581) |.. -...++++... ..+.. ....+... ..+....+ +....+. ...+.. T Consensus 1 M~~-------~~~~Gv~v~ev-------~~~~~~i~~v~tav~-~~vg~a~~a~~~~~~~--------------~~pv~i 51 (386) T protein:vir:10 1 MAE-------QYLHGAEVVEI-------DNGARPIRTAQSGVI-GLVGTAPDADATAFPL--------------NTPVLI 51 (386) T ss_pred Ccc-------ccCCCeEEEEc-------CCCcccccccCccee-EEEEecCCCCCccccc--------------ccceEe Confidence 110 01112222111 11100 00111110 11100000 0000010 011111 Q ss_pred ccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCC-----------cccchhhHHHHHHHHhcCCce Q lcl|NC_011270. 245 TDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEG-----------DTVTMGDYQNALNKFRDEDEI 313 (581) Q Consensus 245 ~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~-----------~~~t~~dy~~al~~l~~~~~~ 313 (581) ....+....+ +. ...+..+.+..+.++...++........ ..........++.+|...... T Consensus 52 ~s~~~~~~~~-------g~-~~tl~~a~~~~~~~gg~~~~vv~~~~~~~~~~t~~~~ig~~~~~t~~~tgl~~l~~~~~~ 123 (386) T protein:vir:10 52 AGSRREAAKL-------GA-GGTLPQAIDGIFDQTGAVVVVIRVDEGVDSAATQSNVIGKVDADTEQYTGILALLSAENT 123 (386) T ss_pred cchHHHHhhc-------CC-CcchhHHHHHHhccCceeEEEeeccccccccccchhhhcccccccchhhhhHHhhhhccc Confidence 1111111100 00 0011122233333333333322221110 011112334556655544333 Q ss_pred EEEEeCCCcH---HHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCC Q lcl|NC_011270. 314 AIIVAGTGAQ---PIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNR 390 (581) Q Consensus 314 ~iv~~~t~~~---~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~ 390 (581) ..+.|..... ..+..+.+++..+.+. .+++..... . ....+........+++.|..+++|+..+.+...+.. T Consensus 124 ~~~~p~i~~ap~~~~~~~v~~~l~~~~~~---~~~~~~~~~-~-~~~~~~a~~~~~~~~s~~~~~~~p~~~v~~~~~~~~ 198 (386) T protein:vir:10 124 VKVQPRILIAPGFSNQKAVADQLVSVADT---AAWLCHSGW-S-NTTDAAAITYRELFGSRRCEVVDPWYKVWDVETSAH 198 (386) T ss_pred ccccccccccccccchhHHHHHHHHhhcc---eEEEEEeCC-C-CCchHHHHHhhhcccccceEEecCceeeeccccccc Confidence 2232321110 1122334444444321 122221221 1 122334456677899999999999877766554444 Q ss_pred ceecCHHHHHHHHHHHhhccchhcccccccccCcccccccC------CHHHHHHHHhCCcEEEEEeCCCeEEEEEeeecc Q lcl|NC_011270. 391 EVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQ------RDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTD 464 (581) Q Consensus 391 ~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~------t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~ 464 (581) ....|..++||.+|.+.....+++||.|+++.|+.++...+ ++.|.+.|+++|++++. +++++++ ||.+|+ T Consensus 199 ~~~p~s~~~ag~~a~~D~~~G~~~spaN~~l~gv~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~--~~~G~~~-wG~rT~ 275 (386) T protein:vir:10 199 IIQPPSARHAGVMAKVHNTLGFWWSNSNQEILGIDGLCRPVDFKLDDPTCRANLLNAKEVTTTI--QQNGFRV-WGDRTC 275 (386) T ss_pred eeechHHHHHHHHHHhhhcCCcEEccCCceeecccccceecccccccCcchhhhhhhcCcEEEE--cCCCEEE-Eccccc Confidence 44444557788888888888899999999999998887544 56789999999999884 4566765 677888 Q ss_pred CCCcccceEEeehhhHHHHHHHHHHHhhhcCCCccCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----cceeEEeecCC Q lcl|NC_011270. 465 PTSLHTREWNIIGQQDVMVYRIRDYLDADGLIGMPIYDTTIVQVKASAEAALVWLVDNNIIRGYR----NLKARQIERQP 540 (581) Q Consensus 465 ~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~~~~~~~~~~~ 540 (581) .+|+.|++|++||++|+|++.|++.+++ |++|||++.+|.+|+..+++||..||++|+|.+|+ .+.++..+.+. T Consensus 276 ~~d~~~~~i~vrR~~~~i~~~~~~~~~~--~v~e~~~~~~~~~i~~~i~~~L~~l~~~g~l~g~~v~~d~~~nt~~~~~~ 353 (386) T protein:vir:10 276 SADSKWAFKNVVITNDMIADSLVRNHLW--AVDRNITKTYVEDVTEGVNNYLRHLKNIGAIAGGECWVDPELNSPDQIQQ 353 (386) T ss_pred CCCcccceeehhhHHHHHHHHHHHHHHH--hccCCCCHHHHHHHHHHHHHHHHHHHhCCceeeeEEEEcccCCCHHHhhC Confidence 8899999999999999999999999874 77899999999999999999999999999999975 33456667889 Q ss_pred CEEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 541 DVIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 541 ~~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) |+++++|.++|++|+|||.+++++.++- -+.| T Consensus 354 G~~~~~i~~~p~~p~e~i~~~~~~~~~~---------~~~~ 385 (386) T protein:vir:10 354 GKVYFDYDFSAYAPAEHITFRSHMVNGY---------LTEV 385 (386) T ss_pred CeEEEEEEEEecCCceeEEEEEEEehhH---------HHhh Confidence 9999999999999999999999776543 2233 No 52 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.65 E-value=5.7e-19 Score=120.59 Aligned_cols=148 Identities=27% Similarity=0.369 Sum_probs=81.3 Q ss_pred Cee--ccccccCCCcccccCc---cccccccc-------------------------ccCceeeEE-EecCCCCceeeee Q lcl|NC_011270. 1 MAI--DFSQYQTPGVYTEAVG---APQLGIRS-------------------------SVPTAVAIF-GTAVGYQTYRESI 49 (581) Q Consensus 1 ~~~--~~~~~~~~~~~~~~~g---~~~~~~~~-------------------------~~~~~~~~~-~~~~g~~~~~~~~ 49 (581) +++ -+...-|.+.....+| .-+++.+. +..++.... ....++...+..- T Consensus 218 ~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a 297 (397) T protein:vir:23 218 LGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNA 297 (397) T ss_pred eeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccc Confidence 111 1111111100000000 00000000 000000000 0011111111110 Q ss_pred EEcCcCCceeeEEEEEEeccccceeEEEEeCceeccccccCCCHHHHHHHHHhcCC-CCcceEEEEcCCCceEEEEecCC Q lcl|NC_011270. 50 RINPDTGETITTQILALVGEPTGGSFKLSLAGEPTGNIPFNATQGQVQSALRALPN-VEDDEVTVLGDPGGPWTVTFTKA 128 (581) Q Consensus 50 ~~~~~~~~~~evq~v~~~~~~~~GtF~l~~~g~~T~~i~~~asa~~v~~aLe~l~~-i~~~~V~~~~~~g~~w~Vtf~g~ 128 (581) ++.......+.|.+...+.+.+|+|+|+|+|++|.+|+||||+++||.||++|++ ++..+|+|++. |++|+|+|.|+ T Consensus 298 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 375 (397) T protein:vir:23 298 -FVKLTFDPVLTTYALDLDGASAGNFTLSLDGKTSANIAYNASTATVKSAIVAIDDGVSADDVTVTGS-AGDYTITVPGT 375 (397) T ss_pred -eEEEeeccccceeeecccccCcceEEEEecCccccCcccccchhhhHHHhhhcccccccceeeeecC-CceeEEEeccc Confidence 0111234456677766677899999999999999999999999999999999987 89999999985 78999999988 Q ss_pred ccccccccceeccCCCceEEEEEcccc Q lcl|NC_011270. 129 VAALTKDVTGLTGGDDPDLNIASEQTG 155 (581) Q Consensus 129 ~~~l~~~~~~l~~g~~~~v~v~~~~~g 155 (581) ++. ....|+++..+.++|.. .| T Consensus 376 ~~~---~~~~~~~~~~~~~~~~~--~~ 397 (397) T protein:vir:23 376 LTA---DFSGLTDGEGASISVVS--VG 397 (397) T ss_pred ccc---CccccccCccccceeee--cC Confidence 764 34446666666555543 33 No 53 >protein:vir:103168 Length: 641 # NCBI annotation: gp122 # Family: family:all:661 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717789;genbank:gi:113200626;genbank:GeneID:4239177 Probab=99.54 E-value=1.8e-14 Score=95.92 Aligned_cols=435 Identities=13% Similarity=0.062 Sum_probs=145.5 Q ss_pred Ce------eccccccCCCc------ccccCcccccccccccCceeeEEEecCCC-----------Ccee---eeeEEcCc Q lcl|NC_011270. 1 MA------IDFSQYQTPGV------YTEAVGAPQLGIRSSVPTAVAIFGTAVGY-----------QTYR---ESIRINPD 54 (581) Q Consensus 1 ~~------~~~~~~~~~~~------~~~~~g~~~~~~~~~~~~~~~~~~~~~g~-----------~~~~---~~~~~~~~ 54 (581) .+ .+|... .... ..+.+-..|- +...+..+..+- ..+. ........ T Consensus 99 ~~~~~~~~~~~~~~-~~~~~~~~~~~~A~~pG~~g-------n~l~v~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 170 (641) T protein:vir:10 99 TAPLIKNLQEYETT-YESSNSNTFKFASRDAGALG-------NSVGIFITDAGPDQIAVLPAPGTGNEWEFVADEAVTAA 170 (641) T ss_pred chhhcccccccccc-ccCcCccccEEEeccCCCcC-------CceEEEEEcCCCcceeeeecccccccceeccceeeeec Confidence 11 111110 0000 0111111111 111111111110 0000 00000000 Q ss_pred CCceeeEEEEEE--eccccceeEE------EEeCcee--ccccccCCCHHHHHHHHHhcCCCCcceEEEEcCCCceE-EE Q lcl|NC_011270. 55 TGETITTQILAL--VGEPTGGSFK------LSLAGEP--TGNIPFNATQGQVQSALRALPNVEDDEVTVLGDPGGPW-TV 123 (581) Q Consensus 55 ~~~~~evq~v~~--~~~~~~GtF~------l~~~g~~--T~~i~~~asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w-~V 123 (581) .+..-++..... ......+.|. +++.|.- ...+.+++....++-++...-..+...+......+..| ++ T Consensus 171 ~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~~~~~~~~~~a~~~~~~i~~~~~g~~g~~~~~~~~t~gt~~~t~ 250 (641) T protein:vir:10 171 SGASAKVYRYSIRLTLTNVVGTFVPGGATTISISGSDESVDVLAWDAGNKYLEIALPAGGVTGIFADAQVVTQGTNTAAI 250 (641) T ss_pred cCcccccccccccccccceeeecccCCcceeEecccccccccccccCCcceeeeeecCCcceeeeeeeeeccCCccceee Confidence 011111110000 0000111111 1111111 11112221111111111110000100100001111111 11 Q ss_pred EecCCccccc-cccc---------eeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEeccceeE Q lcl|NC_011270. 124 TFTKAVAALT-KDVT---------GLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVT 193 (581) Q Consensus 124 tf~g~~~~l~-~~~~---------~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~ 193 (581) .-.|....+. .+.. ..+.+....+.......- ++. +....++........ .++..+.. T Consensus 251 a~~g~~~~~~~~~~~~~ia~aat~ag~~g~~~~v~~~~v~d~---~~a-------~~~~~g~~~~~va~~--~gts~~a~ 318 (641) T protein:vir:10 251 ASSGIERRLYIGKDSGSINFAATDAVVDTNATSATISSVRNE---YAE-------REYLPGSKWVNVAAR--PGTSLYAN 318 (641) T ss_pred ecccchhhhhhccccccceeeeecccccccceeeEeeeeeee---ecc-------ccccccccccccccc--chhhhhhh Confidence 0001000000 0000 000011111111000000 000 000000000000000 00000000 Q ss_pred EeecccccccCcceeeeeeeeeeecc-----------------cccccce------------eEEEEeecCCcccceeE- Q lcl|NC_011270. 194 RVNAGEDGEANTRDDLYTIQRVVDGG-----------------HIDPGDI------------VQLSYRYTDPNYHEVIR- 243 (581) Q Consensus 194 ~v~~~~dg~~~~~~~~~ti~~~vd~~-----------------~~d~~~~------------~~~s~~~~~~~~~e~~~- 243 (581) ..+ ...+.+..+-...++. ..+..+. ..+.|-+..-...+.+. T Consensus 319 -----~~g--~~~D~~~~lv~d~~~~~~g~~g~v~e~~~~~s~~~~~~~~~~~~~~~~~~~~~~s~~v~~~~~~~~~~~~ 391 (641) T protein:vir:10 319 -----SVG--GVNDELHVLVIDVDGKITGNPGSVLERFIGVSKASDAKTSIGEVNYYKEVIKQQSAYVYWGSHETAPFLG 391 (641) T ss_pred -----hcC--CcccceEEEEEeecceeeccccceeeeeecccccCCcccccccceeeeeeeccccceEEEeccccccccc Confidence 000 0000000000000000 0000000 00000000000000000 Q ss_pred ----eccCcchhhhhhhhhhh-hccccccceeeeeeeecCCcc--e--eEEeeeccC----CcccchhhHHHHHHHHhcC Q lcl|NC_011270. 244 ----FTDPDDIQDFYGPAFDE-AGNVQSEITLCAQLAITNGAS--T--ILACAVDPE----GDTVTMGDYQNALNKFRDE 310 (581) Q Consensus 244 ----~~d~~~~~~~~~~a~~~-~g~~~~~i~~~~~~~~~~g~~--~--~~~~~~~~~----~~~~t~~dy~~al~~l~~~ 310 (581) ..+.+............ ...............+..+.. . .+.++.+.. .......+...+|++|++. T Consensus 392 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~gG~d~~~~~~~~~~~~~~~~tg~~~~~~~ 471 (641) T protein:vir:10 392 TAANAAAGDWGASALNRRYNLLRSTAGTTSFPAGAVTVGSKNNATHYYRLANGADYSASGALYNLSNVDIATAYELIEDP 471 (641) T ss_pred ccccccccccccccccccccccccccccccccccccccCCCCcceeEEEeecCcccccccccccccchhHHHHHHHhhhh Confidence 00000000000000000 000000000000000000000 0 011111110 1122345778899998876 Q ss_pred CceE---EEEeC-----CCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCC------chhHHHHHHHh-hccCCccEEE Q lcl|NC_011270. 311 DEIA---IIVAG-----TGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVT------PVPSATRIANA-QSIKDQRVAL 375 (581) Q Consensus 311 ~~~~---iv~~~-----t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~------~~~~~~~~~~a-~~~ns~r~~~ 375 (581) +... ++++. .+..+++..+.+||+++++ |+++++.+.... ....+..+... ..+++.+.++ T Consensus 472 e~~~i~~l~~~~~~~~~~~~~~v~~~~i~~ce~~~d----~~ailD~p~~~~~~~~~~~~~~~~~~~~~~~~~~s~yaa~ 547 (641) T protein:vir:10 472 ESQVIDYVLSGPAGADEAAAIAKATTITTIVESRKD----CMAFLSPLRSDVIGVSNTTTVTENLVNYFNQLPSSNYVVF 547 (641) T ss_pred hhhccceeeecCCCCCcchhHHHHHHHHHHHHhcCC----EEEEEcCCcccccCCCchhhHHHHHHHHHhhcCCCceEEE Confidence 6433 33332 2345688889999988854 999998654321 11122223332 2357888999 Q ss_pred EEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhcccccc---cccCcccccccCCHHHHHHHHhCCcEEEEEeCC Q lcl|NC_011270. 376 ISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRK---VIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPR 452 (581) Q Consensus 376 v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~---~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~ 452 (581) ++|+.++.+...+.....+|..++|+.+|.....+.+|++|.|. .|.|+.+++.++++.|++.|+++||+||+.+++ T Consensus 548 y~P~~~v~dp~~~~~~~vPpsG~iAGv~ArtD~~rGvwkAPAn~~~~~i~gv~~l~~~~~~~e~~~Lnp~gIN~ir~fpg 627 (641) T protein:vir:10 548 DSGYKYIYDKYNDVYRYVPCNGDIAGLILETGLEEEPWFSPAGFQRGVLRNAVKLAYSPNKTQRDRLYANRINPVVSFPG 627 (641) T ss_pred EeceeEeecccCCceeEecCCHHHHHHHHhhhccCCceECcCCcccceeeeeeeeeEecChhHHhhhhhcccceEEecCC Confidence 99988887766555555666778899999999999999999997 589999999999999999999999999999998 Q ss_pred CeEEEEEeeeccCCCc Q lcl|NC_011270. 453 NLVHVRHGVTTDPTSL 468 (581) Q Consensus 453 ~~v~i~~~itT~~td~ 468 (581) +++ +..+=.+.+.. T Consensus 628 ~G~--v~~~~~~~~~~ 641 (641) T protein:vir:10 628 HAM--INNNIAFHTKL 641 (641) T ss_pred cee--ecceeeeeecC Confidence 774 33433333211 No 54 >protein:vir:101326 Length: 529 # NCBI annotation: gp22 # Family: family:all:9453 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006525;genbank:gi:46401679;genbank:GeneID:2777386 Probab=99.49 E-value=4.6e-13 Score=88.21 Aligned_cols=466 Identities=11% Similarity=0.050 Sum_probs=223.4 Q ss_pred ceeEEEEe-----CceeccccccCCCHHHHHHHHHhcCCCCcc-----eEEEEcCCCceEEEE------ecCCccccccc Q lcl|NC_011270. 72 GGSFKLSL-----AGEPTGNIPFNATQGQVQSALRALPNVEDD-----EVTVLGDPGGPWTVT------FTKAVAALTKD 135 (581) Q Consensus 72 ~GtF~l~~-----~g~~T~~i~~~asa~~v~~aLe~l~~i~~~-----~V~~~~~~g~~w~Vt------f~g~~~~l~~~ 135 (581) -..|.+++ .|---.||+.|++-. ...+++.- -+-.-|.-+-..+|| .+|....+... T Consensus 1 ~~~ysi~q~ig~aSGvav~pi~~d~t~~-------~~~g~g~~v~a~Vgif~RG~i~k~~~Vt~~n~~~~LGep~~~~~g 73 (529) T protein:vir:10 1 MSQYSIQQSLGNASGVAVSPINADATLS-------TGVALNSSLWAGIGVFARGKPFTVLAVTESNYEDVLGEPLKPSSG 73 (529) T ss_pred CCceehhhhhhhhcccccCCcCcccccc-------hheecCceEEEEEEEeecCCCcceEEEchhHHHHHhccccCCCcc Confidence 00122111 112223444444322 11111111 111112234455565 12332222211 Q ss_pred ---------cceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceee-EeccceeEEeeccccccc-- Q lcl|NC_011270. 136 ---------VTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYV-LGTDYVVTRVNAGEDGEA-- 203 (581) Q Consensus 136 ---------~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~v-tgtd~~v~~v~~~~dg~~-- 203 (581) ...+-+|....|++...-.-.|-+..... ... .......+ .+... .|..+..+ . +|++. T Consensus 74 a~~E~~~h~~eA~~~~s~yVVRvv~~dak~p~i~~~~~---~~~-~~s~~~~s-~~~~l~~G~~~~iy-~---~Dgd~~~ 144 (529) T protein:vir:10 74 SQFEPIRHVYEAIQQTSGYVVRAVPDDAKFPIIMFDES---GEP-AYSALPYG-SEIELDSGEAFAIY-V---DDGDPCI 144 (529) T ss_pred hhhhhHhhhhhhhcCCceEEEEEcccccCCceEEecCC---ccc-hhhccccc-ccccccccceEEEE-E---ecCcCcc Confidence 11122333333443332222221221110 000 00000000 00000 01111111 1 11111 Q ss_pred CcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCc--chhhhhh------hhhhh-hc----ccccccee Q lcl|NC_011270. 204 NTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPD--DIQDFYG------PAFDE-AG----NVQSEITL 270 (581) Q Consensus 204 ~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~--~~~~~~~------~a~~~-~g----~~~~~i~~ 270 (581) +.... .+|+..--+...++.....+..+..-.......+|+..- +..|.++ ..+.+ ++ -..+++.. T Consensus 145 s~~~~-l~i~~~~ads~g~e~~~l~~~~~~~~g~~~~let~~~sl~~~a~dd~G~~~yl~svle~~s~~l~ai~~~e~~~ 223 (529) T protein:vir:10 145 SPTRE-LTIETATADSAGNERFLLKLTQTTSLGVVTTLETHTVSLAEEAKDDMGRLCYLPTALEARSKYLRAVVNEELIS 223 (529) T ss_pred CCceE-EEEEeeccccCCCccceeeEEEEeecCCceEEEEEEeeeeechhhhcCCccchhHHHhhccCceeeeeeecccc Confidence 11111 111111111111111111111111100100011111111 1111111 11111 00 00011111 Q ss_pred eeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcCC--ceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEe Q lcl|NC_011270. 271 CAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDED--EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILG 348 (581) Q Consensus 271 ~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~~--~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg 348 (581) .+. +..-..--+++++++.....+.++|..|+.+|++.+ +..++-.+.-+.++-++|...|.+. .+.++.. T Consensus 224 t~~--~~t~~d~~f~~GtdG~~~~i~s~~y~~A~~~L~n~p~d~~~il~~g~y~~a~I~~L~~ic~~~-----~~d~f~D 296 (529) T protein:vir:10 224 TAK--VTNKKSLAFTGGTNGDQSKISTAAYLRAVKVLNNAPYMYTAVLGLGCYDNAAITALGKICADR-----LIDGFFD 296 (529) T ss_pred ccc--hhhhhhhhccCCccccccccchHHHHHHHHHhcCCcceeeeeeccCCccHHHHHHHHHHHhhh-----hhcEEEc Confidence 111 100011123334455555568889999999997765 3434444554666655566666332 2456678 Q ss_pred cCCCCCchhHHHHHHHhhccCCccE--EEEEcCeeEecccccCCceecCHHHHHHHHHHHhh------ccchhccccccc Q lcl|NC_011270. 349 MDGSVTPVPSATRIANAQSIKDQRV--ALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSV------SAIAAMPLTRKV 420 (581) Q Consensus 349 ~~~~~~~~~~~~~~~~a~~~ns~r~--~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a------~~~~~~slt~~~ 420 (581) +.+..++.....+...+..+....+ .+++..+ ..+.++....+.++.+-. |++|..-+ =...|.+|.++. T Consensus 297 V~~~LT~~aA~~~~e~~gl~~~~~~~~s~y~~P~-~~~D~~tg~k~~~GlsG~-A~~akargv~~na~v~g~hY~pAGe~ 374 (529) T protein:vir:10 297 VKPTLTYAEALPAVEDTGLLGTDYVSCSVYHYPF-SCKDKWTQSRVVFGLSGV-AYAAKARGVKKNSDVGGWHYSPAGEE 374 (529) T ss_pred CCCCcCHHHHHHHHHhcCccccCceeeEEEEcce-eeccccccCceeeCCCcc-eeeccccceeecccccccccccCCCc Confidence 8888887776666554444444544 3333333 345556666666553321 33332221 112366777763 Q ss_pred ccCc--ccccccCCHH--HHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCC Q lcl|NC_011270. 421 IRGF--SGPAEVQRDG--EKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI 496 (581) Q Consensus 421 l~g~--~~~~~~~t~~--e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fi 496 (581) =.-+ .++++.++.. |.+.|.++.++++-...+++..|...+|...+++-||.+++++++++|.+.+-+.-++..| T Consensus 375 r~~inr~~I~~ly~~d~~e~~~lv~~riNPV~~~~~g~~~idDsLt~~~knny~R~~hv~~lmn~I~~~~~k~a~~~~~- 453 (529) T protein:vir:10 375 RAVIARASIQPLYPEDTPDEEAMVKGRLNKVSVGTSGQMIIDDALTCCTQDNYLHFQHVPSLMNAISRFFVQLARQMKH- 453 (529) T ss_pred cceeecccceeccCCCccCHHHHHhhccCeeeeeccCcceeeeeeceeeeCCchhhhhHHHHHHHHHHHHHHHHHHHhh- Confidence 2211 3456666654 5667999999999887777777888899999999999999999999999999888777666 Q ss_pred CccCCHHHHHHHHHHHHHHHHHHHhCCceeCCccc--------eeEEeecCCCEEEEEEEEEecCceeEEEEEEEEEe Q lcl|NC_011270. 497 GMPIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL--------KARQIERQPDVIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) Q Consensus 497 G~~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~--------~~~~~~~~~~~~~v~i~v~pv~~~e~I~~~~~~~~ 566 (581) +|+....|. ++..+..+|+.+++.|+|...++. ..+...-+.|+..|++.+.|.-....|++.=.+=- T Consensus 454 -~Pd~it~~g-l~~~l~~~L~r~~asgalv~prdp~~~G~epy~~~V~q~d~D~~~v~~~~~ptGv~Rri~~~p~l~~ 529 (529) T protein:vir:10 454 -SPDGITAAG-LTKGMTKLLDRFVASGALVAPRDPDADGTEPYVLKVTQAEFDKWEVVWACCPTGVARRIQGVPLLIK 529 (529) T ss_pred -CCChHHHHH-HHHhHHHHHHHHHhcCceecccCccCCCCCceEEEEeecccCeEEEEEEeecCCceeeEEeeeeecC Confidence 999998888 999999999999999999987653 23444567789999999999999999887543322 No 55 >protein:vir:3788 Length: 376 # NCBI annotation: tail sheath # Family: family:all:669 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536828;genbank:gi:17981837;genbank:GeneID:929216 Probab=99.33 E-value=2.3e-11 Score=78.94 Aligned_cols=332 Identities=12% Similarity=0.042 Sum_probs=179.3 Q ss_pred eEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEE----EEe-ecCCcccceeEeccCcchhhhhhhhhh Q lcl|NC_011270. 185 VLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQL----SYR-YTDPNYHEVIRFTDPDDIQDFYGPAFD 259 (581) Q Consensus 185 vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~----s~~-~~~~~~~e~~~~~d~~~~~~~~~~a~~ 259 (581) .++ ++.++..+.-.+.+..+ -+- ......+.........|+-..++.+ T Consensus 1 ~~~-------------------------~v~vn~~n~~~g~~~~~er~~Lfig~~~~~~~~~~~~~~~sdld~~lg~~-- 53 (376) T protein:vir:37 1 MFP-------------------------SVQINALNQLSGETKEIERHALFVGVGTTNQGKLLALTPDSDFDKVFGET-- 53 (376) T ss_pred CCC-------------------------eEEEecccccCCCcccccceEEeeccccccccceeeecCccchHhhhCCC-- Confidence 000 01111111111110000 000 1111222222223333332222211 Q ss_pred hhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHH-hcCC--ceEEEEeC-CCcHHHHHHHHHHHHH Q lcl|NC_011270. 260 EAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKF-RDED--EIAIIVAG-TGAQPIQALVQQHVSA 335 (581) Q Consensus 260 ~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l-~~~~--~~~iv~~~-t~~~~i~~~l~~~v~~ 335 (581) .+++..-+.-+..|+... +++...+ .+.++.+|.+|++.. +... ++.++-|- ++.+.+.+ +++..+. T Consensus 54 -----~~~lk~~v~aa~~naG~~-~~~~~~~--~~~~~~~~~~Av~~a~~~~s~E~V~v~~pv~t~~a~i~a-a~~~a~e 124 (376) T protein:vir:37 54 -----DTDLKKQVRAAMLNAGQN-WFAHVYI--AQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIGK-LQECYAE 124 (376) T ss_pred -----chHHHHHHHHHHhCCCCc-EEEEEEe--ecCCchHHHHHHHHhhhhcCceEEEEeccccccHHHHHH-HHHHHHH Confidence 112221122223343332 2222221 123556899998762 2222 23333332 33444433 5666656 Q ss_pred HhcC-CCcEEEEEecCCCC----C----chhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHH Q lcl|NC_011270. 336 QSNN-KYERRAILGMDGSV----T----PVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGK 406 (581) Q Consensus 336 ~~~~-~~~~~avvg~~~~~----~----~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl 406 (581) +... +++.+.++-..+-. . ++-...+.+..+.+.+.++.+|.-.| + ..++.+||+ T Consensus 125 l~~~~~Rpv~file~r~~~~~~~~~e~w~~y~~~~~al~~gia~~~V~~V~~~~-------g---------n~~G~~aGR 188 (376) T protein:vir:37 125 LLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLF-------G---------NETGVLAGR 188 (376) T ss_pred HHHhcCCeEEEEEeccCcCcccccccCHHHHHHHHHHhhcccccccceeeeeeh-------h---------hhHHHHHHH Confidence 5444 56666665543210 1 11133445566777888887665322 1 246788888 Q ss_pred hh--ccchhcccccc---cccCccccc-------ccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccCCCcccceE Q lcl|NC_011270. 407 SV--SAIAAMPLTRK---VIRGFSGPA-------EVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTSLHTREW 473 (581) Q Consensus 407 ~a--~~~~~~slt~~---~l~g~~~~~-------~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~td~~~~~i 473 (581) .+ +.+++++|-.. +|.|+...+ ..++...++.|-++|..+++.-+| .++-+-++.+-......+++| T Consensus 189 l~~aaVsVadspgRV~tG~l~gl~~~~lp~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDY~~i 268 (376) T protein:vir:37 189 LANRAVTVADSPARVQTGALVSLGSANKPLDKDRNELTLAHLKSLETARYSVPMWYPDYDGYYWADGRTLDVEGGDYQVI 268 (376) T ss_pred HhhcccchhhCccceeccccccccccccccCcCcccCCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhh Confidence 64 55678887544 333332221 357889999999999888866555 356555665444556789999 Q ss_pred EeehhhHHHHHHHHHHHhhhcCCCc--cCCH-HHHHHHHHHHHHHHHHHHhCCceeCC---------ccceeEEeecCCC Q lcl|NC_011270. 474 NIIGQQDVMVYRIRDYLDADGLIGM--PIYD-TTIVQVKASAEAALVWLVDNNIIRGY---------RNLKARQIERQPD 541 (581) Q Consensus 474 ~v~R~~d~i~~~ir~~~~~~~fiG~--~n~~-~~r~~ik~~i~~~L~~l~~~gaI~~~---------~~~~~~~~~~~~~ 541 (581) ..+|+.|.+.+.+|..+ -.+|+. .|+. ...+..++.+..=|++|.+...|-+. ++.++++.-.... T Consensus 269 e~~RVvdKa~R~vR~~a--i~~i~D~~lnst~~sia~~~~yi~~pLr~M~~s~~i~g~~fpGeI~~p~d~Di~i~w~s~~ 346 (376) T protein:vir:37 269 ENLRVVDKVARKVRLLA--IGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKT 346 (376) T ss_pred hhhhHHHHHHHHHHHHH--HHHhCCcccCcchhhHHHHHHHHHHHHHHHHhcchhccccccceeecCCCCCceEEeeccc Confidence 99999999999998763 345543 3433 34566777788888888877665553 3345566667888 Q ss_pred EEEEEEEEEecCceeEEEEEEEEEec-cce Q lcl|NC_011270. 542 VIEVRYEWRPAYPLNYIVVRYSIAPE-TGD 570 (581) Q Consensus 542 ~~~v~i~v~pv~~~e~I~~~~~~~~~-tg~ 570 (581) ++.|.+.++|..-..+|.+.|-+... =|+ T Consensus 347 ~V~I~~~v~P~~~pk~Itv~I~Ldlsn~~~ 376 (376) T protein:vir:37 347 KVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred eEEEEEEEEeccCCceEEEEEEeecCCCCC Confidence 89999999999999999988766554 355 No 56 >protein:vir:3751 Length: 376 # NCBI annotation: orf23 # Family: family:all:669 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043492;genbank:gi:9628627;genbank:GeneID:1261129 Probab=99.29 E-value=7.9e-11 Score=75.96 Aligned_cols=332 Identities=12% Similarity=0.028 Sum_probs=178.6 Q ss_pred eEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeE----EEEee-cCCcccceeEeccCcchhhhhhhhhh Q lcl|NC_011270. 185 VLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQ----LSYRY-TDPNYHEVIRFTDPDDIQDFYGPAFD 259 (581) Q Consensus 185 vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~----~s~~~-~~~~~~e~~~~~d~~~~~~~~~~a~~ 259 (581) .++ ++.++..+.-.+.+.. +-+-. .....+..+......|+-..++. T Consensus 1 ~~~-------------------------~v~vn~ln~~qg~~~~ver~~lfig~~~~~~~~~~~~~~~sdld~~lg~--- 52 (376) T protein:vir:37 1 MFP-------------------------SVQINALNQLSGETKEIERHALFVGVGTTNQGKLLALTPDSDFDKVFGE--- 52 (376) T ss_pred CCC-------------------------eEEEeeeeccCCCcccccceEEEeeccccccCceEEecCCCChHHhhCC--- Confidence 000 0011111110000000 00001 11112222222222222222221 Q ss_pred hhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcC---CceEEEEeCC-CcHHHHHHHHHHHHH Q lcl|NC_011270. 260 EAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDE---DEIAIIVAGT-GAQPIQALVQQHVSA 335 (581) Q Consensus 260 ~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~---~~~~iv~~~t-~~~~i~~~l~~~v~~ 335 (581) ..+++..-+.-+..|+... +++...+ ...+..|+.+|+++.... +++.++-|-. +.+.+. .+++.... T Consensus 53 ----~ds~lk~~v~aa~~naG~~-w~a~~~~--p~~~~~~~~~Av~~a~~~~s~E~V~v~~p~~t~~a~i~-a~qa~a~e 124 (376) T protein:vir:37 53 ----TDTDLKKQVRAAMLNAGQN-WFAHVYI--AQEDGYDFVECVKKANQTASFEYCVNTRYLGVDKASIG-KLQECYAE 124 (376) T ss_pred ----CchhHHHHHHHHHhCCCCc-eEEEEEe--cCCChhhHHHHHHHHHhhCCeeEEEEecCcchhHHHHH-HHHHHHHH Confidence 1111111111222232222 2222221 124557899999776433 2222222322 333333 34444444 Q ss_pred Hhc-CCCcEEEEEecCCC----CC----chhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHH Q lcl|NC_011270. 336 QSN-NKYERRAILGMDGS----VT----PVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGK 406 (581) Q Consensus 336 ~~~-~~~~~~avvg~~~~----~~----~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl 406 (581) +.. -+++.+.++-..+- .. ++-.....+..+.+.+.|+.++...+ + ..++.+||. T Consensus 125 l~~~~~R~vffile~~g~d~~~~~ge~w~~y~~~l~a~~~gia~~~V~vV~~~~-------g---------n~~G~~aGR 188 (376) T protein:vir:37 125 LLAKFGRRTFFIQAVQGINHDQSDGETWDQYVQKLTTLQQTIVADHVCLVPLLF-------G---------NETGVLAGR 188 (376) T ss_pred HHHhcCCeEEEEEeccCCCCcccccCCHHHHHHHHHHHhccccccceeeeeeec-------c---------chHHHHHHH Confidence 433 35666666654321 11 11223445566778888887765322 1 135677787 Q ss_pred hh--ccchhccccccc---ccCccccc-------ccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccCCCcccceE Q lcl|NC_011270. 407 SV--SAIAAMPLTRKV---IRGFSGPA-------EVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTSLHTREW 473 (581) Q Consensus 407 ~a--~~~~~~slt~~~---l~g~~~~~-------~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~td~~~~~i 473 (581) .+ +.+++++|-+.. |.|+..++ ..+....++.|-++|..+.+.-++ .++-+-+|.+-......+++| T Consensus 189 l~naaVsVadspgRV~tGai~gl~~~~~p~d~~g~el~~a~l~aLd~arysvpr~Y~gydG~Yw~dg~tl~~~gsDYq~i 268 (376) T protein:vir:37 189 LANRAVTVADSPARVQTGALVSLGSANKPLDKDGNELTLAHLKSLETARYSVPMWYPDYDGYYRADGRTLDVEGGDYQVI 268 (376) T ss_pred HHhCCcchhcCccceeecccccccccccccccCCcccchHHHHHHHhCCCeEEEeeCCCCceEEeCCeEeccCCCCeeee Confidence 64 556688876543 22322111 246778899999999988866555 456556665544556789999 Q ss_pred EeehhhHHHHHHHHHHHhhhcCCCc--cC-CHHHHHHHHHHHHHHHHHHHhCCceeC---------CccceeEEeecCCC Q lcl|NC_011270. 474 NIIGQQDVMVYRIRDYLDADGLIGM--PI-YDTTIVQVKASAEAALVWLVDNNIIRG---------YRNLKARQIERQPD 541 (581) Q Consensus 474 ~v~R~~d~i~~~ir~~~~~~~fiG~--~n-~~~~r~~ik~~i~~~L~~l~~~gaI~~---------~~~~~~~~~~~~~~ 541 (581) ..+|++|.+.+.+|... -..|+- .| ++...+..+..+..=|+.|.+.+-|-+ ..+.++++.-.... T Consensus 269 e~~RVvdKa~R~vR~~A--i~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~ei~g~~fpgei~~P~d~dI~i~w~sk~ 346 (376) T protein:vir:37 269 ENLRVVDKVARKVRLLA--IGKIADRSFNSTTSSTEYHKNYFAKPLRDMSKSATINGKDFPGECMPPKDDAITIVWQSKT 346 (376) T ss_pred hhchHHHHHHHHHHHHH--HHHhcCccccCChhHHHHHHHHHhHHHHHHHhhhhhccccccceeecCCCCceEEEeccCc Confidence 99999999999998653 335654 34 446678889999999999988866555 33455666666778 Q ss_pred EEEEEEEEEecCceeEEEEEEEEEec-cce Q lcl|NC_011270. 542 VIEVRYEWRPAYPLNYIVVRYSIAPE-TGD 570 (581) Q Consensus 542 ~~~v~i~v~pv~~~e~I~~~~~~~~~-tg~ 570 (581) ++.+.+.++|..--.+|.+.|-+... -|+ T Consensus 347 ~V~I~~~vrPy~cpk~i~~~I~LDls~~~~ 376 (376) T protein:vir:37 347 KVTIYIKVRPYDCPKEITANIFLDLDSLGE 376 (376) T ss_pred eEEEEEEEeeecCcceeEEEEEEecCCCCC Confidence 89999999999999999999987766 466 No 57 >protein:vir:276 Length: 369 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536656;genbank:gi:17975134;genbank:GeneID:929090 Probab=99.28 E-value=5.5e-11 Score=76.81 Aligned_cols=337 Identities=12% Similarity=0.020 Sum_probs=177.6 Q ss_pred ccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCC---cccceeEeccCcchhhhh Q lcl|NC_011270. 178 PNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDP---NYHEVIRFTDPDDIQDFY 254 (581) Q Consensus 178 ~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~---~~~e~~~~~d~~~~~~~~ 254 (581) ..-+++.+ -..+.+ +|..+. +.+ .+-+-..++ ..+..+......|+-..+ T Consensus 1 m~~~~V~i-------n~~n~~-qg~~~~------ver-------------~~lfig~g~~~~~~g~~~~~~~~sdld~~l 53 (369) T protein:vir:27 1 MAWPTVII-------KILNLM-NGPIAD------IEC-------------HFLFVIRGTVSGEVRNLIMVDSTSDLDDVL 53 (369) T ss_pred CCCCceEE-------eccccc-CCCccc------ccc-------------eEEEEEeccccccccceEEecCccchHhhc Confidence 11111111 000000 111111 000 000111111 112222222222222222 Q ss_pred hhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcC---CceEEEEeCCCcHHHHHHHHH Q lcl|NC_011270. 255 GPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDE---DEIAIIVAGTGAQPIQALVQQ 331 (581) Q Consensus 255 ~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~---~~~~iv~~~t~~~~i~~~l~~ 331 (581) +.+ .+++..-+.-+..|+... ..+...+ -.+..+|.+|++..... +++.++-|-++.+.+.+ +++ T Consensus 54 g~~-------ds~lk~~v~aa~~naG~~-w~a~~~p---~~~~~~~~~Av~~a~~~~s~E~V~v~~p~t~~a~i~a-aq~ 121 (369) T protein:vir:27 54 AEA-------SAEGLAIVKAAQLNGKQA-WTAGVMI---LSEEDNWQDAVKKANEVSSFEFVVLGFDAETKAMIED-AIT 121 (369) T ss_pred CCc-------ChhHHHHHHHHHhCCCCc-eEEEEEE---eCCchhHHHHHHhhhhhCCccEEEEecCcccHHHHHH-HHH Confidence 211 111111111122222222 1222222 12456899998765422 33322233233344433 344 Q ss_pred HHHHHhc-CCCcEEEEEecCC---CC-Cch----hHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHH Q lcl|NC_011270. 332 HVSAQSN-NKYERRAILGMDG---SV-TPV----PSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAA 402 (581) Q Consensus 332 ~v~~~~~-~~~~~~avvg~~~---~~-~~~----~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~ 402 (581) ....+.. -+++.+.++-+.+ .. ..+ -.....+..+.+.+.|+.++...+.. .-.++. T Consensus 122 ~a~el~~~~~R~vffi~e~~~~~~~~~~~e~w~dy~a~l~al~~g~a~~~V~vv~~~~~~--------------gn~~G~ 187 (369) T protein:vir:27 122 LRTELKNSLGREVGVLCQLPAINNDPTNGQTWSEWLADTVDIPKDVASEYISVVPNVHAA--------------GDTLGK 187 (369) T ss_pred HHHHHHHhcCCeEEEEEeccccCCCccccCCHHHHHHHHHHHhhccCcccceeeeeeccc--------------cchHHH Confidence 4444333 3566666654321 11 111 13444556677888888776332210 013566 Q ss_pred HHHHhh--ccchhcccccc---cccCccccc-----ccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccCCCcccc Q lcl|NC_011270. 403 VAGKSV--SAIAAMPLTRK---VIRGFSGPA-----EVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTSLHTR 471 (581) Q Consensus 403 vAgl~a--~~~~~~slt~~---~l~g~~~~~-----~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~td~~~~ 471 (581) +||..+ +.+++.+|-+. .+.|...+. ..|+...++.|-++|..+++.-++ .++-+-++.+-......++ T Consensus 188 ~aGRl~n~aVsIadsp~RVktG~l~g~~~~p~d~~g~~l~~a~l~aLd~agysvp~~Y~gy~G~Yw~d~~tl~~~gsDYq 267 (369) T protein:vir:27 188 YAGRLANKEVSIADSPARVQTGSVLGNTELMKDKAGKALDLATLKALESNRIAVPMWYPDYPGQYWTTGRTLDVPGGDYQ 267 (369) T ss_pred HHHHHHhcccchhcCcceeeecccccccccccCCCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCCee Confidence 777764 44567777443 233432221 247888999999999988876655 3555556654445567899 Q ss_pred eEEeehhhHHHHHHHHHHHhhhcCCCc--cC-CHHHHHHHHHHHHHHHHHHHhC---CceeCCccceeEEeecCCCEEEE Q lcl|NC_011270. 472 EWNIIGQQDVMVYRIRDYLDADGLIGM--PI-YDTTIVQVKASAEAALVWLVDN---NIIRGYRNLKARQIERQPDVIEV 545 (581) Q Consensus 472 ~i~v~R~~d~i~~~ir~~~~~~~fiG~--~n-~~~~r~~ik~~i~~~L~~l~~~---gaI~~~~~~~~~~~~~~~~~~~v 545 (581) +|..+|+.|.+.+.+|... -..|+- .| ++...+..+..+..=|+.|.+. |-|....+.++++.-....++.| T Consensus 268 ~iE~~RVvdKa~R~vR~~A--i~~i~Dr~lnstp~sia~~~~~~~~pLr~M~ks~fpgei~~P~d~dI~i~w~~k~~V~I 345 (369) T protein:vir:27 268 DIRHIRVAMKAARKVRIRA--IARIADRTLNSTPQSIAAAKLYFTQDLRTMALTGVPGEIYPPEDEDIQIKWVNSTDVEI 345 (369) T ss_pred hhhhhhHHHHHHHHHHHHH--HHHhcCcccccChhHHHHHHHHHhhHHHHHHhhcCCeEEecCCCCceEEEeeccceEEE Confidence 9999999999999998653 335664 23 4456778888899999999875 66666666677777778889999 Q ss_pred EEEEEecCceeEEEEEEEEEeccc Q lcl|NC_011270. 546 RYEWRPAYPLNYIVVRYSIAPETG 569 (581) Q Consensus 546 ~i~v~pv~~~e~I~~~~~~~~~tg 569 (581) .+.++|...-.+|.+.|.+.-..= T Consensus 346 ~~~vrP~~~pk~it~~I~ldl~~~ 369 (369) T protein:vir:27 346 YMSVQPYECPVKITIAISVKQGDY 369 (369) T ss_pred EEEEeeccCCceEEEEEEEeccCC Confidence 999999999999999997765444 No 58 >protein:vir:78782 Length: 370 # NCBI annotation: putative tail sheath protein # Family: family:all:669 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285652;genbank:gi:148727158;genbank:GeneID:5220132 Probab=99.11 E-value=4.2e-10 Score=72.02 Aligned_cols=337 Identities=14% Similarity=0.051 Sum_probs=164.5 Q ss_pred ccceeEEeecccc--cccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhcccc Q lcl|NC_011270. 188 TDYVVTRVNAGED--GEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQ 265 (581) Q Consensus 188 td~~v~~v~~~~d--g~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~ 265 (581) -+..+ .++.... |..+ ++.+. .+=+-+.....+.........|+-..++.+ . T Consensus 1 ~~~~v-~vn~~n~~~g~~~------~~er~------------~lfig~~~~~~g~~~~~~~~sdld~~l~~~-------d 54 (370) T protein:vir:78 1 MWPYV-QIYNLNQMQGPVT------EVERH------------LLFIGSAASNTGKLLSLNAQSDFDQLLGAA-------D 54 (370) T ss_pred CCceE-EEeeccccCCCcC------cccee------------EEEEecccccccceEeecCccCHHHhcCCc-------C Confidence 00000 0111000 1111 11000 000001112222222222222322222211 1 Q ss_pred ccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcC---CceEEEEeCCCcHHHHHHHHHHHHHHhcC-CC Q lcl|NC_011270. 266 SEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDE---DEIAIIVAGTGAQPIQALVQQHVSAQSNN-KY 341 (581) Q Consensus 266 ~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~---~~~~iv~~~t~~~~i~~~l~~~v~~~~~~-~~ 341 (581) +++..-+.-+..|+...- .+...+ -++..|+.+|+++.... +++.++-|-++.+.+ +.++++...+.+. ++ T Consensus 55 s~lk~~v~aa~~naG~~~-~~~~~p---~~~~~d~~~Av~~a~~~~s~E~V~v~~~~s~~a~~-~a~~~~a~el~n~~~R 129 (370) T protein:vir:78 55 SELKANLLAARDNAGQNW-SAAAYV---LPTDKPWLDAARDAQQTQSFEGVVVLGQEWHQAAI-NAAHALNQELIAKWGR 129 (370) T ss_pred hhHHHHHHHHHhCCCCce-EEEEEE---ecCchhHHHHHHHHHhhCCccEEEEecCcchHHHH-HHHHHHHHHHHHhcCC Confidence 111111122223332222 222222 23556899999765433 223222222334444 4456666665543 45 Q ss_pred cEEEEEecCCCCCchh----HHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhh--ccchhcc Q lcl|NC_011270. 342 ERRAILGMDGSVTPVP----SATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSV--SAIAAMP 415 (581) Q Consensus 342 ~~~avvg~~~~~~~~~----~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a--~~~~~~s 415 (581) +.+.++-+......+. ...+.+..+.+.+.++.++.. ++ + ...+.+||..+ +..++.+ T Consensus 130 pv~file~~~~~~~e~w~~y~~~l~al~~gia~~~V~vvp~-~~---g------------~~~G~~aGRL~naavsVads 193 (370) T protein:vir:78 130 WQFMLLAVPAIADEQDWATYEAELATLQDGIAASSVSLIPQ-LW---P------------TLAGAYAGRLCNRAVSIADS 193 (370) T ss_pred eEEEEEeecCCCCcCCHHHHHHHHHHhhhccccccceEEee-ec---c------------ccHHHHHHHHhcCeeeeccc Confidence 6666665444333333 333455566777777766532 21 0 11345556543 2334455 Q ss_pred cccc---cccCcc-----cccccCCHHHHHHHHhCCcEEEEEeCC-CeEEEEEeeeccCCCcccceEEeehhhHHHHHHH Q lcl|NC_011270. 416 LTRK---VIRGFS-----GPAEVQRDGEKSRESSEGLMVIEKTPR-NLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRI 486 (581) Q Consensus 416 lt~~---~l~g~~-----~~~~~~t~~e~~~l~~~Gv~~l~~~~~-~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~i 486 (581) |-.. .+.|.. .....++...++.|-++|..+++.-+| .++-+-++.+-......+++|..+|+.|.+.+.+ T Consensus 194 P~Rv~tG~l~gl~~~p~d~~~~~l~~a~l~aLd~agy~vp~~Y~gy~G~Y~~d~~tl~~~gsDYq~ie~~RVvdKa~R~v 273 (370) T protein:vir:78 194 PCRVKTGALVGLGNKPVGKDGIPLPLATLQTLEANRYSVPMWYPDYDGIYWADGRTLDAEGGDYQVIENLRIAYKVARRM 273 (370) T ss_pred ceeeeccccccccccccccCCcccCHHHHHHHHhCCCeEEEeeCCCCceEEeCceEeccCCCChhhhhhhhHHHHHHHHH Confidence 4332 122221 122357889999999999888866555 3565556654445567899999999999999999 Q ss_pred HHHHhhhcCCCc--cCCH-HHHHHHHHHHHHHHHHHHhCCceeC---------CccceeEEeecCCCEEEEEEEEEecCc Q lcl|NC_011270. 487 RDYLDADGLIGM--PIYD-TTIVQVKASAEAALVWLVDNNIIRG---------YRNLKARQIERQPDVIEVRYEWRPAYP 554 (581) Q Consensus 487 r~~~~~~~fiG~--~n~~-~~r~~ik~~i~~~L~~l~~~gaI~~---------~~~~~~~~~~~~~~~~~v~i~v~pv~~ 554 (581) |..+= ..|.- .|+. ...+..+.....=|++|.+.+-|-+ ..+.+++++.....++.|.+.++|... T Consensus 274 R~~ai--~~i~D~~lnst~gsia~~~~~~~~~L~ema~s~~i~~~~fpgeI~~p~d~Di~i~w~s~~~v~I~~~v~P~~~ 351 (370) T protein:vir:78 274 RLRAI--ARIGDRSFNSTPGSTAAAITYFGKDLREMAKSTTINGQPFPGDIASPQDGDIRIQWVAKNLVSVFVVVRTVDC 351 (370) T ss_pred HHHHH--HHhCCcccCCCCcchhHHHHHHHhhHHHHHhhhhhcccccceeEeccCCCcceEEeeccceEEEEEEEEeccC Confidence 85532 23432 3433 3344455555555656655554433 334456667778888999999999999 Q ss_pred eeEEEEEEEEEeccceEEEEEee Q lcl|NC_011270. 555 LNYIVVRYSIAPETGDITSTIEG 577 (581) Q Consensus 555 ~e~I~~~~~~~~~tg~~~~~~~~ 577 (581) ..+|.+.|.+...-.+ =|| T Consensus 352 pk~Itv~I~LDls~e~----~~~ 370 (370) T protein:vir:78 352 PKGITVNIMLDLSLNN----GEG 370 (370) T ss_pred CceEEEEEEEeecccc----CCC Confidence 9999888754321111 011 No 59 >protein:vir:95263 Length: 450 # NCBI annotation: Phage conserved protein # Family: family:all:396 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944896;genbank:gi:38707836;genbank:GeneID:2744049 Probab=98.97 E-value=7.3e-09 Score=65.20 Aligned_cols=373 Identities=13% Similarity=0.089 Sum_probs=170.1 Q ss_pred ccceeeeeccccccceeeEeccceeEEeecccccccCcceeee-eeeeeeeccccc----------------ccceeEEE Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLY-TIQRVVDGGHID----------------PGDIVQLS 230 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~-ti~~~vd~~~~d----------------~~~~~~~s 230 (581) +-+....+ .++.........++-......... ....+...+ ....+-++.+.+ |..+-... T Consensus 1 ~~s~iVnV-~i~~~~~a~~~~~f~~~l~~~~~~-~~~~r~~~yss~~~V~~~FG~~S~ey~aA~~yF~q~p~p~~l~igr 78 (450) T protein:vir:95 1 MWNPIVNV-DITLNTAGTTREGFGLPLFLASTD-NFEERVRGYTSLTEVAEDFDENTAAYKAAKQLWSQTPKVTQLYIGR 78 (450) T ss_pred CCCceEEE-eecccccccccccceeEEEEcCCC-CCccceeeecCHHHHHHhcCCCcHHHHHHHHHHhCCCcccEEEEEe Confidence 22222221 122222222222222111111111 112222222 222222222111 10000000 Q ss_pred E----------------------eecCCc-ccceeEecc---Ccchhhhhhhhhhhhccccccce--ee-----eeeee- Q lcl|NC_011270. 231 Y----------------------RYTDPN-YHEVIRFTD---PDDIQDFYGPAFDEAGNVQSEIT--LC-----AQLAI- 276 (581) Q Consensus 231 ~----------------------~~~~~~-~~e~~~~~d---~~~~~~~~~~a~~~~g~~~~~i~--~~-----~~~~~- 276 (581) . -..+.. ....+.+.. ..++.+....++.........+. .. ..... T Consensus 79 ~~~~~t~~~~~~~~~~~~g~lt~tv~G~~~~~~~i~~s~a~s~~~va~~~~tai~~~~~~~~~~~~~s~g~~~~~t~~~~ 158 (450) T protein:vir:95 79 RAMQYTVSIPDAVTESTDYSITVAAGGGISQPYQYTAQSSDTAENVLQQFKTQIEADPTIKDKVSVNVTGSNGSATMIIA 158 (450) T ss_pred eccchhhhhhhhhccccceeEEEEecceeeeeeEEEEEecCChhhHHHHhhhhhcccceeeeeeeeeeecccceeeeeee Confidence 0 000000 000000000 00001100001100000000000 00 00000 Q ss_pred cCCcceeEEeeeccC---CcccchhhHHHHHHHHhcC--CceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCC Q lcl|NC_011270. 277 TNGASTILACAVDPE---GDTVTMGDYQNALNKFRDE--DEIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDG 351 (581) Q Consensus 277 ~~g~~~~~~~~~~~~---~~~~t~~dy~~al~~l~~~--~~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~ 351 (581) ..+....+....... ..........++++++.+. ++..++++..+++.+ .++.+|++.. .+++....-.. T Consensus 159 ~~~~~~~~~l~~~~~~~~~~g~~aet~~~a~~a~~~~~~~w~~~~~~~~~~~~i-~a~a~w~~a~----~~~f~~~~~~~ 233 (450) T protein:vir:95 159 KAGDNDFVKVTTTAQTVYIASTTADTASTALAAIEAYSTDWYFIAAEDRTQQFV-LAMASEIQAR----KKIFFTANSDV 233 (450) T ss_pred ccccchhhccccccceeEecccccccHHHHHHHHHHhhCCeEEEEecCCCHHHH-HHHHHHHhhc----CcEEEEEcCCc Confidence 000000000000000 0001122356666666543 445556666666555 4478888754 23443332111 Q ss_pred CCCch-h---HHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchh-cccccccccCccc Q lcl|NC_011270. 352 SVTPV-P---SATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAA-MPLTRKVIRGFSG 426 (581) Q Consensus 352 ~~~~~-~---~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~-~slt~~~l~g~~~ 426 (581) ..... . .......-+..|..|.++++..- -+..++.+.++|......+. ..+.||.++|+.. T Consensus 234 ~~~~~~~~~~~~~i~~~l~~~~~~~t~~~y~~~-------------~~~~~~~aa~~g~~~~~~~g~~T~~fk~l~Gv~~ 300 (450) T protein:vir:95 234 TALQGTELASANDVPAQLAKNMYTRTVCLWHHA-------------AAEDYPEMAYIAYGAPYDAGSIAWGNAQLTGVAA 300 (450) T ss_pred hhhhhhhhhcccchHHHHHhccCCeeEEEeeCC-------------CchhHHHHHHHHHhhhcccceeeeccccccceee Confidence 10000 0 00011112233445666554210 11223445555655555443 3677888888763 Q ss_pred -c----cccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCC----C Q lcl|NC_011270. 427 -P----AEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI----G 497 (581) Q Consensus 427 -~----~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fi----G 497 (581) + ...|+..|.+.|.++|++++....+. -.+.+|++.-. + .|-.+|-.|+++..|++.+.. .|+ + T Consensus 301 ~v~~~~~~~lt~~~~~al~~~~~n~y~~~~~~-~~~~~G~~~~G---~--~iD~~~~~~wl~~~iq~~l~~-ll~~~~~~ 373 (450) T protein:vir:95 301 SLQPSNQRPLTSIQKSALDVRHCNFIDLDGGV-PVVRRGITSGG---E--WIDIIRGVDWLESDLKTSLRD-LLINQKGG 373 (450) T ss_pred eccCccccccchHHHHHHHhCCcEEEEEecCc-eeeeCCeeeCc---c--hhHHHHHHHHHHHHHHHHHHH-HHHhcCCC Confidence 2 24689999999999999998776543 34677876643 3 456899999999999988864 342 4 Q ss_pred c-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccc-----eeEEeecCCCE-EEEEEEEEecCceeEEEEEEEEEec Q lcl|NC_011270. 498 M-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNL-----KARQIERQPDV-IEVRYEWRPAYPLNYIVVRYSIAPE 567 (581) Q Consensus 498 ~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~-----~~~~~~~~~~~-~~v~i~v~pv~~~e~I~~~~~~~~~ 567 (581) | |=++.+...|++.|+..|++..+.|+|-+|... +..+++..... --+.+.++..-.++++.++..++=+ T Consensus 374 KiPy~~~G~~~i~a~i~~~l~~a~~~G~Ia~~~V~~~~~~~~~~~dr~~R~~~~i~~~~~laGAIh~~~i~~~v~~~ 450 (450) T protein:vir:95 374 KITYDDTGITRIRQVIETSLQRAVNRNFLSSYTVNVPKASQVALADKKARILKDVTFAGILAGAILDVDLKGTVAYE 450 (450) T ss_pred CCccChhhHHHHHHHHHHHHHHHHhcCcccceeEecCChHhcCHHHHhccCCCCeeEEEEEccceEEEEEEEEEEeC Confidence 5 778899999999999999999999999888521 12222222222 2388889999999999999988877 No 60 >protein:vir:80052 Length: 331 # NCBI annotation: gp14 # Family: family:all:396 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468718;genbank:gi:157325298;genbank:GeneID:5601743 Probab=98.94 E-value=1.3e-09 Score=69.31 Aligned_cols=308 Identities=13% Similarity=0.065 Sum_probs=154.7 Q ss_pred eeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeec---ccccccceeEEEEeecCCcccceeEeccC Q lcl|NC_011270. 171 DTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDG---GHIDPGDIVQLSYRYTDPNYHEVIRFTDP 247 (581) Q Consensus 171 ~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~---~~~d~~~~~~~s~~~~~~~~~e~~~~~d~ 247 (581) .+..+++. ...++. .......+-- +...++.+ ....+.+. T Consensus 1 ~~~~iv~V----------------------------------~v~~~~~~~~~~~~~~~~~--~~~~~t~~-~~~~y~s~ 43 (331) T protein:vir:80 1 MVETITDV----------------------------------RVHISVLYPSPRIGLGRPA--IFVKGTAM-GYKEYTTL 43 (331) T ss_pred Cccceecc----------------------------------eeeecccccccccccCcce--eEEecccc-ceEEEech Confidence 11111110 000000 0000000000 00011110 01112222 Q ss_pred cchhhhhhhhhhhhccccccceeeeeeeecCCcce-eEEeeeccCCcccchhhHHHHHHHHhcCCceEEEEeCCCcHHHH Q lcl|NC_011270. 248 DDIQDFYGPAFDEAGNVQSEITLCAQLAITNGAST-ILACAVDPEGDTVTMGDYQNALNKFRDEDEIAIIVAGTGAQPIQ 326 (581) Q Consensus 248 ~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~-~~~~~~~~~~~~~t~~dy~~al~~l~~~~~~~iv~~~t~~~~i~ 326 (581) +++..-+. ....+...+...|..+..+ ...+. .. +..+.+..+.+.+. .++..+++...+++.+. T Consensus 44 ~~v~~d~~--------~~~~~Ykaa~~~f~Q~~~~~~i~v~--~~---~~~~~~~a~~a~~~-~~w~~~~~~~~~~~~~~ 109 (331) T protein:vir:80 44 EELKDTFA--------DNTEVYAKAKAVFLQKDRPDTVAVI--TY---EDTKLLEAAEAYFL-KSWHFALLAEFKAADAL 109 (331) T ss_pred hhhccCCC--------CCcHHHHHHHHHHhccCccceEEEe--cc---chHHHHHHHHHhcc-CceeEEEeecCCHHHHH Confidence 22110000 0011122222233222221 11111 11 11122333344443 34445666666666654 Q ss_pred HHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHH Q lcl|NC_011270. 327 ALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGK 406 (581) Q Consensus 327 ~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl 406 (581) ++.+|++.. +.++.++.. ........ ..+..|.+++... .. ..+. ++.+.|. T Consensus 110 -a~a~~~~a~----~~~f~~~~~------~~~~~~~~---~~~~~~t~~~~~~----------~~---~~~~-~aa~~g~ 161 (331) T protein:vir:80 110 -ALSNLIEEQ----KFKFAVFQV------TAVADITP---LAKNTRTIAIVHS----------KT---GEKL-DAALIGN 161 (331) T ss_pred -HHHHHHhhC----CcEEEEEec------CchHHHHH---hhccccEEEEEcC----------Cc---cchh-HHHHHHH Confidence 477787643 335544432 11222212 2233343332210 01 1222 3344455 Q ss_pred hhccchhc-cccccc-ccCcccccccCCHHHHHHHHhCCcEEEEEeCCCeEEEEEeeeccCCCcccceEEeehhhHHHHH Q lcl|NC_011270. 407 SVSAIAAM-PLTRKV-IRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVY 484 (581) Q Consensus 407 ~a~~~~~~-slt~~~-l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~ 484 (581) .+..++.+ .+.++. ++|+.. ..++..|++.|.++|++++....+ .-.+.+|.+.-. ..|-.++-.|++.. T Consensus 162 ~~~~~~g~~t~~fk~~l~GV~~--~~lt~t~~~al~~~~~N~y~~~~~-~~~~~~G~~~~G-----~~iD~~~~~dWl~~ 233 (331) T protein:vir:80 162 VASLPVGSATWKGRHGLAGITS--EELKVSEIDAIQKAGGMCYIEKAG-IAQTSEGKTVSG-----EFIDSIHGDDWIKA 233 (331) T ss_pred HHhcCccceeeeeecccCCCCC--CCCCHHHHHHHHhcCceEEEEecC-eeEEecceEeCc-----hhHHHHHHHHHHHH Confidence 56666643 466774 788753 468999999999999999987654 444667876432 46888999999999 Q ss_pred HHHHHHhhhcCCC--c-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCc----c-cee--------EEeecCCCE-EEEEE Q lcl|NC_011270. 485 RIRDYLDADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNNIIRGYR----N-LKA--------RQIERQPDV-IEVRY 547 (581) Q Consensus 485 ~ir~~~~~~~fiG--~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~----~-~~~--------~~~~~~~~~-~~v~i 547 (581) .|++.+.. .|+- | |=++.+...|++.+++.|++..+.|+|..-. + ..+ .+.+..... --+.+ T Consensus 234 ~lq~~l~~-ll~~~~kiPy~~~G~~~l~a~i~~~~~~av~~G~I~~g~~~~~~~~~v~~~~~~~~s~~dr~~R~~~~~~~ 312 (331) T protein:vir:80 234 TIETRLQK-LLTETDKLTFDARGIALLQSELTTVLNEGFANGIIDSNDETGEPNFSITALQRSDLNDDDIAKRNYKGLSF 312 (331) T ss_pred HHHHHHHH-HHHhCCCCccChhhHHHHHHHHHHHHHHHHhCCceecCccCCCcceEEEeCchhcCCHHHHhccCCCCeEE Confidence 99888763 3543 4 5688999999999999999999999996311 0 011 111112111 33888 Q ss_pred EEEecCceeEEEEEEEEEe Q lcl|NC_011270. 548 EWRPAYPLNYIVVRYSIAP 566 (581) Q Consensus 548 ~v~pv~~~e~I~~~~~~~~ 566 (581) .+.+...+++|.+.+.+++ T Consensus 313 ~~~~~gaI~~v~i~~~v~~ 331 (331) T protein:vir:80 313 RYKRSGAIHSVDVYGEVEV 331 (331) T ss_pred EEEEcceEEEEEEEEEEeC Confidence 8999999999999999988 No 61 >protein:vir:5260 Length: 502 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852765;genbank:gi:31544040;uniprot:Q7Y5T5;genbank:GeneID:2753559 Probab=98.89 E-value=1.5e-08 Score=63.50 Aligned_cols=420 Identities=11% Similarity=0.029 Sum_probs=178.5 Q ss_pred HhcCCCCcceEEEEcCCCceEEEEecCCccccccccceeccCCCceEEEEEcccccceeeecccccccccee--eeeccc Q lcl|NC_011270. 101 RALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDT--IRVVNP 178 (581) Q Consensus 101 e~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~--~~l~~~ 178 (581) .+||==.-|+|+..-.....=...| |.+..|....+.......-.+......+..... ......+.+.+. .. ..| T Consensus 1 msip~s~ivnV~i~~~~~a~~~~~f-~~~l~l~~~~~~~~~~~~~r~~~~~s~~~V~~~-FG~~s~ey~aA~~yF~-q~p 77 (502) T protein:vir:52 1 MALSISHIVNVQLNTVPKSAARKSF-GIVALFTPEAGQAFADEKTRYVYVENQRDVEQL-FGTNSETAKAAQPFFA-QSP 77 (502) T ss_pred CCCCccceeEEeecccccccccccc-CceEEEeeccCccccCCccceEEecCHHHHHHh-cCCChHHHHHHHHHhc-CCC Confidence 5666444455554322111000111 111111110000000100112111111111100 001111111000 00 022 Q ss_pred cccceeeEecc--ceeEEeecccccccCcce--------eeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCc Q lcl|NC_011270. 179 NSGQVYVLGTD--YVVTRVNAGEDGEANTRD--------DLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPD 248 (581) Q Consensus 179 ~~~~~~vtgtd--~~v~~v~~~~dg~~~~~~--------~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~ 248 (581) -+.++...+-. ........+.-....... .-.++...+++... .+++-+.... +..+ T Consensus 78 ~P~~l~igR~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~G~l~i~i~g~~~----------t~~~i~lS~~---ts~~ 144 (502) T protein:vir:52 78 RAKQLIVARWQKSASTIEATKNTLSGATLSDDLERFKSVVNGRFSLTIGGDVK----------KVDGLSFARL---ADFN 144 (502) T ss_pred ccceEEEEeccccccceeechhhhhhhhhHHhHHHhhhhcCceeEEEecceee----------eeeccccccc---cchh Confidence 23332221100 000000000000000000 00001111111000 0000000000 1111 Q ss_pred chhhhhhhhhhhhcc--------cccccee--------------------------eeeeeecCCcceeEEeeeccCCcc Q lcl|NC_011270. 249 DIQDFYGPAFDEAGN--------VQSEITL--------------------------CAQLAITNGASTILACAVDPEGDT 294 (581) Q Consensus 249 ~~~~~~~~a~~~~g~--------~~~~i~~--------------------------~~~~~~~~g~~~~~~~~~~~~~~~ 294 (581) ++......++...+. ..+.+.. +.-+.++.+...+.- ... ... T Consensus 145 ~vA~~i~~~l~~~~~~~tv~~d~~~~~F~i~s~ttg~~~~~~~~~a~~~~~~gt~~a~~l~l~~~~~av~v-~~~--~~g 221 (502) T protein:vir:52 145 AVATKIQEKLTTLSVAVSIAYDETGNRFIVSANVAGEDKKTEIDYAIDEGGEGEYIGALLKLENGQASRKV-GKN--SVS 221 (502) T ss_pred HHHHHHHhhhcccccceEEEEecCCceEEEEeccCCCcceeEEEEeecCCcchhHHHHHhccccccceeee-eee--ccc Confidence 111111111110000 0000000 000111111111111 001 112 Q ss_pred cchhhHHHHHHHHhcCC--ceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCcc Q lcl|NC_011270. 295 VTMGDYQNALNKFRDED--EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQR 372 (581) Q Consensus 295 ~t~~dy~~al~~l~~~~--~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r 372 (581) .+.+.+.++|+++.+.. +..++++...+.+-+.++.+|++.. ++++.+.-.................+..+..| T Consensus 222 ~~aet~~~al~a~~~~~~~w~~~~~a~~~~~~~~la~a~~iea~----~~~f~~~~~d~~~~~~~~~~i~~~l~a~~~~~ 297 (502) T protein:vir:52 222 LKKETLGEALFNVAEVNNTWYGFTVAAQLTDSEVEAAAKYAQAN----TKLFGANVIRAEQIEWSADNIYKKLYDAGLDH 297 (502) T ss_pred ccccCHHHHHHHHHhccCceEEEEEeecCChhHHHHHHHHHhhc----CcEEEEEecCcceeccccchHHHHHHhccCce Confidence 23456788888876543 4445565543333344688888753 23443322111111111111222334456667 Q ss_pred EEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccch-----hcccccccccCcccccccCCHHHHHHHHhCCcEEE Q lcl|NC_011270. 373 VALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIA-----AMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVI 447 (581) Q Consensus 373 ~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~-----~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l 447 (581) .++++.. .. . ++++.+.|..+..++ ...+.||.++|+. ...++..|++.|.++|++++ T Consensus 298 t~~~y~~-----------~~---~-~~~aa~~g~~as~~f~~~~g~iT~~fk~l~GV~--~~~lt~t~~~al~~~~~N~y 360 (502) T protein:vir:52 298 TLAMFDK-----------ND---M-YPVSSALARLLSTNFAANNSTLTLKFKQQPTIT--ADEITATEFAKAKRLGINVY 360 (502) T ss_pred eEEEecC-----------Cc---c-hhHHHHHHHHHhcCCCcCcceeeecccccCCcc--cCcCCHHHHHHHHhcCceEE Confidence 7665531 01 1 223334455565544 3356788888875 44789999999999999999 Q ss_pred EEeCCCeEEEEEeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcC--CCc-cCCHHHHHHHHHHHHHHHHHHHhCCc Q lcl|NC_011270. 448 EKTPRNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGL--IGM-PIYDTTIVQVKASAEAALVWLVDNNI 524 (581) Q Consensus 448 ~~~~~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~f--iG~-~n~~~~r~~ik~~i~~~L~~l~~~ga 524 (581) ....+ .-.+.+|.+.-. + .|-.++-.|++...|++.+....| -+| |=++.+...|++.++..|++..+.|+ T Consensus 361 ~~~~~-~~~~~~G~~~~G---~--~iD~~~~~~Wl~~~lq~~l~~~L~~s~~kIPy~~~G~~~l~a~i~~~l~~a~~~G~ 434 (502) T protein:vir:52 361 TYFDD-VAMIAEGTVIGG---K--FADEIVILDWFVDAVQKEVFARLYKSPTKIPLTDKGQAILIAAVEKVCLEGINNGA 434 (502) T ss_pred EEecC-eeEEecCeeeCC---c--hhhHHHHHHHHHHHHHHHHHHHHHhcCCCcccChhHHHHHHHHHHHHHHHHHhcCc Confidence 77644 445667766543 3 455888889999999888754333 245 78999999999999999999999999 Q ss_pred eeC--------------------Ccc-----ceeEEeecCCCE-EEEEEEEEecCceeEEEEEEEEEe Q lcl|NC_011270. 525 IRG--------------------YRN-----LKARQIERQPDV-IEVRYEWRPAYPLNYIVVRYSIAP 566 (581) Q Consensus 525 I~~--------------------~~~-----~~~~~~~~~~~~-~~v~i~v~pv~~~e~I~~~~~~~~ 566 (581) |.. |.- .+..+.+..... --|.+.+.+...+++|.+.++++- T Consensus 435 I~~G~~~~~~~g~~~~~d~~~~gy~v~~~~~~~~s~~dr~~R~~~~~~~~~~~aGaIh~v~i~~nv~~ 502 (502) T protein:vir:52 435 FAPGKWTGAGFGNLSTGDYLDKGFYVWAAPMDTLSDSDRQARRATPIQTAVKLAGAIHSSDVIVNYNR 502 (502) T ss_pred cccccccCcccceeeecccccCceEEEeCchhhCCHHHHHcccCCCeEEEEEECceEEEEEEEEEEeC Confidence 963 110 011111222222 258999999999999999887777 No 62 >protein:vir:3165 Length: 426 # NCBI annotation: capsid protein CP67 # Family: family:all:28419 # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665936;genbank:gi:22091122;genbank:GeneID:951267 Probab=97.82 E-value=1.3e-05 Score=47.28 Aligned_cols=380 Identities=12% Similarity=0.007 Sum_probs=151.9 Q ss_pred ccceeeeeccccccceeeEeccceeEEeeccccccc----Ccceeeeeeeeeeeccccc-ccce-----e-EE-EEeecC Q lcl|NC_011270. 168 IKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEA----NTRDDLYTIQRVVDGGHID-PGDI-----V-QL-SYRYTD 235 (581) Q Consensus 168 ~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~----~~~~~~~ti~~~vd~~~~d-~~~~-----~-~~-s~~~~~ 235 (581) |......+.-.-.-.+....++..+..+.....-+. .-.-.++.++.+-+|.+.+ +... . +. ...+.. T Consensus 1 m~~~iVnV~Is~~t~A~~~~~Fg~~liigs~~~~~p~~~f~~~~~Yss~~~V~~Dfg~~s~~Y~AA~~~f~Q~~~~~r~~ 80 (426) T protein:vir:31 1 MPKQIVEIELTAEIADRPQETFTDAAIVGTAEEEPPDAEFGEVNQYSTSTSVGDDYGEDSDVYTASEAIEEMGAEQWRVM 80 (426) T ss_pred CCcceEEEEeecccccccccccceeeeeeeccccccccccchhhhhhhHHHHHhcCCCChHHHHHHHHHHhCCceeEEee Confidence 332111111000001111222222221111111110 0111112222111211110 0000 0 00 000000 Q ss_pred Cccccee--Ee-ccCcch----------hhhhhhhhhhhccccccceee-eeeeecCCcceeEEeeeccCCcccchhhHH Q lcl|NC_011270. 236 PNYHEVI--RF-TDPDDI----------QDFYGPAFDEAGNVQSEITLC-AQLAITNGASTILACAVDPEGDTVTMGDYQ 301 (581) Q Consensus 236 ~~~~e~~--~~-~d~~~~----------~~~~~~a~~~~g~~~~~i~~~-~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~ 301 (581) ......+ +- ++...+ .+.....+...-+...+.+.. ... +.+...+..+.....-.-...+.||. T Consensus 81 v~~at~~~~~~~t~~~tv~g~~~s~~a~~~~~a~~i~~~~~~~~~~~~~~~~~-~~~t~~g~~t~~~~~~~~~~s~~dw~ 159 (426) T protein:vir:31 81 VLEATEVTEEELSDGDTIDKVPILGNHEVESPDGDIEFTTDDDPDVEDFDAEI-VINSATGDVATSEDSIELTYFHADWS 159 (426) T ss_pred ccccceeeeccCCcceeecceeeeecccCcchHHHHHHhhccccccccceeee-EeccccceeeccccceeeeeccCcch Confidence 0000000 00 000000 000000000000000000000 011 11111111111111111112334554 Q ss_pred HHHHHHhcCCceEEEEeCCCc-HHHHHHHHHHHHHHhcCCCcEEEEEecCCC-CCchhHHHHHHHhhccCCccEEEEEcC Q lcl|NC_011270. 302 NALNKFRDEDEIAIIVAGTGA-QPIQALVQQHVSAQSNNKYERRAILGMDGS-VTPVPSATRIANAQSIKDQRVALISPS 379 (581) Q Consensus 302 ~al~~l~~~~~~~iv~~~t~~-~~i~~~l~~~v~~~~~~~~~~~avvg~~~~-~~~~~~~~~~~~a~~~ns~r~~~v~~~ 379 (581) +++.++. .-....++.... -..+..+..|.....+. ++..+..... .........++ ...... . +.|. T Consensus 160 -~~~~~~s-~~~~~~ia~~~~~~~~~~~~~~~~~wa~~~---~i~~va~~~e~~~~~~~~~~~a--~~~~~~--~-y~p~ 229 (426) T protein:vir:31 160 -QLDEFPS-DVNNFAVADRRFDLKGVGVLDETHSWASDE---DMGMIANGVNVDDYDSVDEAMD--VAHEVA--G-YVPS 229 (426) T ss_pred -hhhcccc-cchhhhhhccccchhhhhhhHhhhhhhhhc---ceeeeeeccchhhhcchhhhhh--hhhccc--c-cccc Confidence 4554442 111112332211 12223333344433332 2222221111 11111111111 101100 0 1111 Q ss_pred eeEecccccCCceecCHHHHHHHHHHHhhccchhcccccccccCcccccccCC--------HHHHHHHHhCCcEEEEEeC Q lcl|NC_011270. 380 SFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAMPLTRKVIRGFSGPAEVQR--------DGEKSRESSEGLMVIEKTP 451 (581) Q Consensus 380 ~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~slt~~~l~g~~~~~~~~t--------~~e~~~l~~~Gv~~l~~~~ 451 (581) ..... .....-++ ..+++++..+..+|+.+|+.+.+++........+ ..+.....++-++.|+... T Consensus 230 ~~~~~----~~~~~~~~--~~~~~~~~~aa~~~~~~~~~~~~~~~~~~~~~~~~~gv~~t~~~~~~A~~~~~~n~~~~~~ 303 (426) T protein:vir:31 230 GDLMM----IVDASDDD--LAAYQLGKFAVSEPWYNPLWNELPAGETVSKNVGDPEEQGTFEGGDEAEGEGPVNVLIDVS 303 (426) T ss_pred hhhee----ehhccccc--hhhHHhhhhhhhccccchhhhhccccccceeeccccccccccchhhhhhhcCCceEEEEec Confidence 10000 00111111 3678888899998988888776665443322222 1223334456678888766 Q ss_pred CCeEEEEEeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCCC--c-cCCHHHHHHHHHHHHHHHHHHHhCC--cee Q lcl|NC_011270. 452 RNLVHVRHGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLIG--M-PIYDTTIVQVKASAEAALVWLVDNN--IIR 526 (581) Q Consensus 452 ~~~v~i~~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fiG--~-~n~~~~r~~ik~~i~~~L~~l~~~g--aI~ 526 (581) +. ..|.+++++-.......+|=++|..|++...|+..+.. .++. | |=++.+...|++.|++-|+...+.| .+. T Consensus 304 ~~-~~i~~~~~~~G~~~~G~~iD~~~g~dwl~~~iq~~l~~-ll~~~~KIpyt~~Gi~~I~~~i~~~L~~~v~~~g~~~~ 381 (426) T protein:vir:31 304 DA-NRVSNAVTTAGADSDTSFFDIRRTKVYTAEMLELDLES-LQVSDDDVPFTEDGQAMIEDAIKGTMSGLTGSVGQPLA 381 (426) T ss_pred Cc-eeeecceeecccccchhhhhhHHHHHHHHHHHHHHHHH-HhhcCCCCccchhHHHHHHHHHHHHHHHHhcCCCcccc Confidence 54 66889998888777778899999999999999999864 4553 3 5678899999999999999888753 344 Q ss_pred CCcc--ceeE-EeecCCCEEE--EEEEEEecCceeEEEEEEEEEe Q lcl|NC_011270. 527 GYRN--LKAR-QIERQPDVIE--VRYEWRPAYPLNYIVVRYSIAP 566 (581) Q Consensus 527 ~~~~--~~~~-~~~~~~~~~~--v~i~v~pv~~~e~I~~~~~~~~ 566 (581) +|.- ++.. ...+...|++ +.+..+..-+++++.+...+++ T Consensus 382 ~y~v~~P~~~~~~~dra~R~~~~i~~~~~laGAIh~v~I~g~v~v 426 (426) T protein:vir:31 382 EYEVDVPEWDDDDVDRVNRNWGGIDLDARLAQRAHTFSLGLNVSV 426 (426) T ss_pred ceeecCCCccccchhhhhhccCCceEEEEEeCcEEEEEEEEEEeC Confidence 4531 1111 1112223332 8888889999999999998888 No 63 >protein:vir:99586 Length: 507 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039795;genbank:gi:126011045;genbank:GeneID:4818281 Probab=96.58 E-value=0.00049 Score=38.71 Aligned_cols=373 Identities=10% Similarity=-0.014 Sum_probs=149.6 Q ss_pred cccceeeeeccccccceeeEeccceeEEeecccccccCccee-eeeeeeeeeccccc--------------------ccc Q lcl|NC_011270. 167 GIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDD-LYTIQRVVDGGHID--------------------PGD 225 (581) Q Consensus 167 ~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~-~~ti~~~vd~~~~d--------------------~~~ 225 (581) .+.-...--+.++..........+..-.+-.........+.. +...+.+-++.+.+ |.. T Consensus 1 mip~s~iVnV~~~v~~~a~~~~~~~~~lilt~~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsq~p~~~~~P~~ 80 (507) T protein:vir:99 1 MISQSRYVRIVSGVGAGAPVAQRRLIMRVMTTNAVLPPGVVFESSSADAVGAYFGMASEEYKRAKAYMSFISKSINSPSY 80 (507) T ss_pred CCCccceeEEeeeccccCcccccccceeeeccccCCCccceEeecCHHHHHHhcCCChHHHHHHHHHhccCCCCCcccce Confidence 222111111122222221111111111111111111111211 12222222221111 111 Q ss_pred eeEEEEeecCCcc---c---------------ceeEe-ccCc----------------chhhhhhhhhhhhcc---cccc Q lcl|NC_011270. 226 IVQLSYRYTDPNY---H---------------EVIRF-TDPD----------------DIQDFYGPAFDEAGN---VQSE 267 (581) Q Consensus 226 ~~~~s~~~~~~~~---~---------------e~~~~-~d~~----------------~~~~~~~~a~~~~g~---~~~~ 267 (581) +-.-.+.-+.... + -.+++ .++. ++......++.+... .... T Consensus 81 L~igR~~~~~~~a~l~g~~~~~~l~~~~~~~~G~lti~v~G~~~t~~~i~lS~~ts~~~vAs~i~~~l~a~~~~~~~~~t 160 (507) T protein:vir:99 81 ISFARWVNAAIASMIVGDSLVKNLPALKAVATPTLSLSIGGTVVPIAGIDLTAALTLTDVAATLQTKIRASANAELATAT 160 (507) T ss_pred EEEEeecCccccceeecchhhhhHHHHhhhcceeEEEEEcCceeEeccccccccCCHHHHHHHHHHhhhccccccccceE Confidence 1111110000000 0 00000 0000 000000000000000 0000 Q ss_pred cee-----e--eeeeecCCccee-EEeeecc----------------CCcccchhhHHHHHHHHhcC--CceEEEEeC-- Q lcl|NC_011270. 268 ITL-----C--AQLAITNGASTI-LACAVDP----------------EGDTVTMGDYQNALNKFRDE--DEIAIIVAG-- 319 (581) Q Consensus 268 i~~-----~--~~~~~~~g~~~~-~~~~~~~----------------~~~~~t~~dy~~al~~l~~~--~~~~iv~~~-- 319 (581) ++. . +....++..+++ ++...++ -......+...++++++.+. .+..+++.. T Consensus 161 v~~d~~~~~F~v~s~~tG~~s~i~~at~~~~gt~~s~l~~~~~~~a~~~~g~~aet~~~a~~a~~~~~~nW~~~~~a~~~ 240 (507) T protein:vir:99 161 VTFNTTTNQFVLNGTTTGALAPTITAVRTDPATDISSLLGWTNTGTVFVKGQAAETPDTSISKSAAISTNFGSFIYTSTP 240 (507) T ss_pred EEEecCCceEEEEeeeccccceeEEEEcCCchhhHHHHhccccccceEeecccccCHHHHHHHHHhhcCCeEEEEEEecc Confidence 000 0 000000000000 0000000 00001223455677766543 333333332 Q ss_pred -CCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHH Q lcl|NC_011270. 320 -TGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQF 398 (581) Q Consensus 320 -t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~ 398 (581) .+++.+ .++.+|++.. ++++.+.-..... .... ..+.........+... ....|.++ T Consensus 241 ~~td~~~-lalA~wiea~----~~~f~~~~~~~~a---~~~~--~~~~~~~~~~~~~~~~------------~~~~~~~~ 298 (507) T protein:vir:99 241 ALTNDQI-TAVASWNASQ----NNMYMYSVPTTIA---NIGT--LYAAVKGFSGCALNIT------------SDSLPVDY 298 (507) T ss_pred ccChHHH-HHHHHHHhhc----CcEEEEEEecCch---hhhh--hhhhhhhcceeEEEee------------cccccchh Confidence 233343 4477777744 2344333211111 1111 1111111111111110 11234445 Q ss_pred HHHHHHHHhhccch-----hcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCC--CeEEEE-EeeeccCCCccc Q lcl|NC_011270. 399 MAAAVAGKSVSAIA-----AMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPR--NLVHVR-HGVTTDPTSLHT 470 (581) Q Consensus 399 ~Aa~vAgl~a~~~~-----~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~--~~v~i~-~~itT~~td~~~ 470 (581) .++.+.|..+..+. ...+.+|.++|+. ...++..|.+.|.++|++++....+ ..+.+. +|+.+ ...-+| T Consensus 299 ~~aa~~g~~as~nf~~~ng~~T~~fk~l~GV~--a~~lt~t~a~al~~~n~N~y~~~a~~~~~~~~~~~G~~~-gG~~~f 375 (507) T protein:vir:99 299 IEQSPCEILAATDYTRVNATQNYMYYQFPSRN--ITVSDDTTANLVDANRGNYIGQTQSAGQSLAFYQRGILC-GGPNDA 375 (507) T ss_pred HHHHHHHHHHhhccCcCccceeecccccCCcc--cccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeee-CCcccc Confidence 56666677776554 3356777888875 3468999999999999999865543 334444 55544 222267 Q ss_pred ceEEeehhhHHHHHHHHHHHhhhcCC--Cc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEEe----------- Q lcl|NC_011270. 471 REWNIIGQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQI----------- 536 (581) Q Consensus 471 ~~i~v~R~~d~i~~~ir~~~~~~~fi--G~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~~----------- 536 (581) .++.+.+=.+++...|+..+.. .|. +| |=++.+...|++.+++.|++-.+.|+|..-...+..|. T Consensus 376 id~d~~~~~~WL~~~iq~~l~~-l~~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~I~~Gvtl~~~q~~~in~~~~~~~ 454 (507) T protein:vir:99 376 VDMNIYANEIWLKSAISAQILS-LFLNVPRVPANETGESMLLSVIQSVVNTAKNNGTISAGKNLNVIQQQYITQISGDAN 454 (507) T ss_pred eeeeeecchHHHHHHHHHHHHH-HHhcCCCCccChhhHHHHHHHHHHHHHHHHhccccccCCcccccchheecccccccc Confidence 7788888888888888877753 454 34 67999999999999999999999999976321111100 Q ss_pred ------------------------ecCCCEEEEEEEEEecCceeEEEEEEEEE Q lcl|NC_011270. 537 ------------------------ERQPDVIEVRYEWRPAYPLNYIVVRYSIA 565 (581) Q Consensus 537 ------------------------~~~~~~~~v~i~v~pv~~~e~I~~~~~~~ 565 (581) ...+....+.+.+.-..++++|.+.-... T Consensus 455 ~~~~~~~~Gyy~~~~~~s~~~~~~r~~r~~~~~~~~y~~~gaI~~v~~~~~~v 507 (507) T protein:vir:99 455 AWRQVANIGYWLNITFSSYTNPNTQLTEWKASYQLIYSKDDAIRFVEGTDTLI 507 (507) T ss_pred cccceeccceEEEeCChHhcChhhhhccccceEEEEEEeCCeEEEEEeeeecC Confidence 01112233444444444555554444333 No 64 >protein:vir:96104 Length: 504 # NCBI annotation: hypothetical protein ORF029 # Family: family:all:396 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294446;genbank:gi:149408343;genbank:GeneID:5237223 Probab=96.39 E-value=0.00067 Score=37.99 Aligned_cols=388 Identities=11% Similarity=0.025 Sum_probs=142.2 Q ss_pred cccceeeeeccccccceeeEec-cceeEEeecccccccCcceee-eeeeeeeeccccc-c-cceeEEEEeecCC--cccc Q lcl|NC_011270. 167 GIKTDTIRVVNPNSGQVYVLGT-DYVVTRVNAGEDGEANTRDDL-YTIQRVVDGGHID-P-GDIVQLSYRYTDP--NYHE 240 (581) Q Consensus 167 ~~~~~~~~l~~~~~~~~~vtgt-d~~v~~v~~~~dg~~~~~~~~-~ti~~~vd~~~~d-~-~~~~~~s~~~~~~--~~~e 240 (581) .+.-...--+.++......... +........ .......+... ...+.+-++.+.+ + ......=+...++ .... T Consensus 1 mip~s~iV~V~~~v~~~~~~~~~~~~~l~l~~-~~~~~~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yF~~~~~~~~~P~ 79 (504) T protein:vir:96 1 MISQSRYIRIISGVGAGAPVAGRKLILRVMTT-NNVIPPGIVIEFDNANAVLSYFGAQSEEYQRAAAYFKFISKSVNSPS 79 (504) T ss_pred CCCccceeEeeecccccccccccccceeEeec-ccCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhcCCCCCcccc Confidence 2221111111111111111111 111111111 00000011111 1111111111110 0 0000000001010 0111 Q ss_pred eeEeccCcchhh-h---------hhhhhh--hhccc----------cccceee--------------------------e Q lcl|NC_011270. 241 VIRFTDPDDIQD-F---------YGPAFD--EAGNV----------QSEITLC--------------------------A 272 (581) Q Consensus 241 ~~~~~d~~~~~~-~---------~~~a~~--~~g~~----------~~~i~~~--------------------------~ 272 (581) ...+..+.+... . ....+. ..|.. ...+.++ + T Consensus 80 ~l~igR~~~~a~~~~l~g~~~~~~~~~~~~i~~G~lsitv~G~~~~~~~i~~S~~ts~~~vA~~i~~al~~~~~~~~~~~ 159 (504) T protein:vir:96 80 SISFARWVNTAIAPMVVGDNLPKTIADFAGFSAGVLTIMVGAAEKNITAIDTSAATSMDNVASIIQTEIRKNTDPQLAQA 159 (504) T ss_pred EEEEEeecCcCccceEEechhHHHHHHHhhhhceEEEEEEcceeeeecccccccccchHHHHHHHHhhhhcccccccccc Confidence 112222211100 0 000000 00100 0000000 0 Q ss_pred eee---------ecC---CcceeEEeeec-c---------C-Cc-----ccchhhHHHHHHHHhcCC--ceEEEEeC--C Q lcl|NC_011270. 273 QLA---------ITN---GASTILACAVD-P---------E-GD-----TVTMGDYQNALNKFRDED--EIAIIVAG--T 320 (581) Q Consensus 273 ~~~---------~~~---g~~~~~~~~~~-~---------~-~~-----~~t~~dy~~al~~l~~~~--~~~iv~~~--t 320 (581) ... ++. |.......... + . +. ....+-..++++++.+.. +..+.+++ . T Consensus 160 tv~~d~~~~~f~its~~tg~~~~~~~~~a~~~~~~~~lgl~~~~~~~v~g~~aet~~~al~al~~~~~~Wy~f~~a~~~~ 239 (504) T protein:vir:96 160 TVTWNPNTNQFTLVGATIGTGVLAVAKSADPQDMSTALGWSTSNVVNVAGQAADLPDAAVAKSTNVSNNFGSFLFAGATL 239 (504) T ss_pred eEEEeccCCeEEEEeeccccceeEEEeeccccchhhhhhcccccceEEeecccccHHHHHHHHHhhcCCeEEEEEEeccC Confidence 000 000 00000000000 0 0 00 001223556666666543 23334433 2 Q ss_pred CcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHH Q lcl|NC_011270. 321 GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMA 400 (581) Q Consensus 321 ~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~A 400 (581) +++.+. ++.+|++.. .++++.++-. ... .........+..+ .+..++++. .....++.++.. T Consensus 240 ~dd~il-alA~w~ea~---~~~~~~~~~~-~~~---~~~~~~~~~~~~~-~~~~~~~~~---------~~~~~~~~~~~~ 301 (504) T protein:vir:96 240 DNDQIK-AVSAWNAAQ---NNQFIYTVAT-SLA---NLGALFDLVKGNS-GTALNVLSA---------TASNDFVEQCPS 301 (504) T ss_pred CHHHHH-HHHHHHhhc---CceEEEEEee-ccc---chhhHHHhhhhcc-eeEEEEeec---------CccchhHHHHHH Confidence 344443 467777643 2333322221 111 1111112222222 223332211 111223333333 Q ss_pred HHHHHHh-hccchhcccccccccCcccccccCCHHHHHHHHhCCcEEEEEeCCC--eEEE-EEeeeccCCCcccceEEee Q lcl|NC_011270. 401 AAVAGKS-VSAIAAMPLTRKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPRN--LVHV-RHGVTTDPTSLHTREWNII 476 (581) Q Consensus 401 a~vAgl~-a~~~~~~slt~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~~--~v~i-~~~itT~~td~~~~~i~v~ 476 (581) +++|... .+..-...+.||.++|+. ...++..|.+.|.++|++.+....+. ...+ -+|+.+- ..-.|..|.+. T Consensus 302 ~~~as~~f~~~ng~~T~~fk~l~GVt--a~~lt~t~~~aL~~~~~N~y~~~a~~~~~~~~~~~G~~~g-G~~~~~wiDv~ 378 (504) T protein:vir:96 302 EILAATNYDEPGASQNYMYYQFPGRN--ITVSDDTAANTVDKSRGNYIGVTQANGQQLAFYQRGILCG-GPTDAVDMNVY 378 (504) T ss_pred HHHHhcCcCcccccccccccccCCcC--cccCCHHHHHHHHhcCCeEEEEeecccceeeEEecCeeeC-Cccccchhhhh Confidence 4443333 222334477888999875 45789999999999999998654432 2333 4666553 22256678888 Q ss_pred hhhHHHHHHHHHHHhhhcCC--Cc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCcccee------EE---eecC-CCE- Q lcl|NC_011270. 477 GQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKA------RQ---IERQ-PDV- 542 (581) Q Consensus 477 R~~d~i~~~ir~~~~~~~fi--G~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~------~~---~~~~-~~~- 542 (581) +-.+++...|+..+.. .|. +| |=++.+...|++.++..|++-.+.|+|..-..... .. .+.. ++. T Consensus 379 ~~~~WL~~~lq~~l~~-l~~~~~kIPyt~~Gi~~l~a~i~~vl~~av~~G~I~~Gv~~~~~q~~~I~~~~~~d~~~~~~~ 457 (504) T protein:vir:96 379 ANEIWLKSAIAQALLD-LFLNVNAVPASMVGEAMTLAVLQPVLDKATSNGTFTYGKDISAVQQQYITQITGDRRAWRQVQ 457 (504) T ss_pred hhHHHHHHHHHHHHHH-HHhcCCCcccCHhhHHHHHHHHHHHHHHHHhcceeccCccCCccchheeccccccccccccee Confidence 8888888888887753 343 34 67999999999999999999999999965321110 00 0000 001 Q ss_pred ---EEEEEEEE----ec-CceeE---EEEEEEEEeccceEE-EEEeeccc Q lcl|NC_011270. 543 ---IEVRYEWR----PA-YPLNY---IVVRYSIAPETGDIT-STIEGTTS 580 (581) Q Consensus 543 ---~~v~i~v~----pv-~~~e~---I~~~~~~~~~tg~~~-~~~~~~~~ 580 (581) .++.+... +. |..++ |.+.+ -..|.|. ++|-++.- T Consensus 458 ~~GYyv~~~~~s~~s~~~r~~R~~~~~~~~y---~~~gaI~~v~~~~~~v 504 (504) T protein:vir:96 458 TLGYWINITFSSYTNSNTGLTEWKANYTLIY---SKGDAIRFVEGSDVMI 504 (504) T ss_pred ccceEEEecChhccChhHhhhccccceEEEE---EECCeEEEEEeccccC Confidence 22332110 11 11111 22222 1223322 11212111 No 65 >protein:vir:106730 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944312;genbank:gi:38638611;genbank:GeneID:2657359 Probab=94.72 E-value=0.0037 Score=33.93 Aligned_cols=390 Identities=16% Similarity=0.094 Sum_probs=160.8 Q ss_pred CeeccccccCCCcccccCcccccc--cccccCceee-EEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccccceeEEE Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTEAVGAPQLG--IRSSVPTAVA-IFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKL 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~g~~~~~--~~~~~~~~~~-~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~~~GtF~l 77 (581) |+=+|+...| .|+.+. .=+-+ -+...|+.+. ..|..... .-.++.-+..+. .+... ..++|+|+| T Consensus 53 V~~~FG~~S~--ey~aA~-~yFsg~~~q~p~P~~l~igR~~~~~~---~~~l~g~~l~~~-----~la~~-~~~~g~l~i 120 (501) T protein:vir:10 53 VENWFGALSN--EAKIAD-AYFPGIVNGGQLPYDLKFARYVAADA---PASVYGIPLTGI-----TLAQL-QGYSGTLTV 120 (501) T ss_pred HHHhcCCChH--HHHHHH-HHhhhhcCCCccccEEEEEeecccCc---cceeeeceehhh-----hhhhh-hheeeEEEE Confidence 4444554333 344320 00000 0111122111 11111000 000110010111 11110 135699999 Q ss_pred EeCceec-cccccCC--CHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCCccccccccceeccCCCceEEEEEccc Q lcl|NC_011270. 78 SLAGEPT-GNIPFNA--TQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQT 154 (581) Q Consensus 78 ~~~g~~T-~~i~~~a--sa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~ 154 (581) +.+|..+ ..|.+.+ |..++.+.|++-- +.-. ++|+|..-... +.+...++ T Consensus 121 ~i~g~~~~~~i~~s~ats~~~vA~~i~~al--~~~~----------~tv~~d~~~~~---------------f~i~~~t~ 173 (501) T protein:vir:10 121 TTAAQHVSANISLAAATSFANAATLIEAAF--TSPD----------FVVAYDALRNR---------------FTVVTNTT 173 (501) T ss_pred eeccceeeeccccccccCHHHHHHHHHHhh--cCCc----------eEEEEecccce---------------EEEEeccc Confidence 9999543 3344432 3445555554321 1112 23333211111 11112222 Q ss_pred ccce-eeeccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEee Q lcl|NC_011270. 155 GVPA-MNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRY 233 (581) Q Consensus 155 g~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~ 233 (581) |..+ +... +++ . T Consensus 174 G~~~~i~~~-----------------------t~~-----------------------------------~--------- 186 (501) T protein:vir:10 174 GTAAAISAV-----------------------TGT-----------------------------------N--------- 186 (501) T ss_pred CcceeEEEe-----------------------ecc-----------------------------------c--------- Confidence 2100 0000 000 0 Q ss_pred cCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcC--C Q lcl|NC_011270. 234 TDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDE--D 311 (581) Q Consensus 234 ~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~--~ 311 (581) ++.+. +.++.+....... .+ ....+ ..++++++.+. . T Consensus 187 ---------------d~a~~--------------------l~Lt~~~~a~v~~--~g-~~aet---~~~Al~a~~~~~~~ 225 (501) T protein:vir:10 187 ---------------NLADE--------------------LGLSAAAGATLQA--AG-VAADT---PASAMNRAVGLSRN 225 (501) T ss_pred ---------------cchhh--------------------hcccccCceeEEe--cC-ccccc---HHHHHHHHHhcccc Confidence 00000 0000000000000 00 00011 22344443332 2 Q ss_pred ceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCc--hhH-HHHHHHhhccCCccEEEEEcCeeEeccccc Q lcl|NC_011270. 312 EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTP--VPS-ATRIANAQSIKDQRVALISPSSFVYYAPEL 388 (581) Q Consensus 312 ~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~--~~~-~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~ 388 (581) +..+.........-+.++.+|++... ++++...- ...... ... ......-+.-|..|.++++.. T Consensus 226 Wy~f~~a~~~~~~~~la~A~wi~a~~---~~f~~~~~-~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~--------- 292 (501) T protein:vir:10 226 WATFTTAWTAVIADRLAFAAWNSGQA---YKYMYVAP-DLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD--------- 292 (501) T ss_pred eEEEEEEecCChHHHHHHHHHHHhcC---ceEEEEEe-cCcceeeecccchhHHHHHHhcCCCceEEECCC--------- Confidence 22233333233332335777776442 22222211 111110 001 111122333456676665421 Q ss_pred CCceecCHHHHHHHHHHHhhccchhc-----ccccccc-cCcccccccCCHHHHHHHHhCCcEEEEEeCC--CeEEEE-E Q lcl|NC_011270. 389 NREVVLGGQFMAAAVAGKSVSAIAAM-----PLTRKVI-RGFSGPAEVQRDGEKSRESSEGLMVIEKTPR--NLVHVR-H 459 (581) Q Consensus 389 ~~~~~~p~~~~Aa~vAgl~a~~~~~~-----slt~~~l-~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~--~~v~i~-~ 459 (581) ..| ++.+.|..+..+..+ .+.||.+ +|+. ...++.+|.+.|.++|++++....+ ..+.+. + T Consensus 293 ----~~~----~aa~~g~~as~nf~~~~g~~T~~fkql~~Gv~--a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~~ 362 (501) T protein:vir:10 293 ----QAT----AGAVMGYAASINFQLRNGRTVLAFRQFNAGVP--ATAHDLPTANALRSNNYTYIGAYANAANNYTIAYD 362 (501) T ss_pred ----CCH----HHHHHHHHHhcCcccCcceeeeeecccCCCcC--cccCCHHHHHHHHhcCCeEEEEEecccceeeEEEc Confidence 011 223345555555443 4556665 4543 3568999999999999998865433 345554 5 Q ss_pred eeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCC--Cc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEE- Q lcl|NC_011270. 460 GVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQ- 535 (581) Q Consensus 460 ~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fi--G~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~- 535 (581) |+.+ . .|..|.+++=.|+++..|+..+.. .|. +| |=++.+...|++.++..|++-.+.|+|..-......+ T Consensus 363 G~~s-G---~~~wiD~~~g~dWl~~~iq~~l~~-ll~~~~kIPyt~~G~~~l~a~i~~~l~~av~nG~Ia~Gv~l~~~q~ 437 (501) T protein:vir:10 363 GKLS-G---KFLWVDTYLDQIYLNAELQRAEFE-AMLAYNSLPYNEDGYTANYRAGVDVIDAAVTSGIIRAGVTLTNSQL 437 (501) T ss_pred ceee-c---cceehhhHhhHHHHHHHHHHHHHH-HHhcCCCcccCHHHHHHHHHHHHHHHHHHHhCcceecCcccCcccc Confidence 5422 2 456788888888888888877753 344 34 6789999999999999999999999997632111110 Q ss_pred ---e-----ec------CCC-------------------EEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 536 ---I-----ER------QPD-------------------VIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 536 ---~-----~~------~~~-------------------~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) . ++ +.| ...+.+.+.-...+++|-| +++-. T Consensus 438 ~~i~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i----------------~s~~v 500 (501) T protein:vir:10 438 QQIDAVAGVAGAGALVQTRGWYFLIGNPANPGQARQNRTSPACTLWYSDGGSIQELTI----------------GSNAV 500 (501) T ss_pred eeecccccccccccceeccceEEeeCcccCChhhhhhcccCceEEEEEeCCceeEEEe----------------eeeec Confidence 0 00 111 1223333333344444433 22222 No 66 >protein:vir:94073 Length: 494 # NCBI annotation: hypothetical protein # Family: family:all:396 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453619;genbank:gi:84662655;genbank:GeneID:5142583 Probab=92.77 E-value=0.01 Score=31.54 Aligned_cols=401 Identities=12% Similarity=0.066 Sum_probs=140.8 Q ss_pred cccccc------ccceeccCC---CceEEEEEccccccee---eeccccccccceeeeec-----------------ccc Q lcl|NC_011270. 129 VAALTK------DVTGLTGGD---DPDLNIASEQTGVPAM---NRALAKKGIKTDTIRVV-----------------NPN 179 (581) Q Consensus 129 ~~~l~~------~~~~l~~g~---~~~v~v~~~~~g~~~~---~~~~~~~~~~~~~~~l~-----------------~~~ 179 (581) .|.|.+ ...-++.+. ++...+.......+.. ... ...++. +..+.. .|. T Consensus 1 m~~ip~s~iV~V~~~v~~~~~~~~~f~~~l~~~~~~~~~~r~~~y~-s~~~V~-~~FG~~S~ey~aA~~yFs~~~~q~p~ 78 (494) T protein:vir:94 1 MPNIPISQIVSINPQVVSAGGTQGTLDGLLLTQATGFPVTQPQVYF-SAADVG-TAFGLTSDEYNAALVYFAGILGGGQQ 78 (494) T ss_pred CCCCCcccEEEeeeeccccCCcccccceeEeecCccCCccceeeec-CHHHHH-HhcCCChHHHHHHHHHhhhccCCCcc Confidence 222211 111111000 0000111100000100 000 000000 000110 111 Q ss_pred ccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEecc---Ccchhhhhhh Q lcl|NC_011270. 180 SGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTD---PDDIQDFYGP 256 (581) Q Consensus 180 ~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d---~~~~~~~~~~ 256 (581) +.....-+- ..........+. .+...+..... . ...+++..++......+.|.. .+++.+.... T Consensus 79 P~~l~igR~------~~~a~~~~l~g~----~~~~tl~~~~~--~-~g~l~iti~g~~~~~~i~lS~~ts~~~vA~~i~~ 145 (494) T protein:vir:94 79 PASLTIGRY------ASAATSAAVFGA----PLTLSLAQLQT--L-SGTLIVTTDTQRTSAAINLSGATSFANAASLMTS 145 (494) T ss_pred ccEEEEEee------cCccccceeecc----chhhhHHhhhh--c-ceEEEEEEcceEEEeeecccccCChhhHHHHHhh Confidence 111111000 000000000000 00000000000 0 001111111111111111111 1122221211 Q ss_pred hhhhhcc------ccccceeeeeeeecCCcceeEEeee-----------cc----CCcccchhhHHHHHHHHhcC--Cce Q lcl|NC_011270. 257 AFDEAGN------VQSEITLCAQLAITNGASTILACAV-----------DP----EGDTVTMGDYQNALNKFRDE--DEI 313 (581) Q Consensus 257 a~~~~g~------~~~~i~~~~~~~~~~g~~~~~~~~~-----------~~----~~~~~t~~dy~~al~~l~~~--~~~ 313 (581) ++...+. ..+.+.+. .... |......... +. .......+...++++++.+. .+. T Consensus 146 ai~~a~~~v~~d~~~~~f~v~--s~tt-G~~s~is~~t~~~a~~l~lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy 222 (494) T protein:vir:94 146 GFTTPNFAITYDAQRRRFVLS--TTAT-GTTASVSAVTGTLADGVGLSTASGAYVEGSGLAADTAASALDRLAASSSTWA 222 (494) T ss_pred hhccccceEEEcccCcEEEEE--EccC-CceeEEEEeccchhhhhhhhccccceEeecCcccccHHHHHHHHHhccCceE Confidence 2111110 00000000 0000 1100000000 00 00011233466777777654 344 Q ss_pred EEEEeCC-CcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCc---hhHHHHHHHhhccCCccEEEEEcCeeEecccccC Q lcl|NC_011270. 314 AIIVAGT-GAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTP---VPSATRIANAQSIKDQRVALISPSSFVYYAPELN 389 (581) Q Consensus 314 ~iv~~~t-~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~---~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~ 389 (581) .+.+... +++.+ .++.+|++... +.++ +..-...... .........-+..+.+|.++++..- T Consensus 223 ~f~~~~~~~~~~i-lalA~wiea~~---~~~~-~~~~~~d~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~~--------- 288 (494) T protein:vir:94 223 IFTTAWAASLSDR-TALAQWTSDQV---FRRI-YAAWDQDAAGLSVNNVSSFGNIVKTTPFSNTIPVYGLL--------- 288 (494) T ss_pred EEEEecCCCHHHH-HHHHHHHhhcC---ccEE-EEEecCCcceeecccchhHHHHHHhhcCCceEEEcCCC--------- Confidence 3444433 34444 45788887542 2222 2211111110 0111111223445667777766320 Q ss_pred CceecCHHHHHHHHHHHhhccchh-----cccccc-cccCcccccccCCHHHHHHHHhCCcEEEEEeC--CCeEEEEEee Q lcl|NC_011270. 390 REVVLGGQFMAAAVAGKSVSAIAA-----MPLTRK-VIRGFSGPAEVQRDGEKSRESSEGLMVIEKTP--RNLVHVRHGV 461 (581) Q Consensus 390 ~~~~~p~~~~Aa~vAgl~a~~~~~-----~slt~~-~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~--~~~v~i~~~i 461 (581) -|. +.+.|..+..++. ..+++| .++|+. ...++.+|.+.|.++|++++.... +....+..+- T Consensus 289 ----~~~----aa~~g~~aa~~~~~~~g~~T~~~k~q~~gi~--~~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~gg 358 (494) T protein:vir:94 289 ----ANA----MIVLAWGASTNLQIAEGRTTLALRSPVSSAG--VRVDNLANANALLSNGYTYLGKYASATNTYTVTYNG 358 (494) T ss_pred ----ChH----HHHHHHHHhccccccCcceeEEeeccCCCCC--CccCCHHHHHHHHhcCCeEEEEecccCceEEEecCc Confidence 121 2233445555553 356666 345432 345788999999999999996654 3445555554 Q ss_pred eccCCCcccceEEeehhhHHHHHHHHHHHhhhcCC--Cc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeE---- Q lcl|NC_011270. 462 TTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKAR---- 534 (581) Q Consensus 462 tT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fi--G~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~---- 534 (581) +. . -+|..+-..+=.++++..|+..+.. .|. +| |=++.+...|++.++..|++-.+.|+|..-...... T Consensus 359 ~~-s--G~~~~id~~~~~~WL~~~iq~~l~~-ll~~~~KIPytd~G~~~l~a~i~~~l~~av~nG~I~~Gv~~~~~q~~~ 434 (494) T protein:vir:94 359 AI-G--GQFLWADTALGWIALRRNLQQALFE-TLLAYRSLPYNADGYNALYQGAQDVVSQFVAAGVIRAGVALSASQRAQ 434 (494) T ss_pred ee-c--cccceeeeeccHHHHHHHHHHHHHH-HHHhCCCcccChhhHHHHHHHHHHHHHHHHhCceeecccccCcchhhh Confidence 32 2 2444454455445666666666542 232 45 779999999999999999999999999752110000 Q ss_pred -----EeecCCCEE----EEEEE--E-------EecCceeEEEEEEEEEeccceE-EEEEeecccC Q lcl|NC_011270. 535 -----QIERQPDVI----EVRYE--W-------RPAYPLNYIVVRYSIAPETGDI-TSTIEGTTSF 581 (581) Q Consensus 535 -----~~~~~~~~~----~v~i~--v-------~pv~~~e~I~~~~~~~~~tg~~-~~~~~~~~~~ 581 (581) -..+.++.+ ++.+. + +-.+++.|+|.- .|.| .+.|.+|.-. T Consensus 435 i~~~~G~~~~~~~~~kGyy~~~~~~~s~~~ra~R~~~~~~~~y~~------~GAIh~v~i~~~~v~ 494 (494) T protein:vir:94 435 IDQAAGVPISGDVVDKGWYLQVIDPITTTVRTDRGSPTVNFWYCD------GGSIQRVVVSATTVI 494 (494) T ss_pred hhhhhcCccccceeccceeeeccCCCChhhhhccccCCceEEEEe------cCcEEEEEEeeEEeC Confidence 000000000 01100 0 011222222221 3332 2233333333 No 67 >protein:vir:107720 Length: 515 # NCBI annotation: gp17 # Family: family:all:396 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024865;genbank:gi:48697507;genbank:GeneID:2948336 Probab=92.06 E-value=0.013 Score=30.93 Aligned_cols=411 Identities=12% Similarity=0.036 Sum_probs=166.9 Q ss_pred Ceecccccc--CCCc------ccccCcccccccccccCc-eeeEEEecC---CCCc---------------eeeeeEEcC Q lcl|NC_011270. 1 MAIDFSQYQ--TPGV------YTEAVGAPQLGIRSSVPT-AVAIFGTAV---GYQT---------------YRESIRINP 53 (581) Q Consensus 1 ~~~~~~~~~--~~~~------~~~~~g~~~~~~~~~~~~-~~~~~~~~~---g~~~---------------~~~~~~~~~ 53 (581) |+|+-++.- .|.+ ....++.+.+......|. .+..+.... .|.| ++.....+. T Consensus 1 m~I~~~~~V~i~~~v~aa~~~~~~~f~~li~t~~~~~p~~r~~~y~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~P~ 80 (515) T protein:vir:10 1 MPISFDKYVAITSGVAAQQQIAARSFAIRVYTPNPMVSVDRLITATSAADVGAYFGTASEEYKRAVKNFGFISKKTRRPT 80 (515) T ss_pred CCCCceeEEEeecccccCCccccccceeeeeecccCCCccceeeecCHHHHHHhcCCChHHHHHHHHHhhhccCCccccc Confidence 665544431 1111 112233333332222222 222221111 1111 111000000 Q ss_pred --cCCc-eeeEEEEEEeccc-----------c-ceeEEEEeCcee---ccccccCC------CHHHHHHHHHhcCCCCcc Q lcl|NC_011270. 54 --DTGE-TITTQILALVGEP-----------T-GGSFKLSLAGEP---TGNIPFNA------TQGQVQSALRALPNVEDD 109 (581) Q Consensus 54 --~~~~-~~evq~v~~~~~~-----------~-~GtF~l~~~g~~---T~~i~~~a------sa~~v~~aLe~l~~i~~~ 109 (581) ..+. ..+-....+.+.. + .|+|+|+.+|.. ...|++.+ -|..++++|.+.+..... T Consensus 81 ~L~igR~~~~a~~~~l~g~~~~~~~l~~~~~is~G~ltitidG~~~~t~s~i~~S~ats~~~vAs~i~tal~~~~~~~~~ 160 (515) T protein:vir:10 81 SIQFARWQREAGPVAIYGGAKKAAALATLQAVTAGAISFLFGGATTVTVSGISFSAATSLADVASELQTALRANADANLA 160 (515) T ss_pred EEEEEeccCcccceEEEeccchhhhHHhhhcccceeEEEEEcceEEEEeeccccccccCHHHHHHHHHhhhccccccccc Confidence 0000 0011112222211 2 499999999943 36676642 244555555544433323 Q ss_pred eEEEEcC-CCceEEEEecCCccccccccceeccCCCceEEEEEcccccceeeeccccccccceeeeeccccccceeeEec Q lcl|NC_011270. 110 EVTVLGD-PGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDTIRVVNPNSGQVYVLGT 188 (581) Q Consensus 110 ~V~~~~~-~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~l~~~~~~~~~vtgt 188 (581) .++++-. .+..+.|+= .+.|...++++...... T Consensus 161 ~~tv~~d~~~~~F~v~s-------------~~tG~~~~is~~~~t~~--------------------------------- 194 (515) T protein:vir:10 161 TCTVSYDPVGARFNFAG-------------SPSDDTVQESISIVPQS--------------------------------- 194 (515) T ss_pred eeEEEEecCCCeEEEEE-------------eecCCceeEEEEEecCC--------------------------------- Confidence 3332221 111111110 00011111110000000 Q ss_pred cceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEeccCcchhhhhhhhhhhhccccccc Q lcl|NC_011270. 189 DYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEI 268 (581) Q Consensus 189 d~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i 268 (581) .++.++.+. T Consensus 195 --------------------------------------------------------~~~t~~a~~--------------- 203 (515) T protein:vir:10 195 --------------------------------------------------------NPAIDVAQL--------------- 203 (515) T ss_pred --------------------------------------------------------CchhhHHHH--------------- Confidence 000000000 Q ss_pred eeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhc--CCceEEEEeCC----CcHHHHHHHHHHHHHHhcCCCc Q lcl|NC_011270. 269 TLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRD--EDEIAIIVAGT----GAQPIQALVQQHVSAQSNNKYE 342 (581) Q Consensus 269 ~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~--~~~~~iv~~~t----~~~~i~~~l~~~v~~~~~~~~~ 342 (581) +.++.+...+...+.. . +...++++++.+ ..+..+.+... ...+....+.+|+++.. +. T Consensus 204 -----lglt~~~~av~~~g~a----a---et~~~a~~a~~~~s~nWy~f~~a~~~~~~~~~a~~~a~a~~~e~~~---~~ 268 (515) T protein:vir:10 204 -----LGWNSAQGASYIAASP----V---VSPVDTLIASVAGNNNFGSILFTKNGGTGITLSDAEAIALQNQSYN---VA 268 (515) T ss_pred -----hccccccceEEecccc----c---ccHHHHHHHHHhccCCeEEEEEeecCccccchhHHHHHHHHHhhcC---ce Confidence 0011111111111001 1 123344544443 23333444321 12233344566665432 22 Q ss_pred EEEEEecCCCCCchhHHHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchh-----cccc Q lcl|NC_011270. 343 RRAILGMDGSVTPVPSATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAA-----MPLT 417 (581) Q Consensus 343 ~~avvg~~~~~~~~~~~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~-----~slt 417 (581) + +.+.......... ............+...+++.. ..+. ++...|..+..+.. ..+- T Consensus 269 ~--~~~~~~~~~~~~~-~~a~~~~~~~~~~~~~~~~~~--------------~~~~-~a~~~g~~asvnf~~~ng~iT~k 330 (515) T protein:vir:10 269 Y--KFQVGVDDTTYSS-WQAALAAIGGVNMIYSPVALA--------------AEYH-DMQDGIIEAATDFTQQGGATGYM 330 (515) T ss_pred E--EEEeccCccceec-hhhhhhhhhhcCceEEEEecc--------------Ccch-HHHHHHHHHhcCCCccchhheec Confidence 2 2222111111100 000111111112222221110 0122 23344555666543 3467 Q ss_pred cccccCcccccccCCHHHHHHHHhCCcEEEEEeCC--CeEEEE-EeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhc Q lcl|NC_011270. 418 RKVIRGFSGPAEVQRDGEKSRESSEGLMVIEKTPR--NLVHVR-HGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADG 494 (581) Q Consensus 418 ~~~l~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~--~~v~i~-~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~ 494 (581) +|.++|+. ...++..|.+.|.++|++++....+ ..+.+. +|+.+ ..+-.|+.|-.+|-.|+++..|+..+.. . T Consensus 331 fKq~~Git--a~~lt~t~a~al~~~~~N~Y~~~~~~~~~~~~~~~G~~~-gG~~~~~WiD~~~g~~WL~~~iq~~l~~-L 406 (515) T protein:vir:10 331 YVQFNNQT--PAVNDDTLSGILDDLNINYYGQTQVNGTNLSFYQDGVMM-GGPTDPRDSNVYANEQWLKSYAGASFMS-L 406 (515) T ss_pred cccCCCCc--cccCCHHHHHHHHhcCCeEEEEEeccCceEEEEeCCeee-CCccchhHHHHHhhHHHHHHHHHHHHHH-H Confidence 77888764 3468999999999999999965533 456665 45544 3333456678888999999988888753 4 Q ss_pred CCC--c-cCCHHHHHHHHHHHH-HHHHHHHhCCceeCCccceeEE---------ee------cCC--------------- Q lcl|NC_011270. 495 LIG--M-PIYDTTIVQVKASAE-AALVWLVDNNIIRGYRNLKARQ---------IE------RQP--------------- 540 (581) Q Consensus 495 fiG--~-~n~~~~r~~ik~~i~-~~L~~l~~~gaI~~~~~~~~~~---------~~------~~~--------------- 540 (581) |.- | |=++.+...|++.+. +.|++-.+.|+|..-......| .+ ... T Consensus 407 ~~s~~KIPytd~G~a~i~a~v~q~vl~~av~nG~I~~Gv~ls~~Q~~~i~~~~g~d~~~~~~~~~Gyy~~~~~~~~~~~~ 486 (515) T protein:vir:10 407 QLAQGKIPANIEGRGLLLGKMTKDIIPAAKLNGTFSIGKTLTVDQQLFVTELTGDDTAWQKVQNLGYWYDVQISSFVDTG 486 (515) T ss_pred HhcCCCCccChhhHHHHHHHHHHHHHHHHHhCCeeecCcccchhHHHHHHhhhcCcccccchhhcceeEecCcCCCCCcc Confidence 543 4 679999999999875 6999999999998633211110 00 011 Q ss_pred ----CEEEEEEEEEecCceeEEEEEEEEE Q lcl|NC_011270. 541 ----DVIEVRYEWRPAYPLNYIVVRYSIA 565 (581) Q Consensus 541 ----~~~~v~i~v~pv~~~e~I~~~~~~~ 565 (581) ..+.+.+-..--.++.||.+..... T Consensus 487 ~r~~~~~~~~~~y~~g~~i~~i~~~~~~v 515 (515) T protein:vir:10 487 GTTKYQAVYSLVYSKDDLIRKVVGTHTLI 515 (515) T ss_pred cccccCceeEEEEEcCceEEEEEeeeecC Confidence 1112222223333444444433222 No 68 >protein:vir:78611 Length: 501 # NCBI annotation: BcepNY3gp03 # Family: family:all:396 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294840;genbank:gi:149882903;genbank:GeneID:5291082 Probab=89.15 E-value=0.028 Score=29.10 Aligned_cols=389 Identities=14% Similarity=0.087 Sum_probs=158.6 Q ss_pred CeeccccccCCCcccccCccccc--ccccccCcee-eEEEecCCCCceeeeeEEcCcCCceeeEEEEEEeccccceeEEE Q lcl|NC_011270. 1 MAIDFSQYQTPGVYTEAVGAPQL--GIRSSVPTAV-AIFGTAVGYQTYRESIRINPDTGETITTQILALVGEPTGGSFKL 77 (581) Q Consensus 1 ~~~~~~~~~~~~~~~~~~g~~~~--~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~~~~~~~~evq~v~~~~~~~~GtF~l 77 (581) |+=+|+...| .|..+. .=+- .-+...|+.+ -..|.... ....++.-+..+ +.+..- ..++|+|+| T Consensus 53 V~~~FG~~S~--ey~aA~-~yFs~~~~q~~~P~~l~igR~~~~a---~~~~l~g~~l~~-----~~la~~-~~~~G~l~i 120 (501) T protein:vir:78 53 VENWFGGLSN--EAVIAD-AYFPGIVNGGQLPYDLKFARYVAAD---APASVYGIPLTG-----VTLTQL-QGYSGTLTV 120 (501) T ss_pred HHHhcCCChH--HHHHHH-HHhhcCCCCCcccceEEEEeecccC---cceeEeccceec-----cchhhh-ceeeeEEEE Confidence 4444554333 243320 0000 0011111111 11111100 000111101111 111110 124599999 Q ss_pred EeCceecc-ccccC--CCHHHHHHHHHhcCCCCcceEEEEcCCCceEEEEecCCccccccccceeccCCCceEEEEEccc Q lcl|NC_011270. 78 SLAGEPTG-NIPFN--ATQGQVQSALRALPNVEDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQT 154 (581) Q Consensus 78 ~~~g~~T~-~i~~~--asa~~v~~aLe~l~~i~~~~V~~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~ 154 (581) +.+|..+. .|.+. .+..++.+.|++ -++.-. ++|+|..-... +.+...++ T Consensus 121 ti~g~~~~~~i~~S~~ts~~~vA~~i~~--al~a~~----------~tv~~ds~~~~---------------f~its~t~ 173 (501) T protein:vir:78 121 TTAAQHVSSNISLAAATSFANAATLIEA--AFTSPD----------FVVSYDALRNR---------------FVVNTNAT 173 (501) T ss_pred EeccceeeeccccccccCHHHHHHHHHh--hhcCcc----------eEEEEccccce---------------EEEEeeec Confidence 99995432 34332 233344444432 111112 23443221111 11111112 Q ss_pred ccce-eeeccccccccceeeeeccccccceeeEeccceeEEeecccccccCcceeeeeeeeeeecccccccceeEEEEee Q lcl|NC_011270. 155 GVPA-MNRALAKKGIKTDTIRVVNPNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRY 233 (581) Q Consensus 155 g~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~ 233 (581) |..+ +... ++ + T Consensus 174 G~~~~i~~~-----------------------t~----------~----------------------------------- 185 (501) T protein:vir:78 174 GTAAAISAV-----------------------TG----------T----------------------------------- 185 (501) T ss_pred CCceeEEEE-----------------------ec----------c----------------------------------- Confidence 2100 0000 00 0 Q ss_pred cCCcccceeEeccCcchhhhhhhhhhhhccccccceeeeeeeecCCcceeEEeeeccCCcccchhhHHHHHHHHhcC--C Q lcl|NC_011270. 234 TDPNYHEVIRFTDPDDIQDFYGPAFDEAGNVQSEITLCAQLAITNGASTILACAVDPEGDTVTMGDYQNALNKFRDE--D 311 (581) Q Consensus 234 ~~~~~~e~~~~~d~~~~~~~~~~a~~~~g~~~~~i~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~dy~~al~~l~~~--~ 311 (581) .++.+. +.++.+....... .+ ....+ ..++++++.+. . T Consensus 186 --------------~~~a~~--------------------l~Lt~~~~a~v~~--~g-~~aet---~~~a~~a~~~~~~~ 225 (501) T protein:vir:78 186 --------------NNLADE--------------------LGLSAAAGASLQA--AG-VAADT---PASAMNRAVGLSRN 225 (501) T ss_pred --------------cchhhh--------------------hcccccCceeeEe--cc-ccccC---HHHHHHHHHhccCc Confidence 000000 0000000000000 00 00011 23344444332 2 Q ss_pred ceEEE-EeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCch--hH-HHHHHHhhccCCccEEEEEcCeeEecccc Q lcl|NC_011270. 312 EIAII-VAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPV--PS-ATRIANAQSIKDQRVALISPSSFVYYAPE 387 (581) Q Consensus 312 ~~~iv-~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~--~~-~~~~~~a~~~ns~r~~~v~~~~~~~~~~~ 387 (581) +..+. +...+++.+. ++.+|++... + ++.+..-....... .. ......-+.-|..|.+.++.. T Consensus 226 Wy~f~~a~~~~~~~~l-alA~wiea~~---~-~f~~~~~~~~~~~~~~~~~~~i~~~l~a~~y~~t~~~y~~-------- 292 (501) T protein:vir:78 226 WATFTTAWTAVIADRL-ALASWNSGQA---Y-KYMYVAPDLEPASIVTNNSASFGAQVFAAPYQGTLPLYGD-------- 292 (501) T ss_pred eEEEEEecCCCHHHHH-HHHHHHHhcC---c-eEEEEEecCCcceeecccchhHHHHHhhcCCCceEEEcCC-------- Confidence 32233 3333344443 4777887542 2 33222111111100 00 011112233456666665421 Q ss_pred cCCceecCHHHHHHHHHHHhhccchhc-----ccccccc-cCcccccccCCHHHHHHHHhCCcEEEEEeC--CCeEEEE- Q lcl|NC_011270. 388 LNREVVLGGQFMAAAVAGKSVSAIAAM-----PLTRKVI-RGFSGPAEVQRDGEKSRESSEGLMVIEKTP--RNLVHVR- 458 (581) Q Consensus 388 ~~~~~~~p~~~~Aa~vAgl~a~~~~~~-----slt~~~l-~g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~--~~~v~i~- 458 (581) ++ .++.+.|..+..+..+ .+.||.+ +|+. ...++..|.+.|.++|++++.... +..+.+. T Consensus 293 --------~~-~~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gv~--a~~l~~t~a~al~~~~~N~y~~~~~~~~~~~~~~ 361 (501) T protein:vir:78 293 --------QA-TAGAVMGYAASINFQLRNGRTVLAFRQFNAGVP--ATAHDLGTANALRSNNYTYIGAYANAANNYTIAY 361 (501) T ss_pred --------cc-hHHHHHHHHHhcCcccCcceeeeeccccCCCcC--cccCCHHHHHHHHhcCCeEEEEEecccceeeEEE Confidence 11 2333445555555543 4456665 4543 346899999999999999886543 3345554 Q ss_pred EeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCC--Cc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeEE Q lcl|NC_011270. 459 HGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKARQ 535 (581) Q Consensus 459 ~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fi--G~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~~ 535 (581) +|..+ ..|..|.+.+=.|+++..|+..+.. .|. +| |=++.+...|++.++..|++-.+.|+|..-......+ T Consensus 362 ~G~~s----G~~~wiD~~~~~~Wl~~~iq~~l~~-ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q 436 (501) T protein:vir:78 362 DGKLS----GKFLWVDTYLDQIYLNAELQRAEFE-AMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQ 436 (501) T ss_pred cCeee----ccceeehhhhhHHHHHHHHHHHHHH-HHHhCCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCCcc Confidence 55322 2466788888778888888777653 343 34 7799999999999999999999999997632211111 Q ss_pred ----e-----ec------CCC-------------------EEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 536 ----I-----ER------QPD-------------------VIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 536 ----~-----~~------~~~-------------------~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) + ++ +.| ...+.+.+.-...+++|-| +++-. T Consensus 437 ~~~I~~~~g~~~~~~~~~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i----------------~s~~v 500 (501) T protein:vir:78 437 LQQIDAAAGVAGAGQLVQMRGWYFLIGDPANPGQARQNRTTPTCTLWYSDGGSIQELTI----------------GSNAV 500 (501) T ss_pred ceeeccccCccccccceeccceEEeeccccCChhhhhhcccCcEEEEEEeCCceeEEEe----------------eeeec Confidence 0 01 111 1223333333333333333 22222 No 69 >protein:vir:101576 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958108;genbank:gi:41057654;genbank:GeneID:2716834 Probab=81.03 E-value=0.089 Score=26.33 Aligned_cols=366 Identities=13% Similarity=0.048 Sum_probs=140.8 Q ss_pred ccc---eeeeecc--ccccceeeEeccceeEEeecccccccCcceeeeeeeeeeeccccc-------------------- Q lcl|NC_011270. 168 IKT---DTIRVVN--PNSGQVYVLGTDYVVTRVNAGEDGEANTRDDLYTIQRVVDGGHID-------------------- 222 (581) Q Consensus 168 ~~~---~~~~l~~--~~~~~~~vtgtd~~v~~v~~~~dg~~~~~~~~~ti~~~vd~~~~d-------------------- 222 (581) |.. -...+++ ++.........++--........-...-...+.....+-++.+.+ T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~l~l~~~~~~~~~~~~~~~s~~~V~~~FG~~S~ey~aA~~yFsg~~~q~p~ 80 (501) T protein:vir:10 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTGLVLTQDTSVQPGQLADFFQETDVENWFGALSNEAKIADAYFPGIVNGGQL 80 (501) T ss_pred CCCCCcccceEEEEeeecccCCCccccceeEEEeccCCCCccceEEecCHHHHHHhcCCChHHHHHHHHHhhhhcCCCcc Confidence 431 1111222 111111111111111111111111111111111111111211111 Q ss_pred ccceeEEEEeecCC--------------ccccee----Eec-cCcc---------------hhhhhhhhhhhhcc----- Q lcl|NC_011270. 223 PGDIVQLSYRYTDP--------------NYHEVI----RFT-DPDD---------------IQDFYGPAFDEAGN----- 263 (581) Q Consensus 223 ~~~~~~~s~~~~~~--------------~~~e~~----~~~-d~~~---------------~~~~~~~a~~~~g~----- 263 (581) |..+-.-.+..+.. ..+..+ .++ ++.+ +......++..+.. T Consensus 81 P~~l~igR~~~~~~~~~l~g~~l~~~~la~~~~~sg~l~vti~g~~~~~~i~ls~ats~~~vAs~i~~al~~~~~tv~~d 160 (501) T protein:vir:10 81 PYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYSGTLTVTTAAQHVSANISLAAATSFANAATLIEAAFTSPDFVVAYD 160 (501) T ss_pred ccEEEEEeecCCCccceEeccchhhhhhhhcceeeeEEEEeeccceeecccccccccCHHHHHHHHhhhccCCceEEEEc Confidence 11111000000000 000000 000 0000 00000000000000 Q ss_pred -ccccceeeeeeeecCCcceeEEeeecc------------C-----CcccchhhHHHHHHHHhcCC--ceEEEEeC-CCc Q lcl|NC_011270. 264 -VQSEITLCAQLAITNGASTILACAVDP------------E-----GDTVTMGDYQNALNKFRDED--EIAIIVAG-TGA 322 (581) Q Consensus 264 -~~~~i~~~~~~~~~~g~~~~~~~~~~~------------~-----~~~~t~~dy~~al~~l~~~~--~~~iv~~~-t~~ 322 (581) ..+.+.. ... +.|.....+...++ . ......+...++++++.... +..+.... .++ T Consensus 161 ~~~~~f~i--ts~-ttG~~~~i~~~~~~~~la~~l~Lt~~~~a~v~~~g~~aet~~~a~~a~~~~~~~Wy~f~~a~~~~~ 237 (501) T protein:vir:10 161 ALRNRFTV--VTN-ATGTAAAISAVTGTNNLADELGLSAAAGATLQAAGVAADTPASAMNRAVGLSRNWATFTTAWTAVI 237 (501) T ss_pred ccCceEEE--Eee-ccCCceeEEEeeCchhhhhhcCccccccceEEecCcccccHHHHHHHHHhccCceEEEEEecCCCh Confidence 0000000 000 01111111100000 0 00011233556676665543 32233333 334 Q ss_pred HHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCchhH---HHHHHHhhccCCccEEEEEcCeeEecccccCCceecCHHHH Q lcl|NC_011270. 323 QPIQALVQQHVSAQSNNKYERRAILGMDGSVTPVPS---ATRIANAQSIKDQRVALISPSSFVYYAPELNREVVLGGQFM 399 (581) Q Consensus 323 ~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~~~---~~~~~~a~~~ns~r~~~v~~~~~~~~~~~~~~~~~~p~~~~ 399 (581) +.+ .++.+|++.. .+ ++.+..-......... ......-+.-+..|.+.++.. .+. T Consensus 238 ~~~-la~A~wiea~---~~-~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~t~~~y~~-----------------~~~ 295 (501) T protein:vir:10 238 ADR-LAFAAWNSGQ---AY-KYMYVAPDLEAASIVTNNAASFGAQVFAAPYQGTLPLYGD-----------------QAT 295 (501) T ss_pred HHH-HHHHHHHHhc---Cc-eEEEEEecCchhhhhhhhhhhHHHHHHhcCCCceEEECCC-----------------CcH Confidence 444 4578888754 22 3322221111111110 011112233355666665421 112 Q ss_pred HHHHHHHhhccchhc-----cccccccc-CcccccccCCHHHHHHHHhCCcEEEEEeCC--CeEEEE-EeeeccCCCccc Q lcl|NC_011270. 400 AAAVAGKSVSAIAAM-----PLTRKVIR-GFSGPAEVQRDGEKSRESSEGLMVIEKTPR--NLVHVR-HGVTTDPTSLHT 470 (581) Q Consensus 400 Aa~vAgl~a~~~~~~-----slt~~~l~-g~~~~~~~~t~~e~~~l~~~Gv~~l~~~~~--~~v~i~-~~itT~~td~~~ 470 (581) ++.+.|..+..+..+ .+.||.++ |+. ...++..|.+.|.++|++++....+ ....+. +|+.+ . .| T Consensus 296 ~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~--a~~lt~t~a~al~~~~~N~y~~~~~~~~~~~~~~~G~~s-G---~~ 369 (501) T protein:vir:10 296 AGAVMGYAASINFQLRNGRTVLAFRQFNAGVP--ATAHDLPTANALRSNNYTYIGAYANAANNYTIAYDGKLS-G---KF 369 (501) T ss_pred HHHHHHHHHhhCcccCccceeeeccccCCCcC--cccCCHHHHHHHHhcCCeEEEEeccccceeeEEecCeee-c---cc Confidence 233445555555433 45566665 432 3468999999999999999865532 344444 55432 2 46 Q ss_pred ceEEeehhhHHHHHHHHHHHhhhcCC--Cc-cCCHHHHHHHHHHHHHHHHHHHhCCceeCCccceeE----Eee-----c Q lcl|NC_011270. 471 REWNIIGQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLVDNNIIRGYRNLKAR----QIE-----R 538 (581) Q Consensus 471 ~~i~v~R~~d~i~~~ir~~~~~~~fi--G~-~n~~~~r~~ik~~i~~~L~~l~~~gaI~~~~~~~~~----~~~-----~ 538 (581) ..|.+.+=.|+++..|+..+.. .|. +| |=++.+...|++.++..|++-.+.|+|..-...... ++. . T Consensus 370 ~wiD~~~~~~Wl~~~iq~~l~~-ll~~~~kIPyt~~G~~~l~a~v~~~l~~av~nG~I~~Gv~l~~~q~~~i~~~~g~~~ 448 (501) T protein:vir:10 370 LWVDTYLDQIYLNAELQRAEFE-AMLAYNSLPYNEDGYTALYRAGVDVIDAAVTSGIIRAGVTLTNSQLQQIDAAAGVAG 448 (501) T ss_pred eeehhhhhHHHHHHHHHHHHHH-HHHhcCCcccCHHHHHHHHHHHHHHHHHHHhCceeecCCCCCcccceeeccccCccc Confidence 6788888777887777776643 332 34 779999999999999999999999999753221110 010 0 Q ss_pred ------CCC-------------------EEEEEEEEEecCceeEEEEEEEEEeccceEEEEEeecccC Q lcl|NC_011270. 539 ------QPD-------------------VIEVRYEWRPAYPLNYIVVRYSIAPETGDITSTIEGTTSF 581 (581) Q Consensus 539 ------~~~-------------------~~~v~i~v~pv~~~e~I~~~~~~~~~tg~~~~~~~~~~~~ 581 (581) +.+ ...+.+.+.-...+++|-| +++-. T Consensus 449 ~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i----------------~s~~v 500 (501) T protein:vir:10 449 AGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQQLTI----------------GSNAV 500 (501) T ss_pred cccceeccceeEeeccccCChhhhhhccccceEEEEEeCCceeEEEe----------------eeeec Confidence 111 1223333333333444433 22222 No 70 >protein:vir:3636 Length: 501 # NCBI annotation: gp04 # Family: family:all:396 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705631;genbank:gi:23752316;genbank:GeneID:955753 Probab=71.48 E-value=0.2 Score=24.47 Aligned_cols=404 Identities=13% Similarity=0.042 Sum_probs=154.1 Q ss_pred cCC--C---CcceEEEEcCCCceEEEEecCCccccccccceeccCCCceEEEEEcccccceeeecccccccccee--ee- Q lcl|NC_011270. 103 LPN--V---EDDEVTVLGDPGGPWTVTFTKAVAALTKDVTGLTGGDDPDLNIASEQTGVPAMNRALAKKGIKTDT--IR- 174 (581) Q Consensus 103 l~~--i---~~~~V~~~~~~g~~w~Vtf~g~~~~l~~~~~~l~~g~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~--~~- 174 (581) ||. | .-|+|+..-.....=...|.+ ..|..+.. +-.+ .++.....+..... ......+.+.+. .. T Consensus 1 m~~~~ip~s~iV~V~~~v~~~~~~~~~~~~--lllt~~~~-~~~~---r~~~y~s~~~V~~~-FG~~S~ey~aA~~yFs~ 73 (501) T protein:vir:36 1 MPTTTIPIDQIVQMLPGVIGAGGAPGRLTG--LVLTQDTS-VQPG---QLADFFQETDVENW-FGALSNEAKIADAYFPG 73 (501) T ss_pred CCcCCcccceEEEEeeeeccCCCcceeeee--EEEeccCC-CCCc---ceeeecCHHHHHHh-cCCChHHHHHHHHHhhc Confidence 553 2 334554432221111333422 12211111 1001 11111111111110 011111111111 00 Q ss_pred --eccccccceeeEecc-----ceeEEeeccc-ccccCcceeeeeeeeeeecccccccceeEEEEeecCCcccceeEec- Q lcl|NC_011270. 175 --VVNPNSGQVYVLGTD-----YVVTRVNAGE-DGEANTRDDLYTIQRVVDGGHIDPGDIVQLSYRYTDPNYHEVIRFT- 245 (581) Q Consensus 175 --l~~~~~~~~~vtgtd-----~~v~~v~~~~-dg~~~~~~~~~ti~~~vd~~~~d~~~~~~~s~~~~~~~~~e~~~~~- 245 (581) -..|.+.+...-+-. ..+....... ....-...+ .++...+++ ......+.+. T Consensus 74 ~~~q~~~P~~l~igR~~~~a~~~~l~g~~l~~~~~a~~~~~s-g~l~vti~g-----------------~~~~~~i~lS~ 135 (501) T protein:vir:36 74 IVNGGQLPYDLKFARYVAADAPASVYGIPLTGVTLAQLQGYS-GTLTVTTAA-----------------QHVSANISLAA 135 (501) T ss_pred ccCCCccccEEEEEeecCcCcceeEeccchhhhhhhhcccee-EEEEEEecc-----------------eeeeeeccccc Confidence 012222222221100 0000000000 000000000 111111111 1100111111 Q ss_pred --cCcchhhhhhhhhhhhc-----------------ccc--cccee-------eeeeeecCCcceeEEeeeccCCcccch Q lcl|NC_011270. 246 --DPDDIQDFYGPAFDEAG-----------------NVQ--SEITL-------CAQLAITNGASTILACAVDPEGDTVTM 297 (581) Q Consensus 246 --d~~~~~~~~~~a~~~~g-----------------~~~--~~i~~-------~~~~~~~~g~~~~~~~~~~~~~~~~t~ 297 (581) .+.++......++..+. ..+ ..+.. +..+.++.+...... ... ... T Consensus 136 ~ts~~~vA~~i~~al~~~~~tv~~d~~~~~f~i~s~t~G~~~~i~~~t~~~~ia~~l~Lt~~~~a~v~---~~g---~~~ 209 (501) T protein:vir:36 136 ATSFANAATLIEAAFTSPDFVVAYDALRNRFTVVTNATGTAAAISAVTGTNNFADEIGLSAAAGATLQ---AAG---VAA 209 (501) T ss_pred ccCHHHHHHHHhhhhcCcceEEEEcCcceeEEEEeccCCcceeeEeeecccchhhhhcccccCcceEE---ecc---ccc Confidence 11111111111111100 000 00000 000111111110000 000 111 Q ss_pred hhHHHHHHHHhcCC--ceEEEEeCCCcHHHHHHHHHHHHHHhcCCCcEEEEEecCCCCCch--hH-HHHHHHhhccCCcc Q lcl|NC_011270. 298 GDYQNALNKFRDED--EIAIIVAGTGAQPIQALVQQHVSAQSNNKYERRAILGMDGSVTPV--PS-ATRIANAQSIKDQR 372 (581) Q Consensus 298 ~dy~~al~~l~~~~--~~~iv~~~t~~~~i~~~l~~~v~~~~~~~~~~~avvg~~~~~~~~--~~-~~~~~~a~~~ns~r 372 (581) +-..++++++.+.. +..+.++......-+.++..|++.. .+ ++.+.--....... .. ......-+.-|..| T Consensus 210 et~~~al~a~~~~s~~Wy~f~~a~~~~~~~~la~A~wiea~---~~-~f~~~~~~~~~~~~~~~~~~~i~~~l~~~~y~~ 285 (501) T protein:vir:36 210 DTPASAMNRAVGLSRNWATFTTAWTAVIADRLAFASWNSGQ---AY-KYMYVAPDLEAASIVSNNAASFGAQVFAAPYQG 285 (501) T ss_pred ccHHHHHHHHHhccCceEEEEEecCCChHHHHHHHHHHhhc---Cc-eEEEEEecCchhhhhccchhhHHHHHHhcCCCc Confidence 23456666665543 3323334433333334578888744 23 33222211111110 00 11112233446667 Q ss_pred EEEEEcCeeEecccccCCceecCHHHHHHHHHHHhhccchhc-----ccccccc-cCcccccccCCHHHHHHHHhCCcEE Q lcl|NC_011270. 373 VALISPSSFVYYAPELNREVVLGGQFMAAAVAGKSVSAIAAM-----PLTRKVI-RGFSGPAEVQRDGEKSRESSEGLMV 446 (581) Q Consensus 373 ~~~v~~~~~~~~~~~~~~~~~~p~~~~Aa~vAgl~a~~~~~~-----slt~~~l-~g~~~~~~~~t~~e~~~l~~~Gv~~ 446 (581) .+.++.. ..+ ++.+.|..+..+..+ .+.||.+ +|+. ...++.+|.+.|.++|+++ T Consensus 286 t~~~y~~-------------~~~----~aa~~g~~as~nf~~~~g~~T~~fkq~~~Gi~--a~~l~~t~a~al~~~~~N~ 346 (501) T protein:vir:36 286 TLPLYGD-------------QAT----AGAVMGYAASINFQLRNGRTVLAFRQFNAGVP--ATVHDLPTANALRSNNYTY 346 (501) T ss_pred EEEEcCC-------------CCH----HHHHHHHHHhcCcccCcceeeeeccccCCCcC--cCcCCHHHHHHHHhcCCcE Confidence 7665421 011 223345555555444 4456665 4543 3468999999999999998 Q ss_pred EEEeC--CCeEEEE-EeeeccCCCcccceEEeehhhHHHHHHHHHHHhhhcCC--Cc-cCCHHHHHHHHHHHHHHHHHHH Q lcl|NC_011270. 447 IEKTP--RNLVHVR-HGVTTDPTSLHTREWNIIGQQDVMVYRIRDYLDADGLI--GM-PIYDTTIVQVKASAEAALVWLV 520 (581) Q Consensus 447 l~~~~--~~~v~i~-~~itT~~td~~~~~i~v~R~~d~i~~~ir~~~~~~~fi--G~-~n~~~~r~~ik~~i~~~L~~l~ 520 (581) +.... +..+.+. +|+.+ . .|..|.+++-.|++...|+..+.. .|. +| |=++.+...|++.++..|++-. T Consensus 347 y~~~~~~~~~~~~~~~G~~s--G--~~~wiD~~~g~dWL~~~iq~~l~~-ll~~~~KIPytd~G~~~l~a~i~~~l~~av 421 (501) T protein:vir:36 347 IGAYANAANNYTIAYDGKLS--G--KFLWVDTYLDQIYLNAELQRAEFE-AMLAYNSLPYNEDGYTGLYRAGVDVIDAAV 421 (501) T ss_pred EEEEecccceeeEEEcCeee--c--cchhhhHHHhHHHHHHHHHHHHHH-HHhcCCCCccChhhHHHHHHHHHHHHHHHH Confidence 75443 3445554 55322 2 456688888889998888888753 454 34 7799999999999999999999 Q ss_pred hCCceeCCccceeEE----e-----ecC------CC-------------------EEEEEEEEEecCceeEEEEEEEEEe Q lcl|NC_011270. 521 DNNIIRGYRNLKARQ----I-----ERQ------PD-------------------VIEVRYEWRPAYPLNYIVVRYSIAP 566 (581) Q Consensus 521 ~~gaI~~~~~~~~~~----~-----~~~------~~-------------------~~~v~i~v~pv~~~e~I~~~~~~~~ 566 (581) +.|+|..-......+ . ++. .| ...+.+.+.-...+++|-| T Consensus 422 ~nG~I~~Gv~~~~~q~~~I~~~~g~~~~~~~v~~~Gyy~~~~~~~~~~~~R~~R~~p~~~~~y~~~gaIh~v~i------ 495 (501) T protein:vir:36 422 TSGIIRAGVTLTNSQLQQIDKAARVAGAGQLVQTRGWYFLIGDPANPGQARQNRTTPACTLWYSDGGSIQSLTI------ 495 (501) T ss_pred hCceeecCCCCCcccceeecccccccccccceeccceEEeeCcccCChhhhhhcccCcEEEEEEeCCceeEEEe------ Confidence 999997532211100 0 010 01 1223333333334444433 Q ss_pred ccceEEEEEeecccC Q lcl|NC_011270. 567 ETGDITSTIEGTTSF 581 (581) Q Consensus 567 ~tg~~~~~~~~~~~~ 581 (581) +++-. T Consensus 496 ----------~s~~v 500 (501) T protein:vir:36 496 ----------GSNAV 500 (501) T ss_pred ----------eeeee Confidence 22222 Done!