Query lcl|NC_020477.1_cdsid_YP_007517407.1 [gene=I907_gp17] [protein=hypothetical protein] [protein_id=YP_007517407.1] [location=10733..11125] Match_columns 130 No_of_seqs 104 out of 190 Neff 6.4 Searched_HMMs 1612 Date Thu Nov 7 16:03:45 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_17 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_17_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99848 Length: 172 100.0 3.7E-43 2.3E-46 253.2 13.6 125 1-130 3-172 (172) 2 protein:vir:1993 Length: 141 # 100.0 2.2E-40 1.4E-43 238.0 14.1 123 1-130 2-140 (141) 3 protein:vir:79074 Length: 150 100.0 3.1E-40 2E-43 237.1 13.2 125 1-130 2-150 (150) 4 protein:vir:107864 Length: 150 100.0 3.9E-40 2.4E-43 236.7 13.3 125 1-130 2-150 (150) 5 protein:vir:79253 Length: 138 100.0 1.6E-39 1E-42 233.3 14.1 121 1-127 2-138 (138) 6 protein:vir:99222 Length: 138 100.0 1.6E-39 1E-42 233.3 14.1 121 1-127 2-138 (138) 7 protein:vir:103846 Length: 138 100.0 1.1E-38 6.9E-42 228.7 13.5 121 1-130 2-138 (138) 8 protein:vir:98481 Length: 136 93.1 0.0068 4.2E-06 32.5 10.7 114 1-129 3-136 (136) 9 protein:vir:2505 Length: 128 # 92.4 0.0005 3.1E-07 38.7 3.5 110 1-130 6-127 (128) 10 protein:vir:80389 Length: 172 87.3 0.0046 2.8E-06 33.4 4.5 117 1-130 16-156 (172) 11 protein:vir:2432 Length: 124 # 85.7 0.035 2.2E-05 28.6 8.5 111 1-124 2-124 (124) 12 protein:vir:94761 Length: 132 81.7 0.04 2.5E-05 28.2 7.1 113 1-130 3-123 (132) 13 protein:vir:9576 Length: 131 # 80.9 0.036 2.2E-05 28.5 6.5 110 1-130 3-122 (131) 14 protein:vir:4831 Length: 105 # 74.3 0.16 9.9E-05 25.0 8.1 97 1-99 1-105 (105) 15 protein:vir:486 Length: 107 # 73.1 0.17 0.00011 24.7 8.1 96 1-103 1-107 (107) 16 protein:vir:9761 Length: 140 # 72.6 0.11 6.7E-05 25.9 6.7 116 1-125 3-140 (140) 17 protein:vir:1640 Length: 132 # 70.5 0.13 7.9E-05 25.5 6.6 110 1-130 3-123 (132) 18 protein:vir:79050 Length: 133 65.8 0.12 7.3E-05 25.7 5.4 116 1-130 1-133 (133) 19 protein:vir:1329 Length: 122 # 65.5 0.25 0.00015 23.9 7.1 115 1-130 2-119 (122) 20 protein:vir:4512 Length: 107 # 65.2 0.29 0.00018 23.5 8.2 96 1-101 1-107 (107) 21 protein:vir:7773 Length: 123 # 62.2 0.29 0.00018 23.5 6.9 111 1-124 2-123 (123) 22 protein:vir:81159 Length: 95 # 61.8 0.35 0.00021 23.1 9.1 91 1-102 2-95 (95) 23 protein:vir:43 Length: 131 # N 60.3 0.37 0.00023 22.9 8.9 99 1-108 2-131 (131) 24 protein:vir:95176 Length: 172 57.2 0.19 0.00012 24.6 4.9 118 1-130 18-172 (172) 25 protein:vir:102083 Length: 96 54.0 0.51 0.00032 22.2 9.2 96 1-111 1-96 (96) 26 protein:vir:107614 Length: 96 54.0 0.51 0.00032 22.2 9.2 96 1-111 1-96 (96) 27 protein:vir:102863 Length: 96 54.0 0.51 0.00032 22.2 9.2 96 1-111 1-96 (96) 28 protein:vir:105005 Length: 96 54.0 0.51 0.00032 22.2 9.2 96 1-111 1-96 (96) 29 protein:vir:4857 Length: 104 # 53.1 0.54 0.00033 22.1 9.8 97 1-102 1-104 (104) 30 protein:vir:99002 Length: 158 52.2 0.19 0.00012 24.6 4.0 117 1-130 3-132 (158) 31 protein:vir:94955 Length: 170 51.0 0.36 0.00022 23.0 5.4 111 1-125 15-170 (170) 32 protein:vir:97267 Length: 172 50.4 0.22 0.00014 24.2 4.1 121 1-130 17-169 (172) 33 protein:vir:4458 Length: 107 # 49.4 0.64 0.0004 21.6 7.3 92 1-94 1-107 (107) 34 protein:vir:78478 Length: 149 48.9 0.66 0.00041 21.6 7.1 118 1-130 2-147 (149) 35 protein:vir:78254 Length: 149 48.9 0.66 0.00041 21.6 7.1 118 1-130 2-147 (149) 36 protein:vir:4228 Length: 125 # 48.1 0.68 0.00042 21.5 8.6 110 1-124 2-125 (125) 37 protein:vir:95004 Length: 169 48.1 0.41 0.00025 22.7 5.2 113 1-126 16-169 (169) 38 protein:vir:6243 Length: 122 # 47.9 0.69 0.00043 21.5 6.9 115 1-130 2-119 (122) 39 protein:vir:80967 Length: 131 43.9 0.83 0.00051 21.0 9.4 99 1-108 2-131 (131) 40 protein:vir:104088 Length: 125 43.8 0.83 0.00051 21.0 7.7 110 1-124 2-125 (125) 41 protein:vir:98900 Length: 132 43.1 0.35 0.00021 23.1 4.0 114 1-130 2-132 (132) 42 protein:vir:94507 Length: 113 39.8 1 0.00062 20.6 8.5 99 2-123 1-113 (113) 43 protein:vir:106596 Length: 128 36.8 1.2 0.00071 20.2 6.9 98 1-123 1-128 (128) 44 protein:vir:78383 Length: 169 36.1 0.68 0.00042 21.5 4.4 113 1-126 16-169 (169) 45 protein:vir:93592 Length: 108 35.4 1.2 0.00076 20.1 7.8 96 1-102 3-108 (108) 46 protein:vir:3970 Length: 110 # 32.9 1.4 0.00087 19.8 6.7 96 2-123 1-110 (110) 47 protein:vir:99796 Length: 110 31.3 1.5 0.00094 19.6 8.2 97 2-123 1-110 (110) 48 protein:vir:97145 Length: 110 31.3 1.5 0.00094 19.6 8.2 97 2-123 1-110 (110) 49 protein:vir:96221 Length: 110 31.3 1.5 0.00094 19.6 8.2 97 2-123 1-110 (110) 50 protein:vir:96390 Length: 110 31.3 1.5 0.00094 19.6 8.2 97 2-123 1-110 (110) 51 protein:vir:9311 Length: 110 # 31.3 1.5 0.00094 19.6 8.2 97 2-123 1-110 (110) 52 protein:vir:78849 Length: 110 31.3 1.5 0.00094 19.6 8.2 97 2-123 1-110 (110) 53 protein:vir:103957 Length: 110 31.3 1.5 0.00094 19.6 8.2 97 2-123 1-110 (110) 54 protein:vir:100245 Length: 113 29.5 1.7 0.001 19.4 9.7 93 1-100 2-113 (113) 55 protein:vir:2738 Length: 112 # 28.7 1.7 0.0011 19.3 8.5 97 1-123 1-112 (112) 56 protein:vir:102158 Length: 99 28.3 1.8 0.0011 19.2 9.6 97 1-107 1-99 (99) 57 protein:vir:4904 Length: 113 # 28.2 1.8 0.0011 19.2 6.8 96 2-123 1-113 (113) 58 protein:vir:1026 Length: 107 # 23.0 2.4 0.0015 18.5 8.3 103 1-118 1-107 (107) 59 protein:vir:106583 Length: 105 21.4 2.6 0.0016 18.3 6.8 90 1-101 1-105 (105) 60 protein:vir:1887 Length: 108 # 20.7 2.7 0.0017 18.2 10.5 98 1-113 7-108 (108) 61 protein:vir:192 Length: 108 # 20.7 2.7 0.0017 18.2 10.5 98 1-113 7-108 (108) 62 protein:vir:4954 Length: 104 # 20.6 2.7 0.0017 18.2 9.0 99 1-102 1-104 (104) 63 protein:vir:9877 Length: 114 # 20.6 2.7 0.0017 18.2 8.3 94 1-123 1-114 (114) 64 protein:vir:3615 Length: 110 # 20.4 2.8 0.0017 18.2 7.7 97 2-123 1-110 (110) No 1 >protein:vir:99848 Length: 172 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164077;genbank:gi:56692609;genbank:GeneID:3192576 Probab=100.00 E-value=3.7e-43 Score=253.19 Aligned_cols=125 Identities=24% Similarity=0.405 Sum_probs=109.5 Q ss_pred CCCCHHHH----------HHhccc---------------------------CCCCcCHHHHHHHHHHHHHHHHHHHhhh- Q lcl|NC_020477. 1 MYAIPDDL----------RLVMNN---------------------------LSKQVTDELLAKYIQEASNYIDARLGVA- 42 (130) Q Consensus 1 MY~t~edl----------~l~d~~---------------------------~~g~~d~~~i~~Al~dAs~~IDgyL~~R- 42 (130) ||||.+|| ||++++ .+|.+|+++|++||+||+++|||||++| T Consensus 3 mYaT~~dl~~r~g~~el~qLt~~~~~~~~~~~l~d~~~~~~~~~~~~~~~~~~g~~d~~~i~~Al~dA~aeIDgYL~~R~ 82 (172) T protein:vir:99 3 VYITLPELAERPGAEELSQAATPRPLQAVDSELLDALLRGLPVDRWTPEEIEVGHATVEVINSAVSDAQGYIDGFLQRRG 82 (172) T ss_pred ccccHHHHHhhcCHHHHHHHhccccccCCHHHHHHHhhcchhhhhcccccccccccCHHHHHHHHHHHHHHHHHHHhccc Confidence 99999996 566654 2589999999999999999999999999 Q ss_pred ccCCCcccchHHHHHHHHHHHHHhhcCCC---CCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCC---CCCceee Q lcl|NC_020477. 43 YKTPFVKVPPIIHDITVDLARFFFAEDHY---TSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKL---PAGFATT 116 (130) Q Consensus 43 Y~lPl~~vP~~L~~~a~dIArY~L~~~~~---~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~---~~~~~~~ 116 (130) |.|||++||++|+++|||||||+||++++ ..+|++++||++||| ||++|++||++||++... +++.+.+ T Consensus 83 Y~lPL~~vP~~L~~~a~dIArY~L~~~r~~~~~~~e~v~~rY~~Ai~-----~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v 157 (172) T protein:vir:99 83 YSLPLAKRYGVVTGWTRAIARYLLHQDRLGPGAEKDPIVRDYRDALK-----FLQLIAEGKFSLGPDDPLTPPGGGVPQV 157 (172) T ss_pred ccCCCcccchHHHHHHHHHHHHHHHhccCCcccCCHHHHHHHHHHHH-----HHHHHhcCccccCCCCCCCCCCCCceee Confidence 99999999999999999999999998764 358999999999995 999999999999875433 3344677 Q ss_pred CCCCcccCcccCC-C Q lcl|NC_020477. 117 TDGEQIFTLDQPE-W 130 (130) Q Consensus 117 ~~~~r~f~R~~~r-w 130 (130) ++++|+|||++|| | T Consensus 158 ~~~~r~F~rd~L~gf 172 (172) T protein:vir:99 158 LAPARTFSHDTLKDY 172 (172) T ss_pred ecCCCccChhhccCC Confidence 8999999999955 6 No 2 >protein:vir:1993 Length: 141 # NCBI annotation: Hypothetical protein # Family: family:all:348 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050640;genbank:gi:9633527;genbank:GeneID:2636296 Probab=100.00 E-value=2.2e-40 Score=238.00 Aligned_cols=123 Identities=19% Similarity=0.333 Sum_probs=108.9 Q ss_pred CCCCHHHH----------HHh-cccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcC Q lcl|NC_020477. 1 MYAIPDDL----------RLV-MNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAED 69 (130) Q Consensus 1 MY~t~edl----------~l~-d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~ 69 (130) =|||.+|| +|+ |++.+|++|+++|++||+||++||||||++||.|||+++|++|+++|||||+|+||++ T Consensus 2 ~Y~T~~Dl~~~~ge~~l~~Lt~d~~~~g~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~~P~~L~~~a~dIA~Y~L~~~ 81 (141) T protein:vir:19 2 NYATVNDLCARYTRTRLDILTRPKTADGQPDDAVAEQALADASAFIDGYLAARFVLPLTVVPSLLKRQCCVVAWFYLNES 81 (141) T ss_pred CcCCHHHHHHhcCHHHHHHHhcCCCCccccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHhcC Confidence 58888886 466 5566899999999999999999999999999999999999999999999999999987 Q ss_pred CCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCC-----CCCceeeCCCCcccCcccCCC Q lcl|NC_020477. 70 HYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKL-----PAGFATTTDGEQIFTLDQPEW 130 (130) Q Consensus 70 ~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~-----~~~~~~~~~~~r~f~R~~~rw 130 (130) + .+|++++||++||+ ||++|++|+++||++..+ +.+.+.+++++|+|+|++.-| T Consensus 82 ~--~~e~i~~rY~~Ai~-----~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~~r~f~r~~~G~ 140 (141) T protein:vir:19 82 Q--PTEQITATYRDTVR-----WLEQVRDGKTDPGVESRTAASPEGEDLVQVQSDPPVFSRKQKGF 140 (141) T ss_pred C--CChHHHHHHHHHHH-----HHHHHhcCccccCCCCCCCCCCCCCceeEeecCCcccCcccccC Confidence 6 47999999999995 999999999999865432 234467889999999999999 No 3 >protein:vir:79074 Length: 150 # NCBI annotation: gp10, conserved hypothetical protein # Family: family:all:348 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111210;genbank:gi:134288797;genbank:GeneID:4960748 Probab=100.00 E-value=3.1e-40 Score=237.15 Aligned_cols=125 Identities=21% Similarity=0.372 Sum_probs=108.0 Q ss_pred CCCCHHHH----------HHhcccCC-------CCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHH Q lcl|NC_020477. 1 MYAIPDDL----------RLVMNNLS-------KQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLAR 63 (130) Q Consensus 1 MY~t~edl----------~l~d~~~~-------g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIAr 63 (130) =|||.+|| +|+|++.+ +++|+++|++||+||+++|||||++||.|||++||.+|+++|||||+ T Consensus 2 ~Y~T~~Dl~~r~ge~~l~~Ltd~~~~~~~~~~~~~~d~~~i~~Al~dA~~~IDgyL~~RY~lPl~~vP~~L~~~a~dIA~ 81 (150) T protein:vir:79 2 RYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESAVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLAR 81 (150) T ss_pred CcCCHHHHHHhcCHHHHHHHhcccccccccccccccCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHH Confidence 47777775 67777643 68999999999999999999999999999999999999999999999 Q ss_pred HHhhcCCC---CCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCC---CCCceeeCCCCcccCcccCC-C Q lcl|NC_020477. 64 FFFAEDHY---TSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKL---PAGFATTTDGEQIFTLDQPE-W 130 (130) Q Consensus 64 Y~L~~~~~---~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~---~~~~~~~~~~~r~f~R~~~r-w 130 (130) |+||.++. ..+|++++||++||| ||++|++||++||+++.. .++.+.+.+++|+|||++|| | T Consensus 82 Y~L~~~~~~~~~~~e~v~~rY~~Ai~-----~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~~r~f~r~~l~g~ 150 (150) T protein:vir:79 82 HWLYARRPEGAALPDTVSQTFKASMH-----MLEKIRDNKLTIGDPSGPATPEPGEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHhcccCCCCCCHHHHHHHHHHHH-----HHHHHhcCccccCCCCccCCCCCCceeeecCCCccChhhccCC Confidence 99998754 358999999999995 999999999999875422 33455788999999999954 7 No 4 >protein:vir:107864 Length: 150 # NCBI annotation: gp36 # Family: family:all:348 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024709;genbank:gi:48696946;genbank:GeneID:2845952 Probab=100.00 E-value=3.9e-40 Score=236.65 Aligned_cols=125 Identities=20% Similarity=0.366 Sum_probs=108.4 Q ss_pred CCCCHHHH----------HHhcccCCC-------CcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHH Q lcl|NC_020477. 1 MYAIPDDL----------RLVMNNLSK-------QVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLAR 63 (130) Q Consensus 1 MY~t~edl----------~l~d~~~~g-------~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIAr 63 (130) =|||.+|| +|+|++.+| ++|+++|++||+||+++|||||++||.|||++||.+|+++|||||| T Consensus 2 ~Y~T~~Dl~~r~ge~el~~Ltd~~~~g~~~~~~~~~d~~~i~~Al~dA~~eIDgYL~~RY~lPl~~vP~~L~~~a~dIAr 81 (150) T protein:vir:10 2 RYCTLADLKLAVPERTLIELTNDTTTDYGAPAPTTINTDIVESSVRQAEEIVDAHLRGRYNLPLSPVPTVIKDVTVNLAR 81 (150) T ss_pred CcCCHHHHHHhcCHHHHHHHhcccccCccccchhhcCHHHHHHHHHHHHHHHHHHHhhhccCCcccccHHHHHHHHHHHH Confidence 47777776 678877654 8999999999999999999999999999999999999999999999 Q ss_pred HHhhcCCC---CCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCC---CCCceeeCCCCcccCcccC-CC Q lcl|NC_020477. 64 FFFAEDHY---TSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKL---PAGFATTTDGEQIFTLDQP-EW 130 (130) Q Consensus 64 Y~L~~~~~---~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~---~~~~~~~~~~~r~f~R~~~-rw 130 (130) |+||.+++ ..+|++++||++||| ||++|++||++||+++.. .++.+.+.+++|+|||++| .| T Consensus 82 Y~L~~~~~~~~~~~e~v~~rY~~Ai~-----~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~v~~~~r~f~r~~l~gf 150 (150) T protein:vir:10 82 HWLYARRPEGAALPDTVSQTFKASMH-----MLEKIRDNKLTIGDPSGPATPEPGEMKVRARRRQFDADLLERF 150 (150) T ss_pred HHHHhcccccCCCCHHHHHHHHHHHH-----HHHHHhcCcccCCCCCCCCCCCCceeeeecCCCccChhhccCC Confidence 99998754 458999999999995 999999999999876432 2344567899999999995 57 No 5 >protein:vir:79253 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469165;genbank:gi:157835007;genbank:GeneID:5648834 Probab=100.00 E-value=1.6e-39 Score=233.26 Aligned_cols=121 Identities=22% Similarity=0.251 Sum_probs=106.2 Q ss_pred CCCCHHHH----------HHhcccC--CCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc Q lcl|NC_020477. 1 MYAIPDDL----------RLVMNNL--SKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE 68 (130) Q Consensus 1 MY~t~edl----------~l~d~~~--~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~ 68 (130) =|||.+|| +|+|++. +|++|+++|++||+||+++|||||++||.|||++||++|+++|||||+|+||+ T Consensus 2 ~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~~ 81 (138) T protein:vir:79 2 SYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLHI 81 (138) T ss_pred CCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHhc Confidence 58887776 5777764 58999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCC----CCCceeeCCCCcccCccc Q lcl|NC_020477. 69 DHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKL----PAGFATTTDGEQIFTLDQ 127 (130) Q Consensus 69 ~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~----~~~~~~~~~~~r~f~R~~ 127 (130) ++. .++.+++||++||| ||++|++||++||+++.+ +++++++++++|+|+||= T Consensus 82 ~~~-~~e~i~~rY~~Ai~-----~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:79 82 VLK-EENPVYKTAEHLRK-----LLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) T ss_pred CCC-CcHHHHHHHHHHHH-----HHHHHhcCcccCCCCCCCcCCCCCCceeeecCCCCCCCCC Confidence 764 46779999999995 999999999999876533 344567889999999986 No 6 >protein:vir:99222 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950460;genbank:gi:119953661;genbank:GeneID:4643082 Probab=100.00 E-value=1.6e-39 Score=233.26 Aligned_cols=121 Identities=22% Similarity=0.251 Sum_probs=106.2 Q ss_pred CCCCHHHH----------HHhcccC--CCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc Q lcl|NC_020477. 1 MYAIPDDL----------RLVMNNL--SKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE 68 (130) Q Consensus 1 MY~t~edl----------~l~d~~~--~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~ 68 (130) =|||.+|| +|+|++. +|++|+++|++||+||+++|||||++||.|||++||++|+++|||||+|+||+ T Consensus 2 ~YaT~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~aeIdgYL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~~ 81 (138) T protein:vir:99 2 SYCTLADLIEQYSEQKIREVSDRVNKPATTIDTVIVDRAIADADSEIDLHLHGRYQLPLASVPTALKRIACGLAYANLHI 81 (138) T ss_pred CCCCHHHHHHhcCHHHHHHHhccCccccCcccHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHhc Confidence 58887776 5777764 58999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCC----CCCceeeCCCCcccCccc Q lcl|NC_020477. 69 DHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKL----PAGFATTTDGEQIFTLDQ 127 (130) Q Consensus 69 ~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~----~~~~~~~~~~~r~f~R~~ 127 (130) ++. .++.+++||++||| ||++|++||++||+++.+ +++++++++++|+|+||= T Consensus 82 ~~~-~~e~i~~rY~~Ai~-----~L~~Ia~Gk~~Lg~~~~~~~~~~~~~~~~~~~~r~F~Rd~ 138 (138) T protein:vir:99 82 VLK-EENPVYKTAEHLRK-----LLSGIANGKLSLALDADGKPAPVANTVQISEGRNDWGADW 138 (138) T ss_pred CCC-CcHHHHHHHHHHHH-----HHHHHhcCcccCCCCCCCcCCCCCCceeeecCCCCCCCCC Confidence 764 46779999999995 999999999999876533 344567889999999986 No 7 >protein:vir:103846 Length: 138 # NCBI annotation: hypothetical protein # Family: family:all:348 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938245;genbank:gi:38229150;genbank:GeneID:2648161 Probab=100.00 E-value=1.1e-38 Score=228.66 Aligned_cols=121 Identities=24% Similarity=0.308 Sum_probs=105.7 Q ss_pred CCCCHHHH----------HHhcccC--CCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc Q lcl|NC_020477. 1 MYAIPDDL----------RLVMNNL--SKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE 68 (130) Q Consensus 1 MY~t~edl----------~l~d~~~--~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~ 68 (130) =|||.+|| +|+|++. +|++|+++|++||++|+++|||||++||.|||++||.+|+++|||||+|+||+ T Consensus 2 ~Y~T~~dl~~r~ge~~l~~Ltd~~~~~~~~~d~~~i~~Al~dA~~eIdgyL~~RY~lPl~~vP~~L~~~a~dIA~Y~L~~ 81 (138) T protein:vir:10 2 SYCTQADLVEQYGEASIRQLSDRVNKPATTIDPAVVAQAIADADAEIDLHLHARYQLPLAQVPVVLKRVACVLAFANLHT 81 (138) T ss_pred CcCCHHHHHHhcCHHHHHHHhcccCCCcCccCHHHHHHHHHHHHHHHHHHHhhcccCCccccchHHHHHHHHHHHHHHhc Confidence 58887776 5777764 57999999999999999999999999999999999999999999999999997 Q ss_pred CCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCC----CCCceeeCCCCcccCcccCCC Q lcl|NC_020477. 69 DHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKL----PAGFATTTDGEQIFTLDQPEW 130 (130) Q Consensus 69 ~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~----~~~~~~~~~~~r~f~R~~~rw 130 (130) ++ ..+|++++||++||+ ||++|++|+++||+++.+ +++++++++++|+|+|| | T Consensus 82 ~~-~~~e~~~~rY~~Ai~-----~L~~Ia~G~~~Lg~~~~~~~~~~~~~~~~~s~~r~Fg~d---~ 138 (138) T protein:vir:10 82 QV-KDDHPAILDAERKRK-----LLGGISSGKLSLALTSSGTPAPIANTVQISSQRNDFGGT---W 138 (138) T ss_pred CC-CCChHHHHHHHHHHH-----HHHHHhcCcccCCCCCCcccCCCCCceeeecCCccCCCC---C Confidence 65 357899999999995 999999999999876532 34456788899999996 4 No 8 >protein:vir:98481 Length: 136 # NCBI annotation: ORF3 # Family: family:all:32440 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958281;genbank:gi:41057255;uniprot:Q38596;genbank:GeneID:2732865 Probab=93.06 E-value=0.0068 Score=32.48 Aligned_cols=114 Identities=19% Similarity=0.204 Sum_probs=60.7 Q ss_pred CCCCHHHHHHhcccC-C-CCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNL-S-KQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNL 78 (130) Q Consensus 1 MY~t~edl~l~d~~~-~-g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~ 78 (130) -|+|.+|++...+-. + ++.+.+.++.-|+|||..|-.+. +-+-++-|..+++++|++++=.+....+..++.. T Consensus 3 ~fAtv~Dl~~rw~~~~~dee~~ra~~~~lL~dAS~~ir~~~----p~~~~~~~~~~~~V~~~~V~R~~~np~G~~s~Ta- 77 (136) T protein:vir:98 3 AYATVEDYQARAAVTLPDGSPRRAQVEAYLDDASALMARHI----PTGHTPDPGTLRAICVAVVRRVMANPGGYRQRTI- 77 (136) T ss_pred ccCCHHHHHHHhccCCCCchhHHHHHHHHHHHHHHHHHHhC----CCCCCCChhHHHHHHHHHHHHHhhCCCCcccccc- Confidence 699999999888743 2 23334567888999999987764 4444566899999999999744432222233332 Q ss_pred HHHHHHHHHHHHHHHHHHHcCcccc--------CCCCCC-----CCCceeeCCC-----CcccCcccCC Q lcl|NC_020477. 79 DEYHIKLKERIEKLLDDIISGVLVL--------DPDTKL-----PAGFATTTDG-----EQIFTLDQPE 129 (130) Q Consensus 79 ~rY~~Aik~~~~a~L~~Ia~G~~~L--------~~~~~~-----~~~~~~~~~~-----~r~f~R~~~r 129 (130) --|-..+- + .|.+-| |+.... .+-+...+++ +-.|+.+=+| T Consensus 78 G~ys~s~t-----~-----~G~Lylt~~E~~~Lg~~rqr~~~~d~a~si~~~~~~~~~~~dp~~~~~~~ 136 (136) T protein:vir:98 78 GQYAETLG-----E-----DGGLYLTEDEKGQLQPPDQTAPDADAAYSLDLDPGTRAWVDDPAGCGWPR 136 (136) T ss_pred hhHHHhhh-----c-----CCCcccChHHHHHhCCCCCcccccccceecccCCCcCCcCCCCCCCCCCC Confidence 23554441 2 355432 222111 1101111111 1123332222 No 9 >protein:vir:2505 Length: 128 # NCBI annotation: hypothetical protein # Family: family:all:28222 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569746;genbank:gi:18496896;genbank:GeneID:932265 Probab=92.35 E-value=0.0005 Score=38.68 Aligned_cols=110 Identities=10% Similarity=0.157 Sum_probs=58.4 Q ss_pred CCCCHHHHHHhcc-cCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMN-NLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNLD 79 (130) Q Consensus 1 MY~t~edl~l~d~-~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~~ 79 (130) -++|.+|+...-+ +++.. +.+.+..-|++|+..|+|||. +|.+| .++|..++++||.|+.=-|.......++ T Consensus 6 alAtvdDv~~~lrr~Lt~d-E~~~a~~Ll~eAsdlI~g~l~-~~~vp-~~~p~~v~rVvA~ivarAltr~~~~~pe---- 78 (128) T protein:vir:25 6 ALATSQDVKRALRRDLTEA-EQTDLSELLAEATDLVVGYLH-PYPVP-TPTPGPIKRVVASMVAAVLTRPTQILPE---- 78 (128) T ss_pred hccCHHHHHHHhcCCCCHH-HHHHHHHHHhcchheeeeecC-CCCCC-CCCCchHHHHHHHHHHHHhhCCCccCCC---- Confidence 6889999876663 33322 445666779999999999997 78888 6889999999999988777543211121 Q ss_pred HHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCcc---------cC--cccCCC Q lcl|NC_020477. 80 EYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQI---------FT--LDQPEW 130 (130) Q Consensus 80 rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~---------f~--R~~~rw 130 (130) .+.+..|-++-+.....+++..--++..+. |+ =.+-|+ T Consensus 79 -------------~~S~TAgpfs~~ft~~~~~~g~yLTaa~k~~Lrp~R~~~~sV~l~sery 127 (128) T protein:vir:25 79 -------------TQSLTADGFGVTFTPGGNSPGPYLSAALKQRLRPYRTGMVAVEMGSERY 127 (128) T ss_pred -------------ceeeecccccccccCCCCCCCceEcHHHHhhcccccceeeEeecccccC Confidence 111122322211111111111111111000 00 000011 No 10 >protein:vir:80389 Length: 172 # NCBI annotation: BcepGomrgp09 # Family: family:all:703 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210229;genbank:gi:146329921;genbank:GeneID:5123498 Probab=87.34 E-value=0.0046 Score=33.41 Aligned_cols=117 Identities=14% Similarity=0.099 Sum_probs=57.0 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHH----HHhhhc------------------cCCCcccchHHHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDA----RLGVAY------------------KTPFVKVPPIIHDIT 58 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDg----yL~~RY------------------~lPl~~vP~~L~~~a 58 (130) =|+|.++...-....--..+++-.+.||..|+..||+ |.+.|- .+|-..||.-|+..+ T Consensus 16 SYvt~~~a~aY~~~rg~~~~~d~~e~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~g~~~~~~~IP~~v~~A~ 95 (172) T protein:vir:80 16 TYAGADFVIAYAQARGVTVDADEAERLILEAMDYIESFRRRWKGERNTREQGLTWPRHDAVVDGFVIPSDVIPKELQSAV 95 (172) T ss_pred ccccHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHHhhccCccccccCCccccccccccCcccCcccccccchhHHHHHHH Confidence 7999998864433333345555679999999999999 333322 245567899999999 Q ss_pred HHHHHHHhhcCCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCC-CcccC-cccCCC Q lcl|NC_020477. 59 VDLARFFFAEDHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDG-EQIFT-LDQPEW 130 (130) Q Consensus 59 ~dIArY~L~~~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~-~r~f~-R~~~rw 130 (130) |.+|.+.+-...... ... ..+++ -++| |-++..-......++...+++ .+.|. =+.|-+ T Consensus 96 ~elA~~~~~g~~~~~--~~~---~~~v~------~ekV--G~i~~eY~~~~~~~~~~~~~~~~~~~~~v~~LL~ 156 (172) T protein:vir:80 96 AAAVIEQVNGFELQQ--SQD---QWAVR------IEKV--DVIEVQYAAGGGGQSASANAPMKPTFPKIDALLN 156 (172) T ss_pred HHHHHHHhcCCccCc--CCC---Cceee------EEec--cceEEeeecccCccccccccCCccchHHHHHHHh Confidence 999975553211111 110 11232 2444 433332111111111110000 00000 000110 No 11 >protein:vir:2432 Length: 124 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046834;genbank:gi:9630402;genbank:GeneID:1261576 Probab=85.68 E-value=0.035 Score=28.55 Aligned_cols=111 Identities=15% Similarity=0.153 Sum_probs=62.7 Q ss_pred CCCCHHHHHHhc-ccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCC-----CcccchHHHHHHHHHHHHHhhcCCCCCc Q lcl|NC_020477. 1 MYAIPDDLRLVM-NNLSKQVTDELLAKYIQEASNYIDARLGVAYKTP-----FVKVPPIIHDITVDLARFFFAEDHYTSQ 74 (130) Q Consensus 1 MY~t~edl~l~d-~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lP-----l~~vP~~L~~~a~dIArY~L~~~~~~~~ 74 (130) =|+|.+|++... ++.+.+ ....++.-|+|||..|- .|++-. -+..|..++.++|++..=-+...++..+ T Consensus 2 ~~At~~Dv~~rw~r~Lt~~-E~~~ve~lL~dAs~~ir----~r~P~l~~~~~~~~~~~~v~~V~a~~V~R~~rnP~G~~s 76 (124) T protein:vir:24 2 AYATADDVVTLWAKEPEPE-VMALIERRLEQVERMIR----RRIPDLDARVSSDIFRADLIDIEADAVLRLVRNPEGYLS 76 (124) T ss_pred CCCCHHHHHHHhCCCCCHH-HHHHHHHHHHHHHHHHH----hcCCCcchhcCCCCChhhHHHHHHHHHHHHhhCCCCcee Confidence 699999999877 666543 44568999999999886 466522 2245789999999987765543222222 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHcCccccCCC------CCCCCCceeeCCCCcccC Q lcl|NC_020477. 75 KPNLDEYHIKLKERIEKLLDDIISGVLVLDPD------TKLPAGFATTTDGEQIFT 124 (130) Q Consensus 75 e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~------~~~~~~~~~~~~~~r~f~ 124 (130) +.. --|-..+. .+...|++-|... +....+........-.=+ T Consensus 77 ~T~-G~Ys~sl~-------~~~~~g~Lylt~~E~~~Lg~~r~~~~~~i~p~~~~~~ 124 (124) T protein:vir:24 77 ETD-GAYTYQLQ-------ADLSQGKLVILDEEWTTLGVNRLSRMSTLVPNIVMPT 124 (124) T ss_pred ccc-chhHHhhh-------hcccCCceeeCHHHHHhhCcccccceeEeecceeeCC Confidence 222 55666552 2455677755321 111112221111111111 No 12 >protein:vir:94761 Length: 132 # NCBI annotation: unknown # Family: family:all:1271 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996708;genbank:gi:45597423;genbank:GeneID:2769029 Probab=81.74 E-value=0.04 Score=28.22 Aligned_cols=113 Identities=15% Similarity=0.169 Sum_probs=56.8 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCc---c----cchHHHHHHHHHHHHHhhcCCCCC Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFV---K----VPPIIHDITVDLARFFFAEDHYTS 73 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~---~----vP~~L~~~a~dIArY~L~~~~~~~ 73 (130) -|||.+|++...++++.. -.++++.-|++||..|..=.-.++..|.. + .+.+++++||++++=-|-... . T Consensus 3 ~fAtv~Dl~~r~r~L~~d-E~~ra~~LL~dAs~~iR~~~~~~~~~~~~~~~~~~d~~~~~~k~V~~~~V~Ral~~~~--~ 79 (132) T protein:vir:94 3 PFATVDDLTMLWRPLKGD-EKERAEKLLEIVSDTLREEADKVGRDLDVMISEKPSYFSSVVKSVTVDIVARTLMTST--D 79 (132) T ss_pred CcCCHHHHHHHhccCChh-HHHHHHHHHHHHHHHHHHHHhhhccccccccCCCCccchhHHHHHHHHHHHHHhcCCC--C Confidence 799999999888765433 24788899999999998766666554322 1 257899999999987775321 1 Q ss_pred chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceee-CCCCcccCcccCCC Q lcl|NC_020477. 74 QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATT-TDGEQIFTLDQPEW 130 (130) Q Consensus 74 ~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~-~~~~r~f~R~~~rw 130 (130) .+.+.+-=. ..|-.+-...-..|.|..-. .+.-+..+-...|| T Consensus 80 ~~g~tq~S~--------------TaG~ys~S~T~~np~G~lylt~~e~~~LGl~~~r~ 123 (132) T protein:vir:94 80 QEPMTQTTE--------------SALGYSVSGSYLVPGGGLFIKNSELSRLGLKKQRF 123 (132) T ss_pred CCCceeeee--------------ecccceeeeeeecCCCCceeChHHHHhhCCCCCce Confidence 111100000 00100000000011111111 11112222222333 No 13 >protein:vir:9576 Length: 131 # NCBI annotation: gp42 # Family: family:all:1271 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862881;genbank:gi:32469473;genbank:GeneID:1461318 Probab=80.92 E-value=0.036 Score=28.53 Aligned_cols=110 Identities=12% Similarity=0.135 Sum_probs=56.0 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhc------cCCCcccchHHHHHHHHHHHHHhhcC-CCC- Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAY------KTPFVKVPPIIHDITVDLARFFFAED-HYT- 72 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY------~lPl~~vP~~L~~~a~dIArY~L~~~-~~~- 72 (130) =|||.+|++...++++.. ....++.-|++||..|..-+-... ..+-+..+..++++||++++.-|-.. ++. T Consensus 3 ~fAtv~D~~~rwr~Lt~~-E~~ra~~LL~~As~~ir~~~p~~~~~l~~~~~~~~~~~~~~~~V~~~~V~Ral~~~~~~~G 81 (131) T protein:vir:95 3 NFATVEDLKKLWRALKFD-EEKRAEALLEVVSHSLRVEAKKVGKDLDGLVATDPSFTMVVKSVTVDVVARTLMTSTDQEP 81 (131) T ss_pred ccCCHHHHHHHhcCCCHH-HHHHHHHHHHHHHHHHHHhhhhccCCccccccCCccchHHHHHHHHHHHHHHhcCCCCCCC Confidence 799999999888765433 345888999999999987654332 12223457899999999999988532 110 Q ss_pred --CchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCcccCcccCCC Q lcl|NC_020477. 73 --SQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIFTLDQPEW 130 (130) Q Consensus 73 --~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f~R~~~rw 130 (130) ...+-.--|-+-. .|+ ...|.+-| +.+.-+..+-...|| T Consensus 82 ~tq~S~TaG~ys~S~-----t~~--~p~g~lyl------------t~~e~~~LGl~~~r~ 122 (131) T protein:vir:95 82 MTQVAESALGYSFSG-----SYL--VPGGGLFI------------KDSELKRLGLKKQRY 122 (131) T ss_pred ceeeeeecccceeee-----eee--cCCCCcee------------ChHHHHHhCCCCCce Confidence 0001001111110 000 00111111 111112222222333 No 14 >protein:vir:4831 Length: 105 # NCBI annotation: ORF27 # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038328;genbank:gi:9634654;genbank:GeneID:1262588 Probab=74.32 E-value=0.16 Score=24.95 Aligned_cols=97 Identities=15% Similarity=0.145 Sum_probs=53.7 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCC-CcccchHHHHHHHHHHHHHhhcCCCCCc----h Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTP-FVKVPPIIHDITVDLARFFFAEDHYTSQ----K 75 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lP-l~~vP~~L~~~a~dIArY~L~~~~~~~~----e 75 (130) |..|.|++..--|= +..-|++.|+..|.-|...|.++++..+..+ ...+|+.++.. |-+-.=++|.+|...+ . T Consensus 1 M~vtLee~K~~LRI-D~dddD~lI~~~i~aA~~yi~~~ig~~~~~~~~~~~~~~~~~A-vl~lv~~~YeNR~~~~~~~~~ 78 (105) T protein:vir:48 1 MSVSKTSIMQTLNL-DETDDTALIPAYIESAKQYIINAVGSDSKFYDLENVQPLFDTA-VIALTSSYFTYRVALTDTVTY 78 (105) T ss_pred CcccHHHHHHHcCC-CCccchHHHHHHHHHHHHHHHHhhCCCCccccccCCchHHHHH-HHHHHHHHHhhhhhccCcccc Confidence 99999999743221 2445888999999999999999998543211 12455555444 4444445565543222 2 Q ss_pred HHHHHHHHHHHHH---HHHHHHHHHcC Q lcl|NC_020477. 76 PNLDEYHIKLKER---IEKLLDDIISG 99 (130) Q Consensus 76 ~v~~rY~~Aik~~---~~a~L~~Ia~G 99 (130) ++..-.+.-|.++ ...|-+-.-+| T Consensus 79 ~ip~~v~sli~~lR~~y~~~~e~~~~g 105 (105) T protein:vir:48 79 PINLTLNSIIGQLRGLYATYSEVVANG 105 (105) T ss_pred hhhHHHHHHHHHHhhhhhhhhhcccCC Confidence 2222222222111 11244444455 No 15 >protein:vir:486 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543093;swissprot:trembl:q8w626;genbank:gi:18249905;uniprot:Q8W626;genbank:GeneID:929692 Probab=73.08 E-value=0.17 Score=24.74 Aligned_cols=96 Identities=11% Similarity=0.087 Sum_probs=53.6 Q ss_pred CCCCHHHHHHhcccCCC-CcCHHHHHHHHHHHHHHHHHHHhhhccCCCc----------ccchHHHHHHHHHHHHHhhcC Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSK-QVTDELLAKYIQEASNYIDARLGVAYKTPFV----------KVPPIIHDITVDLARFFFAED 69 (130) Q Consensus 1 MY~t~edl~l~d~~~~g-~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~----------~vP~~L~~~a~dIArY~L~~~ 69 (130) |+.|.+++..-=+-..+ .-|++.|+..|.-|++.|-+|++.+...+-. .+|+.++ .|+-+..-++|.+ T Consensus 1 M~vtL~e~K~hLRid~D~~ddD~li~~~i~aA~~~i~~~~~r~l~~~~~~~~~~~~~~~~~~~~ik-~Avlllv~~~Y~N 79 (107) T protein:vir:48 1 MLLKEEEIKSHLRLDDGLYSDGDFLKLLAQAVQKRTETYLNRKLYAPEETIPEDDPDGMHLTDDVR-LAMLMLVSHFYEN 79 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccccccCccccccchhHH-HHHHHHHHHHHhh Confidence 99999999754443222 3467899999999999999999876533221 2455544 4555666677776 Q ss_pred CCCCchHHHHHHHHHHHHHHHHHHHHHHcCcccc Q lcl|NC_020477. 70 HYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVL 103 (130) Q Consensus 70 ~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L 103 (130) |-..++.-...---.++ ..|...+ ..+| T Consensus 80 Re~v~~~~~~~iP~~v~----~LL~~yR--~~~l 107 (107) T protein:vir:48 80 RSTITDVEKLETPMSFR----WLAGPYR--IVPL 107 (107) T ss_pred hhhhccccccccCHHHH----HHHHHhh--ccCC Confidence 53222111000001121 2333333 1222 No 16 >protein:vir:9761 Length: 140 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795523;genbank:gi:28876281;genbank:GeneID:1257822 Probab=72.63 E-value=0.11 Score=25.87 Aligned_cols=116 Identities=14% Similarity=0.123 Sum_probs=62.4 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhh-ccCCCc-----ccchHHHHHHHHHHHHHhhcC-C--C Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVA-YKTPFV-----KVPPIIHDITVDLARFFFAED-H--Y 71 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~R-Y~lPl~-----~vP~~L~~~a~dIArY~L~~~-~--~ 71 (130) =|||.+|++...+.++.. -.+.++.-|++||..|...+-.. +.+|-. ..+.+++.+||+|.+=-|-.. . + T Consensus 3 ~fATv~Dv~~rwr~Lt~d-E~~ra~~LL~dAS~~iR~~~p~~g~~~~~~~~~~~~~~~~~k~V~~~mV~Ral~~~~d~~G 81 (140) T protein:vir:97 3 NFATTDDVILLWRPLSVD-ELKRANALLKVVSDTLRMEADKVGKDLDKTMVDKPYFVNVIKSVTVDIVARTLMTSTQGEP 81 (140) T ss_pred cCCCHHHHHHHhcCCCHh-HHHHHHHHHHHHHHHHHHhhhhccCCcchhcccCccchhHHHHHHHHHHHHHhcCCCCCCc Confidence 699999999888766543 24688999999999998877533 555521 235688999999987655321 1 1 Q ss_pred --CCchHHHHHHHHHHHHHHHHHHHHHHcCcccc--------CCCCCCC---CCceeeCCCCcccCc Q lcl|NC_020477. 72 --TSQKPNLDEYHIKLKERIEKLLDDIISGVLVL--------DPDTKLP---AGFATTTDGEQIFTL 125 (130) Q Consensus 72 --~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L--------~~~~~~~---~~~~~~~~~~r~f~R 125 (130) +.++ -.--|-+-. .|+ ...|.+-| |+..+.- .-.+.....+--|+| T Consensus 82 ~tq~S~-TaG~ys~S~-----T~~--np~G~lylt~~e~~~LGl~~~r~~~i~~~g~~~~~~~~~~~ 140 (140) T protein:vir:97 82 MSQESQ-SALGYTWSG-----TYL--VPGGGLFIKDNELKRLGLKKQRYGGIELYGEIKRDNDYFDR 140 (140) T ss_pred ceeeee-eccchhhee-----eee--cCCCCceeChHHHHHhCCCCCceeeecccCccccCcccccC Confidence 1111 112232222 121 11233222 1111000 001122334566777 No 17 >protein:vir:1640 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:1271 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695061;genbank:gi:23455752;genbank:GeneID:955488 Probab=70.53 E-value=0.13 Score=25.49 Aligned_cols=110 Identities=15% Similarity=0.189 Sum_probs=54.8 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhcc-CC---C---cccchHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYK-TP---F---VKVPPIIHDITVDLARFFFAED-HYT 72 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~-lP---l---~~vP~~L~~~a~dIArY~L~~~-~~~ 72 (130) =|||.+|++...+.++.. ..+.++.-|++||..|-.=+-.+.. ++ - ...+..++++||++++=-|-.. .+. T Consensus 3 ~fAtv~Dv~~r~r~L~~~-E~~ra~~lL~dAs~~ir~~~p~~~~~l~a~~~e~~~~~~~~~~~V~~~~V~Ral~~~~~~~ 81 (132) T protein:vir:16 3 PFATVDDLTMLWRPLKGD-EKERAEKLLEIVSDSLREEADKVGRDLYAMIAEKPSYFASVVKSVTVDIVARTLMTSTDQE 81 (132) T ss_pred ccCCHHHHHHHhcCCCHh-HHHHHHHHHHHHHHHHHHhhhhhccccccccccccccchhHHHHHHHHHHHHHhcCCCCCC Confidence 799999999888765443 2468899999999999765543332 21 1 1235679999999988666532 110 Q ss_pred ---CchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCcccCcccCCC Q lcl|NC_020477. 73 ---SQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIFTLDQPEW 130 (130) Q Consensus 73 ---~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f~R~~~rw 130 (130) ...+-.-.|-.-. .|+ ...|.+-|. .+.-...+-...|| T Consensus 82 G~tq~S~TaG~ys~S~-----t~~--~p~G~lylt------------~~e~~~LG~~~~r~ 123 (132) T protein:vir:16 82 PMTQTTESALGYSVSG-----SYL--VPGGGLFIK------------NSELSRLGLKKQRF 123 (132) T ss_pred Cceeeeeeccchheee-----eee--cCCCcceeC------------hHHHHhhCCCCCce Confidence 0011111121111 011 111222111 00111111111233 No 18 >protein:vir:79050 Length: 133 # NCBI annotation: hypothetical protein # Family: family:all:6416 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110727;genbank:gi:134287344;genbank:GeneID:4955224 Probab=65.80 E-value=0.12 Score=25.68 Aligned_cols=116 Identities=15% Similarity=0.044 Sum_probs=62.8 Q ss_pred CCCC----HHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCC-CCch Q lcl|NC_020477. 1 MYAI----PDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHY-TSQK 75 (130) Q Consensus 1 MY~t----~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~-~~~e 75 (130) |=-. .++..-...-...+-|..+++-|++++...|--|+. +..+|.-|..+.+++|.-.+-.... ..+- T Consensus 1 ~~~~i~e~i~~~Lk~~~~~~~~~d~~iL~fa~e~~~n~I~N~cN------i~eiP~~L~~v~~~mai~~fl~~kk~~~~~ 74 (133) T protein:vir:79 1 MGNNIIDDIEKRLESFGYILKDGDKWLIDFVREKIENIIKLDCN------IKTMPIELKEIEADMIVGEFLFTKKNMGQL 74 (133) T ss_pred CCchHHHHHHHHHHHhCCCCCccchHHHHHHHHHHHHHHhhhcC------hhhcchhHHHHHHHHHHHHHHhcccccCCC Confidence 3222 211111222222344677888899999999999997 4789999999999988764432221 1111 Q ss_pred HHHHHHH-HHHHHHHHHHHHHHHcCccccCCCCC----CCCCc------eeeCCCCcccCcc-cCCC Q lcl|NC_020477. 76 PNLDEYH-IKLKERIEKLLDDIISGVLVLDPDTK----LPAGF------ATTTDGEQIFTLD-QPEW 130 (130) Q Consensus 76 ~v~~rY~-~Aik~~~~a~L~~Ia~G~~~L~~~~~----~~~~~------~~~~~~~r~f~R~-~~rw 130 (130) +. .-++ ++ --+.|..|..+...... .+... .-.....+-|.|- ++|| T Consensus 75 ~l-~~~D~~~-------~v~sIkeGDTsv~f~~~~~s~t~eq~l~s~i~~L~~~~k~~l~~yRkLrW 133 (133) T protein:vir:79 75 DI-ESINFEA-------VEKSISEGDTKVDFAIGSGSQTPEQRFDSLIAYLTAYGKNKILTFRCLRW 133 (133) T ss_pred Cc-ccccchh-------hhhheecccceeecccCCCccchhHHHHHHHHHHhhcccchhhccccccC Confidence 11 1111 11 13778889666544311 11110 1133445555554 5999 No 19 >protein:vir:1329 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047928;swissprot:trembl:q9zxb0;genbank:gi:9631146;uniprot:Q9ZXB0;genbank:GeneID:2715909 Probab=65.50 E-value=0.25 Score=23.91 Aligned_cols=115 Identities=20% Similarity=0.182 Sum_probs=69.4 Q ss_pred CCCCHHHHHHhcccC-CCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNL-SKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNLD 79 (130) Q Consensus 1 MY~t~edl~l~d~~~-~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~~ 79 (130) -|+|.|+++..|.-. +....++.+..||+-.-+.+..|++..+...=.|.|..++-....+||-+..+--...++ T Consensus 2 ayatieelraldglddsalfsdellsdaidfsvetveaycgrkwdtaedptpetirwcvrtlarqyvldhvsripd---- 77 (122) T protein:vir:13 2 AYATIEELRALDGLDDSALFSDELLSDAIDFSVETVEAYCGRKWDTAEDPTPETIRWCVRTLARQYVLDHVSRIPD---- 77 (122) T ss_pred cchhhhhhhhhcCccchhhhhhhhhhhhhhhhhhhhhhhhCcccCCcCCCChhHHHHHHHHHHHHHHHHHhhhcch---- Confidence 799999999887643 345788899999999999999999999999888999998777778898777542111222 Q ss_pred HHHHHHHHHHHHHHHHHHcCccccCCCCCC--CCCceeeCCCCcccCcccCCC Q lcl|NC_020477. 80 EYHIKLKERIEKLLDDIISGVLVLDPDTKL--PAGFATTTDGEQIFTLDQPEW 130 (130) Q Consensus 80 rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~--~~~~~~~~~~~r~f~R~~~rw 130 (130) .|++ |+ ---|.+.|.-.+.. |.+-..++..-.+ -|-++-| T Consensus 78 ---ralq------lq-sefgsiqlaqaggnwrptslpevnaklnl-yrvrlpf 119 (122) T protein:vir:13 78 ---RALQ------LQ-SEFGSIQLAQAGGNWRPTSLPEVNAKLNL-YRVRLPF 119 (122) T ss_pred ---hhhh------hh-hcccceeeeccCCCcccCcccccccceee-eeeecce Confidence 2442 21 12255655322211 1111111111111 0111222 No 20 >protein:vir:4512 Length: 107 # NCBI annotation: unknown # Family: family:all:363 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599038;genbank:gi:19548996;genbank:GeneID:935218 Probab=65.17 E-value=0.29 Score=23.55 Aligned_cols=96 Identities=17% Similarity=0.062 Sum_probs=51.5 Q ss_pred CCCCHHHHHHhcccC-CCCcCHHHHHHHHHHHHHHHHHHHhhhccC-----CCc-----ccchHHHHHHHHHHHHHhhcC Q lcl|NC_020477. 1 MYAIPDDLRLVMNNL-SKQVTDELLAKYIQEASNYIDARLGVAYKT-----PFV-----KVPPIIHDITVDLARFFFAED 69 (130) Q Consensus 1 MY~t~edl~l~d~~~-~g~~d~~~i~~Al~dAs~~IDgyL~~RY~l-----Pl~-----~vP~~L~~~a~dIArY~L~~~ 69 (130) |+.|.+++..--|-. +-.-|++.|+.-|.-|++.|..|++.++.- |.. .+|+.++. |+-+-.-++|.+ T Consensus 1 M~vtL~e~K~hLRId~D~~ddD~lI~~~i~AA~~~i~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~-AvLllv~~~Y~N 79 (107) T protein:vir:45 1 MLLKMEEIKLQLRLDDDFSDEDELLELLGKAAQSRTENFLNRKLYATADDRPADDPDGLVISDDVKL-ALLLLVSHFYEN 79 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHHhccccccccccccccccccccCChhHHH-HHHHHHHHHHhh Confidence 999999997544422 224567899999999999999999876532 221 13555554 444444456665 Q ss_pred CCCCchHHHHHHHHHHHHHHHHHHHHHHcCcc Q lcl|NC_020477. 70 HYTSQKPNLDEYHIKLKERIEKLLDDIISGVL 101 (130) Q Consensus 70 ~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~ 101 (130) |...++.-...---.++ +.|...+.=-+ T Consensus 80 Re~~~~~~~~~lp~~v~----~Ll~~~R~~~~ 107 (107) T protein:vir:45 80 RSTVTDVEKMELPMSFN----WLVAPYRLIPL 107 (107) T ss_pred hhhccccchhccchHHH----HHHHHHhhcCC Confidence 53222111111011121 22333222111 No 21 >protein:vir:7773 Length: 123 # NCBI annotation: gp19 # Family: family:all:2817 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817607;genbank:gi:29566037;genbank:GeneID:1259231 Probab=62.15 E-value=0.29 Score=23.50 Aligned_cols=111 Identities=14% Similarity=0.178 Sum_probs=58.4 Q ss_pred CCCCHHHHHHhc-ccCCCCcCHHHHHHHHHHHHHHHHHHHhhhcc-CC-Ccccc---hHHHHHHHHHHHHHhhcCCCCCc Q lcl|NC_020477. 1 MYAIPDDLRLVM-NNLSKQVTDELLAKYIQEASNYIDARLGVAYK-TP-FVKVP---PIIHDITVDLARFFFAEDHYTSQ 74 (130) Q Consensus 1 MY~t~edl~l~d-~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~-lP-l~~vP---~~L~~~a~dIArY~L~~~~~~~~ 74 (130) =|+|.+|++..- ++.+.+ ....++.-|+|||..|-. |++ ++ ..+-| +.+++++|++..=-+...++.-+ T Consensus 2 ~~At~~Dv~ar~~r~LT~~-E~~~ve~lL~dAs~~ir~----r~P~l~~~a~d~~~~~~~~~V~~~~V~R~~rnpeG~~s 76 (123) T protein:vir:77 2 PYATASDVTSRWARQPTDE-ETALINVRLADVERMIKR----RIPDLATKVTDPDYLEDLKQVEADAVLRLVRNPEGYLS 76 (123) T ss_pred CcCCHHHHHHHhCCCCCHH-HHHHHHHHHHHHHHHHHH----hccCcccccCCcchhHHHHHHHHHHHHHHhhCCCCcee Confidence 699999999766 666543 455789999999999866 443 22 12234 67889999987654432222111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHcCccccCCCC-----CCCCCceeeCCCCcccC Q lcl|NC_020477. 75 KPNLDEYHIKLKERIEKLLDDIISGVLVLDPDT-----KLPAGFATTTDGEQIFT 124 (130) Q Consensus 75 e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~-----~~~~~~~~~~~~~r~f~ 124 (130) +.. .-|-..+. .....|++-|.... ..-++..+.......=+ T Consensus 77 ~T~-G~ys~sl~-------~a~~~g~Lylt~~E~~~Lg~~~~~~~~i~p~~~~~~ 123 (123) T protein:vir:77 77 ETD-GNYTYMLR-------SDLASGKLEIFPEEWEILGYRRSRMTVIVPNPVMPT 123 (123) T ss_pred ccc-chhhhhhc-------ccCCCCcceeCHHHHHhhcCCCCceeEEeeceecCC Confidence 111 45655542 24456666553210 11111111111111111 No 22 >protein:vir:81159 Length: 95 # NCBI annotation: putative DNA packaging protein # Family: family:all:316 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285813;genbank:gi:148747734;genbank:GeneID:5247202 Probab=61.82 E-value=0.35 Score=23.11 Aligned_cols=91 Identities=13% Similarity=0.111 Sum_probs=53.2 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCc---hHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQ---KPN 77 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~---e~v 77 (130) |+.|.+++..-=+= ++.-|++.|+.-|+-|...|.+|++.++ .+.|+.++..++-+ .=++|.+|...+ .++ T Consensus 2 m~vtLee~K~~LRI-D~d~dD~lI~~li~aA~~~i~~~~g~~~----~~~~~~~~~Avl~l-v~~~YeNRe~~~~~~~~~ 75 (95) T protein:vir:81 2 MIVTLEEVKNWLRV-DFSDDDALITTLINAAEEYLKNATGTTF----DATNHLAKIFCMTL-IADWYENRELVGRASDQV 75 (95) T ss_pred CcCCHHHHHHHcCC-CCCcchHHHHHHHHHHHHHHHHhhcccc----ccCchHHHHHHHHH-HHHHHhhccccccccccc Confidence 99999998633222 3446889999999999999999998654 45566655554444 445565553221 233 Q ss_pred HHHHHHHHHHHHHHHHHHHHcCccc Q lcl|NC_020477. 78 LDEYHIKLKERIEKLLDDIISGVLV 102 (130) Q Consensus 78 ~~rY~~Aik~~~~a~L~~Ia~G~~~ 102 (130) ..-.+.-|- -|+.-..|.-. T Consensus 76 p~~v~sll~-----~lr~~~~~~~~ 95 (95) T protein:vir:81 76 RPILQSILA-----QLTYAYGGETA 95 (95) T ss_pred cHHHHHHHH-----HhhhccccccC Confidence 333333331 13333333322 No 23 >protein:vir:43 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463469;swissprot:trembl:q9t1b5;genbank:gi:16798791;uniprot:Q9T1B5;genbank:GeneID:922374 Probab=60.34 E-value=0.37 Score=22.92 Aligned_cols=99 Identities=11% Similarity=0.021 Sum_probs=57.3 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccC-CC----cccchHHHHHHHHHHHHHhhcCCCC--- Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKT-PF----VKVPPIIHDITVDLARFFFAEDHYT--- 72 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~l-Pl----~~vP~~L~~~a~dIArY~L~~~~~~--- 72 (130) =|+|.+..+-.. ....+.++..+..+..|+..||.+...||.- -+ +.+|..++..||..|-|.--.+... T Consensus 2 ~Y~d~~~Y~~~y--~g~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~~~~~~~~~~~vk~A~c~q~e~~~~~g~~s~~~ 79 (131) T protein:vir:43 2 PYTTLEFYNDEY--AGEHLEQDEFDKLLKHAERKIDSVTFYRIRKGGIESFSEFIQHQIQLATCNQIEYFKEAGGTSELA 79 (131) T ss_pred CCCCHHHHHHhh--CCCCCCHhHHHHHHHHHHHHHHHHhcccccccCccccchhhHHHHHHHHHHHHHHHHHhHHHhhhh Confidence 699988875333 2245677889999999999999999999862 11 4578889999999998875321100 Q ss_pred ------------------CchHHH-----HHHHHHHHHHHHHHHHHHHcCccccCCCCC Q lcl|NC_020477. 73 ------------------SQKPNL-----DEYHIKLKERIEKLLDDIISGVLVLDPDTK 108 (130) Q Consensus 73 ------------------~~e~v~-----~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~ 108 (130) .++.-. .-+++|. .||+ ..|-+--|++.. T Consensus 80 ~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~-----~~L~--~TGLlyrGV~~~ 131 (131) T protein:vir:43 80 VSKPDNVSIGRTSISDSNFASTATSLNSGLIGSDVR-----SYLA--HTGLLYNGVGVR 131 (131) T ss_pred ccccCeeecCceEEeecccccchhhhchhhhHHHHH-----HHHh--ccCCeecCCCCC Confidence 000000 0122222 2333 123322222222 No 24 >protein:vir:95176 Length: 172 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:703 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293420;genbank:gi:148912841;genbank:GeneID:5228251 Probab=57.22 E-value=0.19 Score=24.59 Aligned_cols=118 Identities=16% Similarity=0.204 Sum_probs=55.4 Q ss_pred CCCCHHHHHHh--cccCCCCcCHHHHHHHHHHHHHHHHHH----Hhhh------------------ccCCCcccchHHHH Q lcl|NC_020477. 1 MYAIPDDLRLV--MNNLSKQVTDELLAKYIQEASNYIDAR----LGVA------------------YKTPFVKVPPIIHD 56 (130) Q Consensus 1 MY~t~edl~l~--d~~~~g~~d~~~i~~Al~dAs~~IDgy----L~~R------------------Y~lPl~~vP~~L~~ 56 (130) =|+|.++...- .+...-..|++..+.||..|+..||+| .+.| -.+|-..||.-|+. T Consensus 18 SYvtv~ea~aY~~~rg~~~~~~~~~ke~aL~~A~dyid~~~~~f~G~r~~~~Q~l~wPR~g~~~~~~~v~~~~IP~~V~~ 97 (172) T protein:vir:95 18 SYVSVADARIYASNRGVELPLDDDELAAMLIRSTDYLEAQACRFQGKPTSTTQALQWPRTGVFLNEDEVPSNVIPKSLIA 97 (172) T ss_pred ccccHHHHHHHHHhcCCcCCCChHHHHHHHHHHHHHhhccCCceeeeecCCcccccCCcCCcccCcccccccchhHHHHH Confidence 79999988633 333333347778899999999999985 2221 12355578999999 Q ss_pred HHHHHHHHHhhcCCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCce-------------eeCCCCccc Q lcl|NC_020477. 57 ITVDLARFFFAEDHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFA-------------TTTDGEQIF 123 (130) Q Consensus 57 ~a~dIArY~L~~~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~-------------~~~~~~r~f 123 (130) .||.+|.+.+-....+.+.. + ...+| -++| |-++..-....+.++- ....+..-| T Consensus 98 A~~elA~~~~~~~~~~~~~~---~-~~~vk------~~kV--G~I~veY~~~~~~~~~~~~~~v~~LL~p~l~~~~~~~~ 165 (172) T protein:vir:95 98 AQVQLTMAINAGFDLQPNVS---P-QDYVT------REKV--GPIETEYADPLSVGIMPTFTAANALLAPLFGECASNKF 165 (172) T ss_pred HHHHHHHHHHcCccccccCC---c-cccee------EEec--cceEEeeccCCCCCCcccHHHHHHHHhhhhcccCCcce Confidence 99999974443211111111 1 11121 1233 4444321100000000 000001111 Q ss_pred CcccCCC Q lcl|NC_020477. 124 TLDQPEW 130 (130) Q Consensus 124 ~R~~~rw 130 (130) +=+.-|- T Consensus 166 ~~r~~r~ 172 (172) T protein:vir:95 166 ALRTIRV 172 (172) T ss_pred eeEEEeC Confidence 1000001 No 25 >protein:vir:102083 Length: 96 # NCBI annotation: DNA packaging protein # Family: family:all:316 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512316;genbank:gi:89152485;genbank:GeneID:3953076 Probab=54.05 E-value=0.51 Score=22.17 Aligned_cols=96 Identities=7% Similarity=0.032 Sum_probs=53.2 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNLDE 80 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~~r 80 (130) |..|.|++..--|=. +. |++.|+..|.-|...|.++++.+| ..-|+. ..+||-+-.-++|.+|...++..... T Consensus 1 M~vtLee~K~~LRID-~D-dD~lI~~~i~aA~~~i~~~~g~~~----~e~~~~-~k~Avl~lv~~~YenR~~~~~~~~~~ 73 (96) T protein:vir:10 1 MLVTLEEAKEWIRVD-GD-DDPTITMLIKAAELYIYKATGKTF----TQTNED-AKLLCLFLVADWYGNRLLVGEKASEK 73 (96) T ss_pred CcCCHHHHHHHcCCC-Cc-hhHHHHHHHHHHHHHHHHhhCCCC----CCCcch-HHHHHHHHHHHHHhhhhhccccccch Confidence 999999986433221 12 567999999999999999998654 344444 44566666667777765333222122 Q ss_pred HHHHHHHHHHHHHHHHHcCccccCCCCCCCC Q lcl|NC_020477. 81 YHIKLKERIEKLLDDIISGVLVLDPDTKLPA 111 (130) Q Consensus 81 Y~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~ 111 (130) -.-.+. +.|....-+--+ ..++. T Consensus 74 ip~~v~----sli~qLr~~~~~----~~e~~ 96 (96) T protein:vir:10 74 IRTIVQ----SMILQLQYASEP----QEERK 96 (96) T ss_pred hhHHHH----HHHHHHhhcCCc----ccccC Confidence 222332 344444322211 00111 No 26 >protein:vir:107614 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338189;genbank:gi:77020184;genbank:GeneID:3703745 Probab=54.05 E-value=0.51 Score=22.17 Aligned_cols=96 Identities=7% Similarity=0.032 Sum_probs=53.2 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNLDE 80 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~~r 80 (130) |..|.|++..--|=. +. |++.|+..|.-|...|.++++.+| ..-|+. ..+||-+-.-++|.+|...++..... T Consensus 1 M~vtLee~K~~LRID-~D-dD~lI~~~i~aA~~~i~~~~g~~~----~e~~~~-~k~Avl~lv~~~YenR~~~~~~~~~~ 73 (96) T protein:vir:10 1 MLVTLEEAKEWIRVD-GD-DDPTITMLIKAAELYIYKATGKTF----TQTNED-AKLLCLFLVADWYGNRLLVGEKASEK 73 (96) T ss_pred CcCCHHHHHHHcCCC-Cc-hhHHHHHHHHHHHHHHHHhhCCCC----CCCcch-HHHHHHHHHHHHHhhhhhccccccch Confidence 999999986433221 12 567999999999999999998654 344444 44566666667777765333222122 Q ss_pred HHHHHHHHHHHHHHHHHcCccccCCCCCCCC Q lcl|NC_020477. 81 YHIKLKERIEKLLDDIISGVLVLDPDTKLPA 111 (130) Q Consensus 81 Y~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~ 111 (130) -.-.+. +.|....-+--+ ..++. T Consensus 74 ip~~v~----sli~qLr~~~~~----~~e~~ 96 (96) T protein:vir:10 74 IRTIVQ----SMILQLQYASEP----QEERK 96 (96) T ss_pred hhHHHH----HHHHHHhhcCCc----ccccC Confidence 222332 344444322211 00111 No 27 >protein:vir:102863 Length: 96 # NCBI annotation: conserved phage protein # Family: family:all:316 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338138;genbank:gi:77020236;genbank:GeneID:3703772 Probab=54.05 E-value=0.51 Score=22.17 Aligned_cols=96 Identities=7% Similarity=0.032 Sum_probs=53.2 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNLDE 80 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~~r 80 (130) |..|.|++..--|=. +. |++.|+..|.-|...|.++++.+| ..-|+. ..+||-+-.-++|.+|...++..... T Consensus 1 M~vtLee~K~~LRID-~D-dD~lI~~~i~aA~~~i~~~~g~~~----~e~~~~-~k~Avl~lv~~~YenR~~~~~~~~~~ 73 (96) T protein:vir:10 1 MLVTLEEAKEWIRVD-GD-DDPTITMLIKAAELYIYKATGKTF----TQTNED-AKLLCLFLVADWYGNRLLVGEKASEK 73 (96) T ss_pred CcCCHHHHHHHcCCC-Cc-hhHHHHHHHHHHHHHHHHhhCCCC----CCCcch-HHHHHHHHHHHHHhhhhhccccccch Confidence 999999986433221 12 567999999999999999998654 344444 44566666667777765333222122 Q ss_pred HHHHHHHHHHHHHHHHHcCccccCCCCCCCC Q lcl|NC_020477. 81 YHIKLKERIEKLLDDIISGVLVLDPDTKLPA 111 (130) Q Consensus 81 Y~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~ 111 (130) -.-.+. +.|....-+--+ ..++. T Consensus 74 ip~~v~----sli~qLr~~~~~----~~e~~ 96 (96) T protein:vir:10 74 IRTIVQ----SMILQLQYASEP----QEERK 96 (96) T ss_pred hhHHHH----HHHHHHhhcCCc----ccccC Confidence 222332 344444322211 00111 No 28 >protein:vir:105005 Length: 96 # NCBI annotation: putative DNA packaging protein phage # Family: family:all:316 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459970;genbank:gi:85701385;genbank:GeneID:3882146 Probab=54.05 E-value=0.51 Score=22.17 Aligned_cols=96 Identities=7% Similarity=0.032 Sum_probs=53.2 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNLDE 80 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~~r 80 (130) |..|.|++..--|=. +. |++.|+..|.-|...|.++++.+| ..-|+. ..+||-+-.-++|.+|...++..... T Consensus 1 M~vtLee~K~~LRID-~D-dD~lI~~~i~aA~~~i~~~~g~~~----~e~~~~-~k~Avl~lv~~~YenR~~~~~~~~~~ 73 (96) T protein:vir:10 1 MLVTLEEAKEWIRVD-GD-DDPTITMLIKAAELYIYKATGKTF----TQTNED-AKLLCLFLVADWYGNRLLVGEKASEK 73 (96) T ss_pred CcCCHHHHHHHcCCC-Cc-hhHHHHHHHHHHHHHHHHhhCCCC----CCCcch-HHHHHHHHHHHHHhhhhhccccccch Confidence 999999986433221 12 567999999999999999998654 344444 44566666667777765333222122 Q ss_pred HHHHHHHHHHHHHHHHHcCccccCCCCCCCC Q lcl|NC_020477. 81 YHIKLKERIEKLLDDIISGVLVLDPDTKLPA 111 (130) Q Consensus 81 Y~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~ 111 (130) -.-.+. +.|....-+--+ ..++. T Consensus 74 ip~~v~----sli~qLr~~~~~----~~e~~ 96 (96) T protein:vir:10 74 IRTIVQ----SMILQLQYASEP----QEERK 96 (96) T ss_pred hhHHHH----HHHHHHhhcCCc----ccccC Confidence 222332 344444322211 00111 No 29 >protein:vir:4857 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049397;genbank:gi:9632425;genbank:GeneID:1258493 Probab=53.05 E-value=0.54 Score=22.06 Aligned_cols=97 Identities=13% Similarity=0.150 Sum_probs=55.4 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCC---cccchHHHHHHHHHHHHHhhcCCCCCc--- Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPF---VKVPPIIHDITVDLARFFFAEDHYTSQ--- 74 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl---~~vP~~L~~~a~dIArY~L~~~~~~~~--- 74 (130) |..|.+++..--|- +..-|++.|+..|.-|.+.|.++++.. +++ ..+|+.++..++-++- ++|.+|...+ T Consensus 1 M~vtLeevK~~LRI-D~d~dD~li~~~i~aA~~~i~~~ig~~--~~~~~~~~~~~~~~~Avl~lv~-~~Y~NR~~~~~~~ 76 (104) T protein:vir:48 1 MSVSKETIMQTLNL-DETDDTALIPAYIESARQYVVNSVGDD--PKFYNLDSVRALFDTAVIALTS-SYFTYRVALTDTA 76 (104) T ss_pred CcccHHHHHHHcCC-CCccchHHHHHHHHHHHHHHHHhhCCC--CCcccccCCChhHHHHHHHHHH-HHHhhhhhhcccc Confidence 99999999744332 244588999999999999999999742 222 3456555555555544 6666553222 Q ss_pred -hHHHHHHHHHHHHHHHHHHHHHHcCccc Q lcl|NC_020477. 75 -KPNLDEYHIKLKERIEKLLDDIISGVLV 102 (130) Q Consensus 75 -e~v~~rY~~Aik~~~~a~L~~Ia~G~~~ 102 (130) .++-.--+.-|. ..++|......|.-. T Consensus 77 ~~~ip~~v~sli~-~lR~~y~~~~~~~~~ 104 (104) T protein:vir:48 77 TYPVNLTLNSIIG-QLRGLYATYSEERGD 104 (104) T ss_pred cchhhHHHHHHHH-HHHHhhhhhcccCCC Confidence 222222222221 122334444443333 No 30 >protein:vir:99002 Length: 158 # NCBI annotation: gp31 # Family: family:all:32652 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655896;genbank:gi:109521468;genbank:GeneID:4157967 Probab=52.19 E-value=0.19 Score=24.60 Aligned_cols=117 Identities=9% Similarity=0.019 Sum_probs=55.6 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHH---HHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc-CCCC---C Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKY---IQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE-DHYT---S 73 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~A---l~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~-~~~~---~ 73 (130) -|+|.|+++...+..-.+--..+..+| |+++|++.--+=+++..+| ..+|..++.+|+.-|+=.+-. ++.. . T Consensus 3 alasvee~~trl~~~lp~~~~r~~a~a~~vLd~~S~~ar~~~gr~W~~~-~daP~~vr~ivL~aa~R~~~NP~g~~~~~~ 81 (158) T protein:vir:99 3 ALVSVEEFTTFLRVPLPEEGSEKYTQMEFLLTLASDWARELSCKPWLLP-ADAPVTARGIILAASRREWNNPKRVSYVVK 81 (158) T ss_pred ceeeHhhhhhhhcccCChhhhHHHHHHHHHHHHHHHHHHHhcCccCCCC-CcchhHHHHHHHHHHHHHHhcCCceEEeee Confidence 799999998665432222222344555 9999999887777777766 578999999999888755432 1100 0 Q ss_pred chHHHHHHHHH------HHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCcccCcccCCC Q lcl|NC_020477. 74 QKPNLDEYHIK------LKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIFTLDQPEW 130 (130) Q Consensus 74 ~e~v~~rY~~A------ik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f~R~~~rw 130 (130) .+.. .+|-+. +-+...+-|++.. ..+=|.- +..++. -=+++-.-| T Consensus 82 G~~~-~~~~~~g~~~~ffT~~E~~~L~r~~--~s~GG~~------~~~ttR---~d~~~~~~y 132 (158) T protein:vir:99 82 GPQS-ATFMQSAYPPGFFTDAEEAKLRSYG--RSTGNWG------VIETYR---DDEEQLNGY 132 (158) T ss_pred cchh-hhcccccCCCcccCHHHHHHHHHhh--cccCcee------EEEeec---CccccCCce Confidence 0000 001000 0000012345442 2111110 111111 111112234 No 31 >protein:vir:94955 Length: 170 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239280;genbank:gi:66392062;genbank:GeneID:5076600 Probab=51.05 E-value=0.36 Score=23.04 Aligned_cols=111 Identities=13% Similarity=0.170 Sum_probs=59.4 Q ss_pred CCCCHHHHHHhc--c---cCCCCcCHHHHHHHHHHHHHHHHH---HHhhhc------------------cCCCcccchHH Q lcl|NC_020477. 1 MYAIPDDLRLVM--N---NLSKQVTDELLAKYIQEASNYIDA---RLGVAY------------------KTPFVKVPPII 54 (130) Q Consensus 1 MY~t~edl~l~d--~---~~~g~~d~~~i~~Al~dAs~~IDg---yL~~RY------------------~lPl~~vP~~L 54 (130) =|+|.++.+.-. + ..-...|++..+.+|..|+..||+ |.+.|- .+|-..||.-| T Consensus 15 SYvtv~ea~aY~~~r~~~~~w~~~~~~~~e~aL~~A~dyId~~~~f~G~r~~~~Q~l~wPR~g~~~dg~~~~~~~IP~~V 94 (170) T protein:vir:94 15 SYVTVAEANSYFDGSYGRPLWTSASEDEKASLVISASRYLDQMMAWIGAPTNPEQSMWWPCKNAVIGGMTLSQVSIPVKV 94 (170) T ss_pred ceecHHHHHHHHHhhccccccCCCCHHHHHHHHHHHHHHhccccccccccCCcchhhcccccCcccCccccccchhhHHH Confidence 899999986422 1 122467888899999999999997 233321 13556789999 Q ss_pred HHHHHHHHHHHhhcCCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCce-------------------e Q lcl|NC_020477. 55 HDITVDLARFFFAEDHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFA-------------------T 115 (130) Q Consensus 55 ~~~a~dIArY~L~~~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~-------------------~ 115 (130) +..+|.+|.+.+-..... ... +.+++ -++| |-++..-....++... . T Consensus 95 ~~Aq~elA~~~~~~~~~~--~~~----~~~v~------~~kV--G~i~veY~~~~~~~~~~~~v~~LL~p~l~~~~~g~~ 160 (170) T protein:vir:94 95 KIAVFELAYFMLESGAAL--SFA----DQTID------SVKV--GTIRVEFTKNSTDAGLPTFVEAMLSGFGSPVLYGSN 160 (170) T ss_pred HHHHHHHHHHHHhCcccC--ccc----cccee------eEec--ceeEEEecCCCCCCccHHHHHHHhhhhhcccccccc Confidence 999999999887432211 111 11232 2444 5554432111110000 0 Q ss_pred eCCCCcccCc Q lcl|NC_020477. 116 TTDGEQIFTL 125 (130) Q Consensus 116 ~~~~~r~f~R 125 (130) ....-++|-- T Consensus 161 ~~~~~~~~r~ 170 (170) T protein:vir:94 161 AARSIDLVRA 170 (170) T ss_pred ccceeeeecC Confidence 0000111111 No 32 >protein:vir:97267 Length: 172 # NCBI annotation: hypothetical protein ORF024 # Family: family:all:703 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294532;genbank:gi:149408253;genbank:GeneID:5237132 Probab=50.39 E-value=0.22 Score=24.21 Aligned_cols=121 Identities=12% Similarity=0.086 Sum_probs=55.4 Q ss_pred CCCCHHHHHHhcccC----CCCcCHHHHHHHHHHHHHHHHH---HHhhhc-c-----------------CCCcccchHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNL----SKQVTDELLAKYIQEASNYIDA---RLGVAY-K-----------------TPFVKVPPIIH 55 (130) Q Consensus 1 MY~t~edl~l~d~~~----~g~~d~~~i~~Al~dAs~~IDg---yL~~RY-~-----------------lPl~~vP~~L~ 55 (130) =|+|.++...-.... ++. +++-.+++|..|+..||+ |.+.|= . +|...||.-|+ T Consensus 17 SYvtv~~a~aY~~~rg~~~~a~-~~~~ke~aLi~A~~yiD~~~~f~G~r~~~~~Q~l~WPRtg~~d~~~~~~~~IP~~v~ 95 (172) T protein:vir:97 17 AYISVEEFKTYHTDRGNSFAGS-TDPQIEAAVIRATDYLDQRFNFVGKKRLGRDQTTEWPRTDAWDRDRYYINDIPPEVK 95 (172) T ss_pred ccccHHHHHHHHHhcCcccCCC-CcHHHHHHHHHHHHHHhhhhhcccCCCCCcchhhhcccCCCCCCcccccccccHHHH Confidence 899999876433221 222 334478899999999997 334342 1 24456899999 Q ss_pred HHHHHHHHHHhhcCCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCC-CCCCCcee-e---C--CCCcccCcccC Q lcl|NC_020477. 56 DITVDLARFFFAEDHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDT-KLPAGFAT-T---T--DGEQIFTLDQP 128 (130) Q Consensus 56 ~~a~dIArY~L~~~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~-~~~~~~~~-~---~--~~~r~f~R~~~ 128 (130) ..||.+|.+-|-..-....+.-... .++ .-|++.-|.++..-.. ..+.++.. . . =.++-+++..- T Consensus 96 ~A~~elA~~al~~~l~~d~~~~~~~--~~v------~~kr~kvg~i~~~y~~~~~~~~~~p~~~~v~aLL~p~gl~~~~~ 167 (172) T protein:vir:97 96 EACAEYALRALAAELNPDPERNASG--VAV------LSKSEAVGPISESVTFVGGAVFQMPKYPAADQKLVRAGLVRSGG 167 (172) T ss_pred HHHHHHHHHHHhccccccccccccc--ccc------eeeeeeecceeeEeeccCCCCCccccHHHHHHHHhhhccccCcc Confidence 9999999988754321111110000 000 0122222444332100 00000000 0 0 00011222222 Q ss_pred CC Q lcl|NC_020477. 129 EW 130 (130) Q Consensus 129 rw 130 (130) +. T Consensus 168 ~~ 169 (172) T protein:vir:97 168 TL 169 (172) T ss_pred ee Confidence 22 No 33 >protein:vir:4458 Length: 107 # NCBI annotation: hypothetical protein # Family: family:all:363 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700381;genbank:gi:23505453;genbank:GeneID:955660 Probab=49.40 E-value=0.64 Score=21.65 Aligned_cols=92 Identities=8% Similarity=0.023 Sum_probs=49.7 Q ss_pred CCCCHHHHHHhcccCCC-CcCHHHHHHHHHHHHHHHHHHHhhhccCCCc----------ccchHHHHHHHHHHHHHhhcC Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSK-QVTDELLAKYIQEASNYIDARLGVAYKTPFV----------KVPPIIHDITVDLARFFFAED 69 (130) Q Consensus 1 MY~t~edl~l~d~~~~g-~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~----------~vP~~L~~~a~dIArY~L~~~ 69 (130) |+.|.+++..-=|-... .-|+..|+..|.-|++.|-+|++.++.-.-. .+|+.++. |+-+-.-++|.+ T Consensus 1 M~vtLee~K~hLRId~D~~dDD~lI~~~i~AA~~~i~~~~~r~l~~~~~~~~~~~~~~~~~~~~~~~-AiLllv~~~Y~N 79 (107) T protein:vir:44 1 MLLSVEEIKAQLRLDEDFEADERYLQLLARAVQKRTETYLNRKLYAPDETIPDSDPDGLLLQDDIRL-GMLMLISHFYEN 79 (107) T ss_pred CCCCHHHHHHHcCCCCCCchhHHHHHHHHHHHHHHHHHhhcCccccccccccccccccccchhhHHH-HHHHHHHHHHhh Confidence 99999999755443222 3457799999999999999999877532111 13455554 555555567765 Q ss_pred CCCCchH----HHHHHHHHHHHHHHHHHH Q lcl|NC_020477. 70 HYTSQKP----NLDEYHIKLKERIEKLLD 94 (130) Q Consensus 70 ~~~~~e~----v~~rY~~Aik~~~~a~L~ 94 (130) |...++. +-.-.+.-|. ..+-|=+ T Consensus 80 Re~~~~~~~~~lP~~v~~Ll~-~yR~~p~ 107 (107) T protein:vir:44 80 RSSVTEVEKLDMPQSFGWLVG-PYRYFPQ 107 (107) T ss_pred hhhhccccccccCHHHHHHHH-HhhhcCC Confidence 5322211 1111111111 0011111 No 34 >protein:vir:78478 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491587;genbank:gi:157786410;genbank:GeneID:5625630 Probab=48.90 E-value=0.66 Score=21.59 Aligned_cols=118 Identities=13% Similarity=0.182 Sum_probs=60.1 Q ss_pred CCCCHHHHHHhc-ccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCC-cccc---hHHHHHHHHHHHHHhhcCCCCCch Q lcl|NC_020477. 1 MYAIPDDLRLVM-NNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPF-VKVP---PIIHDITVDLARFFFAEDHYTSQK 75 (130) Q Consensus 1 MY~t~edl~l~d-~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl-~~vP---~~L~~~a~dIArY~L~~~~~~~~e 75 (130) =|+|.+|++..- +..+. .....++.-|++||..|-.-+- .|+- .+.| +.++.++|++.+=-+....+..++ T Consensus 2 afAtv~Dve~rw~r~LT~-eE~~~ae~lL~dAs~~IR~~iP---~La~~~~dp~~~a~v~~V~~~mV~R~~rnpeG~~S~ 77 (149) T protein:vir:78 2 AYAEPSDVVARLGRPLTD-DEETQVETFLEDAEIEIRSRIP---DLDDKAEDEDYLKRVIKVEASAVTRLIRNPDGYIGE 77 (149) T ss_pred CcCCHHHHHHHhCCCCCH-HHHHHHHHHHHHHHHHHHHhcc---ccccccCCcchhhHHHHHHHHHHHHHhcCCCCeeee Confidence 699999999766 66553 2234789999999999865331 2221 2233 568899999887655432222122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHcCccccCCC------CCCCCCce---------------eeC-CCCcccCccc-CCC Q lcl|NC_020477. 76 PNLDEYHIKLKERIEKLLDDIISGVLVLDPD------TKLPAGFA---------------TTT-DGEQIFTLDQ-PEW 130 (130) Q Consensus 76 ~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~------~~~~~~~~---------------~~~-~~~r~f~R~~-~rw 130 (130) .. ..|-..+. .....|.+-|..+ ...+.|.- .+. -.=.+|-..+ +-| T Consensus 78 T~-G~YS~slt-------~~np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 147 (149) T protein:vir:78 78 TD-GNYSYQLN-------WRLNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEWQVFQQSSPLYW 147 (149) T ss_pred ec-chhhhhhh-------ccCCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceeeeeeeccCcccc Confidence 22 45555442 1333455433210 00010110 111 1124555444 344 No 35 >protein:vir:78254 Length: 149 # NCBI annotation: gp16 # Family: family:all:2817 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491668;genbank:gi:157786492;genbank:GeneID:5625732 Probab=48.90 E-value=0.66 Score=21.59 Aligned_cols=118 Identities=13% Similarity=0.182 Sum_probs=60.1 Q ss_pred CCCCHHHHHHhc-ccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCC-cccc---hHHHHHHHHHHHHHhhcCCCCCch Q lcl|NC_020477. 1 MYAIPDDLRLVM-NNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPF-VKVP---PIIHDITVDLARFFFAEDHYTSQK 75 (130) Q Consensus 1 MY~t~edl~l~d-~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl-~~vP---~~L~~~a~dIArY~L~~~~~~~~e 75 (130) =|+|.+|++..- +..+. .....++.-|++||..|-.-+- .|+- .+.| +.++.++|++.+=-+....+..++ T Consensus 2 afAtv~Dve~rw~r~LT~-eE~~~ae~lL~dAs~~IR~~iP---~La~~~~dp~~~a~v~~V~~~mV~R~~rnpeG~~S~ 77 (149) T protein:vir:78 2 AYAEPSDVVARLGRPLTD-DEETQVETFLEDAEIEIRSRIP---DLDDKAEDEDYLKRVIKVEASAVTRLIRNPDGYIGE 77 (149) T ss_pred CcCCHHHHHHHhCCCCCH-HHHHHHHHHHHHHHHHHHHhcc---ccccccCCcchhhHHHHHHHHHHHHHhcCCCCeeee Confidence 699999999766 66553 2234789999999999865331 2221 2233 568899999887655432222122 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHcCccccCCC------CCCCCCce---------------eeC-CCCcccCccc-CCC Q lcl|NC_020477. 76 PNLDEYHIKLKERIEKLLDDIISGVLVLDPD------TKLPAGFA---------------TTT-DGEQIFTLDQ-PEW 130 (130) Q Consensus 76 ~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~------~~~~~~~~---------------~~~-~~~r~f~R~~-~rw 130 (130) .. ..|-..+. .....|.+-|..+ ...+.|.- .+. -.=.+|-..+ +-| T Consensus 78 T~-G~YS~slt-------~~np~G~LylT~~E~a~LG~~r~~G~~~i~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 147 (149) T protein:vir:78 78 TD-GNYSYQLN-------WRLNTGAIEITDKEWAQLGLSKNVGVLNVRPKTPLERSGEYPAFGSVEWQVFQQSSPLYW 147 (149) T ss_pred ec-chhhhhhh-------ccCCCCceeeCHHHHHhhCCcccccceeecccCccccCCCCCcccceeeeeeeccCcccc Confidence 22 45555442 1333455433210 00010110 111 1124555444 344 No 36 >protein:vir:4228 Length: 125 # NCBI annotation: predicted 14.0Kd protein # Family: family:all:2817 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039683;swissprot:sw:q05225;genbank:gi:9625449;uniprot:Q05225;genbank:GeneID:2942926 Probab=48.15 E-value=0.68 Score=21.51 Aligned_cols=110 Identities=15% Similarity=0.173 Sum_probs=60.5 Q ss_pred CCCCHHHHHHhc-ccCCCCcCHHHHHHHHHHHHHHHHHHHhhhcc-CC-----CcccchHHHHHHHHHHHHHhhcC-CCC Q lcl|NC_020477. 1 MYAIPDDLRLVM-NNLSKQVTDELLAKYIQEASNYIDARLGVAYK-TP-----FVKVPPIIHDITVDLARFFFAED-HYT 72 (130) Q Consensus 1 MY~t~edl~l~d-~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~-lP-----l~~vP~~L~~~a~dIArY~L~~~-~~~ 72 (130) -|+|.+|++..- +..+.+ ....|+.-|++|+..|-. |++ |+ -+..+..++.++.+..+= |..+ ++. T Consensus 2 ~~A~~eDV~a~w~r~lt~~-e~~~v~~~L~~Ae~~Ir~----riPdL~~r~~~~~~~~~~v~~Vea~aV~R-v~RNpeGy 75 (125) T protein:vir:42 2 AYATAEDVVTLWAKEPEPE-VMALIERRLQQIERMIKR----RIPDLDVKAAASATFRADLIDIEADAVLR-LVRNPEGY 75 (125) T ss_pred CcccHhHHHHHhCCCCChH-HHHHHHHHHHHHHHHHHH----hCCCchhhhcccCcchhhHHHHHHHHHHH-HHhCCCcc Confidence 699999998665 554443 567889999999998743 333 21 235577788888776654 4433 221 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHcCccccCCC------CCCCCCceeeCCCCcccC Q lcl|NC_020477. 73 SQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPD------TKLPAGFATTTDGEQIFT 124 (130) Q Consensus 73 ~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~------~~~~~~~~~~~~~~r~f~ 124 (130) -++.. ..|-.-+. .+.+.|++.+..+ +....|..+.....-.=+ T Consensus 76 ~s~T~-G~Ys~~l~-------~~~~~g~L~it~eEw~~L~p~~~~g~~~i~P~~~~~~ 125 (125) T protein:vir:42 76 LSETD-GAYTYQLQ-------ADLSQGKLTILDEEWEILGVNSQKRMAVIVPNVVMPT 125 (125) T ss_pred ccccc-hhHHHhhh-------cccccCceeeCHHHHHhhCccccccceeecccceeCC Confidence 11111 55655542 3667788876432 111222222221111111 No 37 >protein:vir:95004 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224025;genbank:gi:62327312;genbank:GeneID:5176832 Probab=48.14 E-value=0.41 Score=22.70 Aligned_cols=113 Identities=13% Similarity=0.152 Sum_probs=57.4 Q ss_pred CCCCHHHHHHhc--ccCCCCcCHHHHHHHHHHHHHHHHHH----Hhhhc------------------cCCCcccchHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVM--NNLSKQVTDELLAKYIQEASNYIDAR----LGVAY------------------KTPFVKVPPIIHD 56 (130) Q Consensus 1 MY~t~edl~l~d--~~~~g~~d~~~i~~Al~dAs~~IDgy----L~~RY------------------~lPl~~vP~~L~~ 56 (130) =|+|.++...-. +-..-..|+...+.+|..|+..||++ ++.|- .+|-..||.-|+. T Consensus 16 SYvt~~ea~aY~~~rg~~~~~dd~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~~~~~IP~~V~~ 95 (169) T protein:vir:95 16 SYVSLEDGRALAAKYGLELPEDDIAAEASLRNGAVYVGLFESQMCGRRVSANQALAFPRTGIDLHGFPQPSNVIPSLVIQ 95 (169) T ss_pred ccccHHHHHHHHHHcCCcCCCCHHHHHHHHHHHHHHhhccccccccccCCcchhhccccCCceecccccccccchHHHHH Confidence 799999886433 33333346778899999999999983 33321 2456688999999 Q ss_pred HHHHHHHHHhhcCCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCcee--------------eCCCC-- Q lcl|NC_020477. 57 ITVDLARFFFAEDHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFAT--------------TTDGE-- 120 (130) Q Consensus 57 ~a~dIArY~L~~~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~--------------~~~~~-- 120 (130) .||.+|.+-+-......+..- + +++ -++ ..|-++..-....+.++.. ..++. T Consensus 96 A~~elA~~~~~g~~~~~~~~~-~----~v~------~e~-v~G~i~veY~~~~~~~~~~~~~a~~~LL~p~l~g~~g~~~ 163 (169) T protein:vir:95 96 AQVMAAVEYGAGTDVRGSTDG-R----EVQ------TER-VEGAVTVSYFKNGYSGGTVSITAADDALRPLLCGSNNAYS 163 (169) T ss_pred HHHHHHHHHHcCccccCCCCc-c----cee------eee-eccceeEeecCCCCcCccccHHHHHHhhhhhcccCCCcce Confidence 999999999853211111100 0 111 111 1244433211111110000 00000 Q ss_pred -cccCcc Q lcl|NC_020477. 121 -QIFTLD 126 (130) Q Consensus 121 -r~f~R~ 126 (130) ++| |. T Consensus 164 i~~~-rg 169 (169) T protein:vir:95 164 FNVF-RG 169 (169) T ss_pred eeee-cC Confidence 111 11 No 38 >protein:vir:6243 Length: 122 # NCBI annotation: gp37 # Family: family:all:11657 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813697;swissprot:trembl:q859c0;genbank:gi:29366757;uniprot:Q859C0;genbank:GeneID:1258898 Probab=47.85 E-value=0.69 Score=21.47 Aligned_cols=115 Identities=18% Similarity=0.204 Sum_probs=67.6 Q ss_pred CCCCHHHHHHhcc-cCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMN-NLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKPNLD 79 (130) Q Consensus 1 MY~t~edl~l~d~-~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~v~~ 79 (130) -|+|.|+++.... +......++.+..||+-.-+.+.-|++..+...=.|.|.+++-....+||-+..+--...++ T Consensus 2 ayatieelralegiddaslfpdellsdaidfsvetvevycgqkwdtaenptpevirwcvrtlarqyvldhvsripd---- 77 (122) T protein:vir:62 2 AYATIEELRALEGIDDASLFPDELLSDAIDFSVETVEVYCGQKWDTAENPTPEVIRWCVRTLARQYVLDHVSRIPD---- 77 (122) T ss_pred ccchhhhhHhhccccccccchhhhhhhhhhhhhhhhhhhcCcccCCcCCCchHHHHHHHHHHHHHHHHHHhhhcch---- Confidence 7999999986543 23356677899999999999999999999999888999988777778898777542111222 Q ss_pred HHHHHHHHHHHHHHHHHHcCccccCCCCC--CCCCceeeCCCCcccCcccCCC Q lcl|NC_020477. 80 EYHIKLKERIEKLLDDIISGVLVLDPDTK--LPAGFATTTDGEQIFTLDQPEW 130 (130) Q Consensus 80 rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~--~~~~~~~~~~~~r~f~R~~~rw 130 (130) .|++ |+ ---|.+.|.-.+. .|.+-..++..-.+ -|-++-| T Consensus 78 ---ralq------lq-sefgsiqlaqaggtwrptslpevnaklnl-yrvrlpf 119 (122) T protein:vir:62 78 ---RALQ------LQ-SEFGSIQLAQAGGTWRPTSLPEVNAKLNL-YRVRLPF 119 (122) T ss_pred ---hhhh------hh-hcccceeeeccCCccccCcCcccccceee-eEeecce Confidence 2442 21 1235565532211 11111111111111 0111222 No 39 >protein:vir:80967 Length: 131 # NCBI annotation: gp8 # Family: family:all:3882 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468394;genbank:gi:157324968;genbank:GeneID:5601402 Probab=43.90 E-value=0.83 Score=21.04 Aligned_cols=99 Identities=12% Similarity=0.031 Sum_probs=57.5 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCC-C----cccchHHHHHHHHHHHHHhhcCCCC--- Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTP-F----VKVPPIIHDITVDLARFFFAEDHYT--- 72 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lP-l----~~vP~~L~~~a~dIArY~L~~~~~~--- 72 (130) =|+|.+..+-.. ..+.+..+..+..+..|+..||.+...|+.-- + +.+|..++..||..|-|.--.+... T Consensus 2 ~Y~d~~~Y~~~y--~G~~i~e~~F~~l~~rAs~~ID~~T~~ri~~~~~d~~~~~~~~~vk~A~c~q~e~~~~~g~~~~~~ 79 (131) T protein:vir:80 2 PYTTLEFYTNEY--AGEHLEQDEFAKLLKHAERKIDSVTFYRIRKSGIEAFSEFIQHQIQLATCNQIEYFKEAGGTSELA 79 (131) T ss_pred CCCCHHHHHHhh--CCCCCchhHHHHHHHHHHHHHHHHhcccccccccccCchhHHHHHHHHHHHHHHHHHHhhhhhhhc Confidence 599998876433 22446667799999999999999999998632 1 4578889999999998765321100 Q ss_pred ------------------CchHH-----HHHHHHHHHHHHHHHHHHHHcCccccCCCCC Q lcl|NC_020477. 73 ------------------SQKPN-----LDEYHIKLKERIEKLLDDIISGVLVLDPDTK 108 (130) Q Consensus 73 ------------------~~e~v-----~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~ 108 (130) .++.. .+.+++|. .||+ ..|-+--|++.. T Consensus 80 ~~~~~S~svG~~Svs~~~~~~~~~~~~~~~~~~~a~-----~~L~--~TGLlyrGV~~~ 131 (131) T protein:vir:80 80 VSKPDNVSIGRTSISDSNFASTATSLNSGLVGSDVR-----SYLA--HTGLLYNGVGVR 131 (131) T ss_pred ccccCeeeeCceEEeeccccchhhhhhhhhhHHHHH-----HHHh--ccCCeecCCCCC Confidence 00000 00122222 2333 223332232222 No 40 >protein:vir:104088 Length: 125 # NCBI annotation: gp20 # Family: family:all:2817 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655599;genbank:gi:109392470;genbank:GeneID:4156956 Probab=43.85 E-value=0.83 Score=21.03 Aligned_cols=110 Identities=15% Similarity=0.141 Sum_probs=58.8 Q ss_pred CCCCHHHHHHhc-ccCCCCcCHHHHHHHHHHHHHHHHHHHhhhcc-CC-----CcccchHHHHHHHHHHHHHhhcCC-CC Q lcl|NC_020477. 1 MYAIPDDLRLVM-NNLSKQVTDELLAKYIQEASNYIDARLGVAYK-TP-----FVKVPPIIHDITVDLARFFFAEDH-YT 72 (130) Q Consensus 1 MY~t~edl~l~d-~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~-lP-----l~~vP~~L~~~a~dIArY~L~~~~-~~ 72 (130) =|+|.+|++..- +..+.+ ....|+.-|++|+..|-. |++ |+ -+..+..++.++.+..+ ++..+. +. T Consensus 2 a~A~~~Dv~~~w~r~lT~~-E~~~v~~~L~~Ae~~Ir~----riP~L~~r~~a~~~~~~~v~~Vea~aV~-Rv~rNPeGy 75 (125) T protein:vir:10 2 AYANAQDVVTLWAKEPEPE-VMELIERRLAQVERMIKR----RIPNLDLKVAADATFQADLIDIEADAVL-RLVRNPEGY 75 (125) T ss_pred CcCCHHHHHHHhCCCCCHH-HHHHHHHHHHHHHHHHHH----hCCChhhhhhcCCCccccHHHHHHHHHH-HHhcCCCcc Confidence 699999998776 555543 567889999999998743 332 21 34567777777666544 454432 21 Q ss_pred CchHHHHHHHHHHHHHHHHHHHHHHcCccccCCC------CCCCCCceeeCCCCcccC Q lcl|NC_020477. 73 SQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPD------TKLPAGFATTTDGEQIFT 124 (130) Q Consensus 73 ~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~------~~~~~~~~~~~~~~r~f~ 124 (130) -++ -.-.|-.-+. .+.++|++-|..+ +....+........-.=+ T Consensus 76 ~s~-T~G~Ys~~l~-------~~~~~g~L~it~~Ew~~Lg~~r~s~~~~i~p~~~~~~ 125 (125) T protein:vir:10 76 ISE-TDGAYTYQLQ-------TDLSQGRLTILDDEWTTLGVNRLSRMSVIAPNIVMPT 125 (125) T ss_pred ccc-ccchhHHhhh-------cccccCceeeCHHHHHhhccccccceeeeecccccCC Confidence 111 1145555442 3567788766422 222222222111111111 No 41 >protein:vir:98900 Length: 132 # NCBI annotation: hypothetical protein # Family: family:all:3882 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164420;genbank:gi:56694910;genbank:GeneID:3197290 Probab=43.09 E-value=0.35 Score=23.11 Aligned_cols=114 Identities=14% Similarity=0.125 Sum_probs=64.9 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCC-Cc----ccchHHHHHHHHHHHHHhhcCCCCCch Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTP-FV----KVPPIIHDITVDLARFFFAEDHYTSQK 75 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lP-l~----~vP~~L~~~a~dIArY~L~~~~~~~~e 75 (130) -|+|.+..+- -....++++..++.+..|+..||.+...||.-. |. .++..++..+|..+-|.-- .+....| T Consensus 2 ~Y~t~~~Y~~---~~G~~i~e~~F~~l~~rAs~~ID~iT~~ri~~~~~~~d~~~~~~~vk~A~c~qiey~~~-~G~~sae 77 (132) T protein:vir:98 2 PYLTYEEFMD---LNGRDIDDKKFEKLLPKASAIIDGVTGHFYQKVDMEKDNAWRVNQFKLALCAQIEYFDA-LGATTFE 77 (132) T ss_pred CCCCHHHHHh---hcCCCCCHHHHHHHHHHHHHHHHHHhcccccCCCccccChHHHHHHHHHHHHHHHHHHh-ccchhhh Confidence 6999999852 122357888899999999999999999999642 32 3445678888877776532 2211112 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeC------------CCCcccCcccCCC Q lcl|NC_020477. 76 PNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTT------------DGEQIFTLDQPEW 130 (130) Q Consensus 76 ~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~------------~~~r~f~R~~~rw 130 (130) .... -+..+.-|+.++.-....+...+... .+-.+.=|.=.+| T Consensus 78 ~~~~------------~~~S~svG~~Svs~~s~~~~~~~~~~~~~~~~~a~~~L~~tGLLyrGV~~~ 132 (132) T protein:vir:98 78 EINN------------SPQTFQAGRTSVSNASRYNPSGANESKPLVAEDVYIYLQGTGLLFQGVKTW 132 (132) T ss_pred hccC------------ccceeeeCcEEEEeeccCCcccccccccchHHHHHHHHhhcCCccccCCCC Confidence 2111 14556777777654322211111000 0112233444556 No 42 >protein:vir:94507 Length: 113 # NCBI annotation: putative DNA packaging protein # Family: family:all:372 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223891;genbank:gi:62327103;genbank:GeneID:5075523 Probab=39.82 E-value=1 Score=20.58 Aligned_cols=99 Identities=16% Similarity=0.207 Sum_probs=61.9 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCCc----- Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTSQ----- 74 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~~----- 74 (130) -.+.+++.+.-.- +....++.++..|.+|+..|=.||+ +|.+-...+|.-|..+.+++|..+.-. +++..+ T Consensus 1 M~~L~~vK~~lgi-~d~~~D~lL~~iI~~a~~~i~~~l~-~~~~~~~~iP~~l~~Iv~evavkryNR~g~EG~~S~SeeG 78 (113) T protein:vir:94 1 MALLDSIKLRIGI-EDTKQDDLLTDIISDVQARVLAYVN-QDGLVQSELPNGLDFVIKDVTIRIYNKIGDEGKESSSEGN 78 (113) T ss_pred CchHHHHHHHhCC-CCCchhhHHHHHHHHHHHHHHHHhC-CccchhhhhhhHHHHHHHHHHHHHhcccCCccceeeecCc Confidence 3555665533222 2233346899999999999999998 566666789999999999998877643 222110 Q ss_pred ------h-HHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 75 ------K-PNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 75 ------e-~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) + .--+-|.+-|. +|++.-.++ ..+.|+| T Consensus 79 ~S~sf~~~~df~~y~~~l~----~~~~~~~~~-----------------~~g~rF~ 113 (113) T protein:vir:94 79 VSNTWDTPADLSEYSDVLD----VYRKSYKRR-----------------SAGMRFI 113 (113) T ss_pred eeeeecCccchhhHHHHHH----HHHhhccCC-----------------CCCceeC Confidence 1 12467777775 476642111 1234555 No 43 >protein:vir:106596 Length: 128 # NCBI annotation: ORF042 # Family: family:all:372 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239495;genbank:gi:66395254;genbank:GeneID:4555750 Probab=36.82 E-value=1.2 Score=20.25 Aligned_cols=98 Identities=17% Similarity=0.275 Sum_probs=58.1 Q ss_pred CCCC-----------------HHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHH Q lcl|NC_020477. 1 MYAI-----------------PDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLAR 63 (130) Q Consensus 1 MY~t-----------------~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIAr 63 (130) |-.| .+++.+.-. .+....++.++..|++|...|-.||+... ..+|.-|..+..++|. T Consensus 1 ~~~~~~~~~~~~~~~~~~m~~Le~vK~~Lg-I~d~~~D~lL~~lI~~a~~~i~~~l~~~~----~~iP~~L~~Iv~evaV 75 (128) T protein:vir:10 1 MVVTKLHKSLLQLNSGEVMNYLDDVKSRIG-LNDNEQDKQLNSIINNVAAELLSRLPVDT----ISIPDKLQFIVVEVST 75 (128) T ss_pred CcccccccchheecHHHHHHHHHHHHHHhC-CCCcchhhHHHHHHHHHHHHHHHHcCCCh----hhhhhhHHHHHHHHHH Confidence 3333 333332211 12344457899999999999999998322 3578999999988877 Q ss_pred HHhhc--CCCCC-----------chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 64 FFFAE--DHYTS-----------QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 64 Y~L~~--~~~~~-----------~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) -+.-. +++.. .+..-+.|.+-|. +|++. .|+.. .+..++| T Consensus 76 kryNR~g~EG~~S~SeeG~S~tf~dnd~~~Y~~~L~----~y~~~--~~~~~--------------kG~v~F~ 128 (128) T protein:vir:10 76 KRYNRIGAEGMSTDSQDGRSNTFERNDFEEYQSIID----ALYPK--LDSSE--------------RGSVNFY 128 (128) T ss_pred HHhcccCccCcceeeeCceeeeeccCCcchhHHHHH----HHHhh--ccCCC--------------CCceeeC Confidence 66532 22211 1334667888876 57764 22211 2345666 No 44 >protein:vir:78383 Length: 169 # NCBI annotation: hypothetical protein # Family: family:all:703 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110841;genbank:gi:134288602;genbank:GeneID:5179646 Probab=36.06 E-value=0.68 Score=21.52 Aligned_cols=113 Identities=14% Similarity=0.157 Sum_probs=55.1 Q ss_pred CCCCHHHHHHhc--ccCCCCcCHHHHHHHHHHHHHHHHHH----Hhhhc------------------cCCCcccchHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVM--NNLSKQVTDELLAKYIQEASNYIDAR----LGVAY------------------KTPFVKVPPIIHD 56 (130) Q Consensus 1 MY~t~edl~l~d--~~~~g~~d~~~i~~Al~dAs~~IDgy----L~~RY------------------~lPl~~vP~~L~~ 56 (130) =|+|.++...-. +...-..|+...+.+|..|+..||+| ++.|- .+|-..||.-|+. T Consensus 16 SYvtv~~a~aY~~~rg~~~~~d~~~~e~aL~~A~~yid~~~~~f~G~r~~~~Q~l~wPRtg~~~~g~~~~~~~IP~~v~~ 95 (169) T protein:vir:78 16 SYVSLEDGRALAAKYGLELPEDDTAAEAALRNGAVYVGLFESQMCGRRVSANQALAFPRTGVTLHGFPQPSNVIPPLVIQ 95 (169) T ss_pred ccccHHHHHHHHHHcCCcCCCChHHHHHHHHHHHHHhhhccccceeeeCCcccccccccCCceecccccccccchHHHHH Confidence 799999886332 32333346788899999999999974 33321 2345678999999 Q ss_pred HHHHHHHHHhhcCCCCCchHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCcee--------------eCCC--- Q lcl|NC_020477. 57 ITVDLARFFFAEDHYTSQKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFAT--------------TTDG--- 119 (130) Q Consensus 57 ~a~dIArY~L~~~~~~~~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~--------------~~~~--- 119 (130) .||.+|.+.+-......+.. ...++ .|+ -.|-++..-....+.++.. ..++ T Consensus 96 A~~elA~~~~~g~~~~~~~~-----~~~v~------~e~-v~G~i~veY~~~~~~~~~~~~~~~~~LL~p~l~~~~g~~~ 163 (169) T protein:vir:78 96 AQVMAAVEYGAGTDVRGSTD-----GREVQ------TER-VEGAVTVSYFKNGYSGGTVSITTADDALRPLLCGSNNAYS 163 (169) T ss_pred HHHHHHHHHhcCcccCCCCC-----cceeE------EEE-ecCceeEeecCCCCCCCcccHHHHHHHhhhhcccCCCcce Confidence 99999998775321111110 00111 011 0133332111000000000 0000 Q ss_pred CcccCcc Q lcl|NC_020477. 120 EQIFTLD 126 (130) Q Consensus 120 ~r~f~R~ 126 (130) -++| |. T Consensus 164 i~~~-rg 169 (169) T protein:vir:78 164 FNVF-RG 169 (169) T ss_pred eeee-cC Confidence 0111 11 No 45 >protein:vir:93592 Length: 108 # NCBI annotation: hypothetical protein # Family: family:all:1879 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449297;genbank:gi:157166045;uniprot:Q6H9U4;genbank:GeneID:5580414 Probab=35.44 E-value=1.2 Score=20.09 Aligned_cols=96 Identities=8% Similarity=0.109 Sum_probs=50.4 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhh-ccCC------C-cccchHHHHHHHHHHHHHhhcCCCC Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVA-YKTP------F-VKVPPIIHDITVDLARFFFAEDHYT 72 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~R-Y~lP------l-~~vP~~L~~~a~dIArY~L~~~~~~ 72 (130) |+.|.|++..-=|- ++..|++.|+..|.-|++.|-+||... +..+ . .++|+.++..++- -.-++|.+|-. T Consensus 3 ~~vtLeevK~hLRI-d~d~dD~li~~~i~aA~~~v~~~l~~~~~~~~~~~~~~~~~~~~~~i~~AvLl-Lv~~~YenRe~ 80 (108) T protein:vir:93 3 ALLTLEEIKAHLRV-DHDADDDMLMDKVRQATAVLLAYIQGSRDKVIREDGELIPGEALTRMKGAAMR-LTGMLYRNPDL 80 (108) T ss_pred cCCCHHHHHHHcCC-CCCcChHHHHHHHHHHHHHHHHHhccccccccccccccccccCChHHHHHHHH-HHHHHHhcccc Confidence 99999999643332 234588999999999999999999543 2221 1 1345555555444 44556665532 Q ss_pred CchHHHHHHH--HHHHHHHHHHHHHHHcCccc Q lcl|NC_020477. 73 SQKPNLDEYH--IKLKERIEKLLDDIISGVLV 102 (130) Q Consensus 73 ~~e~v~~rY~--~Aik~~~~a~L~~Ia~G~~~ 102 (130) .++...+-.+ -.++ +.|...++=.+- T Consensus 81 ~~~~~~~~~elP~~v~----~Ll~~~R~p~~~ 108 (108) T protein:vir:93 81 AEREELLQGELPFSVS----VLIYDLRCPTVL 108 (108) T ss_pred ccccccccccCCHHHH----HHHHHccccccC Confidence 2211000000 0111 223333322222 No 46 >protein:vir:3970 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663678;genbank:gi:21716115;genbank:GeneID:951203 Probab=32.87 E-value=1.4 Score=19.79 Aligned_cols=96 Identities=19% Similarity=0.252 Sum_probs=60.5 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+. .|..+++.++..|++|...|=.||+. ....+|+-|..+.+++|..+.-. +++.. T Consensus 1 M~iL~~vK~~----lgi~~D~lL~~li~~a~~~i~~~l~~----~~~~iP~~l~~iv~evav~ryNR~g~EG~~S~SeeG 72 (110) T protein:vir:39 1 MAITDDLKKL----LGGSSDERLEVIEKRTRERLLLILSS----NIKEVPPELEYVVLDVSLKRFNRIGQEGMQSYSQEG 72 (110) T ss_pred CchHHHHHHh----cCCChhHHHHHHHHHHHHHHHHHhCC----ChhhhhhHHHHHHHHHHHHHhccccccccceeecCC Confidence 2344555432 34456789999999999999999873 24568999999999998877643 22111 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHH-HHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDD-IISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~-Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. ...|.-.+ +.-++| T Consensus 73 ~S~sf~~~d~~~y~~~l~----~y~~~~~~~~~~~~--------------g~~~f~ 110 (110) T protein:vir:39 73 LSMTFSESDFDEYADEIE----SWRKSKETEGDKKI--------------GRFRLY 110 (110) T ss_pred eeeeecccCcchhHHHHH----HHhhhccccccCcc--------------eeeeeC Confidence 1334567877776 47644 23333322 234666 No 47 >protein:vir:99796 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004309;genbank:gi:122891763;genbank:GeneID:4712351 Probab=31.29 E-value=1.5 Score=19.60 Aligned_cols=97 Identities=13% Similarity=0.194 Sum_probs=59.0 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+.-. .+....++.++..|++|...|-.||+ +-...+|.-|..+.+++|..+.-. +++.. T Consensus 1 M~~L~~vK~~lg-I~d~~~D~lL~~ii~~a~~~i~~~l~----~~~~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:99 1 MTTLADVKKRIG-LKDEKQDEQLEEIIKSCESQLLSMLP----IEVEQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhC-CCCCchhHHHHHHHHHHHHHHHHHhc----cchhhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 355666653322 22344567899999999999999996 112468999999999998777642 22211 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. .|+.. .|. .+++ T Consensus 76 ~S~sf~d~d~~~y~~~l~----~y~~~--~~~~~--------kG~------v~Fl 110 (110) T protein:vir:99 76 RSNAYELNDFKEYEAIID----NYFNA--RTRTK--------KGR------AVFF 110 (110) T ss_pred eeeeecccccchHHHHHH----HHHhh--cCCCC--------Cce------eeeC Confidence 1345677888886 57653 12211 112 2333 No 48 >protein:vir:97145 Length: 110 # NCBI annotation: ORF049 # Family: family:all:372 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239728;genbank:gi:66394913;genbank:GeneID:5130878 Probab=31.29 E-value=1.5 Score=19.60 Aligned_cols=97 Identities=13% Similarity=0.194 Sum_probs=59.0 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+.-. .+....++.++..|++|...|-.||+ +-...+|.-|..+.+++|..+.-. +++.. T Consensus 1 M~~L~~vK~~lg-I~d~~~D~lL~~ii~~a~~~i~~~l~----~~~~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:97 1 MTTLADVKKRIG-LKDEKQDEQLEEIIKSCESQLLSMLP----IEVEQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhC-CCCCchhHHHHHHHHHHHHHHHHHhc----cchhhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 355666653322 22344567899999999999999996 112468999999999998777642 22211 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. .|+.. .|. .+++ T Consensus 76 ~S~sf~d~d~~~y~~~l~----~y~~~--~~~~~--------kG~------v~Fl 110 (110) T protein:vir:97 76 RSNAYELNDFKEYEAIID----NYFNA--RTRTK--------KGR------AVFF 110 (110) T ss_pred eeeeecccccchHHHHHH----HHHhh--cCCCC--------Cce------eeeC Confidence 1345677888886 57653 12211 112 2333 No 49 >protein:vir:96221 Length: 110 # NCBI annotation: ORF044 # Family: family:all:372 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239573;genbank:gi:66395333;genbank:GeneID:5132767 Probab=31.29 E-value=1.5 Score=19.60 Aligned_cols=97 Identities=13% Similarity=0.194 Sum_probs=59.0 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+.-. .+....++.++..|++|...|-.||+ +-...+|.-|..+.+++|..+.-. +++.. T Consensus 1 M~~L~~vK~~lg-I~d~~~D~lL~~ii~~a~~~i~~~l~----~~~~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:96 1 MTTLADVKKRIG-LKDEKQDEQLEEIIKSCESQLLSMLP----IEVEQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhC-CCCCchhHHHHHHHHHHHHHHHHHhc----cchhhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 355666653322 22344567899999999999999996 112468999999999998777642 22211 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. .|+.. .|. .+++ T Consensus 76 ~S~sf~d~d~~~y~~~l~----~y~~~--~~~~~--------kG~------v~Fl 110 (110) T protein:vir:96 76 RSNAYELNDFKEYEAIID----NYFNA--RTRTK--------KGR------AVFF 110 (110) T ss_pred eeeeecccccchHHHHHH----HHHhh--cCCCC--------Cce------eeeC Confidence 1345677888886 57653 12211 112 2333 No 50 >protein:vir:96390 Length: 110 # NCBI annotation: ORF048 # Family: family:all:372 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239650;genbank:gi:66395410;genbank:GeneID:5132866 Probab=31.29 E-value=1.5 Score=19.60 Aligned_cols=97 Identities=13% Similarity=0.194 Sum_probs=59.0 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+.-. .+....++.++..|++|...|-.||+ +-...+|.-|..+.+++|..+.-. +++.. T Consensus 1 M~~L~~vK~~lg-I~d~~~D~lL~~ii~~a~~~i~~~l~----~~~~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:96 1 MTTLADVKKRIG-LKDEKQDEQLEEIIKSCESQLLSMLP----IEVEQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhC-CCCCchhHHHHHHHHHHHHHHHHHhc----cchhhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 355666653322 22344567899999999999999996 112468999999999998777642 22211 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. .|+.. .|. .+++ T Consensus 76 ~S~sf~d~d~~~y~~~l~----~y~~~--~~~~~--------kG~------v~Fl 110 (110) T protein:vir:96 76 RSNAYELNDFKEYEAIID----NYFNA--RTRTK--------KGR------AVFF 110 (110) T ss_pred eeeeecccccchHHHHHH----HHHhh--cCCCC--------Cce------eeeC Confidence 1345677888886 57653 12211 112 2333 No 51 >protein:vir:9311 Length: 110 # NCBI annotation: phi Mu50B-like protein # Family: family:all:372 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803289;genbank:gi:29028599;genbank:GeneID:1258047 Probab=31.29 E-value=1.5 Score=19.60 Aligned_cols=97 Identities=13% Similarity=0.194 Sum_probs=59.0 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+.-. .+....++.++..|++|...|-.||+ +-...+|.-|..+.+++|..+.-. +++.. T Consensus 1 M~~L~~vK~~lg-I~d~~~D~lL~~ii~~a~~~i~~~l~----~~~~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:93 1 MTTLADVKKRIG-LKDEKQDEQLEEIIKSCESQLLSMLP----IEVEQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhC-CCCCchhHHHHHHHHHHHHHHHHHhc----cchhhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 355666653322 22344567899999999999999996 112468999999999998777642 22211 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. .|+.. .|. .+++ T Consensus 76 ~S~sf~d~d~~~y~~~l~----~y~~~--~~~~~--------kG~------v~Fl 110 (110) T protein:vir:93 76 RSNAYELNDFKEYEAIID----NYFNA--RTRTK--------KGR------AVFF 110 (110) T ss_pred eeeeecccccchHHHHHH----HHHhh--cCCCC--------Cce------eeeC Confidence 1345677888886 57653 12211 112 2333 No 52 >protein:vir:78849 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285363;genbank:gi:148717891;genbank:GeneID:5246980 Probab=31.29 E-value=1.5 Score=19.60 Aligned_cols=97 Identities=13% Similarity=0.194 Sum_probs=59.0 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+.-. .+....++.++..|++|...|-.||+ +-...+|.-|..+.+++|..+.-. +++.. T Consensus 1 M~~L~~vK~~lg-I~d~~~D~lL~~ii~~a~~~i~~~l~----~~~~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:78 1 MTTLADVKKRIG-LKDEKQDEQLEEIIKSCESQLLSMLP----IEVEQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhC-CCCCchhHHHHHHHHHHHHHHHHHhc----cchhhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 355666653322 22344567899999999999999996 112468999999999998777642 22211 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. .|+.. .|. .+++ T Consensus 76 ~S~sf~d~d~~~y~~~l~----~y~~~--~~~~~--------kG~------v~Fl 110 (110) T protein:vir:78 76 RSNAYELNDFKEYEAIID----NYFNA--RTRTK--------KGR------AVFF 110 (110) T ss_pred eeeeecccccchHHHHHH----HHHhh--cCCCC--------Cce------eeeC Confidence 1345677888886 57653 12211 112 2333 No 53 >protein:vir:103957 Length: 110 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873994;genbank:gi:118430769;genbank:GeneID:4525451 Probab=31.29 E-value=1.5 Score=19.60 Aligned_cols=97 Identities=13% Similarity=0.194 Sum_probs=59.0 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCC------ Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTS------ 73 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~------ 73 (130) -.+.+++.+.-. .+....++.++..|++|...|-.||+ +-...+|.-|..+.+++|..+.-. +++.. T Consensus 1 M~~L~~vK~~lg-I~d~~~D~lL~~ii~~a~~~i~~~l~----~~~~~iP~~l~~iv~ev~vkryNR~g~EG~~S~S~eG 75 (110) T protein:vir:10 1 MTTLADVKKRIG-LKDEKQDEQLEEIIKSCESQLLSMLP----IEVEQIPERFSYMIKEVAVKRYNRIGAEGMTSEAVDG 75 (110) T ss_pred CchHHHHHHHhC-CCCCchhHHHHHHHHHHHHHHHHHhc----cchhhhhhHHHHHHHHHHHHHhcccCccccceeecCc Confidence 355666653322 22344567899999999999999996 112468999999999998777642 22211 Q ss_pred -----chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 -----QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 -----~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+..-+.|.+-|. +|++. .|+.. .|. .+++ T Consensus 76 ~S~sf~d~d~~~y~~~l~----~y~~~--~~~~~--------kG~------v~Fl 110 (110) T protein:vir:10 76 RSNAYELNDFKEYEAIID----NYFNA--RTRTK--------KGR------AVFF 110 (110) T ss_pred eeeeecccccchHHHHHH----HHHhh--cCCCC--------Cce------eeeC Confidence 1345677888886 57653 12211 112 2333 No 54 >protein:vir:100245 Length: 113 # NCBI annotation: gp74 # Family: family:all:363 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355410;genbank:gi:77864700;genbank:GeneID:3725967 Probab=29.54 E-value=1.7 Score=19.39 Aligned_cols=93 Identities=13% Similarity=0.033 Sum_probs=52.4 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhc-cCCC---------------cccchHHHHHHHHHHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAY-KTPF---------------VKVPPIIHDITVDLARF 64 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY-~lPl---------------~~vP~~L~~~a~dIArY 64 (130) |+.|.+++..--+-. +..|++.|+.-|.-|++.|-.||+.++ ..+. .++|+.++..+. +-.- T Consensus 2 ~~vtLee~K~hLRvd-~d~dD~lI~~li~AA~~~ve~~l~r~l~~~~~~~~~~~~~~~~~~~~~~~p~~i~~AvL-llv~ 79 (113) T protein:vir:10 2 ALVELKLALGFVRAN-AGVEDDVVQMLLDAATQSAVDYLNRQVFETEDAMTTAIEAGTAGQNPMVVNAAIRAAIL-KITA 79 (113) T ss_pred CCCCHHHHHHHcCCC-CCcchHHHHHHHHHHHHHHHHHhCccccccccccccccccccccccccccChHHHHHHH-HHHH Confidence 999999987443322 335889999999999999999998663 2211 135766665444 4455 Q ss_pred HhhcCCCCCch-HHHHHHHHHHHHHHHHHHHHHH--cCc Q lcl|NC_020477. 65 FFAEDHYTSQK-PNLDEYHIKLKERIEKLLDDII--SGV 100 (130) Q Consensus 65 ~L~~~~~~~~e-~v~~rY~~Aik~~~~a~L~~Ia--~G~ 100 (130) ++|.+|-..++ ...+- --.++ ..|...+ -|. T Consensus 80 ~~Y~nRe~~~~~~~~~l-P~~v~----~Ll~~yR~~~g~ 113 (113) T protein:vir:10 80 ELYANREDTAFGPITEL-PLNAR----ALLRPHRIIPGV 113 (113) T ss_pred HHHhhhhhhchhhhhcc-CHHHH----HHHHHhhhhcCC Confidence 56665432221 11110 00111 2333332 244 No 55 >protein:vir:2738 Length: 112 # NCBI annotation: hypothetical protein # Family: family:all:372 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695111;genbank:gi:23455880;genbank:GeneID:955641 Probab=28.72 E-value=1.7 Score=19.29 Aligned_cols=97 Identities=15% Similarity=0.113 Sum_probs=56.4 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcC--CCCC----- Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAED--HYTS----- 73 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~--~~~~----- 73 (130) |=-+.+.+...-....|..|++.++..|.+|+..|=.||+ ...+|.-|..+.+++|..+.-.. ++.. T Consensus 1 ~~l~~~~~L~~iK~~lg~~dD~lL~~ii~~a~~~i~~~l~------~~~iP~~l~~Iv~evavkryNR~g~EG~~S~See 74 (112) T protein:vir:27 1 MTLDKDKVIKNVSVDLNTNDDALLKILLERVVNHFKSEYG------VEEIDDKLAFIFEDCVIKRFNRRGAEGAKSESVD 74 (112) T ss_pred CcchhHHHHHHHHhhcCCChhHHHHHHHHHHHHHHHHhcC------ccccchhHHHHHHHHHHHHhcccCccccceeecC Confidence 3222222222223345667789999999999999999885 46899999999999988876432 2111 Q ss_pred --------chHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 74 --------QKPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 74 --------~e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) .+.--.-|.+-|. +|++. .|+.. . +..+++ T Consensus 75 G~S~sf~d~~~df~~Y~~~l~----~~~~~--~~~~~--------~------G~v~Fl 112 (112) T protein:vir:27 75 GHSMSYYDNENEFKPYDDMLQ----RLYGT--SGQAK--------E------GEVLFL 112 (112) T ss_pred ceeeeecccccchhhhHHHHH----HHHhh--cCCCC--------C------ceeeeC Confidence 1123456777775 46542 12111 1 112222 No 56 >protein:vir:102158 Length: 99 # NCBI annotation: uncharacterized phage protein (possible DNA packaging) # Family: family:all:316 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699940;genbank:gi:110804046;genbank:GeneID:4206702 Probab=28.27 E-value=1.8 Score=19.23 Aligned_cols=97 Identities=15% Similarity=0.150 Sum_probs=51.9 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcCCCCCchH--HH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAEDHYTSQKP--NL 78 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~e~--v~ 78 (130) |..|.+++..--+= ++.-|++.|+.-|.-|...|.+|++.++ .+.++.++ .|+-+-.-++|.+|....+. .. T Consensus 1 M~vtLee~K~~LRI-D~d~dD~lI~~~i~aA~~~i~~~~~~~~----~~~~~~~k-~Avl~lv~~~YenR~~~~~~~~~~ 74 (99) T protein:vir:10 1 MILSVDEVKNYLRV-DYDEDDILIQDLIESAEDYLYNATGKKF----TEKNKLAK-RYCLALVYDWYKDKGMNIRATKNT 74 (99) T ss_pred CcCCHHHHHHHcCC-CCCcchHHHHHHHHHHHHHHHHhhCCCC----CCCChHHH-HHHHHHHHHhHhcchhhhhhhhcc Confidence 99999999733222 3556889999999999999999997554 34444544 44555556667665422211 00 Q ss_pred HHHHHHHHHHHHHHHHHHHcCccccCCCC Q lcl|NC_020477. 79 DEYHIKLKERIEKLLDDIISGVLVLDPDT 107 (130) Q Consensus 79 ~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~ 107 (130) ...+ -+..-+.+.|..++.+--. .+ T Consensus 75 ~~~~-~lp~~v~sli~qlr~~~~~---~~ 99 (99) T protein:vir:10 75 TVSE-KVKYTLQSILLQLKFCKEE---DT 99 (99) T ss_pred chhh-hhhHHHHHHHHHHhhccCC---CC Confidence 0000 0111112333333321110 00 No 57 >protein:vir:4904 Length: 113 # NCBI annotation: gp113 # Family: family:all:372 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056682;genbank:gi:9635017;genbank:GeneID:1262667 Probab=28.22 E-value=1.8 Score=19.23 Aligned_cols=96 Identities=18% Similarity=0.135 Sum_probs=54.1 Q ss_pred CCCHHHHHH--hcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhcC--CCCCc--- Q lcl|NC_020477. 2 YAIPDDLRL--VMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAED--HYTSQ--- 74 (130) Q Consensus 2 Y~t~edl~l--~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~~--~~~~~--- 74 (130) --+.++.+. .-....|.-|+..++..|.+|+..+=.||+ ...+|.-|..+.+++|..+.-.. ++..+ T Consensus 1 m~~l~~~~~L~~vK~~lgi~dD~lL~~li~~a~~~i~~~l~------~~~iP~~l~~Iv~evavkryNR~g~EG~~S~Se 74 (113) T protein:vir:49 1 MMALDKEKVIQNVSVDLNINDDNLLGILLERIVNHFKAEYG------VDEVDDNLAFIFEDCLVKRFNRRGAEGARSESI 74 (113) T ss_pred CcchhHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHHHHhC------ccccchHHHHHHHHHHHHHhcccCccccceeec Confidence 111111111 112234566788999999999999999986 36899999999999988876432 21111 Q ss_pred ----------hHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 75 ----------KPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 75 ----------e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) +.--.-|.+-|. +|++. .|+.. .+..+++ T Consensus 75 eG~S~sf~d~~~df~eY~~~l~----~~~~~--~~~~~--------------~G~v~Fl 113 (113) T protein:vir:49 75 DGHSMSYYDNENEFDPYDNMLQ----RLYGT--SGQAK--------------EGEVLFL 113 (113) T ss_pred CceeeeecccccccchhHHHHH----HHHhh--cCCCC--------------CcceeeC Confidence 122345677775 46542 12111 1122333 No 58 >protein:vir:1026 Length: 107 # NCBI annotation: Orf46 # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076680;genbank:gi:13095789;genbank:GeneID:920344 Probab=22.98 E-value=2.4 Score=18.53 Aligned_cols=103 Identities=16% Similarity=0.147 Sum_probs=58.3 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCC-C---cccchHHHHHHHHHHHHHhhcCCCCCchH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTP-F---VKVPPIIHDITVDLARFFFAEDHYTSQKP 76 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lP-l---~~vP~~L~~~a~dIArY~L~~~~~~~~e~ 76 (130) |=.|.++|...-+--. . |+..+..+|.-|.+.|-+.++.....| | +.+++...-.|.-+|-.+ |.+|.. +.+ T Consensus 1 M~vtld~iK~sLriD~-d-Dd~~l~~~l~aA~~YIk~Aig~d~~~~~Fy~~e~~~~lfd~Avl~La~~~-Y~nR~a-t~~ 76 (107) T protein:vir:10 1 MSVTVDDLLDQLSEDD-D-RKPQLQIYFDTATAYVKNAVSSDTVDAPFFNVENVSPIYDVAVLSYSMDL-WINRST-TMP 76 (107) T ss_pred CeecHHHHHHHhcCCC-C-chHHHHHHHHHHHHHHhhhcCcccccCCccccccchhHHHHHHHHHHHHH-hhcccc-eee Confidence 9999999976554432 2 889999999999999999999876554 4 357888888888888776 444432 222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCC Q lcl|NC_020477. 77 NLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTD 118 (130) Q Consensus 77 v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~ 118 (130) +----..-|.+ |+. .-..--+.....+ .++. T Consensus 77 vp~~v~siI~Q-----LRg----~y~~~~e~~~~~~--~~~~ 107 (107) T protein:vir:10 77 PTTAVDHMVGQ-----LRG----LYSSWKEAQDGQN--LQTE 107 (107) T ss_pred cchHHHHHHHH-----Hhh----hhcccccccCCCc--ccCC Confidence 21111122222 221 1111001111110 0011 No 59 >protein:vir:106583 Length: 105 # NCBI annotation: putative protein # Family: family:all:6481 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958586;genbank:gi:41179246;genbank:GeneID:2717119 Probab=21.39 E-value=2.6 Score=18.30 Aligned_cols=90 Identities=14% Similarity=0.068 Sum_probs=53.3 Q ss_pred CCCCHHHHHHhc--ccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCCc-- Q lcl|NC_020477. 1 MYAIPDDLRLVM--NNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTSQ-- 74 (130) Q Consensus 1 MY~t~edl~l~d--~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~~-- 74 (130) |---.+.++..- --.++..+++.++..|++|.+.|=+|+.. ..+|+.|..+++++|..+.-. +++..+ T Consensus 1 ~~~~~~~~e~ik~L~~~~d~~~DelL~~lieda~~~vl~y~nr------~~ip~~l~~~v~evav~~fNR~G~EG~tS~S 74 (105) T protein:vir:10 1 MLNVDQLTEIVSALSTRLENVNNALLTELVKESIAQVLDYTGQ------KKLVGSMDIYVKKLAVINYNRLGIEGETQRS 74 (105) T ss_pred CCchHHHHHHHHHHhccCCCchhHHHHHHHHHHHHHHHHHcCC------cccchhHHHHHHHHHHHHhcccCCcccceee Confidence 544444443111 12346788899999999999999999863 477899999999988777643 222111 Q ss_pred ---------hHHHHHHHHHHHHHHHHHHHHHHcCcc Q lcl|NC_020477. 75 ---------KPNLDEYHIKLKERIEKLLDDIISGVL 101 (130) Q Consensus 75 ---------e~v~~rY~~Aik~~~~a~L~~Ia~G~~ 101 (130) ..+-+-|.+.|++. .+-.-|+. T Consensus 75 egGvS~sy~~~~~~~~~~~l~~y-----R~~~v~~~ 105 (105) T protein:vir:10 75 EGGITNYLETGIPKDIRQGLNSY-----RIAKVKKL 105 (105) T ss_pred cCCeeeeeeccCcHHHHHHHHHH-----hhhcccCC Confidence 12334455545421 11122222 No 60 >protein:vir:1887 Length: 108 # NCBI annotation: gp6 # Family: family:all:6764 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037667;genbank:gi:9634125;genbank:GeneID:1262472 Probab=20.70 E-value=2.7 Score=18.20 Aligned_cols=98 Identities=14% Similarity=0.107 Sum_probs=55.1 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhh-ccCCCcccchHHHHHHHHHHHHHhhcCCCCCc-hHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVA-YKTPFVKVPPIIHDITVDLARFFFAEDHYTSQ-KPNL 78 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~R-Y~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~-e~v~ 78 (130) ++.|.+++..--+- +...|++.|+.-|.-|++.|-+|+... |..| ..+|+.++. |+=+-.-++|.+|-..+ .+.. T Consensus 7 ~~vtLee~K~hLRi-d~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-~~~p~~ik~-AiLllv~~~YenRE~~~~~~~~ 83 (108) T protein:vir:18 7 DVISLSLFKQQIEF-EEDDRDELITLYAQAAFDYCMRWCDEPAWKVA-ADIPAAVKG-AVLLVFADMFEHRTAQSEVQLY 83 (108) T ss_pred cccCHHHHHHHcCC-CCCcchHHHHHHHHHHHHHHHHHhCCcccccc-cccchHHHH-HHHHHHHHHHhcccccccchhh Confidence 88999998644332 355789999999999999999999754 3333 356777664 55555556676653222 2211 Q ss_pred HHHHHHHHHHHHHHHH--HHHcCccccCCCCCCCCCc Q lcl|NC_020477. 79 DEYHIKLKERIEKLLD--DIISGVLVLDPDTKLPAGF 113 (130) Q Consensus 79 ~rY~~Aik~~~~a~L~--~Ia~G~~~L~~~~~~~~~~ 113 (130) .- .+++ .+|. +-=.|+.. ..+|+ T Consensus 84 ~~--~~~~----~LL~pYR~~~g~~~------~~~~~ 108 (108) T protein:vir:18 84 EN--AAAE----RMMFIHRNWRGKAE------SEEGS 108 (108) T ss_pred hh--HHHH----HHHHHHHhcCCCCC------cccCC Confidence 10 1222 2333 22234331 11222 No 61 >protein:vir:192 Length: 108 # NCBI annotation: Gp6 # Family: family:all:6764 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037702;genbank:gi:9634167;genbank:GeneID:1262532 Probab=20.70 E-value=2.7 Score=18.20 Aligned_cols=98 Identities=14% Similarity=0.107 Sum_probs=55.1 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhh-ccCCCcccchHHHHHHHHHHHHHhhcCCCCCc-hHHH Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVA-YKTPFVKVPPIIHDITVDLARFFFAEDHYTSQ-KPNL 78 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~R-Y~lPl~~vP~~L~~~a~dIArY~L~~~~~~~~-e~v~ 78 (130) ++.|.+++..--+- +...|++.|+.-|.-|++.|-+|+... |..| ..+|+.++. |+=+-.-++|.+|-..+ .+.. T Consensus 7 ~~vtLee~K~hLRi-d~dddD~lI~~~i~AA~~~v~~~~~~~~~~~~-~~~p~~ik~-AiLllv~~~YenRE~~~~~~~~ 83 (108) T protein:vir:19 7 DVISLSLFKQQIEF-EEDDRDELITLYAQAAFDYCMRWCDEPAWKVA-ADIPAAVKG-AVLLVFADMFEHRTAQSEVQLY 83 (108) T ss_pred cccCHHHHHHHcCC-CCCcchHHHHHHHHHHHHHHHHHhCCcccccc-cccchHHHH-HHHHHHHHHHhcccccccchhh Confidence 88999998644332 355789999999999999999999754 3333 356777664 55555556676653222 2211 Q ss_pred HHHHHHHHHHHHHHHH--HHHcCccccCCCCCCCCCc Q lcl|NC_020477. 79 DEYHIKLKERIEKLLD--DIISGVLVLDPDTKLPAGF 113 (130) Q Consensus 79 ~rY~~Aik~~~~a~L~--~Ia~G~~~L~~~~~~~~~~ 113 (130) .- .+++ .+|. +-=.|+.. ..+|+ T Consensus 84 ~~--~~~~----~LL~pYR~~~g~~~------~~~~~ 108 (108) T protein:vir:19 84 EN--AAAE----RMMFIHRNWRGKAE------SEEGS 108 (108) T ss_pred hh--HHHH----HHHHHHHhcCCCCC------cccCC Confidence 10 1222 2333 22234331 11222 No 62 >protein:vir:4954 Length: 104 # NCBI annotation: putative DNA packaging protein # Family: family:all:734 # ACLAME annotation(s): phi:0000018 - phage genome packaging # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049930;genbank:gi:9632901;genbank:GeneID:1262077 Probab=20.64 E-value=2.7 Score=18.19 Aligned_cols=99 Identities=14% Similarity=0.062 Sum_probs=51.2 Q ss_pred CCCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCC-CcccchHHHHHHHHHHHHHhhcCCCCCc----h Q lcl|NC_020477. 1 MYAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTP-FVKVPPIIHDITVDLARFFFAEDHYTSQ----K 75 (130) Q Consensus 1 MY~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lP-l~~vP~~L~~~a~dIArY~L~~~~~~~~----e 75 (130) |-.|.+++..--|- +..-|++.|+..|.-|...|.++++...... ...+|+..+. ||-+-.=++|.+|...+ . T Consensus 1 M~vtLeeiK~~LRI-D~dddD~li~~~i~aA~~yi~~aig~~~~~~~~~~~~~~~~~-Avl~Lv~~~YeNR~~~~~~~~~ 78 (104) T protein:vir:49 1 MSVSKTSIMQTLNL-DETDDTALIPAYIESAKQYIINAVGSDSKFYDLDSVRALFDT-AVIALTSSYFTYRVALTDTATY 78 (104) T ss_pred CcccHHHHHHHcCC-CCccchHHHHHHHHHHHHHHHHhhCCCCccccccCCChHHHH-HHHHHHHHHHhhchhccccccc Confidence 99999999643332 3445888999999999999999998653311 1345655554 44444445565553222 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHcCccc Q lcl|NC_020477. 76 PNLDEYHIKLKERIEKLLDDIISGVLV 102 (130) Q Consensus 76 ~v~~rY~~Aik~~~~a~L~~Ia~G~~~ 102 (130) ++.---+.-|.++ +........+.-. T Consensus 79 ~vp~~v~sli~qL-r~~y~~~~e~~~~ 104 (104) T protein:vir:49 79 PVNLTLNSIIGQL-RGLYATYSEERGD 104 (104) T ss_pred hhhHHHHHHHHHH-HHhhhhhhhccCC Confidence 2222222222211 0000111110000 No 63 >protein:vir:9877 Length: 114 # NCBI annotation: hypothetical protein # Family: family:all:2716 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795639;genbank:gi:28876402;genbank:GeneID:1257933 Probab=20.62 E-value=2.7 Score=18.18 Aligned_cols=94 Identities=22% Similarity=0.321 Sum_probs=56.4 Q ss_pred CCCCHH----HHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHH--HHHhhcCCCCCc Q lcl|NC_020477. 1 MYAIPD----DLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLA--RFFFAEDHYTSQ 74 (130) Q Consensus 1 MY~t~e----dl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIA--rY~L~~~~~~~~ 74 (130) |=.|-+ ++.+.-.- +....++.++..|.+|+..|-.||+ ...+|.-|.-+..++| ||+=..+++..+ T Consensus 1 m~~~~~~~L~~vK~~Lgi-~d~~~D~lL~~ii~~~~~~i~~~l~------~~~iP~~L~~Iv~ev~vkryNR~g~EG~~S 73 (114) T protein:vir:98 1 MDETKQAIIDRVRVRLAD-ETSLKEELLEELTQTAIDRINLKVG------DVVFNPLFNSIAVDVVVKMYRRMYFEGIDT 73 (114) T ss_pred CchhHHHHHHHHHHHhCC-CCCchhhHHHHHHHHHHHHHHHhhC------ccccchHHHHHHHHHHHHHhcccCccccce Confidence 776643 33322222 2334458899999999999999996 3678888877776655 555333332111 Q ss_pred -----------hHHHHHHHHHHHHHHHHHHHH---HHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 75 -----------KPNLDEYHIKLKERIEKLLDD---IISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 75 -----------e~v~~rY~~Aik~~~~a~L~~---Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) +..-..|.+-|. +|++. ..+|++. |++ T Consensus 74 ~S~eG~S~tf~dndf~ey~~~l~----~y~~~~~~~~~g~~v------------------~Fl 114 (114) T protein:vir:98 74 EKADTISTKFIENVLAEYGEELA----SYKKDRLAILNKKVV------------------RFL 114 (114) T ss_pred eeccceeeeeeccccchhHHHHH----HHHhhhhhhhcCcee------------------ecC Confidence 344677888886 58773 3344421 111 No 64 >protein:vir:3615 Length: 110 # NCBI annotation: ORF38 # Family: family:all:372 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112701;genbank:gi:13786569;genbank:GeneID:921067 Probab=20.43 E-value=2.8 Score=18.15 Aligned_cols=97 Identities=19% Similarity=0.256 Sum_probs=59.6 Q ss_pred CCCHHHHHHhcccCCCCcCHHHHHHHHHHHHHHHHHHHhhhccCCCcccchHHHHHHHHHHHHHhhc--CCCCCc----- Q lcl|NC_020477. 2 YAIPDDLRLVMNNLSKQVTDELLAKYIQEASNYIDARLGVAYKTPFVKVPPIIHDITVDLARFFFAE--DHYTSQ----- 74 (130) Q Consensus 2 Y~t~edl~l~d~~~~g~~d~~~i~~Al~dAs~~IDgyL~~RY~lPl~~vP~~L~~~a~dIArY~L~~--~~~~~~----- 74 (130) -.+.+.+.+. .|..+++.++..|++|...|=.||+. ....+|.-|..+.+++|..+.-. +++..+ T Consensus 1 M~~L~~vK~~----lg~~~D~lL~~li~~a~~~i~~~~~~----~~~eiP~~l~~iv~evav~ryNR~g~EG~~S~SeeG 72 (110) T protein:vir:36 1 MAITDDLKML----LGGSLDERLEVIEKRTRDRLLLILGS----DIKEVPPELEYVVLDVSLKRFNRIGQEGMQSYSQEG 72 (110) T ss_pred ChhHHHHHhh----cCCChhHHHHHHHHHHHHHHHHHhCC----ChhhhhhHHHHHHHHHHHHHhccccccccceeecCC Confidence 3444444432 34557789999999999999999874 34578999999999998877643 222111 Q ss_pred ------hHHHHHHHHHHHHHHHHHHHHHHcCccccCCCCCCCCCceeeCCCCccc Q lcl|NC_020477. 75 ------KPNLDEYHIKLKERIEKLLDDIISGVLVLDPDTKLPAGFATTTDGEQIF 123 (130) Q Consensus 75 ------e~v~~rY~~Aik~~~~a~L~~Ia~G~~~L~~~~~~~~~~~~~~~~~r~f 123 (130) +..-+.|.+-|. +|++.=..+. ....+..++| T Consensus 73 ~S~sf~~~d~~~y~~~l~----~y~~~~~~~~-------------~~~~g~~~f~ 110 (110) T protein:vir:36 73 LSMTFSESDFDEYADEIE----SWRKSRETEG-------------DKKIGRFRLY 110 (110) T ss_pred ceeeecccCcchHHHHHH----HHHhhhcccc-------------CCcceeeeeC Confidence 233467777775 4765321110 0112235666 Done!