GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:50:09 Sequence gi568815586r:54192377_54395248 : 202872 bp : 47.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8084 8265 182 2 2 77 50 130 0.743 7.61 1.02 Intr + 13859 13923 65 0 2 93 47 39 0.203 -1.26 1.03 Intr + 17936 17985 50 1 2 102 110 5 0.505 1.58 1.04 Intr + 19156 19327 172 1 1 96 98 47 0.875 6.45 1.05 Term + 26190 26261 72 2 0 90 55 49 0.324 -0.29 1.06 PlyA + 26593 26598 6 1.05 2.07 PlyA - 29181 29176 6 1.05 2.06 Term - 49529 49379 151 1 1 126 43 134 0.996 10.08 2.05 Intr - 53839 53739 101 1 2 81 81 42 0.758 1.71 2.04 Intr - 59851 59665 187 0 1 112 99 80 0.999 11.19 2.03 Intr - 65316 65138 179 0 2 121 80 175 0.994 18.72 2.02 Intr - 67297 67133 165 1 0 117 94 -12 0.767 2.46 2.01 Init - 71778 71728 51 0 0 57 23 67 0.339 -1.74 2.00 Prom - 75864 75825 40 -8.86 3.00 Prom + 77859 77898 40 -5.26 3.01 Init + 88432 88446 15 0 0 83 105 -2 0.834 1.28 3.02 Intr + 89010 89126 117 2 0 90 83 110 0.999 11.36 3.03 Intr + 89419 89565 147 0 0 48 81 123 0.990 8.03 3.04 Intr + 89714 89924 211 1 1 98 89 231 0.991 22.69 3.05 Intr + 90018 90110 93 1 0 85 69 47 0.844 2.44 3.06 Intr + 90197 90289 93 0 0 61 68 91 0.947 4.34 3.07 Intr + 90424 90498 75 2 0 104 115 85 0.999 12.29 3.08 Intr + 90703 90858 156 2 0 52 66 152 0.942 9.38 3.09 Intr + 91436 91591 156 0 0 75 86 94 0.982 7.88 3.10 Term + 94577 94590 14 0 2 113 49 9 0.649 -2.14 3.11 PlyA + 94692 94697 6 1.05 4.04 PlyA - 95723 95718 6 1.05 4.03 Term - 101005 99998 1008 1 0 122 49 1028 0.999 94.40 4.02 Intr - 102851 102759 93 2 0 76 109 66 0.681 7.66 4.01 Init - 104643 104560 84 0 0 68 98 60 0.517 5.82 4.00 Prom - 107040 107001 40 -9.16 5.00 Prom + 111323 111362 40 -2.06 5.01 Init + 131429 131473 45 1 0 64 89 21 0.416 -1.46 5.02 Intr + 132707 132805 99 1 0 54 54 126 0.623 6.11 5.03 Intr + 148171 148239 69 0 0 90 100 84 0.701 9.18 5.04 Intr + 149830 149911 82 0 1 71 48 162 0.954 9.71 5.05 Intr + 150849 150940 92 1 2 106 78 81 0.994 8.61 5.06 Intr + 153084 153139 56 2 2 106 83 28 0.913 1.88 5.07 Intr + 155391 155468 78 0 0 41 110 100 0.928 6.17 5.08 Intr + 155624 155675 52 2 1 117 98 19 0.957 4.71 5.09 Intr + 157244 157282 39 1 0 99 80 44 0.847 3.12 5.10 Term + 158100 158147 48 2 0 120 49 44 0.711 1.00 5.11 PlyA + 159451 159456 6 1.05 6.02 PlyA - 159984 159979 6 1.05 6.01 Sngl - 171475 170285 1191 1 0 49 42 1079 0.967 95.31 6.00 Prom - 175159 175120 40 -6.56 7.11 PlyA - 176787 176782 6 1.05 7.10 Term - 178110 177880 231 0 0 110 43 167 0.991 10.87 7.09 Intr - 178345 178250 96 1 0 75 77 103 0.978 8.11 7.08 Intr - 178720 178551 170 2 2 98 94 140 0.995 15.27 7.07 Intr - 179266 179097 170 0 2 57 86 115 0.859 7.79 7.06 Intr - 179744 179653 92 2 2 127 28 12 0.824 -2.11 7.05 Intr - 181495 181470 26 2 2 107 113 3 0.762 2.44 7.04 Intr - 181759 181597 163 1 1 58 89 45 0.824 1.15 7.03 Intr - 183578 183468 111 2 0 57 110 52 0.775 4.88 7.02 Intr - 192188 192052 137 0 2 101 103 129 0.898 15.99 7.01 Init - 198868 198859 10 1 1 80 109 23 0.297 2.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:54192377_54395248|GENSCAN_predicted_peptide_1|180_aa XVPSALKTEEANDISCPTWKVKVQKTEGTRLLHILRSVISCRKKHPTCRKKHSTRMTQSL PSLLLKKARLRSWEGRIELEKGICPSGAYSEVVLGAFIRGAVDRVFGNSSRRKILKLEPT PNLRKEKVLSRMGSVEKKARKGIPPIPARQKHKVNSKPYLLHHGQTQAHNSQRFGYTKLL >gi568815586r:54192377_54395248|GENSCAN_predicted_CDS_1|543_bp nnggtcccctcagcattgaagacagaagaggcaaatgacatttcatgtcccacttggaaa gtcaaggtgcagaaaacagaggggaccagactccttcatatcctgcgcagcgtcatctcc tgccgcaagaagcaccccacctgccgcaagaagcactccacccgcatgactcagtctttg cccagtctcctgctgaagaaagccaggctcaggtcatgggaaggtcggattgagctggag aaagggatatgcccatcaggtgcttactcagaggtggtgctgggtgctttcataagaggg gcagtggacagagtctttggaaatagctcgaggagaaagattctgaagctggagcccact ccaaatctcaggaaagagaaggtcctgagtaggatggggagtgtggaaaagaaggcccgg aagggaataccacccatcccagcaagacaaaaacataaagtcaattcaaagccttatctt ctgcatcatggccagactcaggcacacaacagccagcgcttcggttacacaaaactgctc tga >gi568815586r:54192377_54395248|GENSCAN_predicted_peptide_2|277_aa MRVEPELDKADRYQGPYCSTAASCYGNGLGQLRSGAQAAGWGQGSCQPAKTQTWEGRPLL LAQRERKNWGDPGPGGLSLSGGTVCDMGKKTKRTADSSSSEDEEEYVVEKVLDRRVVKGQ VEYLLKWKGFSEEHNTWEPEKNLDCPELISEFMKKYKKMKEGENNKPREKSESNKRKSNF SNSADDIKSKKKREQSNDIARGFERGLEPEKIIGATDSCGDLMFLMKWKDTDEADLVLAK EANVKCPQIVIAFYEERLTWHAYPEDAENKEKETAKS >gi568815586r:54192377_54395248|GENSCAN_predicted_CDS_2|834_bp atgagagtagaacctgaactggacaaagcagacagataccagggaccctattgtagtact gctgccagttgctatggcaacggactgggacagctgcgcagtggcgcccaggcagctggg tgggggcaggggagctgccagcccgccaagacccagacctgggaggggaggcccctcctg ttagcccagagggaaagaaagaactggggcgatccgggacctggtggccttagtctttca ggtggaacggtgtgcgacatgggaaagaaaaccaagcggacagctgacagttcttcttca gaggatgaggaggagtatgttgtggagaaggtgctagacaggcgcgtggttaagggacaa gtggaatatctactgaagtggaaaggcttttctgaggagcacaatacttgggaacctgag aaaaacttggattgccctgagctaatttctgaatttatgaaaaagtataagaagatgaag gagggtgaaaataataaacccagggagaagtcagaaagtaacaagaggaaatccaatttc tcaaacagtgccgatgacatcaaatctaaaaaaaagagagagcagagcaatgatatcgct cggggctttgagagaggactggaaccagaaaagatcattggggcaacagattcctgtggt gatttaatgttcctaatgaaatggaaagacacagatgaagctgacctggttcttgcaaaa gaagctaatgtgaaatgtccacaaattgtgatagcattttatgaagagagactgacatgg catgcatatcctgaggatgcggaaaacaaagagaaagaaacagcaaagagctaa >gi568815586r:54192377_54395248|GENSCAN_predicted_peptide_3|358_aa MSKSESPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCVVMRDPNTKRSRGFGFV TYATVEEVDAAMNARPHKVDGRVVEPKRAVSREDSQRPGAHLTVKKIFVGGIKEDTEEHH LRDYFEQYGKIEVIEIMTDRGSGKKRGFAFVTFDDHDSVDKIVIQKYHTVNGHNCEVRKA LSKQEMASASSSQRGRSGSGNFGGGRGGGFGGNDNFGRGGNFSGRGGFGGSRGGGGYGGS GDGYNGFGNDGGYGGGGPGYSGGSRGYGSGGQGYGNQGSGYGGSGSYDSYNNGGGGGFGG GSGSNFGGGGSYNDFGNYNNQSSNFGPMKGGNFGGRSSGPYGGGGQYFAKPRNQGTSL >gi568815586r:54192377_54395248|GENSCAN_predicted_CDS_3|1077_bp atgtctaagtcagagtctcctaaagagcccgaacagctgaggaagctcttcattggaggg ttgagctttgaaacaactgatgagagcctgaggagccattttgagcaatggggaacgctc acggactgtgtggtaatgagagatccaaacaccaagcgctccaggggctttgggtttgtc acatatgccactgtggaggaggtggatgcagctatgaatgcaaggccacacaaggtggat ggaagagttgtggaaccaaagagagctgtctccagagaagattctcaaagaccaggtgcc cacttaactgtgaaaaagatatttgttggtggcattaaagaagacactgaagaacatcac ctaagagattattttgaacagtatggaaaaattgaagtgattgaaatcatgactgaccga ggcagtggcaagaaaaggggctttgcctttgtaacctttgacgaccatgactccgtggat aagattgtcattcagaaataccatactgtgaatggccacaactgtgaagttagaaaagcc ctgtcaaagcaagagatggctagtgcttcatccagccaaagaggtcgaagtggttctgga aactttggtggtggtcgtggaggtggtttcggtgggaatgacaacttcggtcgtggagga aacttcagtggtcgtggtggctttggtggcagccgtggtggtggtggatatggtggcagt ggggatggctataatggatttggtaatgatggtggttatggaggaggcggccctggttac tctggaggaagcagaggctatggaagtggtggacagggttatggaaaccagggcagtggc tatggcgggagtggcagctatgacagctataacaacggaggcggaggcggctttggcggt ggtagtggaagcaattttggaggtggtggaagctacaatgattttgggaattacaacaat cagtcttcaaattttggacccatgaagggaggaaattttggaggcagaagctctggcccc tatggcggtggaggccaatactttgcaaaaccacgaaaccaaggaacgtccttgtga >gi568815586r:54192377_54395248|GENSCAN_predicted_peptide_4|394_aa MSDRRDQEDLAKSTWNLLLTAGKTDARKQSRNRVIQLSTSELGEMELTWQEIMSITELQG LNAPSEPSFEPQAPAPYLGPPPPTTYCPCSIHPDSGFPLPPPPYELPASTSHVPDPPYSY GNMAIPVSKPLSLSGLLSEPLQDPLALLDIGLPAGPPKPQEDPESDSGLSLNYSDAESLE LEGTEAGRRRSEYVEMYPVEYPYSLMPNSLAHSNYTLPAAETPLALEPSSGPVRAKPTAR GEAGSRDERRALAMKIPFPTDKIVNLPVDDFNELLARYPLTESQLALVRDIRRRGKNKVA AQNCRKRKLETIVQLERELERLTNERERLLRARGEADRTLEVMRQQLTELYRDIFQHLRD ESGNSYSPEEYALQQAADGTIFLVPRGTKMEATD >gi568815586r:54192377_54395248|GENSCAN_predicted_CDS_4|1185_bp atgtctgaccgtagagatcaggaagaccttgccaaatctacttggaatctcttgctgacg gcagggaaaacagatgcaaggaagcagagcaggaacagggtgatacagctgtccacttca gagctaggagagatggaactgacttggcaggagatcatgtccatcaccgagctgcagggt ctgaatgctccaagtgagccatcatttgagccccaagccccagctccataccttggacct ccaccacccacaacttactgcccctgctcaatccacccagattctggcttcccacttcct ccaccaccttatgagctcccagcatccacatcccatgtcccagatcccccatactcctat ggcaacatggccataccagtctccaagccactgagcctctcaggcctgctcagtgagccg ctccaagaccccttagccctcctggacattgggctgccagcagggccacctaagccccaa gaagacccagaatccgactcaggattatccctcaactatagcgatgctgaatctcttgag ctggaggggacagaggctggtcggcggcgcagcgaatatgtagagatgtacccagtggag tacccctactcactcatgcccaactccttggcccactccaactataccttgccagctgct gagacccccttggccttagagccctcctcaggccctgtgcgggctaagcccactgcacgg ggggaggcagggagtcgggatgaacgtcgggccttggccatgaagattccttttcctacg gacaagattgtcaacttgccggtagatgactttaatgagctattggcaaggtacccgctg acagagagccagctagcgctagtccgggacatccgacgacggggcaaaaacaaggtggca gcccagaactgccgcaagaggaagctggaaaccattgtgcagctggagcgggagctggag cggctgaccaatgaacgggagcggcttctcagggcccgcggggaggcagaccggaccctg gaggtcatgcgccaacagctgacagagctgtaccgtgacattttccagcaccttcgggat gaatcaggcaacagctactctcctgaagagtacgcgctgcaacaggctgccgatgggacc atcttccttgtgccccgggggaccaagatggaggccacagactga >gi568815586r:54192377_54395248|GENSCAN_predicted_peptide_5|219_aa MTRSRLTSTSASQVQAGPNGNVWPISGGVSFAAPRRHQLRGKMEALILEPSLYTVKAILI LDNDGDRLFAKYYDDTYPSVKEQKAFEKNIFNKTHRTDSEIALLEGLTVVYKSSIDLYFY VIGSSYENELMLMAVLNCLFDSLSQMLRKNVEKRALLENMEGLFLAVDEIVDGGVILESD PQQVVHRVALRGEDVPLTEQTVSQVLQSAKEQIKWSLLR >gi568815586r:54192377_54395248|GENSCAN_predicted_CDS_5|660_bp atgacgagatctcgcctcacctcaacctccgcctcccaggttcaagcggggcctaatggg aacgtatggccaatcagcggcggcgtttcttttgcggctccacgtcggcaccagctgcgg ggcaagatggaggcgctgattttggaaccttccctgtatactgtcaaagccatcctgatt ctggacaatgatggagatcgactttttgccaagtactatgacgacacctaccccagtgtc aaggagcaaaaggcctttgagaagaacattttcaacaagacccatcggactgacagtgaa attgccctcttggaaggcctgacagtggtatacaaaagcagtatagatctctatttctat gtgattggcagctcctatgaaaatgagctgatgcttatggctgttctgaactgtctcttc gactcattgagccagatgctgaggaaaaatgtagaaaagcgagcactgctggagaacatg gaggggctgttcttggctgtggatgaaattgtagatggaggggtgatcctagagagtgat ccccagcaggtggtacaccgggtggcattaaggggtgaagatgtcccccttacggagcag accgtgtctcaggtgctgcagtcagccaaagaacagatcaagtggtcactccttcggtga >gi568815586r:54192377_54395248|GENSCAN_predicted_peptide_6|396_aa MWNSSDANFSCYHESVLGYRYVAVSWGVVVAVTGTVGNVLTLLALAIQPKLRTRFNLLIA NLTLADLLYCTLLQPFSVDTYLHLHWRTGATFCRVFGLLLFASNSVSILTLCLIALGRYL LIAHPKLFPQVFSAKGIVLALVSTWVVGVASFAPLWPIYILVPVVCTCSFDRIRGRPYTT ILMGIYFVLGLSSVGIFYCLIHRQVKRAAQALDQYKLRQASIHSNHVARTDEAMPGRFQE LDSRLASGGPSEGISSEPVSAATTQTLEGDSSEVGDQINSKRAKQMAEKSPPEASAKAQP IKGARRAPDSSSEFGKVTRMCFAVFLCFALSYIPFLLLNILDARVQAPRVVHMLAANLTW LNGCINPVLYAAMNRQFRQAYGSILKRGPRSFHRLH >gi568815586r:54192377_54395248|GENSCAN_predicted_CDS_6|1191_bp atgtggaacagctctgacgccaacttctcctgctaccatgagtctgtgctgggctatcgt tatgttgcagttagctggggggtggtggtggctgtgacaggcaccgtgggcaatgtgctc accctactggccttggccatccagcccaagctccgtacccgattcaacctgctcatagcc aacctcacactggctgatctcctctactgcacgctccttcagcccttctctgtggacacc tacctccacctgcactggcgcaccggtgccaccttctgcagggtatttgggctcctcctt tttgcctccaattctgtctccatcctgaccctctgcctcatcgcactgggacgctacctc ctcattgcccaccctaagctttttccccaagttttcagtgccaaggggatagtgctggca ctggtgagcacctgggttgtgggcgtggccagctttgctcccctctggcctatttatatc ctggtacctgtagtctgcacctgcagctttgaccgcatccgaggccggccttacaccacc atcctcatgggcatctactttgtgcttgggctcagcagtgttggcatcttctattgcctc atccaccgccaggtcaaacgagcagcacaggcactggaccaatacaagttgcgacaggca agcatccactccaaccatgtggccaggactgatgaggccatgcctggtcgtttccaggag ctggacagcaggttagcatcaggaggacccagtgaggggatttcatctgagccagtcagt gctgccaccacccagaccctggaaggggactcatcagaagtgggagaccagatcaacagc aagagagctaagcagatggcagagaaaagccctccagaagcatctgccaaagcccagcca attaaaggagccagaagagctccggattcttcatcggaatttgggaaggtgactcgaatg tgttttgctgtgttcctctgctttgccctgagctacatccccttcttgctgctcaacatt ctggatgccagagtccaggctccccgggtggtccacatgcttgctgccaacctcacctgg ctcaatggttgcatcaaccctgtgctctatgcagccatgaaccgccaattccgccaagca tatggctccattttaaaaagagggccccggagtttccataggctccattag >gi568815586r:54192377_54395248|GENSCAN_predicted_peptide_7|401_aa MILGSLSRAGPLPLLRQPPIMQPPLDLKQILPFPLEPAPTLGLFSNYSTMDPVQKAVLSH TFGGPLLKTKRPVISCNICQIRFNSQSQAEAHYKGNRHARRVKGIEAAKTRGREPGVREP GDPAPPGSTPTNGDGVAPRPVPIPPPPPKGSFVYFLGAGQDSRKGLSWQEAQTGWQATEG IPETGQGVTKGEGGTPAPASLPGGSKEEEEKAKRLLYCALCKVAVNSLSQLEAHNKGTKH KTILEARSGLGPIKAYPRLGPPTPGEPEAPAQDRTFHCEICNVKVNSEVQLKQHISSRRH RDGVAGKPNPLLSRHKKSRGAGELAGTLTFSKELPKSLAGGLLPSPLAVAAVMAAAAGSP LSLRPAPAAPLLQGPPITHPLLHPAPGPIRTAHGPILFSPY >gi568815586r:54192377_54395248|GENSCAN_predicted_CDS_7|1206_bp atgatcctcggcagcctgagccgggcagggcccctgcctctgctacggcagcccccgatc atgcagcccccactggacctcaagcagatcctgcccttcccactcgagccagcccctacc cttggcctcttcagcaactacagcaccatggaccctgtgcagaaggctgtgctctcccac acttttgggggacccttgctcaagaccaagcggcccgtcatttcctgtaatatctgtcaa atccgcttcaattctcagagccaggctgaggcgcactacaaaggtaatcgccacgcccga cgagtcaaaggcattgaggctgccaagaccagaggcagggagcctggcgtccgagaacct ggagacccagctcccccaggcagcaccccaacaaatggggatggtgtagcaccccgtcca gtccctatacccccacccccacccaagggctctttcgtgtacttcctgggggctgggcaa gactccaggaaggggctgtcctggcaggaagcccagactggctggcaggccactgagggc attccggagactggtcagggtgtaaccaagggtgaaggggggactccagccccggcttcc ttgcctgggggtagcaaggaagaggaggagaaagccaagcggctgctctactgtgctctg tgcaaggtggctgtgaactccctgtcccagcttgaggcacataacaaaggtactaagcac aagacaattctggaggcccgaagtgggctcgggcccatcaaagcttaccctcggctgggg cctcccaccccgggggaaccagaggctcctgcccaggaccgaactttccactgtgagatc tgcaatgtcaaggtcaactcggaggtccaactgaaacagcacatctccagccggcggcac cgagacggcgtggccgggaagcccaacccactactgagccgtcacaagaagtctaggggc gccggggagctggcgggcacgctgactttctccaaggagctgcccaagtccctggcgggc ggcctgctccccagccccctggcggtggctgcagtgatggcagcggcagcaggctcgccg ctgtccctgcgcccggctccagccgcacctcttctccagggaccgccgatcactcaccct ctgcttcacccggcccccggacccatccgaactgcgcacggacccatcctcttctccccg tactga