GENSCAN 1.0 Date run: 6-Nov-116 Time: 18:35:27 Sequence gi568815596r:27681618_27990345 : 308728 bp : 40.83% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 631 742 112 1 1 78 98 73 0.962 6.76 1.02 Intr + 3420 3660 241 2 1 81 45 274 0.552 18.60 1.03 Intr + 7083 7150 68 1 2 30 106 51 0.233 -1.19 1.04 Intr + 12068 12137 70 1 1 101 71 58 0.169 3.24 1.05 Intr + 34717 34836 120 2 0 103 60 46 0.066 2.85 1.06 Intr + 35230 35401 172 2 1 37 87 70 0.197 -0.12 1.07 Intr + 38607 38775 169 0 1 77 78 153 0.953 12.23 1.08 Intr + 43420 43524 105 0 0 110 78 41 0.926 4.79 1.09 Intr + 46000 46068 69 0 0 97 113 95 0.896 11.36 1.10 Term + 49048 49383 336 0 0 106 42 98 0.421 0.59 1.11 PlyA + 51792 51797 6 1.05 2.05 PlyA - 53029 53024 6 1.05 2.04 Term - 54494 54379 116 0 2 74 45 151 0.841 7.15 2.03 Intr - 54931 54745 187 1 1 62 11 145 0.295 2.54 2.02 Intr - 65133 64977 157 2 1 97 76 66 0.239 5.39 2.01 Init - 65357 65275 83 2 2 82 81 78 0.907 6.99 2.00 Prom - 65700 65661 40 -6.75 3.00 Prom + 66562 66601 40 -7.95 3.01 Init + 67103 67132 30 1 0 72 113 24 0.414 1.74 3.02 Intr + 71226 71330 105 2 0 46 116 63 0.637 4.39 3.03 Intr + 71374 71469 96 0 0 83 75 37 0.655 1.19 3.04 Intr + 75381 75461 81 2 0 87 42 62 0.056 0.42 3.05 Intr + 86329 86421 93 0 0 103 92 30 0.785 4.14 3.06 Term + 86956 87120 165 0 0 79 36 92 0.645 0.03 3.07 PlyA + 87453 87458 6 1.05 4.12 PlyA - 89022 89017 6 1.05 4.11 Term - 90192 89668 525 0 0 74 48 264 0.116 14.27 4.10 Intr - 94827 94680 148 2 1 38 80 87 0.131 2.22 4.09 Intr - 117505 117379 127 2 1 91 49 70 0.088 2.22 4.08 Intr - 119519 119306 214 2 1 79 -20 156 0.024 1.27 4.07 Intr - 136172 135987 186 2 0 87 87 48 0.102 3.56 4.06 Intr - 146138 145950 189 2 0 95 127 83 0.994 11.76 4.05 Intr - 151160 151069 92 0 2 115 94 96 0.983 11.69 4.04 Intr - 161614 161450 165 2 0 53 76 112 0.954 5.51 4.03 Intr - 165487 165425 63 2 0 112 115 58 0.986 8.67 4.02 Intr - 166480 166417 64 1 1 62 98 42 0.596 -0.03 4.01 Init - 169758 169600 159 0 0 61 91 78 0.526 5.27 4.00 Prom - 172967 172928 40 -3.95 5.03 PlyA - 173375 173370 6 1.05 5.02 Term - 189642 189348 295 2 1 71 43 207 0.439 8.29 5.01 Init - 208728 208640 89 0 2 87 92 111 0.180 11.62 5.00 Prom - 211248 211209 40 -7.25 6.02 PlyA - 211377 211372 6 1.05 6.01 Sngl - 215226 215020 207 0 0 88 44 275 0.803 18.04 6.00 Prom - 217314 217275 40 -5.05 7.00 Prom + 222899 222938 40 -3.95 7.01 Init + 222993 223027 35 2 2 74 87 21 0.182 -0.01 7.02 Intr + 226081 226148 68 1 2 102 78 34 0.061 1.43 7.03 Intr + 236637 236697 61 1 1 97 103 34 0.066 2.67 7.04 Term + 244178 244301 124 2 1 128 48 105 0.172 7.38 7.05 PlyA + 245586 245591 6 1.05 8.00 Prom + 254859 254898 40 -3.65 8.01 Init + 286033 286168 136 0 1 53 57 138 0.794 7.55 8.02 Term + 286635 287173 539 1 2 26 49 246 0.845 7.92 8.03 PlyA + 289688 289693 6 1.05 9.04 PlyA - 290331 290326 6 1.05 9.03 Term - 290596 290582 15 1 0 112 32 9 0.375 -5.04 9.02 Intr - 290983 290851 133 0 1 45 116 42 0.132 2.43 9.01 Init - 296733 296621 113 0 2 90 35 122 0.308 6.83 9.00 Prom - 306879 306840 40 -2.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 30902 30957 56 1 2 73 72 65 0.807 4.21 S.002 Sngl - 90162 89668 495 0 0 88 48 253 0.833 17.10 S.003 Term + 129332 129487 156 1 0 68 49 134 0.800 4.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_1|487_aa XTETQTTGAENKAKKLTLPLFGAMKGGSKFKLKTGTVGKLPPKRPELPPTLMRMKDEPEV EEEEEEEEEEEKEKEEHEKKKLEDGSLSRPQPEIEPEAAVQEMRPPTDLTHFKETQTHAS KNEYEKSRGELKKKKTPGPGKLPPTLSSKYPEDDPDYCVWVPPEAPDQALLLCWGRPRTV SCLGALIAFLGNPAKGPPSSCSPGDDKTETQKMNQLGSVHRASVWKSQNPDPDPNCSVPA LTPNQPVPLSLVLFHPAFLSKKEGARLEERRCAGTAFVRALKGAEWALTQLYDSFTVLLS PSSGHILWTISAQPQVEEGLCFLLSSALSEKDPTGDLNPLNMVPGERNVGMRKPHRSLWG WGHDGREGGKKGELPKRKGFPSQLSPQERRQLYGPGPSPEDAFLTVGRRDGMLVRVSPEV VGKQAPFLTPPQVSGFGWKLPSSSSDKQPAPANQGSRQPTLLLNFLGVKLQKIQLLEEIV EVMVDDK >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_1|1464_bp nngactgaaactcagactacaggtgcagaaaacaaagctaaaaagcttacattgcctcta tttggtgccatgaaaggaggaagcaaattcaaattaaaaactggaacagtagggaagtta ccccccaagcgtccagaactccctccaactctaatgagaatgaaagatgagcctgaagta gaagaggaggaggaagaggaagaggaagaagagaaagaaaaggaggagcatgaaaagaaa aaactggaggatggaagcctcagtaggccacagccagagatagagccagaagcagcagtg caggaaatgaggcctcccacagatctcacacattttaaagaaacccaaacccatgcatca aagaatgaatatgagaaaagcagaggtgaattgaagaaaaagaaaacacctggtccaggc aaacttccaccaacactttcttccaaatatcctgaagatgacccagactactgtgtgtgg gtcccacctgaagcccctgaccaggccctgctcctctgctggggtcggccccgcacagta agctgcctgggagccctcattgccttcctgggcaacccggccaaaggccccccgagcagc tgcagccctggagatgacaaaactgagacacaaaagatgaatcagcttggcagtgttcac agagccagcgtgtggaaaagccagaatccagacccagacccgaactgcagcgtcccggct cttacccctaaccagccagtgcctctcagcttagtgctcttccacccggcttttctttct aaaaaggaaggggcaaggctggaagaacgccgctgtgctggcacagccttcgtcagagcc ctgaaaggggctgagtgggctctgactcagctctatgactcattcactgtattgctgtct ccatcctctgggcacatcctctggaccatttctgctcagcctcaagtggaggaggggctt tgtttcctgctctcctctgccctttctgaaaaggaccccacaggtgatctgaaccccctg aatatggtgcctggggagaggaatgtgggcatgaggaaacctcataggtcgctgtgggga tggggccacgacggaagagaaggtggtaaaaagggcgagttgcctaagagaaaagggttt ccatcacagctttctcctcaagagagaaggcagctgtatgggcccggtcccagccctgag gatgccttcctgacagtggggaggagggacgggatgttagttagggtctccccagaagtg gttggtaagcaggctcctttcttgacccctccccaagtctcagggtttggctggaagctc cccagctccagctctgacaaacaacccgctcctgcaaaccagggatctaggcagcccacc ttgctcttaaacttccttggggttaaattacagaagattcagctgctagaagagatagtt gaggtgatggtagatgacaaataa >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_2|180_aa MALMMRTEQILPESPESWKPGGSRFCRKNRTRAPPSSRIPATIGQVLAETGGGLCLSLHV NSGPQCHAPSLTPFPEKDYQECKLGVGRSFYLSRKIRYLDFYVNFSNFQKPVRTEQRISA DCQGANSDDYDIPAALLAWSLPGRGKGGEQADTCSHKMNERMNNEQPNHRCLRWKLAAVR >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_2|543_bp atggcactgatgatgaggactgagcagatcctcccagagagccccgagagctggaaacct ggaggcagcagattctgcaggaaaaatagaaccagagcacccccaagttcaaggatccca gccacaatagggcaggtgctggctgagaccggaggaggattatgtttaagtctacatgtt aatagtggaccccaatgccatgctccttcccttaccccattcccagaaaaggactatcag gaatgcaagcttggtgttggcagatccttttatttgtcaagaaaaatcagatatctggat ttttatgtgaacttttccaattttcaaaagcctgtgaggacagaacaacgcatatctgca gactgtcagggtgcaaactcggatgactatgacattcctgcagcgctcttggcctggtcc ctgcctgggagggggaaaggtggggagcaggcagatacctgcagccacaaaatgaatgaa cgaatgaacaacgaacagcccaaccaccgctgcctcaggtggaagttggcagctgtgcgg tag >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_3|189_aa MSSAWMLLPTLQITSLCSPSDSTPTEGEGDEEICQEGASRQYLMKGSFFPRATTRFGRRF WARLPGRGGSQALGPLRGGKGKGRRGTADEIGRHKRVFLKDKAKVDLGGSGTEGRENNSR HSPLVEKVRSVFVKNAPGTGQGARITDSVGKNSGCTSQSQVKQDVGVTMGWMMGSQGSAS TAEEEGYSF >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_3|570_bp atgagtagtgcatggatgctcctgcccacgctacagataacatccctttgttctcccagt gactccacacccacagaaggagagggggatgaagaaatctgccaggagggtgcctcccgc cagtatctgatgaagggaagcttcttccccagggccactacccgtttcgggaggcgtttc tgggctaggcttcctggcaggggaggctctcaggcactggggccgctgaggggtgggaaa gggaagggcagacggggaacagctgatgaaataggcagacacaagcgtgtgttcctcaaa gacaaagccaaggttgacttgggaggaagtgggactgaagggagagaaaataactcaagg cacagtcccttagtagagaaggttcggtctgtatttgtcaaaaacgcccctgggacaggt cagggagccaggatcactgattctgtaggcaagaacagtggctgtacttcccaatcacaa gtcaagcaagatgtaggtgtgacaatgggctggatgatgggcagccaggggtctgccagc acagcagaggaggaggggtacagcttctag >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_4|643_aa MGLIRNVIGGVEMRGLTRARVGKVDLKLILEKQVRFDLVEERGQRTSKFSEMMVGKDSFG NDYIENLKQNDISTEFTYQTKDAATGTASIIVNNEGQNIIVIVAGANLLLNTEDLRAAAN VISRAKVMVCQLEITPATSLEALTMARRSGVKTLFNPAPAIADLDPQFYTLSDVFCCNES EAEILTGLTVGSAADAGEAALVLLKRGCQVVIITLGAEGCVVLSQTEPEPKHIPTEKVKA VDTTPWKMGLLVTCHDTGVYNYSFSVDLQKLPDLEKALLTLYSILQPQLNEIIHVYPGTV LPSTMIPLLHLLSMGGSPLPSGRLAVVDSGMLHQVGITEEVQAVPASSNPVPILSCSLTR PSSRPLSRRLSQCSPSARRSWDSVCSSPHRSAPILLTVLGSVSAYLSYGPGLALGLRTGR PTIIVLKLNNLLITVKHAGRTAKLFGTLGNGGRPRGKHLHHEHVQGIQQPHLQKTAERNM VTTPGQSPSPAKRPPTNNNCFRGTARFWKRTSGSRPRPRPPPGTCLQSSGSFLGFSERVC FAFVVLEVGILIILERDLQLRVRAPALTPISGSSAFSVRCFPYLFHEIIATTCIRPTRTT VYCTSAAPAYQPLAYFLKMLPPPTQLPLRLVKPYSHSSESLRH >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_4|1932_bp atgggcttgattaggaatgtcataggaggtgtggagatgagaggattaacaagggccaga gtagggaaggtagaccttaagctgatacttgaaaaacaggtgaggtttgatttagtggag gagagagggcaaaggacttctaagttctcagaaatgatggttggcaaagattcttttggc aatgattatatagaaaacttaaaacagaatgatatttctacagaatttacatatcagact aaagatgctgctacaggaactgcttctataattgtcaataatgaaggccagaatatcatt gtcatagtggctggagcaaatttacttttgaatacggaggatctgagggcagcagccaat gtcattagcagagccaaagtcatggtctgccagctcgaaataactccagcaacttctttg gaagccctaacaatggcccgcaggagtggagtgaaaaccttgttcaatccagcccctgcc attgctgacctggatccccagttctacaccctctcagatgtgttctgctgcaatgaaagt gaggctgagattttaactggcctcacggtgggcagcgctgcagatgctggggaggctgca ttagtgctcttgaaaaggggctgccaggtggtaatcattaccttaggggctgaaggatgt gtggtgctgtcacagacagaacctgagccaaagcacattcccacagagaaagtcaaggct gtggataccacgccatggaagatgggcctacttgtgacttgtcatgatacaggcgtctat aattactccttttctgtagaccttcagaagctgccagatcttgaaaaggccttgcttacc ttgtacagtatactgcagcctcagttaaatgaaataatccatgtataccctgggacagtg cttccttccaccatgattcctctgctacatctcctctccatgggtggcagccctctgccc agtggccggctagctgtggtagacagtggaatgctccatcaagttggcattactgaggaa gtgcaggctgtccctgccagcagtaaccctgttcccatcctgtcctgcagccttacccgc cccagctctaggcctctgtccagacgcctctctcagtgttctccctctgcacggcgttcc tgggacagtgtgtgctcctcacctcatcgctcagcacccatcctcctgacagttcttggt tcggtcagtgcctacctcagctatggtccaggccttgcactaggtctcaggacagggaga ccaacaatcatcgtcctcaaacttaacaatttactcatcacagtaaaacatgctgggaga actgccaagctatttggtaccctgggcaatgggggcaggccccgaggcaaacatcttcat cacgaacacgtgcagggcattcagcagccccacttacagaagaccgcggagaggaacatg gtgaccacacctgggcagtctccgtcaccggccaaaaggcccccaaccaacaacaactgc ttccggggcacggcccggttctggaagaggacatccggcagtcgcccgaggccacgcccc ccgcctggaacctgcctccagtcgtctggctcgttcctcggtttctcagagcgagtgtgt tttgcttttgtcgtgctggaggttggaatcctcatcattctggaaagggatttacagctc agggttcgcgccccggccctgactcctatttctgggtcaagtgctttcagcgttaggtgc tttccatatttgtttcacgaaataatcgcaaccacctgcattagacccacaagaacaact gtgtattgtacttcggccgctccagcgtatcagcccctcgcatattttcttaaaatgttg cctcccccgacccaactccctctccgcctggttaagccttactctcactcttctgagtcc cttcggcattaa >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_5|127_aa MAASGEPQRQWQEEVAAVVVVGSCMTDLVSEHEEQKQSLKATASSNVSVLRVSLLVAKCI AKAKKSFTVGEELTLPAAKDICLELLGDLAVKKVAQVPHSAGTITRQIDEIAEDIEAQLL EWINESL >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_5|384_bp atggcggcgtctggggaaccccagaggcagtggcaagaggaggtggcggcggtggtagtg gtgggctcctgcatgaccgacctggtcagtgaacacgaagaacagaagcaatcattgaag gccacagcttcatcaaatgtgtctgtactgagagtatcattgttagtggctaaatgcatt gctaaagctaagaagtcctttactgttggtgaagagttgaccctgcctgctgctaaggac atttgtcttgaacttttaggagatcttgcagttaaaaaggtggcacaagttcctcattca gctggcaccataactagacaaattgatgaaatagcagaggacattgaggcacaattgtta gagtggattaatgagtcactgtga >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_6|68_aa MADPAEQCGPSWAQAAPDPAGHKLLAQLLEPSEEENMVCILYDKMYKNFMEEVDTIDNET YQWKKGKL >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_6|207_bp atggcggacccagctgagcaatgtggacccagttgggcacaagctgctccagacccagct gggcacaagctgctggcccagttgctggagcctagtgaagaggaaaatatggtgtgcatc ctctatgacaagatgtacaagaacttcatggaggaggtggacacaatagataatgagacc taccagtggaagaaggggaaactttga >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_7|95_aa MKDREQNRDNASYWQPPFWYLSLNVTTLGTSLDVTPYNYCSTFSDCEFDYFRYLIAILAL MRLSVGLICQVGYTLLNSHITEHCLLTEDCEVNVY >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_7|288_bp atgaaagacagagagcaaaacagagacaatgcaagctactggcagccaccattctggtat ctgtctctgaatgtgactactttaggtacctcacttgatgtaaccccttacaactactgt tctactttctctgactgtgaatttgactactttagatacctcatcgccattcttgctctc atgcgcctgagtgtgggccttatctgccaggtgggctacactctgctcaactcccacatc acagaacactgtctcctcactgaagactgtgaagtgaatgtttactga >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_8|224_aa MWDALELPRDLLNGFDQNAHNDMDNKSQAEVVSDGDEELIGNWSKVASAVAEREQCRAWA MASEGASPEPWQLPCGVEPASAQKSRIGVWEPLSRFQKLYGNAWMPRQKFPAGVGPSWRT SARAVQKGNVGLEPPHRVPTGAPPSGAVRRGLPSSRPQNGRSTDSLHHVPRKATDIQHQP VKAAGRTAVPCKATGVELSKTMGIHLLHQHDLGVFTQCLYPHFI >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_8|675_bp atgtgggacgctttggaacttcctagagacttgttgaatggctttgaccaaaatgctcat aatgatatggacaataaaagccaagctgaggtggtttcagatggagatgaggaactcatt gggaactggagcaaagttgcttcagctgtggctgaaagggaacaatgtagagcttgggcc atggcttcagagggtgcaagccccgagccttggcagcttccatgtggtgttgagcctgcg agtgcacagaagtcaagaattggggtttgggaacctctgtctagatttcagaagttgtat ggaaatgcctggatgcccaggcagaagtttcctgcaggagtggggccctcatggagaacc tctgctagggcagtgcagaagggaaatgtggggttggagcccccacacagagttcctact ggggcaccgcctagtggagctgtgagaagagggctaccatcctccagaccccagaatggt agatccactgacagcttgcaccatgtgcctagaaaagccacagacattcaacaccagccc gtgaaggcagctgggaggacggctgtaccctgcaaagccacaggggtggagctgtccaag accatgggaatccacctcttgcatcagcatgacctgggtgtatttacccaatgcctgtac cctcattttatctag >gi568815596r:27681618_27990345|GENSCAN_predicted_peptide_9|86_aa MVPASASGKDLSKLTIMAGGEGRAGTSQDESGSESKRSTSQFRLATFQRLHHHMWLAATI LDSTDLQETLIRKEGNKGNEGKVKYA >gi568815596r:27681618_27990345|GENSCAN_predicted_CDS_9|261_bp atggtgccagcatctgcttctggcaaagacctcagcaagcttacaatcatggcaggaggt gaagggagagcaggcacatcacaagatgagagtggcagtgagagcaagaggagcacatct caattcagactagccacatttcaaagactccatcaccacatgtggctagcagcgactata ttggacagcacagatctacaggaaacactgattaggaaagaaggaaacaaagggaatgaa gggaaggttaaatatgcctag