GENSCAN 1.0 Date run: 8-Nov-116 Time: 17:42:19 Sequence gi568815589r:110144160_110351464 : 207305 bp : 42.64% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8387 8594 208 1 1 70 9 152 0.205 4.54 1.02 Intr + 9041 9198 158 0 2 62 38 69 0.203 -1.89 1.03 Intr + 11160 11334 175 2 1 99 69 42 0.432 1.99 1.04 Intr + 12160 12338 179 2 2 51 84 188 0.968 13.42 1.05 Intr + 17489 17614 126 1 0 21 56 120 0.017 2.06 1.06 Term + 24240 24338 99 2 0 126 37 207 0.993 16.55 1.07 PlyA + 25302 25307 6 1.05 2.13 PlyA - 25905 25900 6 1.05 2.12 Term - 29562 29308 255 0 0 21 54 235 0.899 8.00 2.11 Intr - 29987 29871 117 2 0 101 3 97 0.759 2.24 2.10 Intr - 31737 31640 98 1 2 94 31 64 0.705 0.11 2.09 Intr - 32298 32227 72 1 0 58 101 47 0.508 1.46 2.08 Intr - 34281 34155 127 0 1 65 76 58 0.694 1.63 2.07 Intr - 34524 34375 150 0 0 80 98 80 0.437 7.64 2.06 Intr - 45654 45568 87 0 0 76 78 79 0.169 4.95 2.05 Intr - 45985 45809 177 1 0 62 68 81 0.178 2.69 2.04 Intr - 57315 56966 350 1 2 124 80 209 0.419 17.85 2.03 Intr - 63463 63228 236 0 2 161 97 116 0.769 16.21 2.02 Intr - 70134 69923 212 0 2 28 38 167 0.534 2.49 2.01 Init - 83342 83235 108 2 0 83 33 106 0.210 4.87 2.00 Prom - 85745 85706 40 -3.65 3.02 PlyA - 86505 86500 6 1.05 3.01 Sngl - 88487 87459 1029 2 0 23 42 416 0.851 27.33 3.00 Prom - 89321 89282 40 -6.15 4.09 PlyA - 89485 89480 6 1.05 4.08 Term - 90506 89713 794 0 2 -60 43 514 0.007 24.57 4.07 Intr - 96824 96656 169 2 1 50 93 137 0.541 9.00 4.06 Intr - 100684 100619 66 1 0 105 102 27 0.633 3.98 4.05 Intr - 107303 107199 105 2 0 67 84 74 0.541 4.39 4.04 Intr - 112380 112253 128 1 2 -22 107 134 0.452 3.78 4.03 Intr - 112985 112697 289 2 1 68 -1 295 0.426 14.50 4.02 Intr - 115373 115242 132 2 0 43 39 142 0.331 4.72 4.01 Init - 126991 126962 30 1 0 62 110 73 0.567 6.69 4.00 Prom - 129414 129375 40 -4.05 5.00 Prom + 135663 135702 40 -2.15 5.01 Init + 136550 136628 79 1 1 114 65 23 0.805 3.87 5.02 Intr + 137773 137981 209 2 2 85 21 80 0.127 -1.13 5.03 Intr + 148032 148381 350 2 2 32 82 313 0.042 18.33 5.04 Intr + 164062 164122 61 1 1 57 91 49 0.010 -0.08 5.05 Term + 169664 169768 105 1 0 115 53 29 0.013 -0.47 5.06 PlyA + 169846 169851 6 1.05 6.04 PlyA - 170646 170641 6 1.05 6.03 Term - 172384 172212 173 2 2 64 35 130 0.860 2.31 6.02 Intr - 173511 173445 67 0 1 51 98 53 0.654 0.16 6.01 Init - 176894 176757 138 2 0 89 90 88 0.985 9.29 6.00 Prom - 177272 177233 40 -5.35 7.00 Prom + 178312 178351 40 -6.25 7.01 Init + 185839 185917 79 0 1 98 1 130 0.165 6.57 7.02 Term + 196794 196969 176 1 2 79 41 185 0.686 9.84 7.03 PlyA + 197029 197034 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 17489 17694 206 0 2 21 42 254 0.924 10.65 S.002 Sngl + 28356 28640 285 2 0 55 37 189 0.857 5.81 S.003 Sngl - 90456 89713 744 0 0 95 43 466 0.984 38.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:110144160_110351464|GENSCAN_predicted_peptide_1|314_aa MKKNVIELPGLWHRLFASHSFNDLQDPNSGAVCSVHYWGKSIRSAVTVLMERTSGCKLWT SEEVVRCGTVILHPLYNILSAYIPSFNSAIINTGFGLQTVTVTVLCNLGWSWHLSLCGHD IQVLAASTTKGIIGKIIIVILEDTSQKCIWGSVSINPPHEGPEARARHFLGGRDGNGLGE GKIEKVKPPPSPTTEGPSLQPDLAPEEAAGTQRPKNLMQTLMEDYETHKSKRRERMDDSS DLLLMEQQVTRRLWGSEKPTGREKAIWLQTLANPKWPPTTLLVLEATRVNRRKSALALRW EAGIYANQEEEDNE >gi568815589r:110144160_110351464|GENSCAN_predicted_CDS_1|945_bp atgaaaaagaatgtgattgagctcccaggcctgtggcacagactatttgcatcacattca tttaatgatcttcaagacccaaacagtggtgctgtctgcagtgtccactattgggggaag agcatccggagtgctgtcaccgtcctaatggagaggacctctggctgcaaactctggacc tctgaggaagttgtcaggtgtggaacagtaatccttcaccctctttacaacattctttct gcatacatcccatcatttaattcagccatcatcaacactgggtttgggctccagactgtg actgtcactgtcttatgtaacttgggctggagttggcatctgagcctctgtggtcatgac atccaggttttagccgcatcaaccacaaagggcataattggaaaaatcataattgttata cttgaagatacttctcagaagtgcatttggggctctgtgtcaataaacccaccccatgaa gggccagaagccagggccaggcatttcttgggagggagggatggaaatggtcttggtgag gggaaaatagagaaagtcaaacctcctccatcccccaccactgaaggccccagcttgcag cctgacttagcccctgaagaggctgccggaacccagcggcccaagaatctgatgcagacc ctcatggaagactatgagacacacaaatctaaaaggcgcgagagaatggatgatagtagt gatcttcttttaatggaacaacaggttacaagacggctttggggttcagagaagcctacg ggaagagaaaaggctatttggctacagactctagcaaatccaaaatggcccccaacgacc ttgctggtcctcgaggccacacgggttaatcgaagaaagagcgcactggctttgcgctgg gaagcagggatctatgccaaccaggaggaagaagacaacgaataa >gi568815589r:110144160_110351464|GENSCAN_predicted_peptide_2|662_aa MAEDKNEQVASYMDGSRQREREPVQGISSSMKSSDLAQRRHIPTVSIPNSRNQKLPEAAV RATPGTDTESFLPHSVSQMIPLNQLTLKKVIQAPPLDEGMASSVAKGSCPTQLCLLFSRI RMEGLPCPCPALPHFWQLRSHLMAEGSRTQAPGKGPPLSIQFLRAQYEGLKRQQRTQAHL LVLPKGGNTPAPAESMVNAVWINKERRSSLSLEEADSEVEGRLEEAAQGCLQAPKSPWHT HLEMHCLVQTSPQDTSHQVHHRGKLVGSDQRLPPEGDTHLFETNQMTQQGTGIPEAAQLP CQNQKCADDFMSWSFSNFSIILEMDWKSYEDQRECSDYNEAGFRARAWNDLGKKLLENLS QASSIGVTWKHVQNENYLAPPRPVDSETQEHCPFYDSALTQTVRKGVATSNMTVVPRIPK RILQCYHANQHPSQELPLNKCPAFLGVEVTDESNNHRATKDASCGLQNLRESSYPSFEFG GSATVGGASFENLSGVKVLLHQGPQNGSSRLIPVVQSPPTLNLEQRELSIQSLDNSPIKT WAPITALALIPESLFPGQYSAGRWCKPISTSHACEAANVPQNTPRCSEQEDMVGNPGEKK AGRVTNAEREKKAAGKEVPGRGGAEGKTRKTDGQGSGAQGSVMGWSSRRLKREGGSRRED AS >gi568815589r:110144160_110351464|GENSCAN_predicted_CDS_2|1989_bp atggcagaagacaaaaacgagcaagttgcatcttacatggatggcagcaggcaaagagag agagaacctgtgcaggggatctcctcctcaatgaaatcatcagatcttgctcagaggagg cacatcccaacagtcagcattccaaacagcaggaatcagaagctgccagaagccgcagtt agggctacaccaggaacagacacagagtcatttctaccacattctgttagccaaatgata ccactcaaccaactcacacttaagaaagtgatacaggctccacctcttgatgagggaatg gcaagctcagttgccaaagggtcctgccccacccagctgtgtttgcttttctcccggatc cggatggaggggttgccctgcccgtgcccagccctgccccacttctggcagcttaggtct cacttaatggctgagggctccaggactcaggcccctgggaaagggcccccactcagcatc cagttcctgcgagcccagtatgaaggcttgaagaggcagcagaggacccaggcccacctc ctggtgcttccaaaaggaggaaacacacctgctcctgcagaatcgatggtcaatgctgtt tggattaacaaggagagaagaagctcactgtccctggaagaggcagattctgaggtggag gggaggctggaggaggctgcccagggctgccttcaggcccccaagtctccatggcacacg cacctggagatgcattgtttggtccaaacctccccccaggacaccagtcatcaagtacat cataggggcaagcttgtgggatctgatcaaaggctccctcctgaaggagacacacacttg tttgaaaccaatcagatgactcagcaaggaaccggaatcccagaggctgcccagcttcca tgccagaaccagaagtgtgctgatgacttcatgtcttggagtttctctaatttcagcatt attcttgagatggactggaagtcatatgaagaccagagagaatgcagtgattacaacgag gcaggcttcagggcaagggcctggaatgatttgggcaagaaactcctggaaaacctctct caggccagcagcatcggcgtcacctggaaacatgtccaaaatgaaaattatttggcccca cccagacctgttgactcagaaactcaagagcattgtcccttttatgattctgctttaact caaactgtcagaaagggtgtagcaacaagcaatatgactgtggtgcccaggattccaaaa agaatccttcagtgttaccatgccaaccagcatccatcccaagagttgcctctgaataag tgcccagcattcttaggtgtggaggtgacagatgagtcaaacaatcatagggcaacaaag gatgcatcctgtggcctgcagaacctcagggaaagctcatatccttcctttgaatttgga gggtcggccacagttggaggggcctcgttcgagaatctaagtggagtcaaagttctgctt caccaaggcccacaaaatggtagcagtagactcattccagttgtccagagtccacccact ttgaacttagaacaaagagaactctccatccagagtttggataactctcccatcaagact tgggctcctatcacagccttggctttgatccctgagagcctctttcctggacagtacagt gcggggagatggtgcaagcccatttcgacttcacacgcatgtgaagctgccaatgtgcca cagaacacccccagatgctcagaacaggaagatatggttggaaatcctggggagaaaaag gctggaagagttacaaatgcggagagagagaaaaaggcagctggaaaggaagtaccagga agaggaggggccgagggaaagactaggaaaactgatggtcaggggtcaggggcccagggc tcggtgatgggatggagcagccggagactgaagcgggagggaggaagtagaagggaggac gcttcctga >gi568815589r:110144160_110351464|GENSCAN_predicted_peptide_3|342_aa MFFETNENKDTTYQNLWDTFKAVCKGKFIALNAHKRKQKRSKIDTLTSQLKELEKQEQTH SKASRRQEITKIRAELKKSETQKTLQKINESRSWFFEKINKIDRPLARLIKKKREKNQID AIKHDRGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNR PITGSEIEAIINSSPTKKSPGPDGFTAEFYQRYKLELVPFLLKLFQSIEKKGILSNSFYE ASIILIPKPGRDTTKRENFRPISLMNIDAKILNKILANRIQQHNKKLIHHDQVGFIPGMQ GWFNICKSINVIQHINRTSNKNHMIISIDAEKAFDKIQQPSC >gi568815589r:110144160_110351464|GENSCAN_predicted_CDS_3|1029_bp atgttctttgaaaccaacgagaacaaagacacaacataccagaatctctgggacacattc aaagcagtgtgtaaagggaaatttatagcactaaatgcccacaagagaaagcagaaaaga tctaaaattgacaccctaacatcacaattaaaagaactagagaagcaagagcaaacacac tcaaaagctagcagaaggcaagaaataactaagatcagagcagaactgaagaaatcagaa acacaaaaaacccttcaaaaaatcaatgaatccaggagctggttttttgaaaagatcaac aaaattgatagaccgctagcaagactaataaagaagaaaagagagaagaatcagatagat gcaataaaacatgatagaggggatatcaccaccgatcccacagaaatacaaactaccatc agagaatactataaacacctctatgcaaataaactagaaaatctagaagaaatggataaa ttcctcgacacatacaccctcccaagactaaaccaggaagaagttgaatctctgaataga ccaataacaggctctgaaattgaggcaataattaatagctcaccaaccaaaaaaagtcca ggaccagatggattcacagccgaattctaccagaggtacaagttggagctggtaccattc cttctgaaactattccaatcaatagaaaaaaagggaatcctctctaactcattttatgag gccagcatcatcctgataccaaagcctggcagagacacaacaaaaagagagaattttaga ccaatatccctgatgaacatcgatgcaaaaatcctcaataaaatactagcaaaccgaatc cagcagcacaacaaaaagcttatccaccatgatcaagtgggcttcatccctgggatgcaa ggctggttcaacatatgcaaatcaataaatgtaatccagcatataaacagaaccagcaac aaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaattcaacaacct tcatgctaa >gi568815589r:110144160_110351464|GENSCAN_predicted_peptide_4|570_aa MSYESMAVVQITKPYLKRLILYVCLGVVQECAVSSSTFGYDELGGQEPLRDTWDLSDGQE PNQSPGRFYRTKPLCQAGAAGSAVKMKEQKEVTEKRTVTVNSGEEARTYTPRYFPVTVTQ HFVGFTWLGGPGRGGPFRGIQPCLGGPHLELYKGREQAASLEALFGALDPFPSVLTAARQ TPAAKMVKQIESKTAFQEALDAAGDKLVVVDFSATWCGPCKMIKPFFHDVASECEVKCMP TFQFFKKGQKATEGGELKPEQDWVFVVESSSEWGCREADSTFEGLIILMQYRIKAGEVER RSQEEADEWLTRITDAEKSLKDKMELKTKARELRDECTSLSSPCDQLEERVSVMEDQMNE MKQEKFREKRIKRNEQSLQEISDYVKRPNLYLIDVPESDGENGTKLENTLQDIILENFPN LARQANIQIQEIQRMPKRYSSSRATPGHIIVRFTKVEMKEKMLRAAREKSRVTHKGKPST LTADLSAETLQARREWGPIFDILKEKNFQSRISYPAKLSFISEGEIKSFTDKQMLRDFVT TRPALKELLKEALNMERNNQYQLLQKHAKL >gi568815589r:110144160_110351464|GENSCAN_predicted_CDS_4|1713_bp atgagttacgagagtatggctgtggtccagattacaaagccttacctcaaaagactgatt ctgtatgtctgtctgggtgtggtccaggaatgtgcagtatcatcgagcacctttggctat gatgaattaggtggtcaagaaccactcagagacacgtgggatctgtctgatggtcaagag cccaaccagtcccccggtcgtttctaccgcactaaaccgctgtgtcaagctggagcagcg ggctcagcagtgaaaatgaaagaacagaaggaggttacagagaagagaacggtcacggta aattccggagaggaggcaaggacgtacacaccgagatacttcccggtcaccgttactcag cactttgtggggttcacgtggctgggggggccggggcgtggcggcccttttcgaggaatc cagccctgcctgggcggtccccatctcgagctttataaagggagagagcaagcagcgagt cttgaagctctgtttggtgctttggatccatttccatcggtccttacagccgctcgtcag actccagcagccaagatggtgaagcagatcgagagcaagactgcttttcaggaagccttg gacgctgcaggtgataaacttgtagtagttgacttctcagccacgtggtgtgggccttgc aaaatgatcaagcctttctttcatgatgttgcttcagagtgtgaagtcaaatgcatgcca acattccagttttttaagaagggacaaaaggcaactgaaggtggagagttgaagccggag caggattgggtctttgtagttgagtccagtagtgagtggggatgcagagaagctgacagt acttttgagggattgattatactgatgcagtatagaatcaaggctggagaagtagagagg agaagccaagaagaggctgatgagtggctaacgagaataaccgatgcagagaagtcctta aaggacaagatggagctgaaaaccaaggcacgagaactacgtgatgaatgcacaagcctc agtagcccatgcgatcaactggaagaaagggtatcagtgatggaagatcaaatgaatgaa atgaagcaagagaagtttagagaaaaaagaataaaaagaaatgaacaaagcctccaagaa atatcggactatgtgaaaagaccaaatctatatctgattgatgtacctgaaagtgacggg gagaatggaaccaagttggaaaacactctgcaggatattatcctggagaacttccccaat ctagcaaggcaggccaacattcaaattcaggaaatacagagaatgccaaaaagatactct tcgagcagagcaactccaggacacataattgtcagattcaccaaagttgaaatgaaggaa aaaatgttaagggcagccagagagaaaagtcgggttacccacaaagggaagcccagcaca ctaacagctgatctctcagcagaaactctacaagccagaagagagtgggggccaatattc gacattcttaaagaaaagaattttcaatccagaatttcatatccagccaaactaagcttc ataagtgaaggagaaataaaatcctttacagacaagcaaatgctgagagattttgtcacc accaggcctgccctaaaagagctcctgaaggaagcactaaacatggaaaggaacaaccag taccagctactgcaaaaacatgccaaattgtaa >gi568815589r:110144160_110351464|GENSCAN_predicted_peptide_5|267_aa MPNQTYQLSILEGSQRGMSLCVHVHAVFSSPNRESGRVDGIARPNQRQRQGWNNSDTFIF PMAVITSTTAMAQPTITECLPAVKFLSVSKSCEKEDPPAAHALSGKPAANRRKNHPLPAP QGCSKQGGARDHHQASETQPHFHVGKAAATPASPGRPQRPHLVLQRREPAALPALPDAPA RGPASCSSPVRAPGPSHQVPGRSVVGDKFLTSRVLLASNKEGKISCFHDDGRQDVRLHKL LKISFPNNLAPTCLGINHPSHERSDET >gi568815589r:110144160_110351464|GENSCAN_predicted_CDS_5|804_bp atgcccaaccaaacttatcaactttccatattggaaggaagccagcgtggcatgtctctg tgtgtgcatgtgcatgcggttttctcttcaccaaatagagagagtggcagggtagatggc attgctaggcccaatcaaagacagagacaaggatggaataattctgatacattcattttc cccatggcagtgataaccagcacaacagcaatggcacaaccaacaattactgaatgtctt ccagctgtcaagttcctttcagtttccaagtcttgtgaaaaggaagatcccccggctgcc cacgcactgtccgggaaaccagcagcaaataggcggaagaaccatccactgcctgctccc cagggctgcagcaagcaaggtggagccagagaccaccaccaggcaagtgagacccagccc cacttccacgtgggcaaagctgccgccaccccggccagtccaggacggccgcagaggccc cacctcgtcctgcaaaggagggagcctgccgcgctcccggctctgccagacgctcctgcc cgagggcctgcgtcctgcagttccccagtgcgggctcccggtccctcccaccaggtgccc ggtcggtcggtcgtaggggacaaatttctaaccagcagagttttacttgcttccaacaag gaaggcaagatttcctgctttcatgatgatggaaggcaggatgtgaggcttcacaagctg ctaaaaataagctttcctaataacttggcacctacctgtctaggaataaaccaccctagc catgagagatcagatgaaacctga >gi568815589r:110144160_110351464|GENSCAN_predicted_peptide_6|125_aa MADYGPSLVAEVFMQRLNEHLAGMQYKQWLKLCPSSRAKKSLSKFQPRHWGDPPVGESGA AKKKLTCAAPPKTLIKPLAVAKPVLSLVKVKGEVFKYLQRDADGSGEQMGGEESEKGAQI GSNSN >gi568815589r:110144160_110351464|GENSCAN_predicted_CDS_6|378_bp atggcagattatgggccatccctggttgctgaagttttcatgcagagattgaatgaacat ctggcagggatgcaatataagcagtggctgaagttatgtcctagttctagggcaaagaag agcctaagcaaattccagcctagacattggggtgacccacctgtgggagaatctggggct gcaaagaagaaacttacctgtgcagctcctccaaaaactttaatcaagccccttgcagtt gcaaagcctgtcttgagtcttgtgaaagtcaaaggggaagtgttcaaatacctgcaacgt gatgcagatggctctggggaacagatgggtggggaggagtcagagaaaggggcccagata ggatcaaacagtaattaa >gi568815589r:110144160_110351464|GENSCAN_predicted_peptide_7|84_aa MVLVVGLGEEEVAAATAAAVTTVACKSFEKKMVHNERSDTAATPTNPGHVPGVSGSQARL GQYVYLSVLGLVDFFMEFAAKKLA >gi568815589r:110144160_110351464|GENSCAN_predicted_CDS_7|255_bp atggtgctggtagtgggtctaggggaagaggaagtggcagcagccacagcagcagcagtg acaacagtggcctgtaaatcatttgaaaagaagatggttcataacgaacgctcagacaca gctgctacaccaacaaaccctggtcatgtccctggggtttctggaagtcaagcaaggctg ggccagtacgtctacctttcagtgctagggcttgtagactttttcatggaatttgctgct aagaagcttgcctag