GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:19:43 Sequence gi568815583r:36794757_37198200 : 403444 bp : 38.18% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.10 PlyA - 937 932 6 1.05 1.09 Term - 13708 13589 120 1 0 102 37 90 0.584 2.79 1.08 Intr - 16421 16392 30 2 0 77 86 39 0.467 0.01 1.07 Intr - 16829 16698 132 2 0 19 64 124 0.726 3.02 1.06 Intr - 22532 22421 112 1 1 96 115 46 0.816 7.56 1.05 Intr - 23369 23180 190 0 1 83 81 229 0.668 19.42 1.04 Intr - 31109 31016 94 2 1 11 94 79 0.074 -0.58 1.03 Intr - 31413 31194 220 2 1 84 91 70 0.069 4.28 1.02 Intr - 42310 42100 211 2 1 50 31 144 0.013 1.95 1.01 Init - 44372 44336 37 2 1 88 106 -1 0.458 1.92 1.00 Prom - 45529 45490 40 -8.05 2.08 PlyA - 46391 46386 6 1.05 2.07 Term - 47370 47193 178 2 1 78 39 153 0.669 5.58 2.06 Intr - 51244 51199 46 2 1 108 96 58 0.643 5.15 2.05 Intr - 56849 56801 49 2 1 97 37 40 0.236 -2.87 2.04 Intr - 67365 67261 105 0 0 103 115 95 0.449 13.19 2.03 Intr - 85554 85441 114 0 0 83 37 59 0.404 0.02 2.02 Intr - 85790 85688 103 1 1 67 78 90 0.768 5.16 2.01 Init - 86213 86035 179 2 2 54 75 110 0.800 5.08 2.00 Prom - 86859 86820 40 -9.55 3.19 PlyA - 86976 86971 6 1.05 3.18 Term - 88195 87974 222 1 0 7 55 202 0.272 4.73 3.17 Intr - 91184 91071 114 2 0 45 67 100 0.092 3.32 3.16 Intr - 97703 97441 263 0 2 121 34 97 0.027 3.88 3.15 Intr - 100484 100395 90 0 0 69 96 88 0.406 6.75 3.14 Intr - 102471 102328 144 1 0 100 -22 134 0.685 2.93 3.13 Intr - 103232 103115 118 2 1 53 70 99 0.919 3.72 3.12 Intr - 103664 103608 57 2 0 58 105 59 0.787 2.76 3.11 Intr - 106336 106244 93 1 0 55 93 72 0.480 3.64 3.10 Intr - 121111 121057 55 0 1 72 92 44 0.006 1.26 3.09 Intr - 128900 128828 73 0 1 128 37 64 0.026 2.85 3.08 Intr - 144201 144069 133 0 1 24 57 110 0.002 0.80 3.07 Intr - 148402 148342 61 0 1 81 89 25 0.239 -0.28 3.06 Intr - 155644 155568 77 1 2 53 105 96 0.173 5.19 3.05 Intr - 192794 192718 77 0 2 50 98 58 0.013 1.32 3.04 Intr - 208363 208264 100 1 1 39 107 55 0.202 1.26 3.03 Intr - 232719 232630 90 0 0 45 110 44 0.132 1.57 3.02 Intr - 242203 242058 146 2 2 94 98 115 0.679 12.18 3.01 Init - 245821 245677 145 1 1 57 44 83 0.136 1.13 3.00 Prom - 250543 250504 40 -7.45 4.10 PlyA - 250825 250820 6 -0.45 4.09 Term - 251494 251310 185 2 2 83 54 184 0.248 11.22 4.08 Intr - 287484 287386 99 1 0 80 74 56 0.162 2.56 4.07 Intr - 294462 294360 103 0 1 63 65 51 0.263 -0.87 4.06 Intr - 298974 298825 150 0 0 101 103 150 0.746 17.24 4.05 Intr - 299919 299771 149 1 2 107 121 -39 0.306 0.33 4.04 Intr - 300776 300620 157 2 1 102 40 66 0.156 1.86 4.03 Intr - 300858 300808 51 0 0 116 87 43 0.282 5.19 4.02 Intr - 301674 301533 142 2 1 34 42 163 0.257 5.73 4.01 Init - 303416 303211 206 2 2 87 32 186 0.227 9.40 4.00 Prom - 305279 305240 40 -12.43 5.00 Prom + 306106 306145 40 -7.45 5.01 Init + 306841 307041 201 0 0 79 -7 174 0.380 5.92 5.02 Intr + 314937 315138 202 2 1 42 91 186 0.328 12.24 5.03 Intr + 317646 317701 56 1 2 86 81 66 0.562 3.48 5.04 Intr + 345388 345561 174 0 0 83 58 65 0.273 2.21 5.05 Intr + 351032 351206 175 1 1 49 110 107 0.881 7.59 5.06 Intr + 352392 352509 118 1 1 61 86 123 0.840 8.00 5.07 Intr + 356027 356169 143 2 2 76 27 56 0.088 -2.62 5.08 Term + 368442 368650 209 1 2 26 47 149 0.020 1.12 5.09 PlyA + 368796 368801 6 1.05 6.05 PlyA - 370287 370282 6 1.05 6.04 Term - 370949 370746 204 2 0 58 35 144 0.458 2.49 6.03 Intr - 372217 372028 190 0 1 84 19 87 0.229 0.07 6.02 Intr - 382571 382436 136 0 1 46 90 50 0.001 -0.29 6.01 Init - 403052 403043 10 2 1 100 98 4 0.394 3.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 128900 128824 77 0 2 128 55 70 0.865 4.72 S.002 Init + 212374 212496 123 0 0 55 76 80 0.811 3.72 S.003 Term + 219848 219988 141 1 0 84 43 128 0.849 4.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:36794757_37198200|GENSCAN_predicted_peptide_1|381_aa MDEHMVNISYLKKNGHLKIEIHREITHDNKGRDWSNEAASQGMLKIVSKPPETRRGKEGF PYSSQRESGFANALILDFWSLEFELALLQSSTAPGCWIAVAVPTFYPEATTNGRREVVSP FVVLNQQRIPFPEALQKTFYLVSSNLHHMPALHQLMKVHGHLRDREHLNESDLGAERKCG SGDEWLLGRLAAEFIVGRKYKLLWKIWCSSFGDICLAISITNGEEVVVKPESQGQASPVA VQEQALQDASRTLIHQYDYTFDWTKLKQKAAQQAASSSGRGSRPKPQQREKEPKSNEKMQ GTCPCGGKLAKANAGEDDMEKVKSEVRKHMEAGESSLADNEQDLQGQAMLWHKVTMFVGK QALSRMPLSRLQSSSWINPYQ >gi568815583r:36794757_37198200|GENSCAN_predicted_CDS_1|1146_bp atggatgagcatatggtgaacattagttatcttaaaaaaaacggtcatctgaagatagag atacacagggagataacgcatgataacaaaggcagagattggagtaatgaagctgcaagc caaggaatgctaaagattgtcagcaaaccaccagaaactagaagaggcaaggaagggttc ccttacagttctcagagggagagtggctttgccaatgccttgattttagacttctggtct ctggaatttgagttggctcttctgcagtctagcactgctcctggttgctggattgctgta gcagttcccacattttatccagaggctaccacaaatggtagaagagaggttgtttctccc tttgtggttcttaatcagcaaagaatacctttcccagaagccctccagaagactttctat ttggtttcatccaatttgcatcatatgcccgcgctacaccagttaatgaaagtacatggc catctaagggacagggaacacctaaatgaaagtgaccttggggcagaaagaaagtgtggc agtggtgatgagtggctgctgggacggctagcagcagaattcattgtcggaaggaaatat aaactgctttggaagatctggtgtagctccttcggagacatctgtctggcgatcagcatc accaacggtgaagaagtggtagtgaagccagaatctcagggccaggcatccccagttgct gtacaagagcaggctttacaagatgcttcaaggaccctgatccatcaatatgactacaca tttgattggacaaagttaaagcagaaagcagcacagcaggcagcctcttctagtgggagg ggcagcaggcccaaaccccaacagagagaaaaagagcccaagagtaatgagaaaatgcag ggtacatgtccatgtggagggaaactggcaaaggccaacgcaggggaagatgacatggaa aaagtgaaaagtgaagtgagaaagcatatggaagcaggggaatcaagccttgcagataat gaacaggatcttcagggtcaagctatgctgtggcataaggtgacaatgttcgtggggaaa caggctttgagcaggatgcccctttcccggttgcagtccagctcctggataaatccatac caatag >gi568815583r:36794757_37198200|GENSCAN_predicted_peptide_2|257_aa MLKFIPETPQISRCHKKSMQTHPLLTEQTDPVRVPVETGIGDLNSARAMGDVTPRLPEWR PGSHARMDKRLTGHTGPGAAVKRNGGGQSPRRALPPPPATQIPGHARTGVQPLTGKLPGK EPRQERRRLEEGGFPVAKSNQQPWDLEAQWYSPYRLVSTSKEQSRETKQESKEVSDFVQA TQQAIISFGEALAPGKPEGGEQGNSVSMLTEGGLLLSLTAAVSFVGSGSPIMCGNIIKRS QCSNWHAEDTDKDSVNK >gi568815583r:36794757_37198200|GENSCAN_predicted_CDS_2|774_bp atgctaaaatttattcctgaaacaccgcaaatttccagatgccacaaaaaatcaatgcaa acgcaccccctcctaactgagcagacagacccggtgagggttccagttgaaactggaatt ggagatctgaactccgctagagcaatgggagacgttaccccgaggcttccggagtggagg ccaggctcccacgcccggatggacaaaaggctcacaggccacactggtcccggagccgct gtgaaacgcaatggagggggacagtctcctcgccgcgctctgccgcccccacctgctacc caaatcccgggacacgcccggacaggtgtgcagcctctaactggaaagctcccgggaaaa gagccacgccaggaaaggaggcggctggaggagggggggttcccagtggccaaatccaac cagcagccatgggatctagaagcccagtggtatagtccatataggttggtctccaccagc aaagaacagagcagagaaacgaaacaagagagcaaggaagtctctgattttgtccaggcc acacagcaggctatcatctcctttggtgaggctctagctcccggaaaaccagaaggagga gagcaaggaaattctgtttccatgttaacagagggaggtctcctcctgtcactgactgct gctgttagctttgtgggctctgggagcccaattatgtgtggaaacattataaaaaggagc cagtgctcaaactggcatgcagaagacacggacaaagattccgtcaacaaataa >gi568815583r:36794757_37198200|GENSCAN_predicted_peptide_3|685_aa MAFGLALQKSPVGTQLFKEALFLACLEVTVCTYVTHSAVPEEQDLHPLGDGLDNSVASPG TGDDDDPDKDKKRQKKRGIFPKVATNIMRAWLFQHLTIIGKISISSFHMTTVCKCIPWRS GVIGNYRQVGLLWAPSVGAKLNAQGCPAAVVSHSRKNLCGELQPDSDFPLKSLQPVSFHP KKGLLQHPYPSEEQKKQLAQDTGLTILQVNNWKKVEDNSLQILHNYPIATDSSCSGDSME EGSVFDKRRCSVVRERERAGHQQLGSNSSSNNLCDLERMIYINMLPGHTKESQLVVIYQF LILIFTTFALNFGSAMVIISPSLRKAIHAESSIPAILDSSSECSGMAAFQKLQGEEPSLT LLSISAGFKSAHRRPDRVGPDCPPPKGLQLPRNTELPAHTEFFWLISKSLNCRSCRRAEV FLASTYYCCVGDKTSDSLIDGFADLTIPCESRVLEPVSQGAAYSPEGQPMGSFVLDGQQH MGIRPAGLQSMPGDYVSQGGPMGMSMAQPSYTPPQMTPHPTQLRHGPPMHSYLPSHPHHP AMMMHGGPPTHPGMTMSAQSPTMLNSVDPNVGGQLKSGNQEYGIELMLGAVQGQQKPKDL RILRPRDLMGSKLICRGIYEHHVTRGRTGNESRNNLRLAAAPPAPPTSTLGKGAGNAALT RISLPTATRPPHNECQSTPRNDFCF >gi568815583r:36794757_37198200|GENSCAN_predicted_CDS_3|2058_bp atggcctttggtttagctctgcaaaaaagccctgtgggtactcagctcttcaaagaagct ctgttcctggcatgtttagaagttactgtgtgcacatacgtaacccactcagctgttcca gaggaacaagaccttcaccctttaggggatggtttagacaacagtgtagcttcacctggt acaggtgacgatgatgatccggataaggacaaaaaacgccagaagaaaagaggcattttc cccaaagtagcaacaaatatcatgagagcatggctcttccagcatctcacaatcataggg aaaatttccatttcttcgttccacatgaccacagtttgcaagtgtattccatggagaagt ggagtgattgggaattacaggcaggttggtttgctctgggctccctctgtgggtgccaaa ctaaatgctcaaggctgtcctgcagctgtggtcagtcattcaaggaaaaatctttgtgga gaactccaacctgattctgacttccctttgaaaagtttacaaccagtgtcatttcacccc aagaagggcttgctgcagcatccgtacccttccgaagagcagaagaaacagttagcgcaa gacacaggacttacaattctccaagtaaacaactggaaaaaagttgaagataactcattg caaatattacacaattatcccatagcaactgattcaagctgcagcggggacagcatggag gaaggttcagtgtttgacaaaagacgatgcagtgtggtgcgtgaaagagaacgggctgga catcagcagcttgggtctaattccagctcaaacaacttgtgtgatcttgagcgcatgatc tatatcaacatgcttcctggtcatactaaagaatctcagctagtggtgatctaccagttt ctcatcctcatctttacaacatttgccctgaattttggttcagccatggttatcatatcc ccttccctgcgcaaagcaatccatgcagaatcttcaattccagctattcttgattcctcc tcagagtgttcagggatggcagctttccagaaacttcaaggagaagagccttctctgacc ctcctgagcatctctgctggtttcaagagtgctcacaggaggcccgaccgagtggggcct gactgtcctcctccaaagggacttcagcttcctcggaacacagagctgccagcacacact gaatttttctggctaatttcaaagagcttaaactgcaggagttgccgaagagcagaagtc tttctggccagcacttattactgctgtgttggtgataaaacttcagacagcctaattgat ggctttgctgacctaacaataccttgtgaaagccgagtgctagagcctgtgagccaagga gcagcatatagtccagagggtcagcccatggggagctttgtgttggatggtcagcaacac atggggatccggcctgcaggtttgcagagcatgccaggggactacgtttctcagggtggt cctatgggaatgagtatggcacagccaagttacactcctccccagatgaccccacaccct actcaattaagacatggacccccaatgcattcatatttgccaagccatccccaccaccca gccatgatgatgcacggaggaccccctacccaccctggaatgactatgtcagcacagagc cccacaatgttaaattctgtagatcccaatgttggcggacagttgaaatctggtaaccag gaatatggcattgaattaatgctgggagctgttcaaggtcagcagaagcccaaagacttg agaattctaaggcccagagacttgatgggaagcaagttaatctgccgcgggatttatgag caccatgtaacgcgagggcgcacaggtaacgagagccggaacaatctcaggctggcagcc gcaccgcctgcaccgccaacctccactcttgggaaaggcgctggcaatgcagccctgact cggataagtcttccaaccgcaaccagacccccacataacgagtgtcaatctacaccgcgg aacgacttttgtttctga >gi568815583r:36794757_37198200|GENSCAN_predicted_peptide_4|413_aa MDGVGVPASMYGDPHAPRPIPPVHHLNHGPPLHATQHYGAHAPHPNVMPASMGSAVNDAL KRDKDAIYGHPLFPLLALVFEKCELATCTPREPGVAGGDVCSSDSFNEDIAVFAKQVRAE KPLFSSNPELDNLPLPSPMPRSTHFLLELWGVRTEPSFGALGREGGREQLRAPARELASE EQDERAQLSHHHPHPTPNPKTGPQSHRALTPFPELFFQMIQAIQVLRFHLLELEKVHELC DNFCHRYISCLKGKMPIDLVIDERDGSSKSDHEELSGSSTNLADHCKVRPYGMLFKASGA YSDKVLLLSSVHQLKEAWLGAQRLIFHILAITLTLQAFLTPSQLATPGITQHCCRNLPSQ ASVELQQPQSGLAKPVRMEVLSVSSVESLRKSCVNRGRITKNEPLQLRAFGAI >gi568815583r:36794757_37198200|GENSCAN_predicted_CDS_4|1242_bp atggacggagtaggggttcccgcttccatgtacggagaccctcacgcgccgcggccgatc cccccggttcaccacctgaaccacgggccgccgctccacgccacacagcactacggcgcg cacgccccgcaccccaatgtcatgccggccagtatgggatccgctgtcaacgacgccttg aagcgggacaaggacgcgatctatgggcacccgttgtttcctctgttagctctggtcttt gagaagtgcgagctggcgacctgcactccccgggaacctggagtggctggcggagacgtc tgctcctccgactccttcaacgaggacatcgcggtcttcgccaagcaggttcgcgccgaa aagccacttttttcctcaaatccagagctggacaatttgcctttgccctctcccatgccc agaagcacccactttctgctggagctctggggagttagaacagaaccctcttttggagct ctagggagagagggaggaagggagcagcttagagcccccgcacgggagctggcgagcgag gagcaggatgaacgcgcccaactttctcaccaccatcctcaccccactcccaaccccaag acaggacctcagagtcacagagctctaactccattccctgaacttttctttcagatgata caagcaatacaagtactaaggtttcatcttttggagttagaaaaggtccacgaactgtgc gataacttctgccaccgatacattagctgtttgaaggggaaaatgcccatcgacctcgtc attgatgaaagagacggcagctccaagtcagatcatgaagaactttcaggctcctccaca aatctcgctgaccattgtaaggtgagaccttatggtatgctattcaaagcctctggggcc tattccgataaagttctacttctgagctctgtccaccaactcaaggaagcatggctggga gcccagcgcctcatttttcacattcttgcaattacgttaacgcttcaagcatttctgact ccaagtcagctagcaacaccaggcataactcaacactgttgtagaaatctcccatcacag gccagcgtggagctgcagcagccacagtctgggctggccaagccagtgagaatggaagtg ctttcggtctcttcagtggagagcctgcgtaaatcctgtgtaaatagagggaggataaca aagaatgagcccctgcagctccgagcatttggagccatttga >gi568815583r:36794757_37198200|GENSCAN_predicted_peptide_5|425_aa MGPAPKELFQKGKGNEGNYPTPFGWKLVVAGALSSLARAAPGFTSARLVSRRPRVNHYEP GVGGGATKRTAITFSHPKVLQELRERGEWLKRLITGGEGGARGGPKERGVKKEGPLPGLY LEKALNRFSAIFGKGSSFATHTELDSRFKRLSKNSGSDNELSILQLEKVEPRQIPYYVSQ IQQHRLILHALASPRLDCFLRCSDAAMRSRLDWADSTEGATLVLLAGPFLKDGWEQLLKK IPECHKSSRAWNSSKQVQFIDSYIHIQKTAGTPYTVSLENIGFDDDAIGPWCFVIFHSTK AATPRADKRPITRIMSQHILKWLSVRPEELQDLANMYLGGPEEYVWKLEPGGVELGGPIR MKGKDLDERRTAIYPPFRHAGFGEEEKGCHPTGPQKDGPFLINQRRERTTFSVSKQVNSS HTGRA >gi568815583r:36794757_37198200|GENSCAN_predicted_CDS_5|1278_bp atggggccagctccgaaagaacttttccaaaaaggaaaaggaaacgaagggaactacccc acaccgtttgggtggaaactagttgtggccggtgccctgtcctccctggcccgagctgca ccaggcttcacctctgcgcggctggtttcccggcggccgcgcgtcaatcactacgagccc ggagttgggggcggagccacgaagagaactgccatcactttctcacatcccaaggtgttg caggaactcagagagaggggagagtggctgaaacgcttaattaccggaggcgaaggaggg gcgcgcgggggaccgaaggagaggggagtgaagaaggaaggtcctttaccaggcttgtac ctggagaaagctttgaatcgcttctctgctatatttggaaaaggaagtagttttgcaact cacactgagttagattcgagatttaaacggttgtcgaagaacagtggatcagacaatgag ctctccattttgcaacttgagaaagtggagcctaggcagataccatattatgtatcccaa attcagcagcataggctgatattacatgcattggcatcaccgagattagactgctttcta cggtgtagtgatgcagcaatgagatcacggctggattgggctgacagtaccgagggtgcc accttagtgctcctcgctggaccatttttgaaagatggatgggagcagctgctcaaaaag atcccagagtgccacaaaagctctagagcctggaattcctcaaagcaagtccaatttata gattcctacattcatatacagaaaacagctgggaccccatacacagtatctttggaaaat ataggatttgatgatgatgctattggtccatggtgctttgtcattttccatagcacaaaa gccgcaacgcctcgagcggacaaaagaccaataactagaattatgtcacaacatattcta aaatggttaagtgtcaggcctgaggagctgcaagacctagccaatatgtacctgggaggt ccagaagagtatgtatggaaactggaacctggaggtgttgaattagggggcccaataaga atgaaagggaaggatcttgatgaaaggaggacagctatttatccaccatttaggcatgca ggctttggagaagaggagaaaggttgtcatcccactggccctcagaaagatgggcccttt ctcatcaaccagaggagggaaaggactacctttagtgtgtcaaagcaggtcaatagcagc cacacaggtcgggcatga >gi568815583r:36794757_37198200|GENSCAN_predicted_peptide_6|179_aa MQNSINAVDFSLHCSTYKVSIQKKIFFQETKQSIEPDPQINQMLKISDRAQIFRDPQLKP RWLHSPSSTVIPELQIGSPQNSETAKSSFGYLLLNSLLSTWFSYLLAFSFTQAFSVACRL NVCEGYKKKLSSSGTVKKCGGSAEQPEDTRALPYYTVCHSDVNNGCEVIKQIVLGSSSH >gi568815583r:36794757_37198200|GENSCAN_predicted_CDS_6|540_bp atgcagaacagcataaatgctgttgactttagtcttcactgttctacatataaagtcagc attcaaaagaaaatcttctttcaagagactaagcaatcaatagaaccagacccacagata aaccagatgttgaaaatatcagacagggctcagatattcagggaccctcaattgaagcca aggtggcttcatagcccctcctccacagtcattccagaacttcagattgggtctccccag aactctgaaactgccaaatcttcatttggctatctacttcttaacagtttgctttctact tggtttagctatctcttggcatttagcttcactcaagctttctctgtcgcttgtcgactt aatgtgtgtgagggctataaaaagaagctaagtagttctggcacagtaaagaagtgtgga ggcagtgctgaacagccagaggacacaagagctctaccatattatactgtttgtcactct gatgtgaacaatggctgtgaagttattaaacagattgtcctgggttcttcctctcattag