GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:36:05 Sequence gi568815591r:77693975_77894373 : 200399 bp : 39.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2496 3081 586 2 1 94 90 504 0.900 44.61 1.02 Intr + 3756 3861 106 1 1 102 56 80 0.873 4.55 1.03 Intr + 8957 9065 109 2 1 42 80 78 0.570 1.77 1.04 Intr + 17010 17121 112 2 1 40 28 57 0.003 -6.07 1.05 Intr + 42436 42552 117 2 0 128 106 84 0.594 13.72 1.06 Intr + 55450 56090 641 2 2 63 110 463 0.543 36.85 1.07 Intr + 71521 71658 138 0 0 91 69 56 0.881 3.74 1.08 Intr + 74687 74829 143 1 2 53 95 119 0.863 7.33 1.09 Intr + 79173 79340 168 0 0 67 21 173 0.978 6.64 1.10 Intr + 84364 84472 109 1 1 98 68 61 0.999 4.47 1.11 Term + 84556 85194 639 0 0 68 32 457 0.999 31.32 1.12 PlyA + 85240 85245 6 1.05 2.00 Prom + 88034 88073 40 -3.95 2.01 Init + 100287 100364 78 2 0 51 109 50 0.255 4.51 2.02 Term + 104965 105411 447 0 0 21 41 356 0.870 18.43 2.03 PlyA + 106984 106989 6 1.05 3.05 PlyA - 108379 108374 6 1.05 3.04 Term - 138475 138368 108 1 0 58 48 103 0.196 0.83 3.03 Intr - 152352 152168 185 1 2 17 77 140 0.235 4.39 3.02 Intr - 154510 154312 199 1 1 -21 32 153 0.011 -3.20 3.01 Init - 159996 159940 57 0 0 90 86 56 0.069 6.96 3.00 Prom - 165921 165882 40 -5.85 4.07 PlyA - 166485 166480 6 1.05 4.06 Term - 177875 177182 694 1 1 32 49 342 0.728 16.75 4.05 Intr - 178257 178003 255 2 0 54 68 307 0.711 20.94 4.04 Intr - 178685 178565 121 0 1 73 98 34 0.941 1.53 4.03 Intr - 179406 179304 103 0 1 24 19 98 0.499 -4.67 4.02 Intr - 179726 179560 167 0 2 59 50 167 0.776 8.76 4.01 Init - 180118 179971 148 1 1 24 72 100 0.736 2.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 160433 160523 91 1 1 83 67 48 0.843 2.93 S.002 Term - 165168 165056 113 1 2 31 49 148 0.879 2.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:77693975_77894373|GENSCAN_predicted_peptide_1|955_aa MAEPPSPVHCVAAAAPTATVSEKEPFGKLQLSSRDPPGSLSAKKVRTEEKKAPRRVNGEG GSGGNSRQLQPPAAPSPQSYGSPASWSFAPLSAAPSPSSSRSSFSFSAGTAVPSSASASL SQPVPRKLLVPPTLLHAQPHHLLLPAAAAAASANAKSRRPKEKREKERRRHGLGGAREAG GASREENGEVKPLPRANDKTRSFDDFSPDQAAAECLSIRKRRILKGSCQTGYLRKALFSD TVTLGIKASTYGFWGGMIQSIVKGLAETFNFWIHIWELNDLQYISALFACSEEMPDYHLE KQLADKIKDKIKERDKEKEREKKKHKVMNEIKKENGEVKILLKSGKEKPKTNIEDLQIKK VKKKKKKKHKENEKRKRPKMYSKSIQTICSGLLTDVEDQAAKGILNDNIKDYVGKNLDTK NYDSKIPENSEFPFVSLKEPRVQNNLKRLDTLEFKQLIHIEHQPNGGASVIHAYSNELSH LSPMEMERFAEEFVGLVFSENENSAAFYVMGIVHGAATYLPDFLDYFSFNFPNSPVKMEI LGKKDIETTTMSNFHAQVKRTYSHGTYRAGPMRQISLVGAVDEEVGDYFPEFLDMLEESP FLKCTLPWGTLSSLKLQSRKDSDDGPIMWVRPGEQMIPVADMPKSPFKRKRTTNEIKNLQ YLPRTSEPREMLFEDRTRAHADHIGQGFERQTTAAVGVLKAVHCGEWPDQPRITKDVICF HAEDFLEVVQRMQLDLHEPPLSQCVQWVDDAKLNQLRREGIRYARIQLYDNDIYFIPRNV VHQFKTVSAVCSLAWHIRLKLYHSEEDTSQNTATHETGTSSDSTSSVLGPHTDNMICAVS KASLDSVFSDKLHSKYELQQIKHEPIASVRIKEEPVNVNIPEKTTALNNMDGKNVKAKLD HVQFAEFKIDMDSKFENSNKDLKEELCPGNLSLVDTRQHSSAHSNQDKKDDDILC >gi568815591r:77693975_77894373|GENSCAN_predicted_CDS_1|2868_bp atggcggaaccgccgagccccgtgcactgtgtcgctgccgcggcccccaccgccaccgtc tcggagaaagaaccgtttggcaagctgcaactctcctcccgggaccctccgggttctctg tccgccaagaaggtccggactgaggagaagaaggcaccgcggagagtgaacggagaaggg ggcagcggcgggaacagcaggcagctgcagccgccggcagcaccttcgcctcagagctat ggcagccccgcgtcttggagctttgcccctctgtctgctgctccctccccgtcctcttct cggagcagtttctctttctccgctggcacggccgttccctcctcagcctccgcttccttg tctcagccggtgccgcgcaaactgctggtccctcctacgctgctgcacgctcagcctcac catctcctcctgcccgccgccgccgccgctgcctcggctaacgccaagtcgcgcagacct aaggagaagcgggagaaggagaggaggaggcacggtctcggtggggcccgagaggccggc ggggcctcccgggaggagaacggggaggtgaagccgctgccccgagcaaatgataaaacc aggagctttgacgatttttcgccagatcaagcggctgcagaatgtttgagcattagaaaa aggcgaattcttaagggttcttgtcagacaggttacctccgaaaagccttattttcagat acagtcacattgggcattaaggcttcaacatatggattttggggaggcatgattcagtcc atagtgaagggcttggcagagaccttcaacttctggatacatatttgggaactgaatgac ttgcagtatatttcagccctttttgcctgctcagaagagatgcccgattatcatttagaa aaacaattggctgataaaatcaaagacaaaattaaagagagagacaaagaaaaagaaaga gaaaaaaagaaacataaagtaatgaatgagatcaagaaagagaatggagaagtaaagatt ttgctgaaaagtgggaaggagaaaccaaaaacaaatatagaagacttacaaattaaaaag gtaaagaagaaaaagaaaaagaaacacaaagagaatgaaaaacggaagcgtccgaaaatg tatagcaaatctattcagaccatctgctcaggattgctaactgatgttgaagatcaagca gccaaaggcatcctaaatgataacataaaagattacgttgggaagaatttggataccaag aactatgattccaaaattccagagaacagtgagtttccatttgtctcattaaaggagcca cgagttcagaataacctcaaaaggttggacactttggaatttaaacaactcattcatata gagcaccagcctaatggaggtgcatcggttatccatgcctacagtaacgaactctcccac ctgtctcctatggagatggagaggtttgcagaagagtttgtgggtctagtgttcagtgaa aatgaaaactctgcagctttctacgtgatgggtattgttcatggggcagctacttattta cctgactttttagactatttttcatttaattttcccaattcaccagtgaaaatggagata ttgggaaagaaagatatagagacaacgactatgtccaattttcatgctcaggtaaaaaga acgtattctcatggtacttacagagctggcccaatgagacaaataagcttggtgggagca gttgatgaagaagtaggagattatttccctgagttccttgacatgttggaagagtcacca tttttaaaatgtacactgccatgggggacgctatctagtctaaaattacagagtcgaaaa gatagtgatgatggtcccatcatgtgggttcgtccaggagaacaaatgatccctgtggct gatatgccaaagtcacctttcaaaaggaaaagaactaccaatgaaataaaaaatcttcag tacctacctcgaacaagtgagccccgtgagatgctctttgaagacaggacaagagctcat gcagatcatataggacaaggttttgaacgacagactacagctgctgttggagtgctgaag gctgtgcactgtggagagtggcctgatcaaccccgtataaccaaagatgtaatttgtttt catgctgaagatttcttagaagtagttcaacgaatgcagttagatttacatgaacctcca ctgtcccagtgtgtccaatgggttgatgatgcaaaactgaatcaactgaggagggaaggc attcgctatgccaggattcagctatatgataatgacatttattttattccaaggaatgtt gttcatcagttcaagacagtttcagctgtatgcagtttagcatggcatattcggctcaaa ttatatcactcagaggaggacacttctcagaatacagctactcatgaaacaggcacatca tcagattccacatcatctgttcttggacctcacactgacaacatgatttgtgctgtaagc aaagcctccttggattctgttttttcagataaacttcattctaaatatgaattacagcag attaaacatgaacctattgcatctgtaagaatcaaggaagaacctgtgaatgttaatatt cctgaaaagactacagcactgaataatatggatggcaagaatgttaaagcaaaattggat catgttcaatttgcagaatttaagattgacatggattctaaatttgaaaatagcaacaaa gatttaaaggaagaattgtgccctggaaatctaagtctagttgatacaaggcaacacagt tcagcacattcaaatcaagataaaaaagacgatgacattttgtgctaa >gi568815591r:77693975_77894373|GENSCAN_predicted_peptide_2|174_aa MRNQFQGAFSSSFNTNMIKKSSVKSQPPPSGAALAPTVRAASPPRSGGQCGGRRVLTEFR GWIGTREESGRRDGGGGSGANASPSVAGDFPGSLPYLRAGVSRDPFSSVGRWRRLSRLPP ELARSSSLDGDGPKPSAPPAPRAPLPGPCPLLPSLGAVLRGRRVLEGRSGVSGG >gi568815591r:77693975_77894373|GENSCAN_predicted_CDS_2|525_bp atgaggaaccagttccaaggtgctttctcatccagtttcaacaccaacatgatcaagaag agtagtgtgaaaagccagccgcctcctagcggcgcggcgctcgctcctacggtaagagcg gcatcacctccgcgctcgggtgggcagtgtggaggtcgtcgagttctgacagaatttcga ggatggatcgggacgagggaggagagcggtcgtcgggatggaggtgggggctccggggcc aacgcgtcgccctccgtcgccggcgactttcccggcagtctcccgtacctgagggccggg gtcagccgcgaccccttctcatcagtcggccggtggcggaggctcagccgccttcctccc gagctcgcgcggtcctcaagtcttgatggtgacgggccgaagcccagcgcgccacccgcc cctcgggccccgcttccaggcccctgtcctttgcttccttcccttggcgctgtgctgagg ggtcgtcgggtcctggaagggcgttcaggggtctcgggtggctag >gi568815591r:77693975_77894373|GENSCAN_predicted_peptide_3|182_aa MTSPNELNKAPVTNSRVTKVYEKVLHIMDHQRNANQNYSEISSYLSQNGFSPEDRQQQMP ARMWRNGNPCTSLTGMQISTATMENTIIGIHAPFYASHWQGKQNAMIGLDLSGFTPSETS DTHSLAVRPVERNEFLNKIEALPPGKKHGQSRYSWLIHKQLVDPLLPSSGVFCGWEVMLK LF >gi568815591r:77693975_77894373|GENSCAN_predicted_CDS_3|549_bp atgacctcaccaaatgaactaaataaggcaccagtgaccaattcccgagtaacaaaggta tatgaaaaggtgcttcacatcatggatcatcagagaaatgcaaatcaaaactacagtgag atatcatcttacctcagtcaaaatggcttttctccagaagacaggcaacaacaaatgccg gcaaggatgtggagaaatgggaatccctgtacatcgttgacaggaatgcaaattagtaca gccactatggagaacaccataattggaatacatgccccattctatgccagtcactggcaa gggaaacagaatgccatgattgggttagacttatcaggattcaccccctctgagacttcg gatacacattctttggctgtaagacctgtggagaggaatgaattccttaacaaaatagag gctcttccaccaggaaaaaagcacggacaaagtcgatacagctggcttattcacaaacag cttgtggatccattacttcccagttcgggggtcttctgtggttgggaagtaatgctcaaa ctcttttga >gi568815591r:77693975_77894373|GENSCAN_predicted_peptide_4|495_aa MWRTKEHNEVGWLLLSSVDKVMKENYELRDSISWLQKQILSLKSAKIALEGTASSSPVVA TSPPRPMLPSAFPPLSEEINLVLPEATVMASPEAAVRQDNTDSPQKSAEQAWEWMLRVWD NGGRNTELDQAEFMDLGPLSMSGLTVGSAVTQLQNLNYNGNNWILRWQGPSGSNQPSKAS HPIIVQWAHEQSGHGGRDGGYAWAQQHGLPLTKADLAAAIAECPICQQQRPTLSPPYGTI PRGDQPVTWWQIDYIEPFPSRKGQSIAFDQGTHFTATEVWQWAHAYGIHWSYHAPHHSEA AGLTQQWNTLSKSQLQRQLGDNTLQGWGKVYALNQHPIYGTVSPIARIHRFRNQGVELEV APLTITPSDTLVKFLLPVPETLRSPGLEVLVPEGGTLPRGDTTPLNWKLRLPPGHFGLPL PLSQQAKKGVTVIGLDYQDEISLLLHNGGKEEYSWNTQDPLRRLLISPCRVIKVNGKLQQ PNSGRTTNDPETIQE >gi568815591r:77693975_77894373|GENSCAN_predicted_CDS_4|1488_bp atgtggagaaccaaggaacataatgaagttggttggttgctcctaagttcagtggacaaa gtgatgaaagaaaattatgaactcagggattctatctcctggcttcagaagcagatactg agcctcaaatctgctaagattgccctggaaggaacagcttcctcatccccagtagtggca acatcccctccccgacccatgctgccatcagcctttccacctttgtctgaagagataaac cttgtgctgcctgaggcaacagtgatggcctcccctgaggcagctgtgaggcaagataac actgattctcctcagaaatctgcagaacaggcatgggaatggatgttaagggtgtgggat aacggcggaaggaacacagagttggatcaagctgaatttatggatttgggcccactaagt atgtcaggtctaacagtgggaagtgcagtcactcaactacaaaatttaaattacaatggg aataactggatcctgaggtggcaggggccaagtggcagcaatcaaccatcaaaggcaagc caccccatcatcgtccaatgggcccatgaacaaagtggccatggtggcagggatggaggt tacgcatgggctcagcaacatggacttccactcaccaaggctgatctggctgcagccatt gctgagtgcccaatttgccagcagcagagaccaacactgagccctccatatggcaccatt cctcggggtgatcagccagttacctggtggcagattgattatattgaaccttttccatca cggaaagggcagagcattgcctttgaccaaggcactcactttacagctacagaagtgtgg cagtgggctcatgcttatggaattcactggtcttaccatgctccccatcattctgaagca gctggattgacacaacagtggaataccctttcgaagtcacaattacaacggcaactaggt gacaatactttgcagggctggggcaaagtgtatgctctgaatcagcatccaatatatggt actgtttctccaatagccaggattcacaggttcaggaatcaaggggtagaattggaagta gcaccacttaccatcacccctagtgatacactagtaaaatttttgcttcctgttcctgag acattacgttctcctggcctggaggtcttagttccagagggaggaacactgccaagagga gacacaactccattaaactggaagttaagattgccacctggacactttgggctcccccta cctttaagtcaacaggctaagaagggagttacagtgattggcctggactatcaagatgaa atcagtctactactccacaacggaggtaaggaagagtattcatggaatacacaagatcca ttaaggcgtctcttaatatcaccatgccgtgtgattaaggtcaatgggaaactacaacag cccaattcaggcaggactacaaatgaccccgagaccattcaggaatga