GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:59:29 Sequence gi568815591f:77596470_77879165 : 282696 bp : 39.95% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4195 4337 143 0 2 69 87 78 0.398 4.03 1.02 Intr + 10766 10832 67 2 1 41 78 48 0.243 -2.91 1.03 Intr + 14479 14577 99 0 0 104 66 158 0.999 14.59 1.04 Intr + 22011 22096 86 2 2 98 72 122 0.974 9.30 1.05 Intr + 30236 31206 971 2 2 82 85 759 0.984 64.71 1.06 Intr + 39262 39380 119 2 2 6 100 100 0.528 2.26 1.07 Intr + 57390 57528 139 2 1 12 81 122 0.005 3.02 1.08 Intr + 66461 66722 262 0 1 -86 106 201 0.053 0.12 1.09 Term + 76103 76199 97 2 1 42 48 143 0.130 2.16 1.10 PlyA + 77622 77627 6 1.05 2.00 Prom + 87490 87529 40 -4.85 2.01 Init + 100001 100586 586 1 1 94 90 504 0.900 44.61 2.02 Intr + 101261 101366 106 0 1 102 56 80 0.873 4.55 2.03 Intr + 106462 106570 109 1 1 42 80 78 0.570 1.77 2.04 Intr + 114515 114626 112 1 1 40 28 57 0.003 -6.07 2.05 Intr + 139941 140057 117 1 0 128 106 84 0.594 13.72 2.06 Intr + 152955 153595 641 1 2 63 110 463 0.543 36.85 2.07 Intr + 169026 169163 138 2 0 91 69 56 0.881 3.74 2.08 Intr + 172192 172334 143 0 2 53 95 119 0.863 7.33 2.09 Intr + 176678 176845 168 2 0 67 21 173 0.978 6.64 2.10 Intr + 181869 181977 109 0 1 98 68 61 0.999 4.47 2.11 Term + 182061 182699 639 2 0 68 32 457 0.999 31.32 2.12 PlyA + 182745 182750 6 1.05 3.00 Prom + 185539 185578 40 -3.95 3.01 Init + 197792 197869 78 1 0 51 109 50 0.255 4.51 3.02 Term + 202470 202916 447 2 0 21 41 356 0.870 18.43 3.03 PlyA + 204489 204494 6 1.05 4.05 PlyA - 205884 205879 6 1.05 4.04 Term - 235980 235873 108 0 0 58 48 103 0.196 0.83 4.03 Intr - 249857 249673 185 0 2 17 77 140 0.235 4.39 4.02 Intr - 252015 251817 199 0 1 -21 32 153 0.011 -3.20 4.01 Init - 257501 257445 57 2 0 90 86 56 0.069 6.96 4.00 Prom - 263426 263387 40 -5.85 5.07 PlyA - 263990 263985 6 1.05 5.06 Term - 275380 274687 694 0 1 32 49 342 0.728 16.75 5.05 Intr - 275762 275508 255 1 0 54 68 307 0.711 20.94 5.04 Intr - 276190 276070 121 2 1 73 98 34 0.943 1.53 5.03 Intr - 276911 276809 103 2 1 24 19 98 0.500 -4.67 5.02 Intr - 277231 277065 167 2 2 59 50 167 0.778 8.76 5.01 Init - 277623 277476 148 0 1 24 72 100 0.837 2.30 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 42750 42824 75 2 0 93 43 98 0.886 2.66 S.002 Init + 257938 258028 91 0 1 83 67 48 0.843 2.93 S.003 Term - 262673 262561 113 0 2 31 49 148 0.879 2.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:77596470_77879165|GENSCAN_predicted_peptide_1|660_aa ESRRLYQFHYVNWPDHDVPSSFDSILDMISLMRKYQEHEDVPICIHCSAGCGRTGAICAI DYTWNLLKAGEQYELVHRAIAQLFEKQLQLYEIHGAQKIADGVNEINTENMVSSIEPEKQ DSPPPKPPRTRSCLVEGDAKEEILQPPEPHPVPPILTPSPPSAFPTVTTVWQDNDRYHPK PVLHMVSSEQHSADLNRNYSKSTELPGKNESTIEQIDKKLERNLSFEIKKVPLQEGPKSF DGNTLLNRGHAIKIKSASPCIADKISKPQELSSDLNVGDTSQNSCVDCSVTQSNKVSVTP PEESQNSDTPPRPDRLPLDEKGHVTWSFHGPENAIPIPDLSEGNSSDINYQTRKTVSLTP SPTTQVETPDLVDHDNTSPLFRTPLSFTNPLHSDDSDSDERNSDGAVTQNKTNISTASAT VSAATSTESISTRKVLPMSIARHNIAGTTHSGAEKEISEIQGVNNNKISFFSDTPVRSEW SELQSQERSEQKKSESLGVLLACKEAQAKPWTDERPRGEIPNKSQPTSHGRKSYSSALMQ NEKKEKGRKEGRKEGTKEGDREREEGERGRKEGRKETERGKKEREEGRRERKEGEKKEGE RKKEGKKEKERKEKEKEKERKKKEEERRNKHAEILAKTAVDETAKGEPTIMVGVKLDELK >gi568815591f:77596470_77879165|GENSCAN_predicted_CDS_1|1983_bp gaatctcgtaggctgtatcagtttcattatgtgaactggccagaccatgatgttccttca tcatttgattctattctggacatgataagcttaatgaggaaatatcaagaacatgaagat gttcctatttgtattcattgcagtgcaggctgtggaagaacaggtgccatttgtgccata gattatacgtggaatttactaaaagctggggagcaatatgaacttgttcatagagctatt gcccaactgtttgaaaaacagctacaactatatgaaattcatggagctcagaaaattgct gatggagtgaatgaaattaacactgaaaacatggtcagctccatagagcctgaaaaacaa gattctcctcctccaaaaccaccaaggacccgcagttgccttgttgaaggggatgctaaa gaagaaatactgcagccaccggaacctcatccagtgccacccatcttgacaccttctccc ccttcagcttttccaacagtcactactgtgtggcaggacaatgatagataccatccaaag ccagtgttgcatatggtttcatcagaacaacattcagcagacctcaacagaaactatagt aaatcaacagaacttccagggaaaaatgaatcaacaattgaacagatagataaaaaattg gaacgaaatttaagttttgagattaagaaggtccctctccaagagggaccaaaaagtttt gatgggaacacacttttgaataggggacatgcaattaaaattaaatctgcttcaccttgt atagctgataaaatctctaagccacaggaattaagttcagatctaaatgtcggtgatact tcccagaattcttgtgtggactgcagtgtaacacaatcaaacaaagtttcagttactcca ccagaagaatcccagaattcagacacacctccaaggccagaccgcttgcctcttgatgag aaaggacatgtaacgtggtcatttcatggacctgaaaatgccatacccatacctgattta tctgaaggcaattcctcagatatcaactatcaaactaggaaaactgtgagtttaacacca agtcctacaacacaagttgaaacacctgatcttgtggatcatgataacacttcaccactc ttcagaacacccctcagttttactaatccacttcactctgatgactcagactcagatgaa agaaactctgatggtgctgtgacccagaataaaactaatatttcaacagcaagtgccaca gtttctgctgccactagtactgaaagcatttctactaggaaagtattgccaatgtccatt gctagacataatatagcaggaacaacacattcaggtgctgaaaaagaaatttcagaaatt caaggtgttaataacaataagatttcatttttctcagatacacctgtaagatcggaatgg agtgaacttcaaagtcaggaacgatctgaacaaaaaaagtctgaatctcttggggtcctg ctggcatgtaaagaagctcaggctaagccatggacggatgagagaccacgtggggagatt cccaacaagtcccaacccacatcacatggaagaaagtcatacagcagcgccctcatgcag aatgaaaagaaagaaaaaggaaggaaggaaggaaggaaggaaggaacgaaggaaggagac agagagagggaagaaggagagagaggaaggaaggagggaaggaaggagacagagagagga aagaaggagagagaggaaggaaggagagagaggaaagaaggagagaagaaggagggagag agaaagaaagaaggaaagaaagaaaaagaaagaaaagaaaaagagaaagaaaaagaaaga aagaaaaaagaggaagaaagaagaaacaaacatgcggaaatactggcaaagacagctgta gatgaaacagccaagggagagccaacaattatggttggtgttaagttggatgaattgaaa taa >gi568815591f:77596470_77879165|GENSCAN_predicted_peptide_2|955_aa MAEPPSPVHCVAAAAPTATVSEKEPFGKLQLSSRDPPGSLSAKKVRTEEKKAPRRVNGEG GSGGNSRQLQPPAAPSPQSYGSPASWSFAPLSAAPSPSSSRSSFSFSAGTAVPSSASASL SQPVPRKLLVPPTLLHAQPHHLLLPAAAAAASANAKSRRPKEKREKERRRHGLGGAREAG GASREENGEVKPLPRANDKTRSFDDFSPDQAAAECLSIRKRRILKGSCQTGYLRKALFSD TVTLGIKASTYGFWGGMIQSIVKGLAETFNFWIHIWELNDLQYISALFACSEEMPDYHLE KQLADKIKDKIKERDKEKEREKKKHKVMNEIKKENGEVKILLKSGKEKPKTNIEDLQIKK VKKKKKKKHKENEKRKRPKMYSKSIQTICSGLLTDVEDQAAKGILNDNIKDYVGKNLDTK NYDSKIPENSEFPFVSLKEPRVQNNLKRLDTLEFKQLIHIEHQPNGGASVIHAYSNELSH LSPMEMERFAEEFVGLVFSENENSAAFYVMGIVHGAATYLPDFLDYFSFNFPNSPVKMEI LGKKDIETTTMSNFHAQVKRTYSHGTYRAGPMRQISLVGAVDEEVGDYFPEFLDMLEESP FLKCTLPWGTLSSLKLQSRKDSDDGPIMWVRPGEQMIPVADMPKSPFKRKRTTNEIKNLQ YLPRTSEPREMLFEDRTRAHADHIGQGFERQTTAAVGVLKAVHCGEWPDQPRITKDVICF HAEDFLEVVQRMQLDLHEPPLSQCVQWVDDAKLNQLRREGIRYARIQLYDNDIYFIPRNV VHQFKTVSAVCSLAWHIRLKLYHSEEDTSQNTATHETGTSSDSTSSVLGPHTDNMICAVS KASLDSVFSDKLHSKYELQQIKHEPIASVRIKEEPVNVNIPEKTTALNNMDGKNVKAKLD HVQFAEFKIDMDSKFENSNKDLKEELCPGNLSLVDTRQHSSAHSNQDKKDDDILC >gi568815591f:77596470_77879165|GENSCAN_predicted_CDS_2|2868_bp atggcggaaccgccgagccccgtgcactgtgtcgctgccgcggcccccaccgccaccgtc tcggagaaagaaccgtttggcaagctgcaactctcctcccgggaccctccgggttctctg tccgccaagaaggtccggactgaggagaagaaggcaccgcggagagtgaacggagaaggg ggcagcggcgggaacagcaggcagctgcagccgccggcagcaccttcgcctcagagctat ggcagccccgcgtcttggagctttgcccctctgtctgctgctccctccccgtcctcttct cggagcagtttctctttctccgctggcacggccgttccctcctcagcctccgcttccttg tctcagccggtgccgcgcaaactgctggtccctcctacgctgctgcacgctcagcctcac catctcctcctgcccgccgccgccgccgctgcctcggctaacgccaagtcgcgcagacct aaggagaagcgggagaaggagaggaggaggcacggtctcggtggggcccgagaggccggc ggggcctcccgggaggagaacggggaggtgaagccgctgccccgagcaaatgataaaacc aggagctttgacgatttttcgccagatcaagcggctgcagaatgtttgagcattagaaaa aggcgaattcttaagggttcttgtcagacaggttacctccgaaaagccttattttcagat acagtcacattgggcattaaggcttcaacatatggattttggggaggcatgattcagtcc atagtgaagggcttggcagagaccttcaacttctggatacatatttgggaactgaatgac ttgcagtatatttcagccctttttgcctgctcagaagagatgcccgattatcatttagaa aaacaattggctgataaaatcaaagacaaaattaaagagagagacaaagaaaaagaaaga gaaaaaaagaaacataaagtaatgaatgagatcaagaaagagaatggagaagtaaagatt ttgctgaaaagtgggaaggagaaaccaaaaacaaatatagaagacttacaaattaaaaag gtaaagaagaaaaagaaaaagaaacacaaagagaatgaaaaacggaagcgtccgaaaatg tatagcaaatctattcagaccatctgctcaggattgctaactgatgttgaagatcaagca gccaaaggcatcctaaatgataacataaaagattacgttgggaagaatttggataccaag aactatgattccaaaattccagagaacagtgagtttccatttgtctcattaaaggagcca cgagttcagaataacctcaaaaggttggacactttggaatttaaacaactcattcatata gagcaccagcctaatggaggtgcatcggttatccatgcctacagtaacgaactctcccac ctgtctcctatggagatggagaggtttgcagaagagtttgtgggtctagtgttcagtgaa aatgaaaactctgcagctttctacgtgatgggtattgttcatggggcagctacttattta cctgactttttagactatttttcatttaattttcccaattcaccagtgaaaatggagata ttgggaaagaaagatatagagacaacgactatgtccaattttcatgctcaggtaaaaaga acgtattctcatggtacttacagagctggcccaatgagacaaataagcttggtgggagca gttgatgaagaagtaggagattatttccctgagttccttgacatgttggaagagtcacca tttttaaaatgtacactgccatgggggacgctatctagtctaaaattacagagtcgaaaa gatagtgatgatggtcccatcatgtgggttcgtccaggagaacaaatgatccctgtggct gatatgccaaagtcacctttcaaaaggaaaagaactaccaatgaaataaaaaatcttcag tacctacctcgaacaagtgagccccgtgagatgctctttgaagacaggacaagagctcat gcagatcatataggacaaggttttgaacgacagactacagctgctgttggagtgctgaag gctgtgcactgtggagagtggcctgatcaaccccgtataaccaaagatgtaatttgtttt catgctgaagatttcttagaagtagttcaacgaatgcagttagatttacatgaacctcca ctgtcccagtgtgtccaatgggttgatgatgcaaaactgaatcaactgaggagggaaggc attcgctatgccaggattcagctatatgataatgacatttattttattccaaggaatgtt gttcatcagttcaagacagtttcagctgtatgcagtttagcatggcatattcggctcaaa ttatatcactcagaggaggacacttctcagaatacagctactcatgaaacaggcacatca tcagattccacatcatctgttcttggacctcacactgacaacatgatttgtgctgtaagc aaagcctccttggattctgttttttcagataaacttcattctaaatatgaattacagcag attaaacatgaacctattgcatctgtaagaatcaaggaagaacctgtgaatgttaatatt cctgaaaagactacagcactgaataatatggatggcaagaatgttaaagcaaaattggat catgttcaatttgcagaatttaagattgacatggattctaaatttgaaaatagcaacaaa gatttaaaggaagaattgtgccctggaaatctaagtctagttgatacaaggcaacacagt tcagcacattcaaatcaagataaaaaagacgatgacattttgtgctaa >gi568815591f:77596470_77879165|GENSCAN_predicted_peptide_3|174_aa MRNQFQGAFSSSFNTNMIKKSSVKSQPPPSGAALAPTVRAASPPRSGGQCGGRRVLTEFR GWIGTREESGRRDGGGGSGANASPSVAGDFPGSLPYLRAGVSRDPFSSVGRWRRLSRLPP ELARSSSLDGDGPKPSAPPAPRAPLPGPCPLLPSLGAVLRGRRVLEGRSGVSGG >gi568815591f:77596470_77879165|GENSCAN_predicted_CDS_3|525_bp atgaggaaccagttccaaggtgctttctcatccagtttcaacaccaacatgatcaagaag agtagtgtgaaaagccagccgcctcctagcggcgcggcgctcgctcctacggtaagagcg gcatcacctccgcgctcgggtgggcagtgtggaggtcgtcgagttctgacagaatttcga ggatggatcgggacgagggaggagagcggtcgtcgggatggaggtgggggctccggggcc aacgcgtcgccctccgtcgccggcgactttcccggcagtctcccgtacctgagggccggg gtcagccgcgaccccttctcatcagtcggccggtggcggaggctcagccgccttcctccc gagctcgcgcggtcctcaagtcttgatggtgacgggccgaagcccagcgcgccacccgcc cctcgggccccgcttccaggcccctgtcctttgcttccttcccttggcgctgtgctgagg ggtcgtcgggtcctggaagggcgttcaggggtctcgggtggctag >gi568815591f:77596470_77879165|GENSCAN_predicted_peptide_4|182_aa MTSPNELNKAPVTNSRVTKVYEKVLHIMDHQRNANQNYSEISSYLSQNGFSPEDRQQQMP ARMWRNGNPCTSLTGMQISTATMENTIIGIHAPFYASHWQGKQNAMIGLDLSGFTPSETS DTHSLAVRPVERNEFLNKIEALPPGKKHGQSRYSWLIHKQLVDPLLPSSGVFCGWEVMLK LF >gi568815591f:77596470_77879165|GENSCAN_predicted_CDS_4|549_bp atgacctcaccaaatgaactaaataaggcaccagtgaccaattcccgagtaacaaaggta tatgaaaaggtgcttcacatcatggatcatcagagaaatgcaaatcaaaactacagtgag atatcatcttacctcagtcaaaatggcttttctccagaagacaggcaacaacaaatgccg gcaaggatgtggagaaatgggaatccctgtacatcgttgacaggaatgcaaattagtaca gccactatggagaacaccataattggaatacatgccccattctatgccagtcactggcaa gggaaacagaatgccatgattgggttagacttatcaggattcaccccctctgagacttcg gatacacattctttggctgtaagacctgtggagaggaatgaattccttaacaaaatagag gctcttccaccaggaaaaaagcacggacaaagtcgatacagctggcttattcacaaacag cttgtggatccattacttcccagttcgggggtcttctgtggttgggaagtaatgctcaaa ctcttttga >gi568815591f:77596470_77879165|GENSCAN_predicted_peptide_5|495_aa MWRTKEHNEVGWLLLSSVDKVMKENYELRDSISWLQKQILSLKSAKIALEGTASSSPVVA TSPPRPMLPSAFPPLSEEINLVLPEATVMASPEAAVRQDNTDSPQKSAEQAWEWMLRVWD NGGRNTELDQAEFMDLGPLSMSGLTVGSAVTQLQNLNYNGNNWILRWQGPSGSNQPSKAS HPIIVQWAHEQSGHGGRDGGYAWAQQHGLPLTKADLAAAIAECPICQQQRPTLSPPYGTI PRGDQPVTWWQIDYIEPFPSRKGQSIAFDQGTHFTATEVWQWAHAYGIHWSYHAPHHSEA AGLTQQWNTLSKSQLQRQLGDNTLQGWGKVYALNQHPIYGTVSPIARIHRFRNQGVELEV APLTITPSDTLVKFLLPVPETLRSPGLEVLVPEGGTLPRGDTTPLNWKLRLPPGHFGLPL PLSQQAKKGVTVIGLDYQDEISLLLHNGGKEEYSWNTQDPLRRLLISPCRVIKVNGKLQQ PNSGRTTNDPETIQE >gi568815591f:77596470_77879165|GENSCAN_predicted_CDS_5|1488_bp atgtggagaaccaaggaacataatgaagttggttggttgctcctaagttcagtggacaaa gtgatgaaagaaaattatgaactcagggattctatctcctggcttcagaagcagatactg agcctcaaatctgctaagattgccctggaaggaacagcttcctcatccccagtagtggca acatcccctccccgacccatgctgccatcagcctttccacctttgtctgaagagataaac cttgtgctgcctgaggcaacagtgatggcctcccctgaggcagctgtgaggcaagataac actgattctcctcagaaatctgcagaacaggcatgggaatggatgttaagggtgtgggat aacggcggaaggaacacagagttggatcaagctgaatttatggatttgggcccactaagt atgtcaggtctaacagtgggaagtgcagtcactcaactacaaaatttaaattacaatggg aataactggatcctgaggtggcaggggccaagtggcagcaatcaaccatcaaaggcaagc caccccatcatcgtccaatgggcccatgaacaaagtggccatggtggcagggatggaggt tacgcatgggctcagcaacatggacttccactcaccaaggctgatctggctgcagccatt gctgagtgcccaatttgccagcagcagagaccaacactgagccctccatatggcaccatt cctcggggtgatcagccagttacctggtggcagattgattatattgaaccttttccatca cggaaagggcagagcattgcctttgaccaaggcactcactttacagctacagaagtgtgg cagtgggctcatgcttatggaattcactggtcttaccatgctccccatcattctgaagca gctggattgacacaacagtggaataccctttcgaagtcacaattacaacggcaactaggt gacaatactttgcagggctggggcaaagtgtatgctctgaatcagcatccaatatatggt actgtttctccaatagccaggattcacaggttcaggaatcaaggggtagaattggaagta gcaccacttaccatcacccctagtgatacactagtaaaatttttgcttcctgttcctgag acattacgttctcctggcctggaggtcttagttccagagggaggaacactgccaagagga gacacaactccattaaactggaagttaagattgccacctggacactttgggctcccccta cctttaagtcaacaggctaagaagggagttacagtgattggcctggactatcaagatgaa atcagtctactactccacaacggaggtaaggaagagtattcatggaatacacaagatcca ttaaggcgtctcttaatatcaccatgccgtgtgattaaggtcaatgggaaactacaacag cccaattcaggcaggactacaaatgaccccgagaccattcaggaatga