GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:42:17 Sequence gi568815597f:214983386_215335142 : 351757 bp : 36.75% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1329 1494 166 2 1 66 48 94 0.187 3.06 1.02 Intr + 9306 9424 119 1 2 96 66 41 0.020 1.96 1.03 Term + 22372 22521 150 0 0 53 45 163 0.351 5.43 1.04 PlyA + 22758 22763 6 1.05 2.03 PlyA - 22820 22815 6 1.05 2.02 Term - 46990 46662 329 2 2 15 42 218 0.491 3.79 2.01 Init - 54872 54809 64 2 1 49 105 45 0.496 3.66 2.00 Prom - 58500 58461 40 -6.05 3.00 Prom + 59576 59615 40 -2.75 3.01 Init + 74260 74274 15 0 0 89 116 7 0.352 4.01 3.02 Intr + 89650 89772 123 0 0 24 107 82 0.297 3.56 3.03 Term + 90523 90678 156 0 0 51 42 102 0.376 -1.15 3.04 PlyA + 91181 91186 6 1.05 4.04 PlyA - 91453 91448 6 1.05 4.03 Term - 92201 92095 107 0 2 17 41 131 0.665 -1.11 4.02 Intr - 93217 93100 118 1 1 93 103 69 0.911 8.02 4.01 Init - 93655 93581 75 1 0 43 59 85 0.912 2.24 4.00 Prom - 95014 94975 40 -5.75 5.00 Prom + 95139 95178 40 -5.45 5.01 Init + 95260 95278 19 0 1 71 100 -2 0.465 -0.47 5.02 Intr + 98737 98874 138 2 0 87 44 202 0.848 15.21 5.03 Intr + 98960 99115 156 0 0 66 42 178 0.447 10.06 5.04 Intr + 102983 103293 311 0 2 85 95 249 0.423 20.41 5.05 Intr + 133149 133266 118 2 1 71 121 23 0.080 3.02 5.06 Intr + 142092 142206 115 1 1 66 23 107 0.043 0.59 5.07 Intr + 160940 161020 81 2 0 60 94 73 0.013 2.93 5.08 Term + 163793 163844 52 2 1 110 41 51 0.206 -1.48 5.09 PlyA + 165061 165066 6 1.05 6.03 PlyA - 167355 167350 6 1.05 6.02 Term - 176091 175934 158 1 2 72 54 92 0.722 1.31 6.01 Init - 177598 177559 40 1 1 67 116 18 0.542 2.90 6.00 Prom - 179404 179365 40 -3.85 7.02 PlyA - 179504 179499 6 1.05 7.01 Sngl - 180330 179824 507 0 0 51 48 299 0.727 17.99 7.00 Prom - 181741 181702 40 -7.65 8.00 Prom + 184993 185032 40 -4.75 8.01 Init + 185173 185266 94 0 1 83 42 30 0.464 -1.51 8.02 Intr + 185814 185974 161 1 2 26 68 113 0.624 1.89 8.03 Intr + 188612 188798 187 1 1 81 115 122 0.839 12.54 8.04 Term + 199056 199219 164 1 2 39 49 123 0.168 0.62 8.05 PlyA + 201094 201099 6 1.05 9.00 Prom + 201252 201291 40 -4.25 9.01 Init + 206835 206868 34 2 1 68 116 29 0.057 3.79 9.02 Intr + 211568 211707 140 0 2 95 78 118 0.051 10.76 9.03 Term + 251443 251760 318 0 0 96 41 309 0.628 20.90 9.04 PlyA + 252836 252841 6 1.05 10.00 Prom + 261227 261266 40 -6.75 10.01 Init + 267713 267790 78 1 0 32 110 22 0.057 -0.09 10.02 Intr + 270047 270207 161 1 2 87 59 124 0.071 7.26 10.03 Intr + 281144 281190 47 2 2 81 83 35 0.134 -0.57 10.04 Term + 286389 286681 293 1 2 45 45 383 0.898 24.22 10.05 PlyA + 287012 287017 6 1.05 11.00 Prom + 288987 289026 40 -6.55 11.01 Init + 289043 289121 79 1 1 84 115 16 0.537 5.17 11.02 Intr + 307010 307074 65 0 2 118 102 78 0.905 9.72 11.03 Term + 311277 311351 75 2 0 26 49 116 0.416 -1.64 11.04 PlyA + 313507 313512 6 1.05 12.03 PlyA - 313802 313797 6 1.05 12.02 Term - 315801 315712 90 0 0 96 40 95 0.181 2.24 12.01 Init - 326237 326115 123 2 0 54 45 102 0.340 2.72 12.00 Prom - 336412 336373 40 -4.95 13.02 PlyA - 337021 337016 6 1.05 13.01 Sngl - 341248 340859 390 1 0 87 53 209 0.735 13.27 13.00 Prom - 342960 342921 40 -3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_1|144_aa MVVWVGPRCVQPSNLVPYIPDTPAMTKRGQGTAQTVASEGGSPNPWQLPHGVESEEYKLE QGGNRATFAVSQHLQQCLAHARDFMRIYWKTKMKNETDLQGNIHTQKGNSMGRWLVSTSD SVLEITDKKPWRKNGIKDQKHDDS >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_1|435_bp atggttgtgtgggttgggcccaggtgtgtgcagcctagcaacttggtgccctacatcccg gacactccagccatgactaaaaggggccaaggtacagctcagactgttgcttcagagggt gggagccccaatccttggcagcttccacatggtgttgagtctgaggaatataaacttgag caaggtggaaatcgtgccacttttgctgtatcccagcatctacagcagtgcctggcacat gctagggacttcatgcgtatttattggaaaacaaaaatgaaaaacgaaactgatctacag ggtaacattcacacacaaaaaggaaacagtatgggacgatggcttgttagtacaagtgac tctgttttggaaattacggacaagaaaccttggaggaagaacggcattaaagatcaaaag catgatgactcctga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_2|130_aa MNEDYHREGLLAEILKALEGTETIKKRERQATSWEKIFAMEIPDKGLYSKIKELLKLSNE KTNNPIKKRVKDRNRHLIREDTQMTNKRMKRCFTLHVIRKMQIKITIRYYLLEWPKSRTL RAPNAGKDVE >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_2|393_bp atgaatgaggactaccatagggaaggcttacttgctgagattcttaaagccctggagggg actgagactatcaagaaaagggaaagacaagccacaagctgggagaaaatatttgcaatg gaaatacctgataaaggactgtattccaaaataaaagaactcttaaaactcagcaatgag aaaacaaacaacccaattaaaaaacgggtcaaagaccgtaacagacacctcatcagagaa gatacacaaatgacaaataagcgtatgaaaagatgcttcacattacatgtcatcaggaaa atgcaaattaagataacaataagatactacctcttagaatggcccaaatccagaacactg agagcaccaaatgctggcaaggatgtggaataa >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_3|97_aa MEETKGEIWTQRQIHTKGKGCEDTQREGGHETGALRLQAKEHQGLLGCSLYPEGNGKLVR DFNEKFNTSSQILFVKVPHDSVWKMDWRSVEWRPEER >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_3|294_bp atggaagaaacaaagggggaaatttggacacagagacagatacacacaaagggaaaagga tgtgaagacacacagagagaaggtggccatgagactggagcgctgcgtctacaagccaag gaacaccaaggattgctgggatgtagtctctatcctgaaggcaatgggaagctagtgcgg gattttaatgagaaattcaatacttcaagccagattttgtttgtaaaagttccacatgac tcagtgtggaaaatggactggagatcggtggagtggaggcctgaggagaggtga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_4|99_aa MLRGEEKLPESEAIQSKQNHDDQVQLSLDSLSVVEASFTIINQPSREVHVSKPAGGAPEA FQQPLNQYLSSTYYVAERDFPVFGDIVVKKMQQILVIME >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_4|300_bp atgctaagaggagaggagaagctgcctgagagtgaagcaatacagagcaaacagaaccat gatgaccaagtccagctctctcttgattctctttctgtggtagaagccagtttcaccatc ataaatcaaccctccagagaggttcatgtgagtaagcctgcaggtggagctcctgaggcc ttccaacagccattaaaccagtacttatcaagcacctactatgtagcagaaagagatttt ccagtatttggggatatagtggtcaaaaagatgcaacaaatcctcgtcatcatggagtaa >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_5|329_aa MEITSQDVFALAPGAFPILFGPGRSADYSAQLMSSTAGGEVGRAGDSLTLPEGLGGHEGE EDALVQGSKDSRPRAWSHTRSHTPNAGREGDRMQRSVMWMPDSEMAAPDLLDPKSAAQNS KPRLSFSTKPTVLASRVESDTTINVMKWKTVSTIFLVVVLYLIIGATVFKALEQPHEISQ RTTIVIQKQTFISQHSCVNSTELDELIQKINLHFQKSKKKAVHFISLFKNSFDTRLHNPQ MEGNISEVPKSYSGNLNTDPTVKAMFVNENTWTQEGEHHTPGPVVGDVAFHLNDPTSAVI TQIQDLYSYTSFRASYSLRPINLASMTGA >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_5|990_bp atggagattacaagtcaagatgtcttcgccctggctccaggtgcgttccccatccttttc ggtcccggacgttctgcagattatagcgcccagctcatgtccagcaccgccggaggtgaa gtggggcgagctggtgactccttaacccttcccgaaggtctggggggccacgagggcgag gaggacgccctagtgcaggggtccaaggactcgcgtcccagagcttggtcccacacccgg tcccacaccccgaacgctggtcgggaaggagataggatgcagcgcagtgtgatgtggatg ccagacagcgagatggcggcacctgacttgctggatcctaaatctgccgctcagaactcc aaaccgaggctctcgttttccacgaaacccacagtgcttgcttcccgggtggagagtgac acgaccattaatgttatgaaatggaagacggtctccacgatattcctggtggttgtcctc tatctgatcatcggagccaccgtgttcaaagcattggagcagcctcatgagatttcacag aggaccaccattgtgatccagaagcaaacattcatatcccaacattcctgtgtcaattcg acggagctggatgaactcattcagaaaataaatttacactttcaaaagagcaagaaaaaa gctgtacacttcattagtctgtttaaaaatagctttgacactcgattgcataatccccaa atggaaggaaatatatcagaagtgcccaaaagctattcaggaaacctaaacacagacccc acagtgaaagctatgtttgtgaatgagaacacttggacacaggaaggggaacatcacaca ccagggcctgttgtgggtgatgtagcctttcatctaaatgacccaacttcagctgtcatc actcagattcaggatctgtatagttacacatcattcagagcctcatatagcttgcgaccc ataaaccttgcatctatgacaggggcatga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_6|65_aa MTRSGGYQGRIKEVDPDRGKKELEKTQVKFLSTQCLLNGTQLQKSPQIVNNPEVPICSAQ RTRRS >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_6|198_bp atgaccaggtctgggggataccagggtaggatcaaggaagtagatcctgataggggaaag aaggaattagagaagacgcaagtaaaattcttgtccacccagtgtctcctcaatggaaca cagcttcagaaatctccacaaattgtaaacaatccagaagttcctatatgctctgcgcag aggacaaggagaagttga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_7|168_aa MHHINRNNDKNHMIISTDAEKAFNKIQLPFMLKTLNKLDTDVTYLKIIRAIYDKPTANIT LNGQKRESLPLKTGTRQGCPLSPLLFNTVLEVLARAIRQEKEIKGIQVGKEEAKLPLFAD HVIVYLENSIVSAQNLLKLISNFSKVSGYKINVQKSQAYTNNRQRAKS >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_7|507_bp atgcatcacataaacagaaacaatgacaaaaaccacatgattatttcaacagatgcagaa aaggccttcaataaaattcaactccccttcatgctaaaaactctcaataaactggatacc gatgtaacatatctcaaaataataagagctatttatgacaaacccacagccaatatcaca ctgaatgggcaaaagcgggaatctctccctttgaaaaccggcacaagacaaggatgccct ctctcaccactcctattcaacacagtattggaagttctggccagggcaatcaggcaagag aaagaaataaagggtattcaagtaggaaaagaggaagccaaattgcctctgtttgcagat cacgtgattgtatatttagaaaactccatcgtctcagcccaaaatctccttaagttgata agcaacttcagcaaagtctcaggatataaaatcaatgtgcaaaaatcacaagcatacacc aataatagacagagagccaaatcatga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_8|201_aa MDEAGSHLPQQTNTGTENQTPHVLAHKWELNRFGNISPRTEGGKIFCIIYALLGIPLFGF LLAGVGDQLGTIFGKGIAKVEDTFIKWNVSQTKIRIISTIIFILFGCVLFVALPAIIFKH IEGWSALDAIYFVVITLTTIGFGDYVAVERAHCTTISGEHPGHSVTTHTDWYQVTTLPLP ASLLAQEKLQQQQHSPPYPGL >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_8|606_bp atggatgaagctggaagccatctgcctcagcaaactaacacaggaacagaaaaccaaaca ccacatgttctcgctcataagtgggagttgaacagatttggaaacatctcaccacgcaca gaaggcggcaaaatattctgtatcatctatgccttactgggaattcccctctttggtttt ctcttggctggagttggagatcagctaggcaccatatttggaaaaggaattgccaaagtg gaagatacgtttattaagtggaatgttagtcagaccaagattcgcatcatctcaacaatc atatttatactatttggctgtgtactctttgtggctctgcctgcgatcatattcaaacac atagaaggctggagtgccctggacgccatttattttgtggttatcactctaacaactatt ggatttggtgactacgttgcagtggaaagggcccactgcactacaatctcaggggagcat cctgggcactcagtaacgacacatacagattggtaccaggtcaccactctgcccctgcct gcaagtctccttgcccaagagaaactgcagcagcagcagcattccccaccctacccaggc ctgtga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_9|163_aa MAGNAGERTEQGGSDIEYLDFYKPVVWFWILVGLAYFAAVLSMIGDWLRVISKKTKEEVG EFRAHAAEWTANVTAEFKETRRRLSVEIYDKFQRATSIKRKLSAELAGNHNQELTPCRRT LSVNHLTSERDVLPPLLKTESIYLNGLTPHCAGEEIAVIENIK >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_9|492_bp atggctgggaacgctggagagaggacagagcaaggtggatccgatattgaatatctggac ttctataagcctgtcgtgtggttctggatccttgtagggcttgcttactttgctgctgtc ctgagcatgattggagattggctccgagtgatatctaaaaagacaaaagaagaggtggga gagttcagagcacacgctgctgagtggacagccaacgtcacagccgaattcaaagaaacc aggaggcgactgagtgtggagatttatgacaagttccagcgggccacctccatcaagcgg aagctctcggcagaactggctggaaaccacaatcaggagctgactccttgtaggaggacc ctgtcagtgaaccacctgaccagcgagagggatgtcttgcctcccttactgaagactgag agtatctatctgaatggtttgacgccacactgtgctggtgaagagattgctgtgattgag aacatcaaatag >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_10|192_aa MTKSIFVTKELTWYGLREKVSDKNVRSVIKDLIQKLKETSKIALHFVLRLNSKFSEFLNA LILLLKTCVLITSVQLVEIRAPEPSLPRSLSKPFGEIKPFDEKVLVQLVLVHWSLEKRPC RLETFVYGKGVSRQLVLVFSGREHNGADLQVREKLRTGFTCCVETNKQTGVLWSQNNNNN NNNNNNNNNNKL >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_10|579_bp atgactaaaagcatatttgtgacaaaggaattaacttggtatggcttgagggagaaagtg agtgataaaaatgtaaggagtgtcataaaggaccttattcagaagctaaaggaaacttct aaaattgcactgcattttgtgcttcgtctgaactctaaattctcagagtttctcaatgct ttgatccttcttttaaaaacctgtgtgctcattacatctgtacagcttgtggaaattagg gcaccagagccttctcttcccaggtccctctcaaagccctttggagaaattaagccattt gatgagaaggtgcttgttcagctagtgctggtgcattggagcttggagaaaaggccctgt agacttgagacctttgtctacggaaaaggggtcagcagacagttggtgctggtgttttca ggcagagaacataatggagctgatctgcaagtgcgggaaaaactacgaacgggtttcacc tgctgcgttgaaacaaacaagcaaactggtgtactatggagtcaaaacaacaacaacaac aacaacaacaacaacaacaacaacaacaacaaactctga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_11|72_aa MEYYSAIKKKKNPVIDSNMNGIGRHYGLSGGCPYLTPEVDPGWYKPVKLLSSGTPPGSHG KELGKIPVFLAE >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_11|219_bp atggaatactattcagccataaaaaagaaaaaaaatcctgtcattgacagtaacatgaat ggaattggaagacattatggactcagtggaggatgcccctacctaactccagaggtagat cctggttggtataaaccagtcaagcttttatcttcaggaactccaccaggttctcatggt aaagaactgggaaaaattcctgtgtttctggcagaataa >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_12|70_aa MVKKKSKVQKREEFLCMLEMKKRQLPGNSHTQMLIPAARYKVDFNRLIIPSFDEDVEQLE LSNNADGNVK >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_12|213_bp atggtgaaaaagaaaagcaaggttcaaaaacgagaagagtttctgtgtatgttagaaatg aaaaagcgtcagttacctgggaacagccatacccagatgctcatacctgcagcaagatat aaggtagattttaatagattgataattccgagttttgatgaggatgtggagcaactggaa ctttcaaataatgctgatgggaatgtaaaatga >gi568815597f:214983386_215335142|GENSCAN_predicted_peptide_13|129_aa MPTVIWTIKSRLKCSQFGDEEFVGNWSKSHSCYALAKGLVAFGPCPRDLWNFELERHDLG YLVEEISKQQRIQEEAEHKSLENLQANDAIENKNPFSGKKFKPAAAICISNKDPNVNPQD NGEMSPGHV >gi568815597f:214983386_215335142|GENSCAN_predicted_CDS_13|390_bp atgccgacagtgatatggacaattaagtccaggctgaagtgttctcagtttggagatgag gaatttgttgggaactggagtaaaagtcactcttgttatgctttagcaaagggactggtg gcatttggcccctgccctagagatctatggaactttgaacttgagagacatgatttaggg tatctggtagaagaaatttctaagcagcaaagaattcaagaggaagcagagcataaaagt ttggaaaatttgcaggctaatgatgcaatagaaaataaaaacccattttctgggaagaaa ttcaagcctgctgcagctatttgcataagtaacaaggacccaaatgttaatccccaagac aatggggaaatgtctccaggacatgtctga