GENSCAN 1.0 Date run: 3-Jul-118 Time: 15:00:08 Sequence gi568815597f:214986368_215335142 : 348775 bp : 36.68% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13143 13271 129 2 0 92 44 70 0.250 3.20 1.02 Term + 19390 19539 150 0 0 53 45 163 0.541 5.43 1.03 PlyA + 19776 19781 6 1.05 2.03 PlyA - 19838 19833 6 1.05 2.02 Term - 44008 43680 329 2 2 15 42 218 0.491 3.79 2.01 Init - 51890 51827 64 2 1 49 105 45 0.496 3.66 2.00 Prom - 55518 55479 40 -6.05 3.00 Prom + 56594 56633 40 -2.75 3.01 Init + 71278 71292 15 0 0 89 116 7 0.352 4.01 3.02 Intr + 86668 86790 123 0 0 24 107 82 0.297 3.56 3.03 Term + 87541 87696 156 0 0 51 42 102 0.376 -1.15 3.04 PlyA + 88199 88204 6 1.05 4.04 PlyA - 88471 88466 6 1.05 4.03 Term - 89219 89113 107 0 2 17 41 131 0.665 -1.11 4.02 Intr - 90235 90118 118 1 1 93 103 69 0.911 8.02 4.01 Init - 90673 90599 75 1 0 43 59 85 0.912 2.24 4.00 Prom - 92032 91993 40 -5.75 5.00 Prom + 92157 92196 40 -5.45 5.01 Init + 92278 92296 19 0 1 71 100 -2 0.465 -0.47 5.02 Intr + 95755 95892 138 2 0 87 44 202 0.848 15.21 5.03 Intr + 95978 96133 156 0 0 66 42 178 0.447 10.06 5.04 Intr + 100001 100311 311 0 2 85 95 249 0.423 20.41 5.05 Intr + 130167 130284 118 2 1 71 121 23 0.080 3.02 5.06 Intr + 139110 139224 115 1 1 66 23 107 0.043 0.59 5.07 Intr + 157958 158038 81 2 0 60 94 73 0.013 2.93 5.08 Term + 160811 160862 52 2 1 110 41 51 0.206 -1.48 5.09 PlyA + 162079 162084 6 1.05 6.03 PlyA - 164373 164368 6 1.05 6.02 Term - 173109 172952 158 1 2 72 54 92 0.722 1.31 6.01 Init - 174616 174577 40 1 1 67 116 18 0.542 2.90 6.00 Prom - 176422 176383 40 -3.85 7.02 PlyA - 176522 176517 6 1.05 7.01 Sngl - 177348 176842 507 0 0 51 48 299 0.727 17.99 7.00 Prom - 178759 178720 40 -7.65 8.00 Prom + 182011 182050 40 -4.75 8.01 Init + 182191 182284 94 0 1 83 42 30 0.464 -1.51 8.02 Intr + 182832 182992 161 1 2 26 68 113 0.624 1.89 8.03 Intr + 185630 185816 187 1 1 81 115 122 0.839 12.54 8.04 Term + 196074 196237 164 1 2 39 49 123 0.168 0.62 8.05 PlyA + 198112 198117 6 1.05 9.00 Prom + 198270 198309 40 -4.25 9.01 Init + 203853 203886 34 2 1 68 116 29 0.057 3.79 9.02 Intr + 208586 208725 140 0 2 95 78 118 0.051 10.76 9.03 Term + 248461 248778 318 0 0 96 41 309 0.627 20.90 9.04 PlyA + 249854 249859 6 1.05 10.00 Prom + 258245 258284 40 -6.75 10.01 Init + 264731 264808 78 1 0 32 110 22 0.057 -0.09 10.02 Intr + 267065 267225 161 1 2 87 59 124 0.071 7.26 10.03 Intr + 278162 278208 47 2 2 81 83 35 0.134 -0.57 10.04 Term + 283407 283699 293 1 2 45 45 383 0.898 24.22 10.05 PlyA + 284030 284035 6 1.05 11.00 Prom + 286005 286044 40 -6.55 11.01 Init + 286061 286139 79 1 1 84 115 16 0.537 5.17 11.02 Intr + 304028 304092 65 0 2 118 102 78 0.905 9.72 11.03 Term + 308295 308369 75 2 0 26 49 116 0.416 -1.64 11.04 PlyA + 310525 310530 6 1.05 12.03 PlyA - 310820 310815 6 1.05 12.02 Term - 312819 312730 90 0 0 96 40 95 0.181 2.24 12.01 Init - 323255 323133 123 2 0 54 45 102 0.340 2.72 12.00 Prom - 333430 333391 40 -4.95 13.02 PlyA - 334039 334034 6 1.05 13.01 Sngl - 338266 337877 390 1 0 87 53 209 0.735 13.27 13.00 Prom - 339978 339939 40 -3.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_1|92_aa MASQRSTKRHSPAIVSCIPAWPVWHCLYKWCGKTASKQPSGEEETDLQGNIHTQKGNSMG RWLVSTSDSVLEITDKKPWRKNGIKDQKHDDS >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_1|279_bp atggcttcacaaagatccacaaagagacacagtcctgccatcgtcagctgcatccctgct tggcctgtgtggcattgtttgtacaaatggtgtggcaaaactgcatctaaacagccatct ggggaagaagaaactgatctacagggtaacattcacacacaaaaaggaaacagtatggga cgatggcttgttagtacaagtgactctgttttggaaattacggacaagaaaccttggagg aagaacggcattaaagatcaaaagcatgatgactcctga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_2|130_aa MNEDYHREGLLAEILKALEGTETIKKRERQATSWEKIFAMEIPDKGLYSKIKELLKLSNE KTNNPIKKRVKDRNRHLIREDTQMTNKRMKRCFTLHVIRKMQIKITIRYYLLEWPKSRTL RAPNAGKDVE >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_2|393_bp atgaatgaggactaccatagggaaggcttacttgctgagattcttaaagccctggagggg actgagactatcaagaaaagggaaagacaagccacaagctgggagaaaatatttgcaatg gaaatacctgataaaggactgtattccaaaataaaagaactcttaaaactcagcaatgag aaaacaaacaacccaattaaaaaacgggtcaaagaccgtaacagacacctcatcagagaa gatacacaaatgacaaataagcgtatgaaaagatgcttcacattacatgtcatcaggaaa atgcaaattaagataacaataagatactacctcttagaatggcccaaatccagaacactg agagcaccaaatgctggcaaggatgtggaataa >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_3|97_aa MEETKGEIWTQRQIHTKGKGCEDTQREGGHETGALRLQAKEHQGLLGCSLYPEGNGKLVR DFNEKFNTSSQILFVKVPHDSVWKMDWRSVEWRPEER >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_3|294_bp atggaagaaacaaagggggaaatttggacacagagacagatacacacaaagggaaaagga tgtgaagacacacagagagaaggtggccatgagactggagcgctgcgtctacaagccaag gaacaccaaggattgctgggatgtagtctctatcctgaaggcaatgggaagctagtgcgg gattttaatgagaaattcaatacttcaagccagattttgtttgtaaaagttccacatgac tcagtgtggaaaatggactggagatcggtggagtggaggcctgaggagaggtga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_4|99_aa MLRGEEKLPESEAIQSKQNHDDQVQLSLDSLSVVEASFTIINQPSREVHVSKPAGGAPEA FQQPLNQYLSSTYYVAERDFPVFGDIVVKKMQQILVIME >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_4|300_bp atgctaagaggagaggagaagctgcctgagagtgaagcaatacagagcaaacagaaccat gatgaccaagtccagctctctcttgattctctttctgtggtagaagccagtttcaccatc ataaatcaaccctccagagaggttcatgtgagtaagcctgcaggtggagctcctgaggcc ttccaacagccattaaaccagtacttatcaagcacctactatgtagcagaaagagatttt ccagtatttggggatatagtggtcaaaaagatgcaacaaatcctcgtcatcatggagtaa >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_5|329_aa MEITSQDVFALAPGAFPILFGPGRSADYSAQLMSSTAGGEVGRAGDSLTLPEGLGGHEGE EDALVQGSKDSRPRAWSHTRSHTPNAGREGDRMQRSVMWMPDSEMAAPDLLDPKSAAQNS KPRLSFSTKPTVLASRVESDTTINVMKWKTVSTIFLVVVLYLIIGATVFKALEQPHEISQ RTTIVIQKQTFISQHSCVNSTELDELIQKINLHFQKSKKKAVHFISLFKNSFDTRLHNPQ MEGNISEVPKSYSGNLNTDPTVKAMFVNENTWTQEGEHHTPGPVVGDVAFHLNDPTSAVI TQIQDLYSYTSFRASYSLRPINLASMTGA >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_5|990_bp atggagattacaagtcaagatgtcttcgccctggctccaggtgcgttccccatccttttc ggtcccggacgttctgcagattatagcgcccagctcatgtccagcaccgccggaggtgaa gtggggcgagctggtgactccttaacccttcccgaaggtctggggggccacgagggcgag gaggacgccctagtgcaggggtccaaggactcgcgtcccagagcttggtcccacacccgg tcccacaccccgaacgctggtcgggaaggagataggatgcagcgcagtgtgatgtggatg ccagacagcgagatggcggcacctgacttgctggatcctaaatctgccgctcagaactcc aaaccgaggctctcgttttccacgaaacccacagtgcttgcttcccgggtggagagtgac acgaccattaatgttatgaaatggaagacggtctccacgatattcctggtggttgtcctc tatctgatcatcggagccaccgtgttcaaagcattggagcagcctcatgagatttcacag aggaccaccattgtgatccagaagcaaacattcatatcccaacattcctgtgtcaattcg acggagctggatgaactcattcagaaaataaatttacactttcaaaagagcaagaaaaaa gctgtacacttcattagtctgtttaaaaatagctttgacactcgattgcataatccccaa atggaaggaaatatatcagaagtgcccaaaagctattcaggaaacctaaacacagacccc acagtgaaagctatgtttgtgaatgagaacacttggacacaggaaggggaacatcacaca ccagggcctgttgtgggtgatgtagcctttcatctaaatgacccaacttcagctgtcatc actcagattcaggatctgtatagttacacatcattcagagcctcatatagcttgcgaccc ataaaccttgcatctatgacaggggcatga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_6|65_aa MTRSGGYQGRIKEVDPDRGKKELEKTQVKFLSTQCLLNGTQLQKSPQIVNNPEVPICSAQ RTRRS >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_6|198_bp atgaccaggtctgggggataccagggtaggatcaaggaagtagatcctgataggggaaag aaggaattagagaagacgcaagtaaaattcttgtccacccagtgtctcctcaatggaaca cagcttcagaaatctccacaaattgtaaacaatccagaagttcctatatgctctgcgcag aggacaaggagaagttga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_7|168_aa MHHINRNNDKNHMIISTDAEKAFNKIQLPFMLKTLNKLDTDVTYLKIIRAIYDKPTANIT LNGQKRESLPLKTGTRQGCPLSPLLFNTVLEVLARAIRQEKEIKGIQVGKEEAKLPLFAD HVIVYLENSIVSAQNLLKLISNFSKVSGYKINVQKSQAYTNNRQRAKS >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_7|507_bp atgcatcacataaacagaaacaatgacaaaaaccacatgattatttcaacagatgcagaa aaggccttcaataaaattcaactccccttcatgctaaaaactctcaataaactggatacc gatgtaacatatctcaaaataataagagctatttatgacaaacccacagccaatatcaca ctgaatgggcaaaagcgggaatctctccctttgaaaaccggcacaagacaaggatgccct ctctcaccactcctattcaacacagtattggaagttctggccagggcaatcaggcaagag aaagaaataaagggtattcaagtaggaaaagaggaagccaaattgcctctgtttgcagat cacgtgattgtatatttagaaaactccatcgtctcagcccaaaatctccttaagttgata agcaacttcagcaaagtctcaggatataaaatcaatgtgcaaaaatcacaagcatacacc aataatagacagagagccaaatcatga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_8|201_aa MDEAGSHLPQQTNTGTENQTPHVLAHKWELNRFGNISPRTEGGKIFCIIYALLGIPLFGF LLAGVGDQLGTIFGKGIAKVEDTFIKWNVSQTKIRIISTIIFILFGCVLFVALPAIIFKH IEGWSALDAIYFVVITLTTIGFGDYVAVERAHCTTISGEHPGHSVTTHTDWYQVTTLPLP ASLLAQEKLQQQQHSPPYPGL >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_8|606_bp atggatgaagctggaagccatctgcctcagcaaactaacacaggaacagaaaaccaaaca ccacatgttctcgctcataagtgggagttgaacagatttggaaacatctcaccacgcaca gaaggcggcaaaatattctgtatcatctatgccttactgggaattcccctctttggtttt ctcttggctggagttggagatcagctaggcaccatatttggaaaaggaattgccaaagtg gaagatacgtttattaagtggaatgttagtcagaccaagattcgcatcatctcaacaatc atatttatactatttggctgtgtactctttgtggctctgcctgcgatcatattcaaacac atagaaggctggagtgccctggacgccatttattttgtggttatcactctaacaactatt ggatttggtgactacgttgcagtggaaagggcccactgcactacaatctcaggggagcat cctgggcactcagtaacgacacatacagattggtaccaggtcaccactctgcccctgcct gcaagtctccttgcccaagagaaactgcagcagcagcagcattccccaccctacccaggc ctgtga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_9|163_aa MAGNAGERTEQGGSDIEYLDFYKPVVWFWILVGLAYFAAVLSMIGDWLRVISKKTKEEVG EFRAHAAEWTANVTAEFKETRRRLSVEIYDKFQRATSIKRKLSAELAGNHNQELTPCRRT LSVNHLTSERDVLPPLLKTESIYLNGLTPHCAGEEIAVIENIK >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_9|492_bp atggctgggaacgctggagagaggacagagcaaggtggatccgatattgaatatctggac ttctataagcctgtcgtgtggttctggatccttgtagggcttgcttactttgctgctgtc ctgagcatgattggagattggctccgagtgatatctaaaaagacaaaagaagaggtggga gagttcagagcacacgctgctgagtggacagccaacgtcacagccgaattcaaagaaacc aggaggcgactgagtgtggagatttatgacaagttccagcgggccacctccatcaagcgg aagctctcggcagaactggctggaaaccacaatcaggagctgactccttgtaggaggacc ctgtcagtgaaccacctgaccagcgagagggatgtcttgcctcccttactgaagactgag agtatctatctgaatggtttgacgccacactgtgctggtgaagagattgctgtgattgag aacatcaaatag >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_10|192_aa MTKSIFVTKELTWYGLREKVSDKNVRSVIKDLIQKLKETSKIALHFVLRLNSKFSEFLNA LILLLKTCVLITSVQLVEIRAPEPSLPRSLSKPFGEIKPFDEKVLVQLVLVHWSLEKRPC RLETFVYGKGVSRQLVLVFSGREHNGADLQVREKLRTGFTCCVETNKQTGVLWSQNNNNN NNNNNNNNNNKL >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_10|579_bp atgactaaaagcatatttgtgacaaaggaattaacttggtatggcttgagggagaaagtg agtgataaaaatgtaaggagtgtcataaaggaccttattcagaagctaaaggaaacttct aaaattgcactgcattttgtgcttcgtctgaactctaaattctcagagtttctcaatgct ttgatccttcttttaaaaacctgtgtgctcattacatctgtacagcttgtggaaattagg gcaccagagccttctcttcccaggtccctctcaaagccctttggagaaattaagccattt gatgagaaggtgcttgttcagctagtgctggtgcattggagcttggagaaaaggccctgt agacttgagacctttgtctacggaaaaggggtcagcagacagttggtgctggtgttttca ggcagagaacataatggagctgatctgcaagtgcgggaaaaactacgaacgggtttcacc tgctgcgttgaaacaaacaagcaaactggtgtactatggagtcaaaacaacaacaacaac aacaacaacaacaacaacaacaacaacaacaaactctga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_11|72_aa MEYYSAIKKKKNPVIDSNMNGIGRHYGLSGGCPYLTPEVDPGWYKPVKLLSSGTPPGSHG KELGKIPVFLAE >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_11|219_bp atggaatactattcagccataaaaaagaaaaaaaatcctgtcattgacagtaacatgaat ggaattggaagacattatggactcagtggaggatgcccctacctaactccagaggtagat cctggttggtataaaccagtcaagcttttatcttcaggaactccaccaggttctcatggt aaagaactgggaaaaattcctgtgtttctggcagaataa >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_12|70_aa MVKKKSKVQKREEFLCMLEMKKRQLPGNSHTQMLIPAARYKVDFNRLIIPSFDEDVEQLE LSNNADGNVK >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_12|213_bp atggtgaaaaagaaaagcaaggttcaaaaacgagaagagtttctgtgtatgttagaaatg aaaaagcgtcagttacctgggaacagccatacccagatgctcatacctgcagcaagatat aaggtagattttaatagattgataattccgagttttgatgaggatgtggagcaactggaa ctttcaaataatgctgatgggaatgtaaaatga >gi568815597f:214986368_215335142|GENSCAN_predicted_peptide_13|129_aa MPTVIWTIKSRLKCSQFGDEEFVGNWSKSHSCYALAKGLVAFGPCPRDLWNFELERHDLG YLVEEISKQQRIQEEAEHKSLENLQANDAIENKNPFSGKKFKPAAAICISNKDPNVNPQD NGEMSPGHV >gi568815597f:214986368_215335142|GENSCAN_predicted_CDS_13|390_bp atgccgacagtgatatggacaattaagtccaggctgaagtgttctcagtttggagatgag gaatttgttgggaactggagtaaaagtcactcttgttatgctttagcaaagggactggtg gcatttggcccctgccctagagatctatggaactttgaacttgagagacatgatttaggg tatctggtagaagaaatttctaagcagcaaagaattcaagaggaagcagagcataaaagt ttggaaaatttgcaggctaatgatgcaatagaaaataaaaacccattttctgggaagaaa ttcaagcctgctgcagctatttgcataagtaacaaggacccaaatgttaatccccaagac aatggggaaatgtctccaggacatgtctga