GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:26:49 Sequence gi568815585f:20861349_21062195 : 200847 bp : 44.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 1510 1369 142 2 1 92 108 151 0.380 16.91 1.01 Init - 10972 10942 31 1 1 33 113 7 0.162 -2.40 1.00 Prom - 28556 28517 40 -3.96 2.00 Prom + 34672 34711 40 -3.76 2.01 Init + 41413 41556 144 0 0 88 6 225 0.494 12.42 2.02 Term + 49811 49858 48 1 0 32 55 88 0.223 -2.80 2.03 PlyA + 50001 50006 6 1.05 3.03 PlyA - 50108 50103 6 1.05 3.02 Term - 60445 60329 117 1 0 44 52 144 0.450 4.94 3.01 Init - 60986 60903 84 2 0 81 71 44 0.729 2.82 3.00 Prom - 61367 61328 40 -4.26 4.00 Prom + 63467 63506 40 -2.26 4.01 Init + 84006 84063 58 2 1 75 86 66 0.624 6.57 4.02 Term + 84930 85087 158 1 2 29 47 107 0.365 -1.20 4.03 PlyA + 86511 86516 6 1.05 5.03 PlyA - 86596 86591 6 1.05 5.02 Term - 87559 87282 278 2 2 -6 48 264 0.759 8.62 5.01 Init - 88116 87957 160 0 1 43 49 165 0.866 8.09 5.00 Prom - 90277 90238 40 -5.06 6.00 Prom + 90307 90346 40 -5.76 6.01 Init + 99964 100381 418 0 1 60 -115 429 0.968 16.40 6.02 Term + 100414 100850 437 2 2 92 43 368 0.984 28.15 6.03 PlyA + 100900 100905 6 1.05 7.03 PlyA - 101968 101963 6 1.05 7.02 Term - 103514 103405 110 0 2 68 43 90 0.831 1.17 7.01 Init - 106252 106177 76 1 1 93 86 27 0.475 2.33 7.00 Prom - 110307 110268 40 -3.66 8.09 PlyA - 111417 111412 6 1.05 8.08 Term - 114016 113522 495 1 0 112 37 686 0.998 60.47 8.07 Intr - 118449 118343 107 1 2 53 90 56 0.970 2.23 8.06 Intr - 120300 120118 183 1 0 0 83 119 0.022 2.36 8.05 Intr - 122458 121876 583 1 1 115 67 817 0.028 74.55 8.04 Intr - 127956 126533 1424 1 2 126 25 1972 0.974 183.89 8.03 Intr - 130056 129924 133 0 1 104 99 236 0.890 26.52 8.02 Intr - 137733 137311 423 0 0 37 29 198 0.117 2.86 8.01 Init - 157265 157143 123 2 0 52 43 149 0.358 6.99 8.00 Prom - 167066 167027 40 -2.46 9.03 PlyA - 167107 167102 6 1.05 9.02 Term - 171561 171415 147 0 0 80 38 103 0.830 2.40 9.01 Init - 184678 184337 342 1 0 68 94 320 0.949 27.84 9.00 Prom - 189865 189826 40 -6.16 10.00 Prom + 192383 192422 40 -3.26 10.01 Init + 199471 199904 434 0 2 87 48 247 0.747 14.29 10.02 Term + 200004 200244 241 0 1 87 46 151 0.493 6.30 10.03 PlyA + 200551 200556 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 120277 120118 160 1 1 95 83 101 0.941 10.29 S.002 Term - 122458 121865 594 1 0 115 45 825 0.969 75.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_1|58_aa MLVVLGQLGKETSKVDYVLFQAATAIMEAVVREWILLEKGSIESLRTFLLTYVLQRPN >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_1|174_bp atgttagttgtgcttggacaactggggaaagaaactagtaaagtggactatgtcctcttt caagctgccacagccataatggaagcagttgtccgagagtggattctcttggaaaaaggt agcatcgagtctctgcgaacattccttttaacctatgtcttacaaaggcccaan >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_2|63_aa MRCILGAGAAAGQSPPRPRSLTRSQGRRSRCRHRRRGLGGSGRAAGFRDVGRLSDCAYRQ KVC >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_2|192_bp atgcgctgcattctgggagcgggcgccgcggccggccaatcgccgccgcgaccccggagc ctcacgcgctcccagggccgtcgcagccgctgtcgacaccggcgtcgcggcctcggcggg tctggccgggctgccgggttccgggatgtgggcagactcagcgactgtgcctaccgccag aaggtttgctga >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_3|66_aa MVEHSGISRLNRTLQLKIKHHLHLLNSQPLDFGFSQQPEDVLGFDVFDAVWKALAGLTEM ELPGQA >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_3|201_bp atggtggagcactcaggcatttctaggctcaacagaaccctgcagctgaagattaaacac cacctgcatctactcaactcacagccattggactttggtttttctcaacagccagaggat gtgctgggctttgatgtcttcgatgctgtttggaaagccttagcaggcctgactgagatg gagctgccaggccaggcgtga >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_4|71_aa MSTFFTGIPNTQYSVWHTAAPEGPGRSRGRQDGRNPPQLAVNRKGDTERGSWHPSSPHRS LLVTATRDRAM >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_4|216_bp atgtccaccttcttcacgggtatccccaatacacagtacagtgtctggcacacagcagct cctgaagggcctggccgttcacggggcaggcaggacgggcgaaatcccccgcagttagcg gtcaacagaaagggcgacacggaacggggttcctggcacccgagctcgccgcaccgaagt ctcctggtaacagcgacacgggaccgggctatgtga >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_5|145_aa MRDPNTKHARGFGFVIYATVEEVDAATNARPHKVDGRVVEPKRAVSREDSQRPEIIGYLE QYGKMEVIEITTDRGSGKKRGFAFITFDDYDSVDKIVIQKYHNVNVHNCEVRKPCQSKRW RVLHPAKKVEVVLETLVVIMVVSWE >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_5|438_bp atgagagatccaaacaccaagcacgccaggggctttgggtttgtcatctatgccactgtg gaggaggtggacgcagccacgaatgcaaggccacacaaggtggatggaagagttgtggaa ccaaagagagctgtctcaagagaagattctcaaagaccagagattattggttatttggaa cagtatgggaaaatggaagtgatcgaaatcacgactgaccgaggcagtggcaagaaaagg ggctttgcttttataacctttgatgactatgactccgtggataagattgtcattcagaaa taccataacgtgaatgtccacaactgtgaagttaggaagccttgtcaaagcaaaagatgg cgagtgcttcatccagccaaaaaggtcgaagtggttctggaaactttggtggtgatcatg gtggtgtcatgggaatga >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_6|284_aa MSGALDVLQMKEEDVLKFLAAGTHLGGTNLDFQMEQYIYKRKSDGIYIINLKKTWEKLLL AARAIVAIENPADVSVISSRNTGQRAVLKFAAATGATPIAGRFTPGTFTNQIQAAFREPR LLVVTDPRADPPASHGGVLYSPLRYVDIAIPCNNKGARSVGLMWWMLAREVLRMRGTISR EHPWEVMTDLYFYRDPEEIEKEEQAAAEKAVTKEEFQGEWTAPAPEFTATQPEVADWSEG VQVPSVPIQQFPTEDWSAQPATEDWSAAPTVQATEWVGATTDWS >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_6|855_bp atgtccggagcccttgatgtcctgcaaatgaaggaggaggatgtccttaagttccttgca gcaggaacccacttaggtggcaccaatcttgacttccagatggaacagtacatctataaa aggaaaagtgatggcatctatatcataaatctgaagaagacctgggagaagcttctgctg gcagctcgtgctattgttgccattgaaaaccctgctgatgtcagtgttatatcctccagg aatactggccagagggctgtgctgaagtttgctgcggccactggagccactccaattgct ggccgcttcactcctggaaccttcactaaccagatccaggcagccttccgggagccacgg cttcttgtggttactgaccccagggctgacccaccagcctctcacggaggcgtcttatat tctcctctgcgctatgtggacattgccatcccatgcaacaacaagggagctcgctcagtg ggtttgatgtggtggatgctggctcgggaagttctgcgcatgcgtggcaccatttctcgt gaacacccatgggaggtcatgactgatctctacttctacagagatcctgaagagattgaa aaagaagagcaggctgctgctgaaaaggcggtgaccaaggaggaatttcagggtgaatgg actgctccagctcctgagttcactgctactcagcctgaggttgcagactggtctgaaggt gtgcaggtgccctctgtgcctattcagcagttccctactgaagactggagcgctcagcct gccacggaagactggtctgcagctcccactgttcaggccactgaatgggtaggagcaacc actgactggtcttaa >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_7|61_aa MREWCRVQGPVCLVLVAILGNHLSKVLEVEKSKIKVPASCEGFVAASSHGGRQKGKGAKG D >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_7|186_bp atgcgggaatggtgcagggtgcagggtcctgtgtgtcttgttctggtggctatcttgggt aatcacctgtccaaagttctggaggttgagaagtccaagatcaaggtgccagcatcttgt gagggctttgttgctgcatcctcccatggtggaaggcagaagggcaaaggggccaaagga gactga >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_8|1156_aa MSLSGTWVKDIREATTVFLDLLRKQGVVCQGGPACPFIKGQVTGAPASRGQGCGPGRQNP RPACTRSRGSLSCVDKVVEKVTRALAPCAQGHRDSYPAWTTSPGPPPRVHKVVDNVAGPP PGVHNVAAAPAPCGQGHGALPHQPKVVGPRPASPRIDKALGPHLPKIAGSAALRGGARNG GAEMAGRALKQTGSRSIEAALEYISKMGYLDPRNEQIVRVIKQTSPGKGLMPTPVTRRPS FEGTGDSFASYHQLSGTPYEGPSFGADGPTALEEMPRPYVDYLFPGVGPHGPGHQHQHPP KGYGASVEAAGAHFPLQGAHYGRPHLLVPGEPLGYGVQRSPSFQSKTPPETGGYASLPTK GQGGPPGAGLAFPPPAAGLYVPHPHHKQAGPAAHQLHVLGSRSQVFASDSPPQSLLTPSR NSLNVDLYELGSTSVQQWPAATLARRDSLQKPGLEAPPRAHVAFRPDCPVPSRTNSFNSH QPRPGPPGKAEPSLPAPNTVTAVTAAHILHPVKSVRVLRPEPQTAVGPSHPAWVPAPAPA PAPAPAPAAEGLDAKEEHALALGGAGAFPLDVEYGGPDRRCPPPPYPKHLLLRSKSEQYD LDSLCAGMEQSLRAGPNEPEGGDKSRKSAKGDKGGKDKKQIQTSPVPVRKNSRDEEKRES RIKSYSPYAFKFFMEQHVENVIKTYQQKVNRRLQLEQEMAKAGLCEAEQEQMRKILYQKE SNYNRLKRAKMDKSMFVKIKTLGIGAFGEVCLACKVDTHALYAMKTLRKKDVLNRNQVAH VKAERDILAEADNEWVVKLYYSFQDKDSLYFVMDYIPGGDMMSLLIRMEVFPEHLARFYI AELTLAIESVHKMGFIHRDIKPDNILIDLDGHIKLTDFGLCTGFRWTHNSKYYQKGSHVR QDSMEPSDLWDDVSNCRCGDRLKTLEQRARKQHQRCLAHSLVGTPNYIAPEVLLRKGYTQ LCDWWSVGVILFEMLVGQPPFLAPTPTETQLKVINWENTLHIPAQVKLSPEARDLITKLC CSADHRLGRNGADDLKAHPFFSAIDFSSDIRKQPAPYVPTISHPMDTSNFDPVDEESPWN DASEGSTKAWDTLTSPNNKHPEHAFYEFTFRRFFDDNGYPFRCPKPSGAEASQAESSDLE SSDLVDQTEGCQPVYV >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_8|3471_bp atgtcactctcaggcacctgggtgaaggacatcagggaagcgaccacagtcttcctggac ctgctgcgcaagcagggtgtggtctgccagggtggtcctgcttgtcctttcatcaaaggc caggtcaccggggcccctgcctcgcgtggacaaggttgtggaccaggtcgccagaacccc cgccctgcgtgcacaaggtcgcggggatccctgtcctgcgtggacaaggtcgtggaaaag gtcaccagggcccttgcgccgtgtgcgcaaggtcaccgggactcctaccctgcgtggaca acgtcgccagggcccccgccccgcgtgcacaaggtcgtggacaatgtcgcagggccccca cccggcgtgcacaatgtcgctgcggcccccgccccgtgtggacaaggtcacggggctctg ccccaccagcccaaggttgtcgggccccgccccgcctcgccccgcattgacaaggccttg gggccccacctgcccaagatcgccgggagcgccgcgctcaggggaggggcacgcaatggt ggggcggagatggctggccgagctctcaagcagactggcagcaggagcatcgaggccgcc ctggagtacatcagcaagatgggctacctggacccgaggaatgagcagattgtgcgggtc attaagcagacctccccaggaaaggggctcatgccaaccccagtgacgcggaggcccagc ttcgaaggaaccggcgattcgtttgcgtcctaccaccagctgagcggtaccccctacgag ggcccaagcttcggcgctgacggccccacggcgctggaggagatgccgcggccgtacgtg gactaccttttccccggagtcggcccccacgggcccggccaccagcaccagcacccaccc aagggctacggtgccagcgtagaggcagcaggggcacacttcccgctgcagggcgcgcac tacgggcggccgcacctgctggtgcctggggaacccctgggctacggagtgcagcgcagc ccctccttccagagcaagacgccgccggagaccgggggttacgccagcctgcccacgaag ggccagggaggaccgccaggcgccggcctcgctttcccaccccctgccgccgggctctac gtgccgcacccacaccacaagcaggccggtcccgcggcccaccagctgcatgtgctgggc tcccgcagccaggtgttcgccagcgacagccccccgcagagcctgctcactccctcgcgg aacagcctcaacgtggacctgtatgaattgggcagcacctccgtccagcagtggccggct gccaccctggcccgccgggactccctgcagaagccgggcctggaggcgccgccgcgcgcg cacgtggccttccggcctgactgcccagtgcccagcaggaccaactccttcaacagccac cagccgcggcccggtccgcctggcaaggccgagccctccctgcccgcccccaacaccgtg acggctgtcacggccgcgcacatcttgcacccggtgaagagcgtgcgtgtgctgaggccg gagccgcagacggctgtggggccctcgcaccccgcctgggtgcccgcgcctgccccggcc cccgcccccgcccccgccccggctgcggagggcttggacgccaaggaggagcatgccctg gcgctgggcggcgcaggcgccttcccgctggacgtggagtacggaggcccagaccggagg tgcccgcctccgccctacccgaagcacctgctgctgcgcagcaagtcggagcagtacgac ctggacagcctgtgcgcaggcatggagcagagcctccgtgcgggccccaacgagcccgag ggcggcgacaagagccgcaaaagcgccaagggggacaaaggcggaaaggataaaaagcag attcagacctctcccgttcccgtccgcaaaaacagcagagacgaagagaagagagagtca cgcatcaagagctactcgccatacgcctttaagttcttcatggagcagcacgtggagaat gtcatcaaaacctaccagcagaaggttaaccggaggctgcagctggagcaagaaatggcc aaagctggactctgtgaagctgagcaggagcagatgcggaagatcctctaccagaaagag tctaattacaacaggttaaagagggccaagatggacaagtctatgtttgtcaagatcaaa accctggggatcggtgcctttggagaagtgtgccttgcttgtaaggtggacactcacgcc ctgtacgccatgaagaccctaaggaaaaaggatgtcctgaaccggaatcaggtggcccac gtcaaggccgagagggacatcctggccgaggcagacaatgagtgggtggtcaaactctac tactccttccaagacaaagacagcctgtactttgtgatggactacatccctggtggggac atgatgagcctgctgatccggatggaggtcttccctgagcacctggcccggttctacatc gcagagctgactttggccattgagagtgtccacaagatgggcttcatccaccgagacatc aagcctgataacattttgatagatctggatggtcacattaaactcacagatttcggcctc tgcactgggttcaggtggactcacaattccaaatattaccagaaagggagccatgtcaga caggacagcatggagcccagcgacctctgggatgatgtgtctaactgtcggtgtggggac aggctgaagaccctagagcagagggcgcggaagcagcaccagaggtgcctggcacattca ctggtggggactccaaactacatcgcacccgaggtgctcctccgcaaagggtacactcaa ctctgtgactggtggagtgttggagtgattctcttcgagatgctggtggggcagccgccc tttttggcacctactcccacagaaacccagctgaaggtgatcaactgggagaacacgctc cacattccagcccaggtgaagctgagccctgaggccagggacctcatcaccaagctgtgc tgctccgcagaccaccgcctggggcggaatggggccgatgacctgaaggcccaccccttc ttcagcgccattgacttctccagtgacatccggaagcagccagccccctacgttcccacc atcagccaccccatggacacctcgaatttcgaccccgtagatgaagaaagcccttggaac gatgccagcgaaggtagcaccaaggcctgggacacactcacctcgcccaataacaagcat cctgagcacgcattttacgaattcaccttccgaaggttctttgatgacaatggctacccc tttcgatgcccaaagccttcaggagcagaagcttcacaggctgagagctcagatttagaa agctctgatctggtggatcagactgaaggctgccagcctgtgtacgtgtag >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_9|162_aa MRPKTFPATTYSGNSRQRLQEIREGLKQPSKSSVQGLPAGPNSDTSLDAKVLGSKDATRQ QQQMRATPKFGPYQKALREIRYSLLPFANESGTSAAAEVNRQMLQELVNAGCDQAMRFSP TLGSLMLSPPEVHSPASRGPPFESHFHDPSLWPSTAIHPSAR >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_9|489_bp atgaggccaaagacttttcctgccacgacttattctggaaatagccggcagcgactgcaa gagattcgtgaggggttaaaacagccatccaagtcttcggttcaggggctacccgcagga ccaaacagtgacacttccctggatgccaaagtcctggggagcaaagatgccaccaggcag cagcagcagatgagagccaccccaaagttcggaccttatcagaaagccttgagggaaatc agatattccttgttgccttttgctaatgaatcgggcacctctgcagctgcagaagtgaac cggcaaatgctgcaggaactggtgaacgcaggatgcgaccaggctatgaggttttctccc acactgggttcgctcatgctgtcgccccctgaagtgcattccccagccagccgtggtcca cccttcgagtcccacttccacgacccatccttgtggccttctacggccattcatcccagt gcccgctaa >gi568815585f:20861349_21062195|GENSCAN_predicted_peptide_10|224_aa MAPDPWDRCAAGPGVPRLSGAPGRASSEKPIPRRQRKQRRARRYGLSGLGHRPRPAPPLP RRDPGRPGRRTTKRRSPAAPARRRPPTSPAPLEPAPRAPYGLDYEPPGGGGSGRARTVGA GRRAPSGQTSSAAPPGALPGASAPQGAVYTLGGVLAGGSRASRAGRPGTGGWSCCWPRRD GGAPGGGGGVDGDCSIFPGRSRRGSKAQTQVGHSLHCWHSTGVT >gi568815585f:20861349_21062195|GENSCAN_predicted_CDS_10|675_bp atggccccggatccgtgggaccgctgcgccgccggaccgggggtcccgaggctgtcaggg gcgcccggccgggcgtctagtgaaaagccgatcccccggagacagcggaagcagaggcgc gcccggcgctacggcctttcgggtctaggacaccgcccccggcccgcgccgcccctgccc cgccgggatcccggccgccccggtcgccgcacaacaaagcggcgcagccccgctgccccc gcccgacgccggcctccaacttccccggctcctctggaaccggctccgcgggctccgtac ggcctggactacgagccgccgggcgggggcggctccgggcgggcgcggaccgtgggggca gggcgcagggctccgtccggacaaacttcctcggccgccccgccgggcgcccttcccgga gcctcggcaccacagggcgcggtctacaccctcgggggcgtcctggccggcggctcccgg gcctctcgcgccgggagaccgggcactggtggctggagctgctgctggccccgcagggac gggggagccccgggcggcggcggtggcgtggacggcgactgctccatcttcccggggcgc tcacgccgcggttccaaagcgcagacccaagtgggacattcgctacattgttggcattcc acgggcgtcacgtga