GENSCAN 1.0 Date run: 24-Oct-119 Time: 21:40:04 Sequence gi568815593f:68190729_68397598 : 206870 bp : 40.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11062 11127 66 0 0 74 103 46 0.430 6.02 1.02 Term + 13338 13562 225 2 0 33 42 128 0.482 -1.50 1.03 PlyA + 14621 14626 6 1.05 2.05 PlyA - 15386 15381 6 1.05 2.04 Term - 20664 20115 550 2 1 40 49 274 0.162 11.48 2.03 Intr - 24811 24445 367 2 1 102 19 158 0.009 3.58 2.02 Intr - 25140 25020 121 0 1 61 11 167 0.210 5.45 2.01 Init - 26347 26273 75 1 0 69 44 104 0.242 5.24 2.00 Prom - 27069 27030 40 -4.85 3.00 Prom + 31352 31391 40 -2.45 3.01 Init + 35948 36281 334 1 1 92 86 278 0.920 25.50 3.02 Intr + 59422 59694 273 2 0 91 4 119 0.002 0.39 3.03 Intr + 82662 82754 93 1 0 64 86 103 0.355 6.72 3.04 Intr + 83211 83345 135 1 0 96 95 25 0.678 3.62 3.05 Intr + 88874 89005 132 0 0 76 72 200 0.994 16.90 3.06 Intr + 89800 90001 202 2 1 72 111 36 0.874 1.72 3.07 Intr + 90199 90278 80 1 2 65 95 44 0.567 1.08 3.08 Intr + 101531 101633 103 0 1 87 94 94 0.980 8.21 3.09 Intr + 102373 102471 99 1 0 99 121 133 0.999 16.01 3.10 Intr + 102575 102755 181 2 1 100 90 131 0.993 13.45 3.11 Intr + 102981 103106 126 2 0 88 109 141 0.999 16.16 3.12 Intr + 103808 103950 143 1 2 65 81 172 0.999 12.43 3.13 Intr + 104420 104596 177 2 0 116 95 165 0.999 18.11 3.14 Intr + 104692 104760 69 1 0 90 89 111 0.998 8.78 3.15 Intr + 105443 105613 171 2 0 43 31 204 0.946 8.34 3.16 Term + 106684 106873 190 1 1 98 48 167 0.998 9.64 3.17 PlyA + 108004 108009 6 1.05 4.03 PlyA - 108108 108103 6 1.05 4.02 Term - 116089 115940 150 1 0 114 36 98 0.512 4.13 4.01 Init - 125338 125285 54 1 0 77 107 30 0.598 5.19 4.00 Prom - 130331 130292 40 -6.15 5.00 Prom + 131716 131755 40 -9.15 5.01 Sngl + 134700 134945 246 2 0 13 39 624 0.985 44.83 5.02 PlyA + 135614 135619 6 1.05 6.00 Prom + 145979 146018 40 -5.05 6.01 Init + 151160 151249 90 1 0 59 74 44 0.049 0.64 6.02 Intr + 157983 158094 112 2 1 106 85 58 0.314 6.33 6.03 Intr + 179485 179568 84 2 0 104 85 103 0.812 10.47 6.04 Intr + 199123 199253 131 2 2 88 100 119 0.288 12.59 6.05 Term + 201905 202015 111 1 0 60 48 67 0.424 -2.52 6.06 PlyA + 205887 205892 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 75428 75309 120 2 0 71 87 99 0.880 8.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:68190729_68397598|GENSCAN_predicted_peptide_1|96_aa MDWRMEVVMVVEMMVCANTPGQGPGSVGSTDISGSPPVEGTKAGVSFEKFFLSSEIVQHS FLDKGLIIYQCDFALNFYASSANMNILKQILLPRQL >gi568815593f:68190729_68397598|GENSCAN_predicted_CDS_1|291_bp atggattggcgaatggaggtggtgatggtggtggaaatgatggtgtgtgcaaacactccc ggacaggggccaggttctgtcggcagcactgacatctctgggtcccctccagtggaggga acaaaggctggggtttcttttgaaaaattctttctgagcagtgagattgtccaacacagt ttccttgataaagggctcattatctatcagtgtgactttgcccttaatttttacgccagc agtgccaacatgaacatactcaagcaaatccttttacccagacagttgtaa >gi568815593f:68190729_68397598|GENSCAN_predicted_peptide_2|370_aa MKPPNKEQTHTRTPTALQLKGLTVEPGCDPPAVPDPCLPSPTASLPPSPLPLLPPTPLQL LATPLPAGMPRAAVPRAASAAECSGSTRPGPARLLGVSQRRGGGVRGGRRQVQTSLEAGC VSNPSVFQPTPGFFEPLAPQQSGWGGEDSDSRARQSGEALQFLVSPALPSEWAENPRHPP GAPGVKEGKEIGGILAHPRDLWNFKFEGEDLGYLAEEMFKQQTIQEVTWVLLKAFSFKRE THHKSSEILQPDVIERKNPFSEETFKLAAEICISNKALNVNHQDNGENVSRACQRPFPQL LQPWLNGAKVQLGLWLQRVQALSHGNFHMVGPVGAEKSRIEVWKPLPGFQRMYGKAWMSR QKFAAGVGPS >gi568815593f:68190729_68397598|GENSCAN_predicted_CDS_2|1113_bp atgaagcccccaaataaagagcagactcacactcggacaccaacagcgctgcagctcaag ggattaacggtggaacccggctgcgacccgccggccgtgcccgatccgtgcttgccgtct ccgacagctagcctgccgccctcgccgctgccgctgcttcctccaactccgctccagctg ctggcgactcctctccccgcggggatgccccgggcggccgtgccgcgcgctgcctccgct gcggagtgctctggctctacacgtcccgggcccgcccggctgctgggcgtctcccagcgc agaggcggaggggttcggggaggtcgaagacaagttcagacctccctggaggctggctgc gtctctaatccgagtgtctttcagccaaccccaggctttttcgagcccctagcgccccag cagagtggctggggtggggaggactcggattccagagctcgccaaagcggagaggccctg caattcctggtctcgccagcactgccttccgagtgggcggaaaacccgcggcacccgcct ggcgctccgggggtgaaggagggcaaagagattggtggcattttggcccaccctagagat ttgtggaacttcaaatttgagggggaagatttagggtacctggcagaagaaatgtttaag cagcaaaccattcaagaggtgacttgggtgctgttaaaagcattcagttttaaaagggaa acacatcataaaagttcagaaattttgcaacctgatgtgatagaaaggaaaaacccattt tcagaggagacattcaagctggctgcagaaatttgcataagtaacaaggcactaaatgtt aatcaccaagataatggggaaaatgtctctagggcttgccagagaccttttccacagctg ctccaaccgtggctaaatggggccaaggtacagcttgggctgtggcttcagagggtgcaa gccctaagccatggcaacttccacatggttgggcctgtgggtgcagagaagtcaagaatt gaggtttggaaacctctgcccggatttcagaggatgtatggaaaggcctggatgtcaagg cagaagtttgctgcaggggtggggccctcatga >gi568815593f:68190729_68397598|GENSCAN_predicted_peptide_3|835_aa MSAEGYQYRALYDYKKEREEDIDLHLGDILTVNKGSLVALGFSDGQEARPEEIGWLNGYN ETTGERGDFPGTYVEYIGRKKISPPTPKPRPPRPLPVAPGSSKTEADVEQQALAPLQLTV FLFSLGSKELRLHCLLLRLWEQEQRVGACHGTGGPRDRPQHGKAVETHALLSGIATRLLL PAFQPFAFPQLRQLLASDISDETLTLPDLAEQFAPPDIAPPLLIKLVEAIEKKGLECSTL YRTQSSSNLAELRQLLDCGECHRARNANGKDRSWLSVSDTPSVDLEMIDVHVLADAFKRY LLDLPNPVIPAAVYSEMISLAPEVQSSEEYIQLLKKLIRSPSIPHQYWLTLQYLLKHFFK LSQTSSKNLLNARVLSEIFSPMLFRFSAASSDNTENLIKVIEILISTEWNERQPAPALPP KPPKPTTVANNGMNNNMSLQDAEWYWGDISREEVNEKLRDTADGTFLVRDASTKMHGDYT LTLRKGGNNKLIKIFHRDGKYGFSDPLTFSSVVELINHYRNESLAQYNPKLDVKLLYPVS KYQQDQVVKEDNIEAVGKKLHEYNTQFQEKSREYDRLYEEYTRTSQEIQMKRTAIEAFNE TIKIFEEQCQTQERYSKEYIEKFKREGNEKEIQRIMHNYDKLKSRISEIIDSRRRLEEDL KKQAAEYREIDKRMNSIKPDLIQLRKTRDQYLMWLTQKGVRQKKLNEWLGNENTEDQYSL VEDDEDLPHHDEKTWNVGSSNRNKAENLLRGKRDGTFLVRESSKQGCYACSVVVDGEVKH CVINKTATGYGFAEPYNLYSSLKELVLHYQHTSLVQHNDSLNVTLAYPVYAQQRR >gi568815593f:68190729_68397598|GENSCAN_predicted_CDS_3|2508_bp atgagtgctgaggggtaccagtacagagcgctgtatgattataaaaaggaaagagaagaa gatattgacttgcacttgggtgacatattgactgtgaataaagggtccttagtagctctt ggattcagtgatggacaggaagccaggcctgaagaaattggctggttaaatggctataat gaaaccacaggggaaaggggggactttccgggaacttacgtagaatatattggaaggaaa aaaatctcgcctcccacaccaaagccccggccacctcggcctcttcctgttgcaccaggt tcttcgaaaactgaagcagatgttgaacaacaagcacttgctcctctccagctcactgtg tttcttttctctctggggtctaaggagcttaggcttcactgcctgctcctcagattgtgg gagcaggagcagagagtgggtgcctgccacgggacagggggtcctagagacaggccccag catgggaaggcagtggaaacccatgcattgctctctggcattgccacacgtcttctgctg cctgccttccagccctttgcttttccccagctccgtcagctccttgcctctgacatcagt gatgaaactttgactctcccggatcttgcagagcagtttgcccctcctgacattgccccg cctcttcttatcaagctcgtggaagccattgaaaagaaaggtctggaatgttcaactcta tacagaacacagagctccagcaacctggcagaattacgacagcttcttgattgtggtgag tgtcacagagctagaaatgcaaatgggaaagacaggtcttggctttctgtttcagataca ccctccgtggacttggaaatgatcgatgtgcacgttttggctgacgctttcaaacgctat ctcctggacttaccaaatcctgtcattccagcagccgtttacagtgaaatgatttcttta gctccagaagtacaaagctccgaagaatatattcagctattgaagaagcttattaggtcg cctagcatacctcatcagtattggcttacgcttcagtatttgttaaaacatttcttcaag ctctctcaaacctccagcaaaaatctgttgaatgcaagagtactctctgaaattttcagc cctatgcttttcagattctcagcagccagctctgataatactgaaaacctcataaaagtt atagaaattttaatctcaactgaatggaatgaacgacagcctgcaccagcactgcctcct aaaccaccaaaacctactactgtagccaacaacggtatgaataacaatatgtccttacaa gatgctgaatggtactggggagatatctcgagggaagaagtgaatgaaaaacttcgagat acagcagacgggacctttttggtacgagatgcgtctactaaaatgcatggtgattatact cttacactaaggaaagggggaaataacaaattaatcaaaatatttcatcgagatgggaaa tatggcttctctgacccattaaccttcagttctgtggttgaattaataaaccactaccgg aatgaatctctagctcagtataatcccaaattggatgtgaaattactttatccagtatcc aaataccaacaggatcaagttgtcaaagaagataatattgaagctgtagggaaaaaatta catgaatataacactcagtttcaagaaaaaagtcgagaatatgatagattatatgaagaa tatacccgcacatcccaggaaatccaaatgaaaaggacagctattgaagcatttaatgaa accataaaaatatttgaagaacagtgccagacccaagagcggtacagcaaagaatacata gaaaagtttaaacgtgaaggcaatgagaaagaaatacaaaggattatgcataattatgat aagttgaagtctcgaatcagtgaaattattgacagtagaagaagattggaagaagacttg aagaagcaggcagctgagtatcgagaaattgacaaacgtatgaacagcattaaaccagac cttatccagctgagaaagacgagagaccaatacttgatgtggttgactcaaaaaggtgtt cggcaaaagaagttgaacgagtggttgggcaatgaaaacactgaagaccaatattcactg gtggaagatgatgaagatttgccccatcatgatgagaagacatggaatgttggaagcagc aaccgaaacaaagctgaaaacctgttgcgagggaagcgagatggcacttttcttgtccgg gagagcagtaaacagggctgctatgcctgctctgtagtggtggacggcgaagtaaagcat tgtgtcataaacaaaacagcaactggctatggctttgccgagccctataacttgtacagc tctctgaaagaactggtgctacattaccaacacacctcccttgtgcagcacaacgactcc ctcaatgtcacactagcctacccagtatatgcacagcagaggcgatga >gi568815593f:68190729_68397598|GENSCAN_predicted_peptide_4|67_aa MGSGWLASAVLLGENATENLQGSFLRHAEVGAAQTVLGNRQDAPRAEFWSQHHRTSFLEQ YSASATD >gi568815593f:68190729_68397598|GENSCAN_predicted_CDS_4|204_bp atgggttcaggctggcttgcttcagctgtcctcctgggagaaaatgctactgagaacttg cagggcagctttctccggcatgctgaagtgggtgcagcccaaacagtgctaggaaacagg caagatgcacctcgagcagagttctggagccaacatcacagaacctcctttctagaacaa tattctgcttctgccacagactaa >gi568815593f:68190729_68397598|GENSCAN_predicted_peptide_5|81_aa MTEEEEEEEEEKEEKEEEEEEEEKEKEEEEEEEEEEQEEEEEEEEEEEEEEEEEEEKEEE EKEKEKRRRKRRNQFPGLLNQ >gi568815593f:68190729_68397598|GENSCAN_predicted_CDS_5|246_bp atgacagaagaggaagaagaagaggaggaggagaaagaggagaaggaggaagaggaggag gaagaggagaaggagaaagaggaagaggaggaagaagaagaggaggagcaagaggaggag gaagaggaggaagaggaagaagaagaagaagaagaagaagaggaagaaaaagaagaagaa gagaaggagaaagaaaaaagaagaagaaaaagaagaaatcaattcccagggcttctgaat cagtag >gi568815593f:68190729_68397598|GENSCAN_predicted_peptide_6|175_aa MRGHRRKRKWIVDTSHKSDTQTHRTLPGEGAAPAPTAAQRAPGIAQAAAPEGTSRKLWQL PHGVKPTELVKAHLSIGPQWRKRQLSSHTDTVNGQGSCQSVMGDRQTLPTVVIKLQGAMM GISTESMETKGRSVTVDQKICRTRNSTQPDDHLLRENEKALMHSRSEDAEISTAI >gi568815593f:68190729_68397598|GENSCAN_predicted_CDS_6|528_bp atgaggggccatagaaggaagaggaaatggattgtggacacctcccataagtcagataca caaacacacaggactcttccaggagaaggggctgctccagctccaacagcagctcaaagg gctccaggtatagctcaggctgctgctccagagggcacaagccgtaagctttggcaactt ccacatggtgttaagcctacagagctggtcaaggctcatctttccattggtcctcagtgg agaaagaggcagctgtcctcacatacggacaccgttaatggacaaggaagctgtcaatct gtcatgggagacaggcaaacattgcctactgtcgtgattaaactacaaggtgctatgatg ggtatcagcactgagtccatggaaaccaaaggaagaagcgtcacagttgaccagaagata tgcagaacaaggaatagcacacagccagatgatcacttacttagggaaaatgaaaaggcc ttgatgcattcaagatctgaagatgcagaaataagcactgcaatctga