GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:39:18 Sequence gi568815582f:25117204_25328489 : 211286 bp : 44.20% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1195 1328 134 0 2 81 47 80 0.279 2.64 1.02 Intr + 11272 11320 49 1 1 153 22 18 0.139 0.38 1.03 Intr + 13940 13970 31 1 1 99 80 18 0.373 -0.20 1.04 Intr + 15199 15320 122 2 2 130 95 52 0.838 10.31 1.05 Intr + 21552 21641 90 2 0 76 77 23 0.029 0.19 1.06 Intr + 34629 34713 85 2 1 40 94 40 0.001 -0.81 1.07 Intr + 43899 44001 103 1 1 81 42 105 0.380 4.43 1.08 Intr + 47395 47515 121 1 1 81 98 79 0.676 8.70 1.09 Intr + 51909 52010 102 2 0 129 71 141 0.995 16.87 1.10 Intr + 53511 53602 92 2 2 61 80 102 0.957 5.49 1.11 Intr + 57734 57831 98 2 2 103 70 106 0.446 9.95 1.12 Term + 74994 75145 152 1 2 69 37 133 0.521 4.37 1.13 PlyA + 76337 76342 6 1.05 2.00 Prom + 94630 94669 40 -2.66 2.01 Init + 96020 96079 60 1 0 78 47 68 0.687 2.95 2.02 Intr + 99995 100242 248 1 2 90 94 257 0.948 22.76 2.03 Intr + 104254 104380 127 1 1 102 98 138 0.997 16.88 2.04 Intr + 107159 107373 215 1 2 60 92 273 0.874 22.41 2.05 Intr + 109865 109999 135 2 0 91 63 117 0.988 9.18 2.06 Term + 111241 111289 49 1 1 100 48 78 0.622 1.78 2.07 PlyA + 111704 111709 6 1.05 3.08 PlyA - 113782 113777 6 1.05 3.07 Term - 123535 122613 923 2 2 89 45 422 0.941 30.36 3.06 Intr - 127073 126582 492 0 0 41 91 393 0.199 27.87 3.05 Intr - 130187 129504 684 0 0 67 83 516 0.320 40.54 3.04 Intr - 134832 134706 127 0 1 14 82 92 0.650 1.45 3.03 Intr - 135834 135743 92 1 2 95 88 19 0.849 2.31 3.02 Intr - 138189 138003 187 0 1 80 95 148 0.976 13.96 3.01 Init - 139924 139526 399 1 0 86 94 347 0.876 31.77 3.00 Prom - 154588 154549 40 -1.26 4.07 PlyA - 156374 156369 6 1.05 4.06 Term - 159669 159554 116 1 2 67 43 66 0.295 -1.27 4.05 Intr - 162736 162627 110 0 2 72 38 94 0.051 2.73 4.04 Intr - 181232 181161 72 1 0 71 80 41 0.006 0.12 4.03 Intr - 198032 197928 105 1 0 38 110 60 0.501 2.53 4.02 Intr - 206769 206702 68 0 2 104 67 64 0.502 3.60 4.01 Init - 210649 210602 48 1 0 97 110 -1 0.690 3.95 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 26399 26232 168 0 0 42 86 93 0.805 4.42 S.002 Intr - 35074 34913 162 2 0 84 26 116 0.884 4.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:25117204_25328489|GENSCAN_predicted_peptide_1|392_aa VSAFTPHSLLSACFGYGGAVLDPANAEVSQTQDAALMEHWFLEWGFAVSIGYWHDPYIQH FGILALKLDPVGYFARVHGVSQLIKAFLRKTECHCQIVNLGAGMDTTFWRLKHVVQYSKR SIPSLIVTAHKLFGSPCDNFTKAGKHSLLLEVNNRHFEGGAKGGGIYAQPDGHILDSKRY AVIGADLRDLSELEEKLKKCNMNTQLPTLLIAECVLVYMTPEQSANLLKWAANSFERAMF INYEQVNMGDRFGQIMIENLRRRQCDLAGVETCKSLESQKERLLSNGWETASAVDMMELY NRLPRAEVSRIESLEFLDEMELLEQLMRHYCLCWATKGGNELGLKHDSVSIKLQPATCSR EAQLFLQETLEKLHPETLSEIPVIRMFVDLGL >gi568815582f:25117204_25328489|GENSCAN_predicted_CDS_1|1179_bp gtgagtgcattcactccacattctttgttgagcgcctgcttcggttatggaggtgctgtc ctagaccctgccaatgcagaagtgagccaaacacaggatgcagctcttatggagcattgg ttcctggagtgggggtttgcagtaagcattggctactggcatgacccttacatacagcac tttggaatcttggctctcaagttggatcctgtaggatattttgctcgagtccatggtgtc agtcagcttataaaggcatttctacggaagacagaatgtcattgtcaaattgtcaacctt ggggcaggcatggataccaccttctggagattaaagcacgttgtgcagtattcaaaacgc agtataccttcccttattgttacagctcacaagctctttggatccccgtgtgataatttt actaaggctgggaagcacagcctcctgctagaagtcaacaacagacactttgaaggaggg gcaaaaggaggaggaatttatgctcagccagatggacacatactggattcaaagagatat gccgttattggagcagatctccgagacctgtctgaactggaagagaagctaaagaaatgt aacatgaatacacaattgccaacactcctgatagctgaatgtgtgctggtttacatgact ccagagcagtccgcaaacctcctgaagtgggcagccaacagttttgagagagccatgttc ataaactacgaacaggtgaacatgggtgatcggtttgggcagatcatgattgaaaacctg cggagacgccagtgtgacctggcgggagtggagacctgcaagtcattagagtcacagaaa gaacggctcctgtcgaatgggtgggaaacagcatcggccgtcgacatgatggagttgtac aacaggttacctcgagctgaagtgagcaggatagaatcacttgaattcctggatgaaatg gagctgctggagcagctcatgcggcattactgcctttgctgggcaaccaaaggaggaaat gagcttggattgaaacatgattcagtttccataaaattacagccagccacatgctcaaga gaggcacagctgttcctgcaggagaccctggagaagcttcacccagaaaccctcagtgag atccctgtgattcgcatgttcgtggacctgggtttgtag >gi568815582f:25117204_25328489|GENSCAN_predicted_peptide_2|277_aa MEVAWIPESPFGDEPVMDYYIAMCEPEFGNDKAREPSVGGRWRVSWYERFVQPCLVELLG SALFIFIGCLSVIENGTDTGLLQPALAHGLALGLVIATLGNISGGHFNPAVSLAAMLIGG LNLVMLLPYWVSQLLGGMLGAALAKAVSPEERFWNASGAAFVTVQEQGQVAGALVAEIIL TTLLALAVCMGAINEKTKGPLAPFSIGFAVTVDILAGGPVSGGCMNPARAFGPAVVANHW NFHWIYWLGPLLAGLLVGLLIRCFIGDGKTRLILKAR >gi568815582f:25117204_25328489|GENSCAN_predicted_CDS_2|834_bp atggaagtggcctggatccctgagtcaccatttggagacgagcctgtcatggactattat atagccatgtgtgagcctgaatttggcaatgacaaggccagggagccgagcgtgggtggc aggtggcgagtgtcctggtacgaacggtttgtgcagccatgtctggtcgaactgctgggc tctgctctcttcatcttcatcgggtgcctgtcggtcattgagaatgggacggacactggg ctgctgcagccggccctggcccacgggctggctttggggctcgtgattgccacgctgggg aatatcagtggtggacacttcaaccctgcggtgtccctggcagccatgctgatcggaggc ctcaacctggtgatgctcctcccgtactgggtctcacagctgctcggggggatgctcggg gctgccttggccaaggcggtgagtcctgaggagaggttctggaatgcatctggggcggcc tttgtgacagtccaggagcaggggcaggtggcaggggcgttggtggcagagatcatcctg acgacgctgctggccctggctgtatgcatgggtgccatcaatgagaagacaaagggccct ctggccccgttctccatcggctttgccgtcaccgtggatatcctggctgggggccctgtg tctggaggctgcatgaatcccgcccgtgcttttggacctgcggtggtggccaaccactgg aacttccactggatctactggctgggcccactcctggctggcctgcttgttggactgctc attaggtgcttcattggagatgggaagacccgcctcatcctgaaggctcggtga >gi568815582f:25117204_25328489|GENSCAN_predicted_peptide_3|967_aa MAVALDSQIDAPLEVEGCLIMKVEKDPEWASEPILEGSDSSETFRKCFRQFCYEDVTGPH EAFSKLWELCCRWLKPEMRSKEQILELLVIEQFLTILPEKIQAWAQKQCPQSGEEAVALV VHLEKETGRLRQQVSSPVHREKHSPLGAAWEVADFQPEQVETQPRAVSREEPGSLHSGHQ EQLNRKRERRPLPKNARPSPWVPALADEWNTLDQEVTTTRLPAGSQEPVKDVHVARGFSY RKSVHQIPAQRDLYRDFRKENVGNVVSLGSAVSTSNKITRLEQRKEPWTLGLHSSNKRSI LRSNYVKEKSVHAIQVPARSAGKTWREQQQWGLEDEKIAGVHWSYEETKTFLAILKESRF YETLQACPRNSQVYGAVAEWLRECGFLRTPEQCRTKFKSLQKSYRKVRNGHMLEPCAFFE DMDALLNPAARAPSTDKPKEMIPVPRLKRIAISAKEHISLVEEEEAAEDSDDDEIGIEFI RKSEIHGAPVLFQNLSGVHWGYEETKTFLDILRETRFYEALQACHRKSKLYGAVAEQLRE CGFLRTPEQCRTKFKSLQKSYRKVKNGHVLESCAFYKEMDALINSRASAPSPSTPEEVPS PSRQERGGIEVEPQEPTGWEPEETSQEAVIEDSCSERMSEEEIVQEPEFQGPPGLLQSPN DFEIGSSIKEDPTQIVYKDMEQHRALIEKSKRVVSQSTDPSKYRKRECISGRQWENLQGI RQGKPMSQPRDLGKAVVHQRPFVGKRPYRLLKYGESFGRSTRLMCRMTHHKENPYKCGVC GKCFGRSRSLIRHQRIHTGEKPFKCLDCGKSFNDSSNFGAHQRIHTGEKPYRCGECGKCF SQSSSLIIHQRTHTGEKPYQCGECGKSFTNSSHFSAHRRVHTGENPYKCVDCEKSFNNCT RFREHRRIHTGEKPYGCAQCGKRFSKSSVLTKHREVHVREKPLPHPPSLYCPENPHKGKT DEFRKTF >gi568815582f:25117204_25328489|GENSCAN_predicted_CDS_3|2904_bp atggctgtcgccctcgactctcagatcgacgcgcccctggaggttgagggatgcctaata atgaaggtggaaaaggaccctgagtgggcatcagagcccattctggaaggatcggatagc tctgagaccttccgcaaatgcttcaggcaattctgttatgaggatgtgactggaccccat gaagctttcagtaaactctgggaactttgctgccggtggctgaagccagaaatgcgttcc aaggagcaaatacttgagctgctggtgattgagcagtttctcaccattttacccgagaag attcaggcttgggcacagaagcagtgtccgcaaagtggagaggaagcggtggccctggta gtgcatttggagaaagagactggaagactaagacagcaggtcagcagtcccgtgcaccgg gagaagcactccccacttggagcagcgtgggaggtggcagacttccagccagagcaggtg gagacccaacccagggcggtgtctcgggaggaacctggaagcctccactcaggacaccag gaacagctgaaccgaaagcgagaacgtcggcccttacccaagaatgctcggccttctccc tgggttcctgcccttgctgatgaatggaataccctagatcaggaagtgacaaccacacgg cttcctgctgggtcccaggaaccagtgaaagatgtccacgtggccagaggcttttcctac agaaagagtgtgcatcagattcctgcccaaagggacctctaccgggatttcaggaaggag aatgttgggaacgtggtctccctgggaagtgcagtgtctacatctaacaagataacccgg ttggaacagagaaaggagccatggactctaggtctgcattcctctaacaagagaagtatc ctacgaagcaactacgtcaaggaaaagtcagttcatgctattcaggtccctgcaaggagt gcaggaaaaacatggagagagcagcagcagtggggtttagaagatgaaaagatagcaggt gtgcattggagctatgaggaaacaaagactttcctggcaattctcaaagagtctcgcttt tatgaaacacttcaggcctgtccccgaaatagccaagtgtatggtgctgtggctgaatgg ttgcgagaatgtggcttccttagaaccccagaacagtgtcgaaccaagttcaaaagtctc cagaaaagctatcgaaaggtgagaaatggccacatgctagaaccctgcgccttctttgag gacatggatgctttgttgaaccctgcagcccgtgctccgtccactgataaaccaaaggag atgatacctgtccccagactgaagagaattgccatcagtgctaaggaacacatcagcttg gtggaggaggaggaagctgcagaagattctgatgatgatgaaataggcatcgaatttatc cgcaagtctgaaatccatggtgcccctgtcttgtttcagaatctcagtggcgtgcactgg ggctatgaagaaaccaagacttttcttgatatcctccgtgagactcggttttatgaagcg cttcaagcctgtcatcggaagagcaaattgtatggggctgtagctgaacagcttcgagag tgcggcttcctccggacaccagaacagtgccgaaccaagttcaaaagccttcagaagagt taccgcaaggtgaaaaatggccacgtgctagagtcctgcgcgttctacaaggagatggat gccctgattaactctcgggcatctgctccttcccccagcaccccagaggaagtcccatca ccttcaaggcaagaaagagggggtattgaggttgaaccccaggaacctacaggctgggaa cctgaagagacctcacaggaggcagtaatagaagactcttgcagtgagagaatgagcgag gaggaaattgtgcaagagccagagttccagggacctccaggtctactgcagagcccaaat gattttgaaatcggaagtagcatcaaggaggatccaacacagatagtatataaggacatg gaacagcatagggcattaatagaaaagtctaaaagagttgtttcccagagtaccgacccc agcaaatatcgcaaaagggaatgcatctcaggaagacaatgggaaaatcttcaaggaatt agacagggaaagccgatgtctcagcctagagatttagggaaagccgttgtgcatcagagg ccttttgtggggaagagaccctacagacttctcaaatatggagaaagctttggaaggagc actcgtctgatgtgccggatgacccaccacaaggagaatccttacaagtgtggtgtctgt gggaagtgctttggtagaagcaggagcctgatcagacaccaaagaatccacacaggcgaa aaaccttttaaatgtcttgactgtggaaaaagctttaatgactcctcaaattttggtgcc caccagagaatccacacaggagagaaaccctacagatgcggagagtgtggaaaatgcttt agtcagagctctagtcttattatacatcagagaacgcacaccggtgagaagccctatcag tgtggagagtgtgggaaaagtttcaccaacagttctcatttcagcgcccaccggagagtt cacactggggagaatccctacaaatgtgtggactgtgaaaaaagtttcaataactgtacg agatttcgagaacatcggagaatacacactggagagaagccctatggatgtgcccagtgt ggcaaacgtttcagtaagagttctgttcttaccaaacatcgggaagttcatgtgagagaa aagcctctgccacaccctccatctctgtattgccctgagaacccacataagggaaagact gatgaatttaggaaaactttttga >gi568815582f:25117204_25328489|GENSCAN_predicted_peptide_4|172_aa MVEGEGGAKSHLTWQQTHTQSISKPGPLCLPNPAKNDESKIAEEEFLTFISSQKHPLKQP PTHKNTFTRAKKFSYECHLHLNPELLLWELQMAHVRMRSSGETSEMKAINAQILGQLMVS QVALLILAGLLSCPELAGDSKSDVIFGHKLPSAVHFWINSHHYHLKKQTLSW >gi568815582f:25117204_25328489|GENSCAN_predicted_CDS_4|519_bp atggtggaaggtgaaggaggagcaaagtcacatcttacatggcagcagacccatacccaa tccatcagcaagcctggtcccctctgcctgccaaatcctgccaaaaatgatgaaagcaag atagcagaagaagagtttctgacattcatctcctcacagaaacatccacttaaacaacca ccaacacataaaaatacctttacaagagctaagaaattcagctatgaatgccaccttcac ctcaaccctgaactcctcttatgggagctgcagatggcccacgtaagaatgaggtcctct ggggagaccagtgagatgaaagcaatcaatgcacagatcttggggcagctgatggtcagc caggtggctctgctgatcctggctggcttgctctcgtgtccagagcttgctggagacagc aagagtgatgtcatttttgggcacaaactgccctctgctgttcacttttggattaacagc caccattatcacctgaagaagcaaacgctgtcctggtag