GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:48:22 Sequence gi568815597r:108964526_109175066 : 210541 bp : 43.96% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 647 642 6 1.05 1.04 Term - 7047 6905 143 1 2 64 43 86 0.310 -0.21 1.03 Intr - 18926 18757 170 1 2 90 91 121 0.899 12.19 1.02 Intr - 22155 21998 158 0 2 91 116 4 0.815 2.31 1.01 Init - 23099 23094 6 2 0 79 52 4 0.500 -3.36 1.00 Prom - 24552 24513 40 -9.75 2.00 Prom + 24598 24637 40 -2.66 2.01 Sngl + 27757 28311 555 0 0 70 32 454 0.999 34.13 2.02 PlyA + 28324 28329 6 1.05 3.09 PlyA - 30151 30146 6 1.05 3.08 Term - 30742 30730 13 0 1 96 47 -4 0.524 -6.03 3.07 Intr - 31312 31055 258 0 0 88 101 106 0.580 8.48 3.06 Intr - 37877 37699 179 2 2 46 63 118 0.795 3.82 3.05 Intr - 40193 40067 127 1 1 4 75 79 0.150 -1.12 3.04 Intr - 47193 46391 803 0 2 81 103 190 0.206 10.06 3.03 Intr - 49400 49316 85 1 1 114 66 59 0.918 6.12 3.02 Intr - 53076 52993 84 2 0 53 116 28 0.650 1.04 3.01 Init - 58987 58830 158 1 2 68 64 102 0.707 5.18 3.00 Prom - 59356 59317 40 -7.36 4.00 Prom + 65953 65992 40 -2.56 4.01 Init + 76983 77058 76 2 1 95 55 101 0.449 6.76 4.02 Intr + 82476 82698 223 1 1 34 111 124 0.432 6.49 4.03 Intr + 85823 85895 73 2 1 64 79 47 0.625 0.81 4.04 Term + 88670 88708 39 1 0 71 50 31 0.219 -5.21 4.05 PlyA + 89652 89657 6 1.05 5.05 PlyA - 91334 91329 6 1.05 5.04 Term - 100168 99998 171 1 0 57 47 208 0.999 11.43 5.03 Intr - 101707 101610 98 2 2 54 95 118 0.863 8.83 5.02 Intr - 110540 110462 79 2 1 52 101 41 0.557 0.92 5.01 Init - 111422 111396 27 2 0 50 113 46 0.660 2.90 5.00 Prom - 113695 113656 40 -4.86 6.00 Prom + 114236 114275 40 -7.06 6.01 Init + 117797 117873 77 1 2 78 64 0 0.119 -2.84 6.02 Intr + 125794 125957 164 1 2 44 68 82 0.550 1.42 6.03 Intr + 126072 126357 286 1 1 45 93 207 0.858 13.30 6.04 Term + 129892 130054 163 1 1 77 49 96 0.837 2.01 6.05 PlyA + 131543 131548 6 1.05 7.06 PlyA - 132361 132356 6 1.05 7.05 Term - 140123 139795 329 0 2 12 47 283 0.057 11.47 7.04 Intr - 142134 142010 125 2 2 114 82 35 0.974 5.73 7.03 Intr - 142597 142482 116 1 2 12 97 102 0.992 2.75 7.02 Intr - 143512 143396 117 1 0 79 67 79 0.939 5.56 7.01 Init - 149378 149322 57 2 0 101 45 122 0.362 8.53 7.00 Prom - 160008 159969 40 -1.46 8.00 Prom + 164682 164721 40 -3.46 8.01 Init + 171816 172042 227 2 2 51 56 135 0.317 4.54 8.02 Intr + 191843 191955 113 2 2 17 98 61 0.012 0.02 8.03 Intr + 192600 192643 44 1 2 79 92 41 0.951 1.56 8.04 Intr + 197369 197489 121 1 1 97 61 119 0.999 10.17 8.05 Intr + 199974 200166 193 1 1 117 69 260 0.968 25.55 8.06 Term + 202069 202171 103 1 1 44 38 161 0.751 4.45 8.07 PlyA + 203915 203920 6 1.05 9.00 Prom + 205426 205465 40 -2.36 9.01 Init + 206376 206383 8 2 2 90 52 1 0.365 -2.86 9.02 Intr + 207341 207488 148 2 1 139 62 321 0.998 34.84 9.03 Intr + 207963 208043 81 2 0 55 65 86 0.843 2.83 9.04 Intr + 208949 209054 106 1 1 112 84 65 0.989 8.29 9.05 Intr + 209163 209312 150 1 0 131 64 78 0.174 9.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 141573 141503 71 1 2 132 36 75 0.936 4.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_1|158_aa MVDDKSKKQFVCINILEDTQAVRAVAFHPAGGLYAVGSNSKTLRVCAYPDVIDPSAHETP KQPVVRFKRNKHHKGSIYCVAWSPCGQLLATGSNDKYVKVLPFNAETCNATGDLTKQLPI MVVGEHKDKVIQCRWHTQDLSFLSSSADRTVTLWTYNG >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_1|477_bp atggtggatgacaaatcaaaaaagcagtttgtttgtattaatatcctagaagacacacaa gctgttagagcagtggcttttcatccagctggaggtttatatgctgttggttcaaattca aaaactctgagagtatgtgcctatccagatgtaattgatccaagtgcacatgagactcct aagcagccggtggtacgttttaaaaggaataaacatcataaaggatccatttactgtgtg gcctggagtccttgtgggcagttattagcaacaggatcaaatgacaaatacgtcaaagtg ctgcccttcaatgcagagacttgtaacgcaacaggggacctcaccaagcagcttcctatc atggtggtgggggagcacaaggacaaagtgattcagtgcagatggcacacccaggatctt tccttcctgtcatcctctgcagatagaactgtcaccctctggacttacaatgggtag >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_2|184_aa MVRYSLDPENPTESCKSRGSNLRVHFKNTRETAQAIKGMHIRKATKYLKDVTLQKQCVPF RRYNGGVGRCAQAKQWGWTQGRWPKKSAEFLLHMLKNTESNAELKGLDVDSLVIEHIQVN KAPKMRRRTYRAHGRINPYMSSPCHIEMILTEKEQIVPKPEEEVAQKKRISQKKLKKQKL MARE >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_2|555_bp atggttcgctattcacttgacccggagaaccccacggaatcatgcaaatcaagaggttcc aatcttcgtgttcactttaagaacactcgtgaaactgctcaggccatcaagggtatgcat atacgaaaagccacgaagtatctgaaagatgtcactttacagaaacagtgtgtaccattc cgacgttacaatggtggagttggcaggtgtgcgcaggccaagcagtggggctggacacaa ggtcggtggcccaaaaagagtgctgaatttttgctgcacatgcttaaaaacacagagagt aatgctgaacttaagggtttagatgtagattctctggtcattgagcatatccaagtgaac aaagcacctaagatgcgccgccggacctacagagctcatggtcggattaacccatacatg agctctccctgccacattgagatgatccttacggaaaaggaacagattgttcctaaacca gaagaggaggttgcccagaagaaaaggatatcccagaagaaactgaagaaacaaaaactt atggcacgggagtaa >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_3|568_aa MTAEETVNVKEVEIIKLILDFLNSKKLHISMLALEKESGVINGLFSDDMLFLRQLILDGQ WDEVLQFIQPLECMEKFDKKRFRYIILKQKFLEALCVNNAMSAEDEPQHLEFTMQEAVQC LHALEEYCPSKDDYSKLCLLLTLPRLTNHAEFKDWNPSTARVHCFEEACVMVAEFIPADR KLSEAGFKASNNRLFQLVMKGLLYECCVEFCQSKATGEEITESEVLLGIDLLCGNGCDDL DLSLLSWLQNLPSSVFSCAFEQKMLNIHVDKLLKPTKAAYADLLTPLISKLSPYPSSPMR RPQSADAYMTRSLNPALDGLTCGLTSHDKRISDLGNKTSPMSHSFANFHYPGVQNLSRSL MLENTECHSIYEESPERSDTPVDAQRPIGSEILGQSSVSEKEPANGAQNPGPAKQEKNEL RDSTEQFQEYYRQRLRYQQHLEQKEQQRQIYQQMLLEGGVNQEDGPDQQQNLTEQFLNRS IQKLGELNIGMDGLGNEVSALNQQCNGSKGNGSNGSSVTSFTTPPQDSSQRLTHDASNIH TSTPRNPGSTNHIPFLEESPCGSQIWHL >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_3|1707_bp atgacggctgaagaaacagtgaatgtaaaagaggttgaaatcattaagctaattttggac ttcctgaattcaaagaagcttcacattagtatgctggccctggagaaggaaagtggagtc ataaatggcctgttttcagatgatatgcttttcctgaggcagctaatacttgatggtcaa tgggatgaagttcttcagttcattcagcctctagaatgtatggaaaaatttgacaaaaaa aggtttcgttatattatcctgaagcagaagtttttagaagctttatgtgttaacaacgcg atgtcagcagaagatgagccccagcatctggaatttaccatgcaagaagctgtgcaatgt ttacatgctctagaagaatactgtccttctaaagatgactatagtaagctctgtttgctt ttgactttgcctcgtctgaccaatcatgccgagtttaaggactggaatcccagcaccgca cgagttcactgttttgaagaggcttgtgtcatggttgcagaattcatccctgctgatagg aagctaagtgaagctggttttaaggctagtaacaatcgtttatttcagcttgtaatgaaa ggcctgctttatgaatgctgtgtagaattttgtcagagtaaagcaactggagaagaaatt acagaaagcgaagtgcttcttggcatcgacctcttatgtggtaatggttgtgatgatttg gatctgagtttactgtcatggcttcagaatcttccatcttctgtcttctcttgtgctttt gaacagaaaatgcttaatattcatgttgacaaacttctgaaacctacaaaagctgcatat gctgatcttttgactcctcttatcagcaaactctctccctatccatcatccccaatgaga agacctcaatcagctgatgcctatatgacccgctctctgaatcctgctttagatggcctc acctgtggactaaccagtcatgataagagaatttcagaccttggaaacaaaacttctcca atgtcacactcctttgctaacttccattatccaggggtacaaaacctcagtagaagtctc atgcttgagaatacagaatgtcacagtatttacgaagaatcccctgagcgaagtgataca cctgttgatgcacagaggcctatcggcagtgaaatcttgggccagagttcagtttcagaa aaagagcctgcaaatggagcacagaatccaggaccagctaaacaagaaaaaaatgagctt cgagattcaacagaacaatttcaagaatattataggcaaagattacgctatcaacagcat ttagaacagaaggagcaacagcggcagatataccaacagatgttgcttgaaggaggcgtg aatcaggaggatggtcctgatcagcagcagaatcttactgaacagttccttaataggtcc attcaaaagcttggtgaattaaatattggaatggatggccttggtaatgaggtatcagca ctcaaccagcaatgtaatgggagcaaaggcaatggatctaatggttcttctgtgactagt tttactacaccaccccaagactctagtcagagattaacacatgatgcttcaaatattcat acaagcactcctcgtaatcctggatcaacaaatcacataccttttctggaggaatcacct tgtggaagccaaatttggcatctttga >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_4|136_aa MGTNLALTCCSVPGRGASLCPQNPTEDVLSSINGIQPARRNVVNRDGYYIEPIAQCAIIM FDVTPRVTYKNVPNWHRDLVPMCEIITMMLCGNKVDIKDSCCNSSQLFKEDKACLDEGLS SDSKEVRSVSARPPII >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_4|411_bp atgggcacgaacctcgcgctcacttgctgctcagtgccagggcgaggggcgtcgctctgc ccccagaatccaactgaagatgtattaagttcaataaatgggatacagccggccaggaga aatgtggtgaacagagatggttattacatcgaacccatagcccagtgtgccatcataatg tttgatgtaacaccaagagttacgtacaagaatgtgcctaactggcatagagatttggtg ccaatgtgtgaaatcatcaccatgatgttgtgtggcaacaaagtggatattaaggacagt tgctgcaatagcagtcaattgttcaaagaggacaaggcttgcttagatgaaggactgtcc tcagattccaaggaagtgaggagcgtctctgcccggccgcccatcatctga >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_5|124_aa MADEEEDPTFEEENEEIGGGAEGGQGKRKRLFSKELRCMMYGFGDDQNPYTESVDILEDL VIEFITEMTHKAMSIGRQGRVQVEDIVFLIRKDPRKFARVKDLLTMNEELKRARKAFDEA NYGS >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_5|375_bp atggcagatgaggaagaagaccccacgtttgaggaagaaaatgaagaaattggaggaggt gcagaaggtggacagggtaaaagaaagagacttttttctaaagaattgcgatgtatgatg tatggctttggggatgaccagaatccttatactgagtcagtggatattcttgaagatctt gtcatagagtttatcactgaaatgactcacaaggcaatgtcaattggaagacaaggtcga gtacaagttgaagatatcgtcttcttgattcgaaaggacccaaggaagtttgccagggtt aaagacttgcttactatgaatgaagaattgaaacgagctagaaaagcatttgatgaagca aattatggatcttga >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_6|229_aa MELYESNLEGMAGVGLSEMRKAGAGWFLISFATFRRSLTRFPSGVRKSSSRAVTEAAGAA SSPAVLEFSQVEAQDDENLIGARRKDLVTTAKYEAFEIAFFLPYVNTQFPELDNRLATPD SVRRSPLGPCSCVLRLLPVTSARIGKCQAGAPPSPPLLPLNPDPLPRSRASRHDERRCDW NQAACCCGNCLCCNGLLRPVYKMNSKAPKSSTANQGDGDEEPVGDLNPV >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_6|690_bp atggagctctatgagagtaatttggaaggaatggcaggggtgggattatcagaaatgagg aaagcaggagctggatggttcctcatttcatttgcaacgttcagacgcagtctgactaga tttccatccggagtaaggaagagcagctcaagggcagtcacggaggccgcaggtgcggct tcttcacccgcggtgctggagttctctcaagtggaagcccaggacgatgaaaacctcata ggggcacgcagaaaagacttagttaccacggccaagtacgaagccttcgagatagcgttc ttcttaccctacgtaaacacgcagtttcccgagctcgacaatcggctagccactcctgat agcgttcggcgctcgcctctcggcccttgctcctgcgtactacggcttcttccagtcacc tcggcccggatcgggaagtgtcaagcgggcgctcccccatctccgccgctattaccactg aacccggaccccctacccaggtccagggccagccgccatgacgaacgccgctgtgattgg aaccaggctgcatgctgctgtggcaattgcttgtgttgtaatggccttttacgtcctgtt tataaaatgaattccaaagcacccaagtcatcaactgccaaccaaggggacggggatgaa gaacctgttggagacctgaacccagtgtag >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_7|247_aa MAAAAAGARGVGAGERAPGKLPYKNPTHLAQQQEPWSRLNSTPTITSMRRDAYYFDPEIP KDDLDFRLAALYNHHTGTFKNKSEILLNQKTTQDTYRTKIQFPGEFLTPPTPPITFLANI RHWINPKKESIHSIQGSIEICLIMPETLSQSSQGRVMTIPYQPMPAKSPVICAGGQDRCS KAVGYPRGTRDLEGPPLDAYSIQGQHIISPLDLAKLNQVARQQSHFAMMHGGTGFTRIDS SSPEVKG >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_7|744_bp atggcagccgccgccgctggtgcgcgcggcgtgggcgcgggagagcgcgcgcccgggaaa ttgccatataagaacccaacacaccttgctcagcagcaggaaccctggagtcggctcaac tcaacccccacaattacttccatgaggcgggatgcctactattttgatcccgagatacca aaggatgacctggacttccgcttagcagccttgtacaaccaccacactgggacattcaag aacaaaagtgagatactgttaaaccagaaaaccacgcaggatacctatagaaccaagatc caattccctggagaatttttaacccctcccactccacccatcactttcctggctaacatc agacactggatcaaccctaaaaaggagtccatccacagcatccaaggatccatagagatc tgcctgatcatgccggaaacgctctcccagtcttcccaggggagagtcatgaccattcca taccagcccatgccggccaagtccccagtcatctgcgcgggcggccaagatcgttgcagc aaggctgtgggctacccccgaggcacccgtgacctggagggaccacctctagatgcctac tcgattcaaggacaacacatcatttctccgcttgatctggccaagctgaaccaggtggca agacaacaatctcactttgccatgatgcacggcgggaccggattcacccgaattgactcc agttctccagaggtgaaaggctaa >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_8|266_aa MLPQAKEIKDGWLSPEARAEAGARFSLVASEGTSPANTLILGFWHPELCDNKFLLFKATQ SVALCYSSPERRMQAPLRFSGMKYIHTVVLPSAPPSSLSTELSYKIEMLYSSNIFSSLKA SAVPQTQYSEYHYEYTACDSTGSRWRVAVPHTPGLCTSLPDPIKGTECSFSCNAGEFLDM KDQSCKPCAEGRYSLGTGIRFDEWDELPHGFASLSANMELDDSAAESTGNCTSDEELEAQ QGYDLLFVIRFTAGEIEIRTQASTAP >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_8|801_bp atgctgccacaagccaaggaaatcaaggatggctggctgtcccccgaagccagggcagaa gcaggggccagattctcccttgtggcttccgaaggaaccagccctgccaacaccttgatt ttgggcttctggcatccagaactgtgcgacaacaaatttctattgttcaaagccacccag tctgtggcactttgttacagcagccctgaaagacgaatgcaggcacctctacggttcagt ggcatgaagtacattcacactgttgtgctaccatcggcaccaccatcgtcattatccaca gaactgtcttacaaaattgaaatgctgtactcatcaaacatttttagcagcttgaaagcc agtgctgtgccccagacacagtattctgagtaccactatgagtacacggcgtgtgacagc acgggttccaggtggagggtcgccgtgccgcataccccgggcctgtgcaccagcctgcct gaccccatcaagggcaccgagtgctccttctcctgcaacgccggggagtttctggatatg aaggaccagtcatgtaagccatgcgctgagggccgctactccctcggcacaggcattcgg tttgatgagtgggatgagctgccccatggctttgccagcctctcagccaacatggagctg gatgacagtgctgctgagtccaccgggaactgtacttcagacgaggaactggaagctcag caaggttatgacctgctctttgttatccggtttacagccggcgagattgaaattcgaacc caagcttccactgctccataa >gi568815597r:108964526_109175066|GENSCAN_predicted_peptide_9|165_aa MTRSKWVPRGDYIASNTDECTATLMYAVNLKQSGTVNFEYYYPDSSIIFEFFVQNDQCQP NADDSRWMKTTEKGWEFHSVELNRGNNVLYWRTTAFSVWTKVPKPVLVRNIAITGVAYTS ECFPCKPGTYADKQGSSFCKLCPANSYSNKGETSCHQCDPDKYSX >gi568815597r:108964526_109175066|GENSCAN_predicted_CDS_9|495_bp atgaccaggtccaagtgggttccccggggcgactacatcgcctccaacacggacgaatgc acagccacactgatgtacgccgtcaacctgaagcaatctggcaccgttaacttcgaatac tactatccagactccagcatcatctttgagtttttcgttcagaatgaccagtgccagccc aatgcagatgactccaggtggatgaagaccacagagaaaggatgggaattccacagtgtg gagctaaatcgaggcaataatgtcctctattggagaaccacagccttctcagtatggacc aaagtacccaagcctgtgctggtgagaaacattgccataacaggggtggcctacacttca gaatgcttcccctgcaaacctggcacgtatgcagacaagcagggctcctctttctgcaaa ctttgcccagccaactcttattcaaataaaggagaaacttcttgccaccagtgtgaccct gacaaatactcagnn