GENSCAN 1.0 Date run: 5-Nov-116 Time: 21:53:21 Sequence gi568815586f:68709206_68939846 : 230641 bp : 41.17% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 800 888 89 1 2 111 111 37 0.913 6.15 1.02 Intr + 4525 4603 79 1 1 80 27 68 0.852 -1.47 1.03 Intr + 6422 6535 114 1 0 138 78 97 0.876 13.42 1.04 Term + 10136 10279 144 1 0 95 36 78 0.446 0.23 1.05 PlyA + 10458 10463 6 1.05 2.00 Prom + 11222 11261 40 -5.55 2.01 Init + 12624 12781 158 2 2 43 87 183 0.918 13.13 2.02 Intr + 12899 12947 49 2 1 46 94 66 0.976 0.76 2.03 Intr + 21905 22055 151 1 1 91 98 69 0.998 7.01 2.04 Intr + 24247 24407 161 2 2 63 77 228 0.995 17.99 2.05 Intr + 25503 25628 126 2 0 46 100 99 0.991 6.86 2.06 Intr + 26026 26139 114 0 0 105 99 94 0.999 11.92 2.07 Intr + 32608 32775 168 0 0 97 119 33 0.918 6.52 2.08 Term + 33150 33257 108 2 0 75 29 64 0.886 -3.27 2.09 PlyA + 33304 33309 6 1.05 3.00 Prom + 33773 33812 40 -7.85 3.01 Init + 37173 37574 402 2 0 88 105 411 0.917 37.47 3.02 Intr + 38725 38839 115 0 1 103 19 50 0.533 -1.30 3.03 Intr + 41277 41488 212 1 2 4 99 112 0.477 1.61 3.04 Intr + 42827 42985 159 1 0 70 111 183 0.973 18.06 3.05 Intr + 43058 43123 66 1 0 61 116 59 0.684 4.18 3.06 Intr + 68395 68586 192 0 0 76 97 74 0.530 5.87 3.07 Intr + 83883 83949 67 2 1 77 61 55 0.036 -0.74 3.08 Intr + 90015 90170 156 1 0 10 99 121 0.248 4.46 3.09 Intr + 95732 95881 150 0 0 42 24 134 0.041 1.61 3.10 Intr + 95887 96083 197 2 2 48 56 143 0.301 5.31 3.11 Term + 98971 99240 270 0 0 30 55 305 0.255 15.90 3.12 PlyA + 100183 100188 6 1.05 4.00 Prom + 108054 108093 40 -4.05 4.01 Init + 111133 111169 37 0 1 53 110 18 0.502 0.79 4.02 Intr + 115158 115225 68 1 2 104 84 73 0.871 6.21 4.03 Intr + 115350 115446 97 2 1 51 76 82 0.844 1.96 4.04 Intr + 119566 119726 161 2 2 52 52 117 0.839 3.29 4.05 Intr + 126624 126779 156 2 0 109 52 234 0.997 21.19 4.06 Intr + 127467 127544 78 2 0 47 87 93 0.928 3.93 4.07 Term + 130069 130644 576 0 0 61 33 377 0.529 22.98 4.08 PlyA + 132964 132969 6 1.05 5.10 PlyA - 133532 133527 6 1.05 5.09 Term - 146158 146138 21 1 0 102 41 8 0.271 -5.17 5.08 Intr - 147474 147262 213 0 0 130 64 52 0.794 4.99 5.07 Intr - 149866 149718 149 2 2 124 101 32 0.956 7.13 5.06 Intr - 160326 160120 207 1 0 41 113 96 0.827 5.53 5.05 Intr - 161194 161010 185 0 2 72 121 130 0.995 13.21 5.04 Intr - 162751 162579 173 1 2 50 98 195 0.819 14.52 5.03 Intr - 176684 176587 98 0 2 78 94 54 0.245 3.81 5.02 Intr - 184276 184066 211 1 1 95 37 107 0.332 3.86 5.01 Init - 187399 187295 105 1 0 80 72 47 0.207 2.57 5.00 Prom - 201600 201561 40 -4.85 6.00 Prom + 203341 203380 40 -5.65 6.01 Init + 204401 204844 444 1 0 25 75 231 0.887 11.56 6.02 Intr + 208032 208115 84 2 0 127 50 19 0.419 1.10 6.03 Term + 209292 209462 171 2 0 92 43 71 0.548 -0.16 6.04 PlyA + 211088 211093 6 1.05 7.04 PlyA - 211925 211920 6 1.05 7.03 Term - 213370 213247 124 0 1 2 42 104 0.082 -5.92 7.02 Intr - 218507 218018 490 0 1 18 53 224 0.321 3.04 7.01 Init - 223632 223473 160 0 1 76 97 157 0.726 13.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:68709206_68939846|GENSCAN_predicted_peptide_1|141_aa LVVDWLESIAKDEIGEFSDNIEFYAKSVYWENTLHTLKQRQLTSYVGSVRPLVTELDPDA PIRQKMPLDDLDREDEVRLLKYLFTLIRAGMTEEAQRLCKRCGQAWRAATLEGWKLYHDP NVNGGILVDFILTVEDKGISL >gi568815586f:68709206_68939846|GENSCAN_predicted_CDS_1|426_bp ctggtggtagattggttagagagtattgccaaagatgaaattggagaattttctgataat attgagttttatgcaaaatcagtatattgggaaaatactctgcataccttaaaacaacgg cagctgacttcttacgttggaagtgttcgtccgcttgtcactgaattggaccctgatgct cccataagacagaaaatgccccttgatgatctggatagagaagatgaagttagattactc aaatatctctttactctaatccgtgctggaatgacagaagaggcacaacgactctgtaaa cgctgtggtcaagcatggagagctgcaacacttgaaggctggaaactgtaccatgaccct aatgttaatggaggtattttagtagattttattctgacagttgaggacaaaggcatttcg ctctaa >gi568815586f:68709206_68939846|GENSCAN_predicted_peptide_2|344_aa MGKKLLPVCDTWEDTVWAYFRVMVDSLVEQEIQTSVATLDETEELPREYLGANWTLEKVF EELQATDKKLLIREKHTNLIAFYTCHLPQDLAVAQYALFLESVTEFEQRHHCLELAKEAA SKKHEAAKEVFVKIPQDSIAEIYNQCEEQGMESPLPAEDDNAIREHLCIRAYLEAHETFN EWFKHMNSVPQKPALIPQPTFTEKVAHEHKEKKYEMDFGIWKGHLDALTADVKEKMYNVL LFVDGGWMVDVREDAKEDHERTHQMVLLRKLCLPMLCFLLHTILHSTGQYQECLQLADMV SSERHKLYLVFSKEELRKLLQKLRESSLMLLDQGLDPLGYEIQL >gi568815586f:68709206_68939846|GENSCAN_predicted_CDS_2|1035_bp atggggaaaaagctgcttcctgtctgtgacacctgggaagacacagtttgggcctacttc cgggtgatggtggacagtctggtagaacaggagatccagacatcagtagcaactctggat gaaactgaagaactccctagagaatatctgggagcaaactggacgttagaaaaggttttt gaggaacttcaagctactgacaaaaagcttttaataagagagaaacatacaaatcttata gcattttatacctgtcatttgcctcaagacctagctgttgcccagtatgcattatttttg gaaagtgttacagaatttgaacagcgccaccattgcctggagttggctaaagaagcagca tcaaaaaagcacgaagctgcaaaagaagtatttgtgaaaattcctcaggattctatagca gaaatctataatcagtgcgaggaacaaggaatggaaagtccacttcctgctgaagatgat aatgctatccgagaacatttgtgcatcagagcttatttggaagcccatgaaacctttaat gagtggtttaagcatatgaattcagttccacaaaaacctgctttgatacctcaaccaact tttactgagaaagtggctcatgaacacaaagaaaagaaatatgaaatggattttggtatt tggaaagggcatttggatgccctaactgctgatgtgaaggagaaaatgtataacgtcttg ttgtttgttgatggagggtggatggtggatgttagagaggatgccaaagaagaccatgaa agaacacatcaaatggtcttactgagaaagctttgtctgccaatgttgtgttttctgctt catacgatattgcacagtactggtcagtatcaggaatgcctacagttagcagatatggta tcctctgagcgccacaaactgtacctggtattttctaaggaagagctaaggaagttgctg cagaagctcagagagtcctctctaatgctcctagaccagggacttgacccattagggtat gaaattcagttatag >gi568815586f:68709206_68939846|GENSCAN_predicted_peptide_3|661_aa MALLVDRVRGHWRIAAGLLFNLLVSICIVFLNKWIYVYHGFPNMSLTLVHFVVTWLGLYI CQKLDIFAPKSLPPSRLLLLALSFCGFVVFTNLSLQNNTIGTYQLAKAMTTPVIIAIQTF CYQKTFSTRIQLTLIPITLGVILNSYYDVKFNFLGMVFAALGVLVTSLYQVVGSAKRARR KGHRESGHVHGDHSALRRALEGWWQEPGCAALRDSGGLRAPAVPGSPVAKTRGCLKRLQK ERQWVGAKQHELQVNSMQLLYYQAPMSSAMLLVAVPFFEPVFGEGGIFGPWSVSALFSEH YVRSILTFIVIITIKGGKREDAQPKWDSLNFPTLRCQRIRGSHAAYLESARNGVHPKNDH HRIYELLLDHAINKSSSPVPLKCHLWSRPIRAGKAAVLFASANEARKRGLPSYNHKELNL ADNLNKLESRFFPRSLRAQAGPHLDLGFMTPGAEEPVQPLADDPGRVDAAVAPSLLACIY LRCLKPCDPSGPRGCLITLLGCLLSSGDPGWPEGKKIRGRPIEDGFLLLLLGLALPRIRF LASVYSASHCARDLTGRAGLLADLGLLTTPLLGARTEAPRRAWLLLGPVWPCVSERWSKK PSPRGGRDPSDRDPAAFAARSTVPPRISAYERPVPWPGEWNDPRGPGRRASARPVKETGE S >gi568815586f:68709206_68939846|GENSCAN_predicted_CDS_3|1986_bp atggcattgctggtggaccgagtgcggggccactggcgaatcgccgccgggctcctgttc aacctgctggtgtccatctgcattgtgttcctcaacaaatggatttatgtgtaccacggc ttccccaacatgagcctgaccctggtgcacttcgtggtcacctggctgggcttgtatatc tgccagaagctggacatctttgcccccaaaagtctgccgccctccaggctcctcctcctg gccctcagcttctgtggctttgtggtcttcactaacctttctctgcagaacaacaccata ggcacctatcagctggccaaggccatgaccacgccggtgatcatagccatccagaccttc tgctaccagaaaaccttctccaccagaatccagctcacgctgattcctataactttaggt gtaatcctaaattcttattacgatgtgaagtttaatttccttggaatggtgtttgctgct cttggtgttttagttacatccctttatcaagtggttggttctgcaaagagggctcgcagg aagggccacagggagtctggacatgtccatggagatcatagtgcccttaggagagcatta gaggggtggtggcaggagccaggctgtgctgcattgagggacagtggaggcctgagagcc cctgctgttccaggcagtcctgtggcgaagacacgcggctgcctgaagaggctgcaaaag gaaaggcagtgggtaggagccaaacagcatgaattacaagtgaactcaatgcagctgctg tactaccaggctccgatgtcatctgccatgttgctggttgctgtgcccttctttgagcca gtgtttggagaaggaggaatatttggtccctggtcagtttctgctttgttttcagagcac tatgtccgatccattctcacatttatcgtcataatcaccataaaaggaggtaagcgtgaa gacgcacaacccaagtgggacagtctgaatttccccaccctcaggtgtcagagaatcaga ggaagtcatgctgcttatctagaatctgctagaaatggagttcacccaaagaatgaccat cacagaatttatgaactactcctagatcatgcaattaataagtcctctagcccagtacca ctcaagtgccacctatggtcacgaccaattagagctggtaaggcagctgtactgtttgct tcagccaatgaggcaagaaaaaggggactaccgtcctacaaccacaaggaactgaatttg gctgacaatttgaataagctagaaagcaggttcttccctagatccttaagagcccaagct ggcccacaccttgatttaggctttatgacacctggagcagaggaaccagtgcaacccttg gcggacgacccaggacgcgttgacgcggcagtcgcgccatctcttcttgcctgcatttat ctgcgctgtttgaagccgtgtgacccatcaggacccagaggctgtctgatcacattattg ggctgtctgctgtcttcgggagatcctggttggcctgaagggaagaagatcagaggaaga cctatcgaggatgggtttctcctgcttctcctcgggttggcgctgcccaggatccgcttc ctggcctcggtgtactcggcttcccactgtgccagggacttgactggaagggcgggcctg ctagcggacttggggctgctgaccacaccgttgctgggggcgcgcaccgaggcaccgcgg cgagcttggctgcttctggggcctgtgtggccctgtgtgtcggaaagatggagcaagaag ccgagcccgaggggcggccgcgacccctctgaccgagatcctgctgctttcgcagccagg agcaccgtccctccccggattagtgcgtacgagcgcccagtgccctggcccggagagtgg aatgatccccgaggcccagggcgtcgtgcttccgcgcgccccgtgaaggaaactggggag tcttga >gi568815586f:68709206_68939846|GENSCAN_predicted_peptide_4|390_aa MIYRNLVVVNQQESSDSGTSVSENRCHLEGGSDQKDLVQELQEEKPSSSHLVSRPSTSSR RRAISETEENSDELSGERQRKRHKSDSISLSFDESLALCVIREICCERSSSSESTGTPSN PDLDAGVSEHSGDWLDQDSVSDQFSVEFEVESLDSEDYSLSEEGQELSDEDDEVYQVTVY QAGESDTDSFEEDPEISLADYWKCTSCNEMNPPLPSHCNRCWALRENWLPEDKGKDKGEI SEKAKLENSTQAEEGFDVPDCKKTIVNDSRESCVEENDDKITQASQSQESEDYSQPSTSS SIIYSSQEDVKEFEREETQDKEESVESSLPLNAIEPCVICQGRPKNGCIVHGKTGHLMAC FTCAKKLKKRNKPCPVCRQPIQMIVLTYFP >gi568815586f:68709206_68939846|GENSCAN_predicted_CDS_4|1173_bp atgatctacaggaacttggtagtagtcaatcagcaggaatcatcggactcaggtacatct gtgagtgagaacaggtgtcaccttgaaggtgggagtgatcaaaaggaccttgtacaagag cttcaggaagagaaaccttcatcttcacatttggtttctagaccatctacctcatctaga aggagagcaattagtgagacagaagaaaattcagatgaattatctggtgaacgacaaaga aaacgccacaaatctgatagtatttccctttcctttgatgaaagcctggctctgtgtgta ataagggagatatgttgtgaaagaagcagtagcagtgaatctacagggacgccatcgaat ccggatcttgatgctggtgtaagtgaacattcaggtgattggttggatcaggattcagtt tcagatcagtttagtgtagaatttgaagttgaatctctcgactcagaagattatagcctt agtgaagaaggacaagaactctcagatgaagatgatgaggtatatcaagttactgtgtat caggcaggggagagtgatacagattcatttgaagaagatcctgaaatttccttagctgac tattggaaatgcacttcatgcaatgaaatgaatcccccccttccatcacattgcaacaga tgttgggcccttcgtgagaattggcttcctgaagataaagggaaagataaaggggaaatc tctgagaaagccaaactggaaaactcaacacaagctgaagagggctttgatgttcctgat tgtaaaaaaactatagtgaatgattccagagagtcatgtgttgaggaaaatgatgataaa attacacaagcttcacaatcacaagaaagtgaagactattctcagccatcaacttctagt agcattatttatagcagccaagaagatgtgaaagagtttgaaagggaagaaacccaagac aaagaagagagtgtggaatctagtttgccccttaatgccattgaaccttgtgtgatttgt caaggtcgacctaaaaatggttgcattgtccatggcaaaacaggacatcttatggcctgc tttacatgtgcaaagaagctaaagaaaaggaataagccctgcccagtatgtagacaacca attcaaatgattgtgctaacttatttcccctag >gi568815586f:68709206_68939846|GENSCAN_predicted_peptide_5|453_aa MAPHRLTMDFSINTGKQFHLIPGPKVTRTPSELPQVPLSINKVRNSQPTTQNMTVKLHWD PVADLGQARGHHICTGSQLCEFRNRLGVHVDWPKPETLQKSQCMPGRNLWVLVVGRFPKE HRIGIPEFKYVANMHGDETVGRELLLHLIDYLVTSDGKDPEITNLINSTRIHIMPSMNPD GFEAVKKPDCYYSIGRENYNQYDLNRNFPDAFEYNNVSRQPETVAVMKWLKTETFVLSAN LHGGALVASYPFDNGVQVSLKVLVHLLNAATGALYSRSLTPDDDVFQYLAHTYASRNPNM KKGDECKNKMNFPNGVTNGYSWYPLQGVKGQVFDQNGNPLPNVIVEVQDRKHICPYRTNK YGEYYLLLLPGSYIINVTVPGHDPHITKVIIPEKSQNFSALKKDILLPFQGQLDSIPVSN PSCPMIPLYRNLPDHSAATKPSLFLFLDTSTWM >gi568815586f:68709206_68939846|GENSCAN_predicted_CDS_5|1362_bp atggcacctcatagactaactatggacttttccatcaatactggaaaacagtttcatcta atccccggaccaaaggtgactcggactccaagtgagcttcctcaggttcccctgtccata aacaaagtcaggaattcccagcccactacacagaatatgacagttaagcttcattgggat cctgtggctgacctggggcaggctagagggcatcacatctgtacaggctctcagctttgc gagttcagaaacagactgggtgttcatgtggactggcccaaacctgaaactctacagaaa tcgcagtgcatgccaggtagaaacctgtgggttcttgttgtggggcggtttccaaaggaa cacagaattgggattccagagttcaaatacgtggcaaatatgcatggagatgagactgtt gggcgggagctgctgctccatctgattgactatctcgtaaccagtgatggcaaagaccct gaaatcacaaatctgatcaatagtacccggatacacatcatgccttccatgaacccagat ggatttgaagccgtcaaaaagcctgactgttattacagcatcggaagggaaaattataac cagtatgacttgaatcgaaatttccccgatgcttttgaatataataatgtctcaaggcag cctgaaactgtggcagtcatgaagtggctgaaaacagagacgtttgtcctctctgcaaac ctccatggtggtgccctcgtggccagttacccatttgataatggtgttcaagtcagttta aaggtcttggttcatcttttaaatgcagcaactggggcattatactcccgaagcttaacg cctgatgatgatgtttttcaatatcttgcacatacctatgcttcaagaaatcccaacatg aagaaaggagacgagtgtaaaaacaaaatgaactttcctaatggtgttacaaatggatac tcttggtatccactccaaggtgtaaagggtcaagtttttgatcagaatggaaatccatta cccaatgtaattgtggaagtccaagacagaaaacatatctgcccctatagaaccaacaaa tatggagagtattatctccttctcttgcctgggtcttatataataaatgttacagtccct ggacatgatccacacatcacaaaggtgattattccggagaaatcccagaacttcagtgct cttaaaaaggatattctacttccattccaagggcaattggattctatcccagtatcaaat ccttcatgcccaatgattcctctatacagaaatttgccagaccactcagctgcaacaaag cctagtttgttcttatttttagacacatccacctggatgtga >gi568815586f:68709206_68939846|GENSCAN_predicted_peptide_6|232_aa MGTQGWGLKLLGNPREPSTFTCRGHILGLFLQRFLVNVADFRLLTAGSGTFIPIKSHAET QCKKLIHLSICFPEYDGNNSHEDAGNYIKSQFLDLSMQQDIKRIYSHMTCATDTQNIKFV FDAATQILLSKKTSRTAASSNPHYSIPQVSAVRPLSYSRTQLMTSAFIGLSTPSGQFCPS NIRTLHTLSRAQNLSIILDFPLLLTRLPPPIQSNLIQSNPIQSNPTQPNPTH >gi568815586f:68709206_68939846|GENSCAN_predicted_CDS_6|699_bp atggggactcaaggatggggactcaaactccttggcaaccccagagagcctagcacattc acctgcagagggcacatcctcggtttattcttgcaaaggttcctggtcaatgtggcagac ttcaggctactcactgctggtagtggaacctttattcctataaaatcccatgcggaaacc cagtgtaaaaagttaatccatctcagcatttgttttccagagtatgatggtaacaactcc catgaggatgcggggaattacatcaagagccagttccttgacctcagtatgcaacaagat atcaaaagaatttacagtcacatgacctgtgctacagatacacagaatatcaaatttgtg tttgatgcagctacacagatattattatcaaagaaaacctcaaggactgcggcctcttca aatcctcactattccattcctcaggtctcagccgtgagacctctgtcatattctcgcaca cagctaatgacatctgcgtttattggtttgtcaactccttcagggcagttttgtccatct aacataagaacacttcatacactcagtcgagcacaaaacctaagcatcatccttgatttt cctcttctgctcacacgccttcctcctccaatccaatccaatctaatccaatccaatcca atccaatccaatccaacccaacccaacccaacccattag >gi568815586f:68709206_68939846|GENSCAN_predicted_peptide_7|257_aa MDFPCLWLGLLLPLVAALDFNYHRQEGMEAFLKTVAQNYSSVTHLHSIGKSVKDLNVRPK TIKTLEENLGITIQDIGTGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQ PPKWEKIFATYSSDKGLISRIYNELKQIYKKKTDNPIEKWVKDMNRHFSKEDIYAAKKHM KKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRVITHLPGVEDNYLDSGDTKAVMI VPTTPVFSLSQNSGEAN >gi568815586f:68709206_68939846|GENSCAN_predicted_CDS_7|774_bp atggacttcccgtgcctctggctagggctgttgctgcctttggtagctgcgctggatttc aactaccaccgccaggaagggatggaagcgtttttgaagactgttgcccaaaactacagt tctgtcactcacttacacagtattgggaaatctgtgaaagacttaaacgttagacctaaa accataaaaaccctagaagaaaacctaggcattaccattcaggacataggcacgggcaag gacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggat ctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaa cctccaaaatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccaga atctacaatgaactcaaacaaatttacaagaaaaaaacagacaaccccatcgagaagtgg gtgaaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatg aaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagatac catctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagagttatcacg cacttacccggagttgaagataactaccttgacagtggggatacaaaggcagtaatgata gtgcctactaccccagtctttagtctatcacagaattcaggagaagccaattaa