GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:58:19 Sequence gi568815579f:8151077_8362106 : 211030 bp : 51.28% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16261 16361 101 1 2 50 94 87 0.080 5.75 1.02 Intr + 40525 40602 78 2 0 126 91 -25 0.281 1.62 1.03 Term + 58419 58720 302 1 2 83 52 141 0.491 5.73 1.04 PlyA + 59644 59649 6 1.05 2.00 Prom + 59927 59966 40 -6.30 2.01 Sngl + 63806 64039 234 1 0 74 43 233 0.841 12.80 2.02 PlyA + 70420 70425 6 1.05 3.00 Prom + 82431 82470 40 -3.21 3.01 Init + 89066 89083 18 1 0 90 97 6 0.472 1.69 3.02 Intr + 89410 89487 78 0 0 74 100 9 0.499 0.94 3.03 Intr + 99968 100173 206 1 2 113 105 213 0.885 24.02 3.04 Intr + 103423 103540 118 1 1 73 78 90 0.990 7.47 3.05 Intr + 104531 104649 119 1 2 100 115 174 0.925 21.07 3.06 Intr + 104746 104803 58 1 1 98 94 85 0.993 9.48 3.07 Intr + 105542 105634 93 1 0 92 100 69 0.975 9.16 3.08 Intr + 105873 106001 129 2 0 72 64 247 0.999 22.10 3.09 Intr + 106803 106909 107 2 2 120 81 145 0.987 16.51 3.10 Intr + 110612 110768 157 2 1 88 80 310 0.999 30.73 3.11 Term + 110854 111033 180 0 0 136 42 128 0.920 10.93 3.12 PlyA + 111836 111841 6 1.05 4.03 PlyA - 118588 118583 6 1.05 4.02 Term - 128617 128363 255 1 0 14 48 217 0.163 6.22 4.01 Init - 142417 142400 18 1 0 111 58 40 0.001 1.35 4.00 Prom - 145478 145439 40 -3.11 5.06 PlyA - 148199 148194 6 -0.45 5.05 Term - 151529 151387 143 0 2 132 55 128 0.999 12.30 5.04 Intr - 151904 151701 204 0 0 76 99 182 0.850 17.80 5.03 Intr - 153012 152779 234 1 0 66 56 242 0.950 16.99 5.02 Intr - 154080 153955 126 1 0 84 73 161 0.999 15.36 5.01 Init - 157214 157073 142 2 1 80 109 194 0.713 18.98 5.00 Prom - 157547 157508 40 -2.61 6.19 PlyA - 157621 157616 6 1.05 6.18 Term - 160519 160429 91 0 1 95 47 72 0.879 1.29 6.17 Intr - 165569 165420 150 1 0 121 78 160 0.947 18.09 6.16 Intr - 169830 169781 50 0 2 132 89 105 0.762 12.97 6.15 Intr - 170337 170232 106 2 1 61 45 149 0.554 8.62 6.14 Intr - 171047 170847 201 1 0 37 72 171 0.701 9.32 6.13 Intr - 171734 171585 150 1 0 50 100 38 0.651 0.99 6.12 Intr - 171846 171767 80 0 2 100 63 67 0.944 4.24 6.11 Intr - 173471 173373 99 2 0 95 99 170 0.999 19.61 6.10 Intr - 173754 173554 201 0 0 93 64 292 0.999 27.30 6.09 Intr - 174020 173875 146 0 2 82 96 241 0.990 24.81 6.08 Intr - 178889 178796 94 2 1 91 95 -7 0.411 0.34 6.07 Intr - 182154 181921 234 0 0 103 59 306 0.291 27.62 6.06 Intr - 182732 182648 85 1 1 124 69 52 0.999 7.22 6.05 Intr - 183040 182834 207 0 0 70 83 348 0.521 31.22 6.04 Intr - 183343 183244 100 2 1 81 44 135 0.999 8.07 6.03 Intr - 184716 183424 1293 1 0 110 100 1527 0.979 144.17 6.02 Intr - 186780 186719 62 2 2 105 117 84 0.485 11.87 6.01 Init - 189469 189426 44 1 2 67 103 18 0.277 1.33 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 23012 23070 59 1 2 40 106 59 0.817 3.63 S.002 Intr + 122955 123073 119 0 2 81 39 97 0.935 4.71 S.003 Intr + 137818 137936 119 1 2 81 39 97 0.891 4.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:8151077_8362106|GENSCAN_predicted_peptide_1|160_aa XKLQQEVTEQQNRDLIQVSVEGAAARPCGEVKCAAPPIENNQEWPGMVALAYNLSTLGSP GKRAPRAANTAPNATLTAVPGPASAAMAAPARPPLAAGLQPPSPRRGLSPSGLPIMAVSP SYPRSPTRAGAQRVPEDAAVKRAGASRGDPGLVTRGNPDV >gi568815579f:8151077_8362106|GENSCAN_predicted_CDS_1|483_bp nncaaattgcaacaggaggtcacagagcaacagaaccgagatctgatccaagtgtctgtg gagggtgcagctgccagaccctgtggcgaggtcaaatgtgcagcccctccgatagaaaac aaccaggaatggccaggcatggtggctcttgcctataatctcagcactttgggaagccca ggtaagcgagcaccccgcgccgcgaacaccgccccgaacgccaccctcacggcggtcccc ggccccgcctccgcggccatggcggccccagcccggcccccactcgcagctgggctccag cccccatcaccccgccggggactgtccccctcggggcttcccatcatggccgtctccccg agctatccgcgctcccccacgagggccggggcgcaacgggtgcccgaggacgcggctgtg aaacgggctggagcgagtaggggcgatcctggcctcgtgacgcgggggaatccggatgtg tga >gi568815579f:8151077_8362106|GENSCAN_predicted_peptide_2|77_aa MKGSGEEVGGEEEEEREEDEGEEREEGEEEKKEEEGEEREEEEEKKEEEGEDEEEEATAA ASGQAARAGWRSQDVLR >gi568815579f:8151077_8362106|GENSCAN_predicted_CDS_2|234_bp atgaagggcagtggggaggaggtgggaggggaggaggaagaggaaagggaggaagacgag ggggaggagagggaggagggggaggaggagaaaaaggaagaggagggggaggagagggag gaggaggaggagaaaaaggaagaggagggggaggacgaggaggaagaagcaacggcagca gccagtgggcaagcagccagagctggatggaggagccaggacgttctcagatag >gi568815579f:8151077_8362106|GENSCAN_predicted_peptide_3|420_aa MVQVSLHVTCSRLSLWPQESGDPRQRARLQGRTSQLTGTCVHRMLSSFNEWFWQDRFWLP PNVTWTELEDRDGRVYPHPQDLLAALPLALVLLAMRLAFERFIGLPLSRWLGVRDQTRRQ VKPNATLEKHFLTEGHRPKEPQLSLLAAQCGLTLQQTQRWFRRRRNQDRPQLTKKFCEAS WRFLFYLSSFVGGLSVLYHTLKPSLYWWYLLELGFYLSLLIRLPFDVKRKDFKEQVIHHF VAVILMTFSYSANLLRIGSLVLLLHDSSDYLLEACKMVNYMQYQQVCDALFLIFSFVFFY TRLVLFPTQILYTTYYESISNRGPFFGYYFFNGLLMLLQLLHVFWSCLILRMLYSFMKKG QMEKDIRSDVEESDSSEEAAAAQEPLQLKNGAAGGPRPAPTDGPRSRVAGRLTNRHTTAT >gi568815579f:8151077_8362106|GENSCAN_predicted_CDS_3|1263_bp atggtacaagtgagcttgcatgtcacgtgctcccgcctgagcctctggccgcaggaatca ggtgaccccaggcagcgggccaggctgcaggggcggacctcccagctaactggaacctgt gtccacagaatgctgtccagtttcaacgagtggttttggcaggacaggttctggttacca cccaatgtcacgtggacagagctagaagaccgggatggccgtgtctacccccacccccag gacttgttggcagccctgcccctggcgctggtcctcctggccatgcgccttgcctttgag agattcattggcctgcccctgagccggtggctgggtgtgagggatcagaccaggaggcaa gtgaagcccaacgccacgctggagaaacacttcctcacggaagggcacaggcccaaggag ccccagctgtctctcctggccgcccagtgtggcctcacgctgcagcagacccagcgatgg ttccggagacgccggaaccaggatcgaccccagctgaccaagaagttctgtgaggccagc tggaggtttctcttctacctgtcctccttcgtgggcggcctctcggtcctgtaccacact ctgaagccatccctgtactggtggtacctcttggagctgggtttctacctctcactgcta atcaggctgccctttgatgtcaagcgcaaggatttcaaggagcaggtgatacaccacttc gtggcggtcatcctgatgaccttctcctacagtgccaacctgctgcgcattggctctctg gtgctgctgttacacgattcctctgactacctgctggaggcctgtaagatggtcaactac atgcagtatcagcaagtgtgcgacgctctcttcctcatcttctcctttgtcttcttctac acccgactggtcctctttcccacccagatcctctacaccacatactacgagtccatcagc aacaggggccccttcttcggctactacttcttcaacgggcttctgatgttgctgcagctg ctgcacgtgttctggtcttgcctcattctgcgcatgctctatagcttcatgaagaagggc cagatggagaaggacattcgtagtgatgtagaagaatcagactccagtgaggaggcggcg gcggcccaggaacctctgcagctaaagaacggggcagctggagggcccaggccagccccc actgatggccctcggagccgggtggccgggcgtctgaccaacaggcacacaacagccaca tag >gi568815579f:8151077_8362106|GENSCAN_predicted_peptide_4|90_aa MARPRLVCSYTPEPERPRTHQKEETTNTSEHQKEQTVDTLPLRTVTFTTRVCGFILEVSE TKTPPILDTLLHRVLIQSAWCPYKKKKYGH >gi568815579f:8151077_8362106|GENSCAN_predicted_CDS_4|273_bp atggccagaccccgactggtctgtagctacactcctgagccggaaagaccacgaacccac cagaaggaggaaactacgaacacatccgaacatcagaaggaacaaactgtggacacgctg cctttaagaactgtaacattcaccacgagggtctgtggcttcattcttgaagtcagtgag accaagaccccaccaattctggatacactgctacatcgggtcctcatccaatctgcctgg tgtccctacaagaagaagaaatatggacactaa >gi568815579f:8151077_8362106|GENSCAN_predicted_peptide_5|282_aa MSGGWMAQVGAWRTGALGLALLLLLGLGLGLEAAASPLSTPTSAQAAGPSSGSCPPTKFQ CRTSGLCVPLTWRCDRDLDCSDGSDEEECRIEPCTQKGQCPPPPGLPCPCTGVSDCSGGT DKKLRNCSRLACLAGELRCTLSDDCIPLTWRCDGHPDCPDSSDELGCGTNEILPEGDATT MGPPVTLESVTSLRNATTMGPPVTLESVPSVGNATSSSAGDQSGSPTAYGVIAAAAVLSA SLVTATLLLLSWLRAQERLRPLGLLVAMKESLLLSEQKTSLP >gi568815579f:8151077_8362106|GENSCAN_predicted_CDS_5|849_bp atgagcggcggttggatggcgcaggttggagcgtggcgaacaggggctctgggcctggcg ctgctgctgctgctcggcctcggactaggcctggaggccgccgcgagcccgctttccacc ccgacctctgcccaggccgcaggccccagctcaggctcgtgcccacccaccaagttccag tgccgcaccagtggcttatgcgtgcccctcacctggcgctgcgacagggacttggactgc agcgatggcagcgatgaggaggagtgcaggattgagccatgtacccagaaagggcaatgc ccaccgccccctggcctcccctgcccctgcaccggcgtcagtgactgctctgggggaact gacaagaaactgcgcaactgcagccgcctggcctgcctagcaggcgagctccgttgcacg ctgagcgatgactgcattccactcacgtggcgctgcgacggccacccagactgtcccgac tccagcgacgagctcggctgtggaaccaatgagatcctcccggaaggggatgccacaacc atggggccccctgtgaccctggagagtgtcacctctctcaggaatgccacaaccatgggg ccccctgtgaccctggagagtgtcccctctgtcgggaatgccacatcctcctctgccgga gaccagtctggaagcccaactgcctatggggttattgcagctgctgcggtgctcagtgca agcctggtcaccgccaccctcctccttttgtcctggctccgagcccaggagcgcctccgc ccactggggttactggtggccatgaaggagtccctgctgctgtcagaacagaagacctcg ctgccctga >gi568815579f:8151077_8362106|GENSCAN_predicted_peptide_6|1130_aa MKWVWQMPVSGGLDRVPLTAAAGNMAKFALNQNLPDLGGPRLCPVPAAGGARSPSSPYSV ETPYGFHLDLDFLKYIEELERGPAARRAPGPPTSRRPRAPRPGLAGARSPGAWTSSESLA SDDGGAPGILSQGAPSGLLMQPLSPRAPVRNPRVEHTLRETSRRLELAQTHERAPSPGRG VPRSPRGSGRSSPAPNLAPASPGPAQLQLVREQMAAALRRLRELEDQARTLPELQEQVRA LRAEKARLLAGRAQPEPDGEAETRPDKLAQLRRLTERLATSERGGRARASPRADSPDGLA AGRSEGALQVLDGEVGSLDGTPQTREVAAEAVPETREAGAQAVPETREAGVEAAPETVEA DAWVTEALLGLPAAAERELELLRASLEHQRGVSELLRGRLRELEEAREAAEEAAAGARAQ LREATTQTPWSCAEKAAQTESPAEAPSLTQESSPGSMDGDRAVAPAGILKSIMKKRDGTP GAQPSSGPKSLQFVGVLNGEYESSSSEDASDSDGDSENGGAEPPGSSSGSGDDSGGGSDS GTPGPPSGGDIRDPEPEAEAEPQQVAQGRCELSPRLREACVALQRQLSRPRGVASDGGAV RLVAQEWFRVSSQRRSQAEPVARMLEGVRRLGPELLAHVVNLADGNGNTALHYSVSHGNL AIASLLLDTGQSQGRPNPCCKAGPLCTGYPFSPLHLGMCFSYLPESGACEVNRQNRAGYS ALMLAALTSVRQEEEDMAVVQRLFCMGDVNAKASQTGQTALMLAISHGRQDMVATLLACG ADVNAQDADGATALMCASEYGRLDTVRLLLTQPGCDPAILDNEGTSALAIALEAEQDEVA ALLHAHLSSGQPDTQSESPPGSQTATPGEGECGDNGENPQVHCGEISSSVTSAFGAQKGP GSPAQRLTLAEEKGNFSWGDCGRKGLTLAPPRSIKGVSSLHPPASSQAQRNLRASRSDSK RVSTSPSRTGPFTLRMMDRLVSSMNSTRTYGSPPEWRKNSPSGGDVTGCALQYRGRKMAS ATRLIQRLRNWASGHDLQGKLQLRYQEISKRTQPPPKLPVGPSHKLSNNYYCTRDGRRES VPPSIIMSSQKALVSGKPAESSAVAATEKKAVTPAPPIKRWELSSDQPYL >gi568815579f:8151077_8362106|GENSCAN_predicted_CDS_6|3393_bp atgaagtgggtttggcagatgcctgtcagtggaggactggacagggtgcctctgacagct gctgcaggaaacatggccaagtttgccctgaatcagaacctgcccgacctgggcggcccc cgcctgtgcccggtccccgccgccgggggcgcacgcagcccgagctcgccctactcggtg gagacgccctacggcttccacctggacctggacttcctcaagtacatagaggagctggag cgtggccccgctgcccgccgcgccccgggacccccgacctcgcgccgtccccgcgcgccc cggcccggcctcgcgggcgcacgtagcccaggcgcctggacatccagcgagtccctggcc agtgacgacggtggagcaccgggcatactctcccagggcgcgccctcggggctcctgatg cagccgctgtcgccgcgcgcgcccgtgcgcaacccgcgcgtcgagcacacgctccgggag accagccggcggctggagctggcgcagacacacgagcgcgcgcccagccccggccgcggg gtcccgcgcagcccacgcgggtccggccgcagcagccccgcccctaaccttgcccctgct tcgcccggccctgcccaactgcagctggtgcgcgagcagatggccgcggcgctgcggcgc ctgcgcgagctcgaggaccaggcgcgaacgctgcccgagctgcaggagcaggtgcgcgcg ctgcgcgccgagaaggcgcggctgctggccgggcgcgcgcagcccgagccggacggggag gctgagacgcgcccggacaagctcgcccagctgcggcggctcaccgagcgcctggccacc tccgagcgcggcggccgtgccagggccagcccccgggctgacagcccagacggcctggct gcagggcgcagcgagggcgcgctccaggtcctcgacggggaggtcgggagtctcgatggg acgccccagacccgggaggtggccgccgaggccgtgcccgagacccgagaagcgggtgcc caggccgtgccggagacccgggaggccggcgtggaggctgcccccgagaccgtggaggcg gacgcgtgggtgaccgaggcgctgctggggctgcctgcggccgccgagcgcgagctagag ctgctgcgcgccagtctggagcaccagcgcggggtgagtgagcttctgcggggccggttg cgggagctggaggaagcccgcgaggctgcggaggaggcagcggcgggggcccgggcccag ctacgcgaggccaccacccagaccccgtggagctgtgccgaaaaggccgcgcagaccgag tccccggcagaggcgccctccttgactcaggagagctcgcccggatccatggacggagac agggccgtggcgcccgcgggcatcctcaaatccatcatgaagaagagagacggcacacct ggtgcccaacccagctccggacccaagagcctgcagtttgttggggtcctcaacggagag tacgagagctcctccagcgaggacgccagcgacagcgatggcgacagcgagaacggtggc gccgagcccccgggtagctcctcgggctccggggatgacagcggcgggggatccgactcg ggcacccctggccctcccagcggcggggacatccgggaccctgagcccgaggcggaggca gagcctcagcaggtggcacaggggaggtgcgagctgagcccgcgtctgagggaggcgtgc gtagcgctgcagcggcagctgagccggccccgcggagtagccagcgacggcggcgcagtg cgcctcgtggcccaggagtggtttcgagtgtccagccagcggcgctctcaggcggagccc gtggccaggatgctggaaggggtgaggcgcctgggacccgaactgctggcgcacgtggtg aacctggcggatggcaacgggaacacggccctgcactacagtgtgtcccacgggaacctg gccatcgcaagcctgctcctggatacgggtcagagccaagggaggcctaatccttgctgc aaagcaggtccactgtgcactggttaccccttcagcccattgcacctaggaatgtgcttt tcctacctgcctgagtctggggcctgcgaggtcaaccgccagaaccgagccggctactcg gccctcatgctggctgcactcacctctgtgaggcaggaagaggaggacatggctgtggtc cagagactcttctgcatgggtgatgtcaatgccaaggccagtcagacggggcagacagcc ctcatgctggccatcagccatggccgacaggacatggtggcaaccctactggcgtgtggg gctgatgtgaatgcgcaggatgcggatggggccacagcgctgatgtgtgccagtgagtat gggcgcctggacaccgtgcggctgctgctcacccagccaggctgtgaccctgccatcctg gacaatgagggcaccagtgccctggccatcgccctggaggctgagcaggatgaggtggcc gctctgctacatgcccacctgagctcgggccagcccgacacccagagcgagtcaccccct ggctcccagacagccacacctggtgaaggagaatgcggtgacaatggagagaacccccag gttcactgtggggagatctcctcgtcagtcacctcagcctttggcgcacagaagggtcca gggtcccctgctcagaggctaacactggccgaagagaaaggcaatttcagttggggtgac tgtggcaggaaggggctcactctggccccaccaaggtctatcaaaggggtgtcctctttg cacccaccagcgagcagccaagctcagcgcaacctccgggcttctcgctctgactccaaa agggtgagcacgtcgccctcgcgcacggggccttttacattgcggatgatggatcggctc gtgtcgtccatgaattccacgcgcacctacgggtccccaccggagtggaggaagaacagc cccagcggtggagacgtcaccggctgcgcccttcagtatcgcggacggaagatggcgtcc gccacccgtctcatccagcggctgcggaactgggcgtccgggcatgacctgcaggggaag ctgcagctacgctaccaggagatctccaagcgaactcagcctcctcccaagctccctgtg ggtcctagccacaagctctccaacaattactattgcactcgcgatggccgccgggaatct gtgcccccttccatcatcatgtcgtcgcagaaggcgctggtgtcaggcaagccagcagag agctctgctgtagctgccactgagaagaaggcggtgactccagctcctcccataaagagg tgggagctgtcctcggaccagccttacctgtga