GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:49:29 Sequence gi568815575r:23887924_24106693 : 218770 bp : 42.79% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5836 5884 49 0 1 86 58 54 0.325 1.36 1.02 Intr + 20028 20297 270 1 0 49 1 184 0.292 2.39 1.03 Intr + 22360 22495 136 2 1 60 97 88 0.575 5.61 1.04 Intr + 28294 28405 112 1 1 83 61 66 0.848 2.86 1.05 Intr + 39316 39447 132 0 0 98 80 98 0.959 9.92 1.06 Intr + 47273 47502 230 1 2 115 66 155 0.851 11.74 1.07 Intr + 50624 50777 154 2 1 41 93 113 0.831 6.25 1.08 Intr + 54478 54579 102 0 0 10 82 122 0.673 3.25 1.09 Intr + 57930 58029 100 2 1 95 64 43 0.322 1.36 1.10 Term + 78843 79063 221 1 2 102 45 74 0.001 0.72 1.11 PlyA + 80313 80318 6 1.05 2.03 PlyA - 80527 80522 6 1.05 2.02 Term - 81607 81244 364 0 1 44 41 199 0.079 3.85 2.01 Init - 86894 86530 365 2 2 -1 53 192 0.075 3.37 2.00 Prom - 93675 93636 40 -4.65 3.03 PlyA - 95821 95816 6 1.05 3.02 Term - 101107 99998 1110 1 0 113 39 870 0.737 75.66 3.01 Init - 118770 118066 705 0 0 38 96 558 0.788 46.58 3.00 Prom - 124730 124691 40 -6.15 4.00 Prom + 138157 138196 40 -4.35 4.01 Init + 138513 138628 116 2 2 28 53 147 0.240 4.93 4.02 Intr + 142779 142914 136 0 1 29 52 151 0.249 5.25 4.03 Term + 146688 146804 117 2 0 101 48 131 0.612 7.96 4.04 PlyA + 147037 147042 6 1.05 5.00 Prom + 152416 152455 40 -4.05 5.01 Init + 155662 155755 94 0 1 83 69 65 0.613 4.69 5.02 Term + 161677 162269 593 2 2 -27 36 336 0.477 10.60 5.03 PlyA + 162782 162787 6 1.05 6.00 Prom + 165034 165073 40 -7.15 6.01 Init + 165211 165304 94 0 1 83 69 65 0.533 4.69 6.02 Intr + 169498 169625 128 2 2 87 121 136 0.999 16.28 6.03 Intr + 169710 169831 122 2 2 32 60 112 0.731 1.17 6.04 Intr + 172165 172259 95 1 2 53 67 95 0.731 2.59 6.05 Intr + 174493 174651 159 2 0 46 116 157 0.848 13.34 6.06 Intr + 176278 176412 135 2 0 77 111 91 0.995 9.92 6.07 Intr + 178075 178169 95 2 2 75 93 103 0.999 8.26 6.08 Intr + 180041 180185 145 1 1 96 87 83 0.999 7.93 6.09 Intr + 183635 183804 170 0 2 55 94 136 0.997 9.64 6.10 Intr + 185168 185340 173 1 2 100 115 108 0.932 12.52 6.11 Term + 188799 188862 64 0 1 77 49 60 0.646 -2.62 6.12 PlyA + 189523 189528 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 44612 44343 270 2 0 28 44 242 0.895 8.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:23887924_24106693|GENSCAN_predicted_peptide_1|501_aa MGFLHVGQAGLELLISGGSAPAPAVARRQESAGSVWTVRGGARREAPGVAAALGNCLCSG DWAPSGVYWDFCEQRLPRGGAWRATGGFESFAAGSRRLAAWIYLTKDYFIGGKMNRSSNV PRKGILKSGTRSLQKVRRVHFANARNARSLLSFRGETFPPFIVFKIFLHTDGHGYKYFSG KNVLMPSSKAVDDACKLMGERKFHRIIMEDERIFPKSKVTDIMDVVTMQDYVQYRSFFDE APAFSGGRNNSWRKLNLENIPRTMLMYDIVHYSESGVISNRLRNEMKFLLQRPVTQEIHK HQLRIVSEIRGPYLTVQPLYRPYKQQNQVKFLGRRSKQAQMKVEKMRKVYLAKEKNTSEV TAKNSMVVEAGLKDSGFLSLKERQPEQQPDLLNCNALKMLAVRVTPSMKFRRDFSGESGQ QGKRLQNTGAWCKLTVDLPFWGLEEGGPLLTAPLGSAPLGTLCGGFNPSFSFHTALAEVL HEVPAPAANFCLNSQAFPYIL >gi568815575r:23887924_24106693|GENSCAN_predicted_CDS_1|1506_bp atggggtttctccatgttggccaggctggtctcgaactcctgatctcaggaggctccgcc cctgcgccggcggttgctaggcggcaagaaagcgccggctctgtgtggacggtacgcgga ggcgcgaggagggaggcgcctggcgttgccgcggcgctgggaaactgtctttgcagcggc gactgggcgcccagcggcgtgtactgggatttctgtgagcagaggctgccgagagggggc gcttggagggccaccggcggcttcgaaagcttcgcagccggaagccgccggttggcggcc tggatttaccttacgaaagattacttcattggagggaaaatgaatcgttcctcaaatgta ccacgtaaaggtattctgaaatcaggtacaagatccttacaaaaagttcgcagagtacat ttcgcaaatgcacgaaatgcaagatcattactatcatttagaggtgaaacgtttccacct ttcatcgtgtttaaaatttttcttcatactgatggccatggttacaagtattttagtgga aaaaatgtattaatgccgtcaagtaaggcagtggatgatgcttgcaaactaatgggggaa aggaaatttcaccgtataattatggaagatgaacgtattttcccgaagtccaaagtaact gatataatggatgttgtcaccatgcaagattatgtacaatatcgcagttttttcgatgag gcccctgcattttctggcggcagaaataacagctggcgcaaattaaatcttgaaaatatt cccaggacaatgctaatgtatgacatagttcattattcagagtctggagtgatctcaaac cgtctacgaaatgaaatgaagtttctgttacagagaccagtaacgcaagagatccataag caccagctacggattgtttctgaaattaggggtccatacttaactgtccaacctctatat cggccctacaaacagcaaaatcaagttaaatttctgggtcgtcgatccaagcaagctcaa atgaaagttgaaaaaatgagaaaagtttatttggctaaagaaaaaaatacttctgaggtg actgcaaagaactccatggtggtagaagcaggattgaaggattccggattcctgtcactg aaggagagacagccagagcagcagcctgacttgctgaactgtaatgcactcaaaatgctg gccgttagggttacaccctccatgaaatttagaagagacttctctggagagtctggccaa caagggaaaagacttcaaaatactggtgcatggtgcaagctgacagtggatctgccattc tggggtctggaggagggtggccctcttctcacagctccactgggcagtgccccactgggg actctgtgtgggggcttcaacccctcattttccttccacactgccttagcagaggttctc catgaggtccctgcccctgcagcaaacttctgcctgaacagccaggcatttccatacatc ctctga >gi568815575r:23887924_24106693|GENSCAN_predicted_peptide_2|242_aa MCTRITHGCCGNTRSDSGVQDKARESAFPTGSQVRPLELLGCTAGVSGILQSCCISHWLC RKSVSPSPISPTPASTTARKTMGDPDEAETHFPWWPVGRSWARGRRWLKIPPLSLGRPLD GSALATIIKIDYIAALIVALKNRVRLLKKTHRTRNDKAYKSTLCYKERTHYSKSMKVKLY TKQTTPKGAWRSQVQAPIVLSIGPGQMSPGSGATNVCTEHFRTRETKMMSLRRWFVFCRS AV >gi568815575r:23887924_24106693|GENSCAN_predicted_CDS_2|729_bp atgtgcacacgaatcacccacggatgttgtggaaatactcgttctgattcaggggtccag gataaggcacgagagtctgcgtttccaacaggctctcaggtgcggcctttggagctgctg ggctgcacagcaggggtctctgggatcctgcagagctgttgcatttcccactggctttgc agaaagtctgtgtcaccaagtcctatatcacccacacctgccagcaccactgctagaaag acaatgggggacccagacgaggcagaaactcatttcccctggtggcctgtgggaaggagc tgggctcgaggcaggaggtggcttaagattcctcctctgtcccttggacggcccttggat ggaagtgcccttgcaacaattataaaaatagactatatagctgcactaatagtggcactt aaaaacagggttcgattgctaaagaagactcataggacccgaaatgataaagcttacaag tctacgctttgttacaaggaaaggacccactatagcaagagcatgaaagtcaagctatac accaagcagacgactccaaaaggggcctggagaagccaggtgcaggctcccattgtcctc tccatagggccaggccagatgtctcctggatccggggccactaatgtgtgcacagaacat ttcagaaccagggagactaaaatgatgtctttgcggaggtggtttgtattttgcaggtct gccgtataa >gi568815575r:23887924_24106693|GENSCAN_predicted_peptide_3|604_aa MAGDVEGFCSSIHDTSVSAGFRALYEEGLLLDVTLVIEDHQFQAHKALLATQSDYFRIMF TADMRERDQDKIHLKGLTATGFSHVLQFMYYGTIELSMNTVHEILQAAMYVQLIEVVKFC CSFLLAKICLENCAEIMRLLDDFGVNIEGVREKLDTFLLDNFVPLMSRPDFLSYLSFEKL MSYLDNDHLSRFPEIELYEAVQSWLRHDRRRWRHTDTIIQNIRFCLMTPTSVFEKVKTSE FYRYSRQLRYEVDQALNYFQNVHQQPLLDMKSSRIRSAKPQTTVFRGMIGHSMVNSKILL LKKPRVWWELEGPQVPLRPDCLAIVNNFVFLLGGEELGPDGEFHASSKVFRYDPRQNSWL QMADMSVPRSEFAVGVIGKFIYAVAGRTRDETFYSTERYDITNDKWEFVDPYPVNKYGHE GTVLNNKLFITGGITSSSTSKQVCVFDPSKEGTIEQRTRRTQVVTNCWENKSKMNYARCF HKMISYNGKLYVFGGVCVILRASFESQGCPSTEVYNPETDQWTILASMPIGRSGHGVTVL DKQIMVLGGLCYNGHYSDSILTFDPDENKWKEDEYPRMPCKLDGLQVCNLHFPDYVLDEV RRCN >gi568815575r:23887924_24106693|GENSCAN_predicted_CDS_3|1815_bp atggcaggggacgtggaaggattctgttcctccatccacgacaccagtgtctctgctggg ttcagagcactgtatgaggagggattgcttcttgatgtcactctggttattgaagatcat cagttccaggcccataaagcactcttggccacccagagtgattacttcagaattatgttt actgcagacatgagggaacgagatcaggacaaaattcatttaaaaggtctaacagctacc ggtttcagccatgtcctgcaatttatgtactatggaactatagagctgagtatgaatacc gttcatgagattcttcaggctgccatgtatgttcaacttatagaagtggtgaagttctgc tgctcttttctcttagcgaagatctgcctagaaaattgtgcagaaattatgagactctta gatgatttcggcgtaaacatcgagggagtcagggagaagttagacacctttctgctagac aactttgtgccactcatgtctaggcctgactttctgtcctatctgagctttgagaagctc atgtcttacttggataatgatcatctgagcaggttcccagagatagagctgtacgaggct gtgcagtcttggctgcggcatgatagaagacgctggagacataccgataccatcattcag aatatccggttttgcttgatgaccccaaccagcgtttttgagaaggttaagacatcagaa ttttatagatactcccgacagctccgttacgaagttgaccaagcattgaattactttcag aatgttcaccagcagcctttgttggatatgaagtcaagccgcatccgttctgcaaaaccg caaactacagtatttcgaggaatgattggacatagcatggttaacagtaaaatacttctc ttaaagaaaccaagagtctggtgggagctagaaggcccacaagtacctctgcgacctgac tgccttgctatcgtcaataattttgtgttcctgttaggcggggaagagctgggcccggat ggtgaattccatgcttcttccaaagtattcaggtatgacccgagacagaactcctggctg cagatggcagatatgtctgtaccacgctctgaatttgctgtaggtgttattgggaagttt atttacgccgtagcaggcagaaccagagatgagactttctattcaactgagagatatgac atcaccaacgataaatgggaatttgtggatccttatccagttaacaaatatggacatgag gggacagtgctcaataacaaattgtttatcaccggtggaatcacctcatcttccacctcc aagcaagtgtgcgtgtttgaccccagcaaagaagggaccatagaacaacggaccaggaga actcaagtggttaccaactgttgggagaataagagcaagatgaattacgcgagatgcttt cacaagatgatttcttacaatggcaagctttatgtcttcggtggtgtctgtgtgatcttg agggcctctttcgaatctcagggatgcccttctacagaagtatacaacccagagactgat cagtggaccatcttggcatccatgccgattggtagaagtggccatggtgtgactgtgctg gacaaacaaataatggttcttggaggcctttgttataatggtcattacagcgattccatc ctcacttttgatccggatgaaaacaagtggaaggaagatgagtaccctcggatgccctgc aagctggatggtttacaagtatgcaacctgcattttccggactatgtactggatgaggtc aggcgttgcaactaa >gi568815575r:23887924_24106693|GENSCAN_predicted_peptide_4|122_aa MASLSGKSVDDGNTQLEARGSTINLKLGHPLPIADPKLSFSYEIIKGLSVQRGLGFVAQQ CEGVIREKIQDCQGDREGVASEIRMTAYRVEEANRSPQCTRWSPTAPAPDGHVAAAGLNE TL >gi568815575r:23887924_24106693|GENSCAN_predicted_CDS_4|369_bp atggcatctctttctgggaaaagtgtagatgacggaaatacacagttggaagcccgtgga tctacaattaaccttaaattagggcatccattgcccatagcagatcctaaactgagcttc agttatgagattatcaaggggctgagtgtacagcgaggactgggctttgtggcgcagcaa tgtgagggagtcatccgggagaagatacaggactgtcaaggagacagagaaggagtagcc agtgagataagaatgacagcttatagagtggaggaggcaaaccggtcccctcaatgtacc agatggtcacctacagcaccagctccagatggccacgtggctgcagctgggctcaatgaa actctgtga >gi568815575r:23887924_24106693|GENSCAN_predicted_peptide_5|228_aa MDGAESHYSQQSNAGAENQIPHVTTYKWELSGALQPTTALWEPLSGLAKAGAHSLSLQGG VEGEARAGTGAACGACGPAGVPGGRGLGGPCTRSSQPALLAPGNEGLSTRASSCGGCTGS PSSASPPALLSISHGALAAFPQDRARDLQPAMPEPPTHSMGSCAAPASPRSTTPCSTAPS PIDHPRAEECERTAQDWQAAPPAAPVPDPLGGASWAPESGGDVESLYV >gi568815575r:23887924_24106693|GENSCAN_predicted_CDS_5|687_bp atggatggagccgaaagccattattctcagcaaagtaacgcaggagcagaaaaccaaata ccgcatgttaccacttataagtgggagctgagtggagcccttcagcccaccactgcactg tgggagcccctttctgggctggccaaggctggagcccactccctcagcttgcaaggaggt gtggagggagaggcccgagcgggaaccggggctgcatgcggcgcttgcgggccagctgga gttccgggtgggcgtgggcttggcgggccctgcactcggagcagtcagccagccctgctg gccccgggcaatgagggacttagcacccgggccagcagctgcggagggtgtactgggtcc cccagcagtgccagcccgccggcgctgctctcgatttctcacggagccttagctgccttc ccgcaggacagggctcgggacctgcagcccgccatgcctgagcctcccacccactccatg ggctcctgtgcggccccagcctccccgaggagcaccaccccctgctccacagcgcccagt cccatcgaccacccaagggctgaggaatgcgagcgcacggcgcaggactggcaggcagct ccacctgcagccccggtgccggatccactgggtggagccagctgggctcctgagtctggt ggggacgtggagagtctttatgtctag >gi568815575r:23887924_24106693|GENSCAN_predicted_peptide_6|459_aa MDGAESHYSQQSNAGAENQIPHVTTYKWELSGTIGHVAHGKSTVVKAISGVHTVRFKNEL ERNITIKLGYANAKIYKLDDPSCPRPECYRSCGSSTPDEFPTDIPGTKGNFKLVRHVSFV DCPGHDILMATMLNGAAVMDAALLLIAGNESCPQPQTSEHLAAIEIMKLKHILILQNKID LVKESQAKEQYEQILAFVQGTVAEGAPIIPISAQLKYNIEVVCEYIVKKIPVPPRDFTSE PRLIVIRSFDVNKPGCEVDDLKGGVAGGSILKGVLKVGQEIEVRPGIVSKDSEGKLMCKP IFSKIVSLFAEHNDLQYAAPGGLIGVGTKIDPTLCRADRMVGQVLGAVGALPEIFTELEI SYFLLRRLLGVRTEGDKKAAKVQKLSKNEVLMVNIGSLSTGGRVSAVKADLGKIVLTNPV CTEVGEKIALSRRVEKHWRLIGWGQIRRGVTIKPTVDDD >gi568815575r:23887924_24106693|GENSCAN_predicted_CDS_6|1380_bp atggatggagccgaaagccattattctcagcaaagtaacgcaggagcagaaaaccaaata ccgcatgttaccacttataagtgggagctgagtggtacaattggtcatgtagctcatggg aaatccacagtcgtcaaagctatttctggagttcatactgtcaggttcaaaaatgaacta gaaagaaatattacaatcaagcttggatatgctaatgctaagatttataagcttgatgac ccaagttgccctcggccagaatgttatagatcttgtgggagcagtacacctgacgagttt cctacggacattccagggaccaaagggaacttcaaattagtcagacatgtttcctttgtt gactgtcctggccacgatattttgatggctactatgctgaacggtgcagcagtgatggat gcagctcttctgttgatagctggtaatgaatcttgccctcagcctcagacatcggaacac ctggctgctatagagatcatgaaactgaagcatattttgattctacaaaataaaattgat ttggtaaaagaaagtcaggctaaagaacaatacgagcagatccttgcatttgtccaaggt acagtagcagagggagctcccattattccaatttcagctcagctgaaatacaatattgaa gttgtttgtgagtacatagtaaagaaaattccagtacccccaagagactttacttcagag ccccggcttattgttattagatcttttgatgtcaacaaacctggctgtgaagttgatgac cttaagggaggtgtagctggtggtagtatcctaaaaggagtattaaaggtgggccaggag atagaagtaagacctggtattgtttccaaagatagtgaaggaaaactcatgtgtaaacca atcttttccaaaattgtatcactttttgcggagcataatgatctgcaatatgctgctcca ggcggtcttattggagttggaacaaaaattgaccccactttgtgccgggctgacagaatg gtggggcaagtacttggtgcagtcggagctttacctgagatattcacagaattggaaatt tcctatttcctgcttagacggcttctaggtgtacgcactgaaggagacaagaaagcagca aaggttcaaaagctgtctaagaatgaagtgctcatggtgaacataggatccctgtcaaca ggagggagagttagtgctgtcaaggccgatttgggtaaaattgttttgaccaatccagtg tgcacagaggtaggagaaaaaattgcccttagccgaagagttgaaaaacactggcgttta attggttggggtcagataagaagaggagtgacaatcaagccaacagtagatgatgactga