GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:14:23 Sequence gi568815578r:14225560_14427506 : 201947 bp : 36.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 242 237 6 -0.45 1.03 Term - 764 634 131 0 2 50 43 92 0.313 -1.74 1.02 Intr - 1319 1141 179 1 2 42 68 206 0.379 12.64 1.01 Init - 12346 12213 134 1 2 78 47 64 0.282 0.96 1.00 Prom - 12470 12431 40 -3.65 2.00 Prom + 14157 14196 40 -1.45 2.01 Init + 18976 19038 63 0 0 66 41 69 0.154 1.20 2.02 Intr + 34110 34242 133 2 1 43 76 77 0.072 1.30 2.03 Intr + 36830 37082 253 0 1 23 35 190 0.551 2.77 2.04 Intr + 38604 38702 99 0 0 77 87 85 0.636 5.61 2.05 Term + 46475 46994 520 2 1 -5 43 269 0.367 5.88 2.06 PlyA + 47222 47227 6 1.05 3.00 Prom + 47390 47429 40 -6.15 3.01 Init + 47654 48917 1264 1 1 35 40 599 0.325 42.37 3.02 Intr + 49069 49500 432 2 0 67 86 138 0.630 4.19 3.03 Term + 49556 50265 710 0 2 79 47 147 0.783 2.38 3.04 PlyA + 51072 51077 6 1.05 4.00 Prom + 64578 64617 40 -0.85 4.01 Init + 67969 68024 56 0 2 50 79 49 0.345 1.01 4.02 Term + 72022 72214 193 1 1 -6 41 179 0.141 -0.29 4.03 PlyA + 73444 73449 6 1.05 5.03 PlyA - 73721 73716 6 1.05 5.02 Term - 77860 77741 120 1 0 55 42 117 0.654 1.29 5.01 Init - 86767 86651 117 1 0 82 60 72 0.496 4.05 5.00 Prom - 99337 99298 40 -5.35 6.02 PlyA - 99460 99455 6 1.05 6.01 Sngl - 101947 99998 1950 1 0 75 44 1195 0.987 107.46 6.00 Prom - 108893 108854 40 -4.25 7.05 PlyA - 109195 109190 6 1.05 7.04 Term - 126209 126064 146 0 2 65 42 114 0.562 1.59 7.03 Intr - 152434 152306 129 2 0 73 88 31 0.102 1.35 7.02 Intr - 165519 165454 66 1 0 92 92 34 0.217 2.16 7.01 Init - 183988 183940 49 1 1 79 92 56 0.044 6.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 14232 14477 246 2 0 34 36 184 0.804 2.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:14225560_14427506|GENSCAN_predicted_peptide_1|147_aa MEYYAAIKKDEFMSFVGTWMKLETIILSKLSQGQKNQTPHVLTHSPWVVDGTGRRGAGGG AHRGGLAAQEPTEWVGGSGMAGCRSRALPRGKAAKARREIEHSAAFNPYKQLRKSYVYKQ LPVPIPLLRATPLLRHHRPYGNTEVLC >gi568815578r:14225560_14427506|GENSCAN_predicted_CDS_1|444_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgtggggacatggatg aagctggaaaccatcattctcagcaaattatcgcaaggacaaaaaaaccaaacaccgcac gttctcactcatagcccttgggtggtcgatgggactgggcgccgtggagcagggggcggc gctcatcggggaggcttggccgcacaggagcccacggagtgggtgggaggctcaggcatg gcgggctgcaggtcccgagccctgccccgcgggaaggcagctaaggcccggcgagaaatc gagcacagcgccgcattcaatccttataaacagttgcgaaaatcctatgtatacaaacag ctaccagtgccaatacctctgctaagggccacacccttactgcgccatcacagaccctat ggaaacactgaggtcctctgctaa >gi568815578r:14225560_14427506|GENSCAN_predicted_peptide_2|355_aa MKIHSSPWVDLYRTKALTQELGAAGLMGMQTRRKRVIYGVISATVELMEYTMESHSFLTV AEQCLDKKFKRRIRRKDNYCSFGNNEFEVHLEDLPGSVLTKVRQRALLVDHPITAINTDV IVETMKVNVITWKEQVVLEEKKDGGRTLGRQEVAIAPICTTGRTRNTCPQDQRSFSEIQV NIFDGENGTKLENTLQGIIRENFPNLVRQANIQIQEIQRMPQRYSSRRATPRNIIVRFTK VEMKEKMLKAAREKGRVTHKGKPIRLTADLSAETLQARRQWRAIFNILKEKNFQPRISYP AKLSFISEGEIKYFTDKQMLRDFVTTRPALKELLKEALNMERNNRYQPLQKHAKL >gi568815578r:14225560_14427506|GENSCAN_predicted_CDS_2|1068_bp atgaagatccatagcagcccatgggtggatctatacagaacaaaggccctaacccaagaa ctaggagctgctggcctaatggggatgcagacaagaagaaagagagttatttatggtgtg ataagtgcaactgtagaattaatggagtatactatggaatcacattcttttctaacagtt gctgaacagtgtcttgataagaaattcaaaaggagaataaggagaaaagataattactgt tcttttggaaataatgaatttgaagttcatttggaagatctacctggaagtgtcttgact aaagttcggcagagggctctcttagttgaccatccgattacagctatcaacacagatgtg atagtggaaaccatgaaagtgaatgtaattacttggaaagagcaagtagttctagaagag aagaaagatggaggcagaactttgggtagacaagaagttgctatagcaccaatttgcact actggacgtaccaggaatacatgtccacaggaccagagaagcttctctgagatccaagta aatatatttgatggggagaatggaaccaagttggaaaacactctgcagggtattatccga gagaacttccccaatctagtaaggcaggccaacattcagattcaggaaatacagagaatg ccacaaagatactcctcgagaagagcaactccaagaaacataattgtgagattcaccaaa gttgaaatgaaggaaaaaatgttaaaggcagccagagagaaaggtcgggttacccacaaa gggaagcccatcagactaacagctgatctctcggcagaaactctacaagccagaagacag tggagggcaatattcaacattcttaaagaaaagaattttcaacccagaatttcatatcca gccaaactaagcttcataagtgaaggagaaataaaatactttacagacaagcaaatgctg agagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcactaaacatg gaaaggaacaaccggtaccagccactgcaaaaacatgccaaattgtaa >gi568815578r:14225560_14427506|GENSCAN_predicted_peptide_3|801_aa MYSKIDHIVGCKALLSKFKRTEIITNCLSDHSAIKLELRIKKLTQNCSTTWKLNNLLLND YWVHNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTS QLKELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINEFRSWFFERINTIDRPLAR LIKKKREKNQIDAIKNDKGDITTDPTETQTTIREYYKHLYANKLENLEEMDKFLDTYTLP RLNQEEVESLNRPITGSGIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSI EEEGILPNSFYEASIILIPKLGRDIAKKENFRPISLMNIDAKILNKILAHRIQQHIKKLI HHDQVGFIPGMQGWFDIRKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNK LVLEVLVRAIRQEKEIKCIQLGKEEVKLSLFADDMIVYVENPIVSAQNLLKLISNFSKVS GYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKSIKYLGIQLTRDVKDLFKDNYKPAL NEIKEDTNKWKNIPCSWIGRINIVKELEKTTLNFIWNQKRARIAKSILSQTNKAGGIMLP DFKLYYKATVTQTAWYWYPNRDIDQWNRTEPSEITSHICKYLIFDKPEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFTTYSSDKGLISRI YNELKQIYKKKTTPSTSGRRT >gi568815578r:14225560_14427506|GENSCAN_predicted_CDS_3|2406_bp atgtattccaaaattgaccacatagttggatgtaaagctctcctcagcaaatttaaaaga acagaaattataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggatt aagaaactcactcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgac tactgggtacataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaac aaagacacaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaattt atagcactaaatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatca caattaaaagaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaa ataactaaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatt aatgaattcaggagctggttttttgaaaggatcaacacaattgatagaccgctagcaaga ctcataaagaagaaaagagagaagaatcaaatagatgcaataaaaaatgataaaggggat atcaccaccgatcccacagaaacacaaactaccatcagagaatactacaaacacctctac gcaaataaactagaaaatctagaagaaatggataaattcctcgacacatacaccctccca agactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctggaattgtg gcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaa ttctaccagaggtacaaagaggaactggtaccattccttctgaaactattccaatcaata gaagaagagggaatcctccctaactcattttatgaggccagcatcatcctgataccaaag ctgggcagagacatagccaaaaaagagaattttagaccaatatccttgatgaacattgat gcaaaaatcctcaataaaatactggcacaccgaatccagcagcacatcaaaaagcttatc caccatgatcaagtgggcttcatccctgggatgcaaggctggttcgatatacgcaaatca ataaatgtaatccagcatataaacagaacgaaagacaaaaaccacatgattatctcaata gatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaa ttagtgttggaagttctggtcagggcaatcaggcaggagaaggaaataaagtgtattcaa ttaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatatgtagaa aaccccattgtctcagcccaaaatctccttaagctgataagcaatttcagcaaagtctca ggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaataacagacaaaca gagagccaaatcatgagtgaactcccattcacaattgcttcaaagagtataaaataccta ggaatccaacttacaagggatgtgaaggacctcttcaaggataactacaaaccagcgctc aatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatggataggaaga atcaatatcgtgaaagaattggaaaaaactactttaaacttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaacgaacaaagctggaggcatcatgctacct gacttcaaactatactacaaggctacagtaacccaaacggcatggtactggtacccaaac agagatatagaccaatggaacagaacagagccctcagaaataacgtcgcatatctgcaag tatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagacagcct acagaatgggagaaaattttcacaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaccccatcaacaagtgggcgaagg acatga >gi568815578r:14225560_14427506|GENSCAN_predicted_peptide_4|82_aa MLGFMEKPKLLLDVRAGECSRKAERGEEVAEEKLEASRGWFLRFMERSHFHNIKAQGKAA SVVVETAASYQEDLAKTTDEGH >gi568815578r:14225560_14427506|GENSCAN_predicted_CDS_4|249_bp atgcttggcttcatggagaaaccaaaactgttgctagatgtgcgggctggggagtgttct agaaaggcagagagaggtgaggaagttgcagaagaaaagttggaagctagcagaggttgg ttcctgaggtttatggaaagaagccatttccataacataaaagcacaaggtaaagcagcc agtgtcgttgtagaaactgctgcaagttatcaagaagatctagctaagaccactgatgaa ggccactaa >gi568815578r:14225560_14427506|GENSCAN_predicted_peptide_5|78_aa MQLRIDQIPSAASLVSSFSDLSDDTSHPEIVTEQVWGGGREDLRDVNTVDVALLPPQKTS SLGRGGCLQGLYGQRQAD >gi568815578r:14225560_14427506|GENSCAN_predicted_CDS_5|237_bp atgcaattacgcattgatcaaattcccagtgcagcctcactagtttcaagtttctcagac ttgtctgatgatacaagtcacccagagattgtaactgagcaggtctggggtggagggaga gaggatctaagggatgtgaatactgtggatgtggcccttctaccacctcagaaaacaagc tctctggggcgtggtggatgcttgcaaggactctacggccagcgtcaggcagattaa >gi568815578r:14225560_14427506|GENSCAN_predicted_peptide_6|649_aa MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDA TTLYLQNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTIT YDSLSKIPYLEELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEEL RLDDNRISTISSPSLQGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAA PVNLPGTNLRKLYLQDNHINRVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQL ILRNNPWYCGCKMKWVRDWLQSLPVKVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGI VSTIQITTAIPNTVYPAQGQWPAPVTKQPDIKNPKLTKDHQTTGSPSRKTITITVKSVTS DTIHISWKLALPMTALRLSWLKLGHSPAFGSITETIVTGERSEYLVTALEPDSPYKVCMV PMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKEPYKNPNLPLAAIIGGAVALV TIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNSILEIRETSFQMLPIS NEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS >gi568815578r:14225560_14427506|GENSCAN_predicted_CDS_6|1950_bp atgatcagcgcagcctggagcatcttcctcatcgggactaaaattgggctgttccttcaa gtagcacctctatcagttatggctaaatcctgtccatctgtgtgtcgctgcgatgcgggt ttcatttactgtaatgatcgctttctgacatccattccaacaggaataccagaggatgct acaactctctaccttcagaacaaccaaataaataatgctgggattccttcagatttgaaa aacttgctgaaagtagaaagaatatacctataccacaacagtttagatgaatttcctacc aacctcccaaagtatgtaaaagagttacatttgcaagaaaataacataaggactatcact tatgattcactttcaaaaattccctatctggaagaattacatttagatgacaactctgtc tctgcagttagcatagaagagggagcattccgagacagcaactatctccgactgcttttc ctgtcccgtaatcaccttagcacaattccctggggtttgcccaggactatagaagaacta cgcttggatgataatcgcatatccactatttcatcaccatctcttcaaggtctcactagt ctaaaacgcctggttctagatggaaacctgttgaacaatcatggtttaggtgacaaagtt ttcttcaacctagttaatttgacagagctgtccctggtgcggaattccctgactgctgca ccagtaaaccttccaggcacaaacctgaggaagctttatcttcaagataaccacatcaat cgggtgcccccaaatgctttttcttatctaaggcagctctatcgactggatatgtccaat aataacctaagtaatttacctcagggtatctttgatgatttggacaatataacacaactg attcttcgcaacaatccctggtattgcgggtgcaagatgaaatgggtacgtgactggtta caatcactacctgtgaaggtcaacgtgcgtgggctcatgtgccaagccccagaaaaggtt cgtgggatggctattaaggatctcaatgcagaactgtttgattgtaaggacagtgggatt gtaagcaccattcagataaccactgcaatacccaacacagtgtatcctgcccaaggacag tggccagctccagtgaccaaacagccagatattaagaaccccaagctcactaaggatcac caaaccacagggagtccctcaagaaaaacaattacaattactgtgaagtctgtcacctct gataccattcatatctcttggaaacttgctctacctatgactgctttgagactcagctgg cttaaactgggccatagcccggcatttggatctataacagaaacaattgtaacaggggaa cgcagtgagtacttggtcacagccctggagcctgattcaccctataaagtatgcatggtt cccatggaaaccagcaacctctacctatttgatgaaactcctgtttgtattgagactgaa actgcaccccttcgaatgtacaaccctacaaccaccctcaatcgagagcaagagaaagaa ccttacaaaaaccccaatttacctttggctgccatcattggtggggctgtggccctggtt accattgcccttcttgctttagtgtgttggtatgttcataggaatggatcgctcttctca aggaactgtgcatatagcaaagggaggagaagaaaggatgactatgcagaagctggcact aagaaggacaactctatcctggaaatcagggaaacttcttttcagatgttaccaataagc aatgaacccatctcgaaggaggagtttgtaatacacaccatatttcctcctaatggaatg aatctgtacaaaaacaatcacagtgaaagcagtagtaaccgaagctacagagacagtggt attccagactcagatcactcacactcatga >gi568815578r:14225560_14427506|GENSCAN_predicted_peptide_7|129_aa MDFNPIIDGYLAWMTLGLLKIITQSSTIVELITLPPTVCTACTLMNAHHGLMSNTTGKQR NSITWSPDFWSITELLVYRLEELKVLARAVRQEKEIKGIQIGKEEDKVSLFADDMILCLE KHKHSTKNY >gi568815578r:14225560_14427506|GENSCAN_predicted_CDS_7|390_bp atggatttcaatcccatcatcgatggttacttggcctggatgactctgggtcttttaaaa atcatcacacagtcttctacaattgttgaactaattacactcccaccaacagtatgtaca gcctgcactcttatgaatgcacatcatggactgatgtccaatactactggaaaacaaaga aattctattacttggagccccgacttctggtcaatcacagagctgttggtttacagactc gaagaactgaaagtcctagctagagcagtaagacaagagaaagaaataaagggcatccaa attggaaaggaagaagacaaagtttccttatttgcagatgatatgatcttatgtttggaa aaacataaacactccaccaaaaactattag