GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:12:53 Sequence gi568815590r:17199865_17473242 : 273378 bp : 39.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10068 10194 127 1 1 93 75 104 0.434 8.42 1.02 Term + 20392 20518 127 1 1 79 38 92 0.126 0.07 1.03 PlyA + 20640 20645 6 1.05 2.04 PlyA - 21543 21538 6 1.05 2.03 Term - 30984 30856 129 0 0 82 44 130 0.954 5.20 2.02 Intr - 34996 34852 145 0 1 78 78 173 0.967 14.66 2.01 Init - 37502 37348 155 2 2 62 66 128 0.619 7.50 2.00 Prom - 37654 37615 40 -3.75 3.00 Prom + 38775 38814 40 -5.25 3.01 Init + 42126 42317 192 2 0 77 72 66 0.269 2.91 3.02 Intr + 46326 46538 213 2 0 91 78 218 0.512 19.09 3.03 Intr + 46788 46909 122 2 2 76 16 116 0.372 1.57 3.04 Intr + 46936 47083 148 1 1 19 8 174 0.263 1.82 3.05 Intr + 47215 47544 330 0 0 50 39 264 0.215 12.60 3.06 Intr + 61820 62107 288 1 0 29 105 142 0.280 6.52 3.07 Intr + 68410 68508 99 0 0 85 48 73 0.501 2.39 3.08 Intr + 68992 69092 101 0 2 40 83 73 0.558 -0.01 3.09 Intr + 74869 75103 235 1 1 126 23 181 0.597 12.17 3.10 Intr + 76533 76603 71 2 2 67 92 93 0.995 4.66 3.11 Intr + 80164 80291 128 1 2 113 55 103 0.752 8.90 3.12 Intr + 80375 80433 59 0 2 15 113 19 0.596 -5.22 3.13 Intr + 80511 80579 69 2 0 75 94 56 0.615 3.36 3.14 Intr + 84609 84752 144 2 0 80 45 124 0.518 6.86 3.15 Term + 86483 86563 81 1 0 64 43 84 0.480 -1.79 3.16 PlyA + 88808 88813 6 1.05 4.08 PlyA - 89345 89340 6 1.05 4.07 Term - 94045 93444 602 2 2 83 43 413 0.889 30.00 4.06 Intr - 100360 100069 292 1 1 73 77 182 0.425 11.38 4.05 Intr - 102416 102290 127 1 1 98 63 220 0.999 20.26 4.04 Intr - 106093 105893 201 0 0 91 108 85 0.994 8.28 4.03 Intr - 109462 109413 50 1 2 87 131 32 0.999 3.96 4.02 Intr - 111772 111647 126 1 0 113 101 139 0.999 17.66 4.01 Init - 113514 113428 87 0 0 59 91 63 0.561 4.39 4.00 Prom - 114428 114389 40 -7.65 5.04 PlyA - 115546 115541 6 1.05 5.03 Term - 117542 117414 129 2 0 51 48 176 0.865 7.10 5.02 Intr - 118530 118403 128 1 2 82 28 103 0.535 3.18 5.01 Init - 125537 125411 127 2 1 48 51 118 0.003 4.47 5.00 Prom - 126723 126684 40 -7.45 6.16 PlyA - 127419 127414 6 1.05 6.15 Term - 130594 130383 212 2 2 63 45 173 0.970 6.97 6.14 Intr - 131418 131286 133 0 1 79 92 133 0.314 12.10 6.13 Intr - 141633 141499 135 0 0 98 106 244 0.968 27.04 6.12 Intr - 149217 149089 129 0 0 85 113 158 0.999 17.97 6.11 Intr - 150918 150790 129 0 0 56 98 39 0.645 1.67 6.10 Intr - 161410 161253 158 2 2 100 70 148 0.950 13.01 6.09 Intr - 171335 171173 163 2 1 40 91 178 0.988 11.93 6.08 Intr - 173376 173254 123 0 0 78 59 99 0.907 5.86 6.07 Intr - 180705 180618 88 2 1 65 53 44 0.003 -2.35 6.06 Intr - 184040 183863 178 0 1 -5 27 175 0.002 0.16 6.05 Intr - 187393 187204 190 1 1 -18 107 159 0.070 5.44 6.04 Intr - 197456 197103 354 2 0 35 93 191 0.278 8.86 6.03 Intr - 198192 198116 77 1 2 65 69 54 0.319 -0.48 6.02 Intr - 213782 213640 143 1 2 102 80 60 0.007 5.68 6.01 Init - 232151 232012 140 2 2 67 51 116 0.073 5.46 6.00 Prom - 244516 244477 40 -5.05 7.04 PlyA - 245341 245336 6 1.05 7.03 Term - 264052 263879 174 1 0 25 43 530 0.815 38.98 7.02 Intr - 270842 270556 287 0 2 63 75 128 0.016 5.14 7.01 Init - 273279 273231 49 0 1 54 82 51 0.087 2.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:17199865_17473242|GENSCAN_predicted_peptide_1|84_aa XAFRSPVFRHGTDKNGFSLGFSKNMRQVFGDEKKYWLLPIFSRYQCCRWMEEASHRKVPP VSCAYVPLAGNAEYELISSLQAIA >gi568815590r:17199865_17473242|GENSCAN_predicted_CDS_1|255_bp naggcattcagaagtccagtatttcgacatggaacagataagaatggattcagcttgggt ttcagtaaaaacatgcgacaagtttttggtgatgagaagaagtactggttgctacccatt ttttcaaggtatcaatgctgtagatggatggaagaggcttcccacaggaaggtgccacca gtcagttgtgcctatgtccctttggctggaaatgcagaatatgaattgattagttctctc caagccattgcttaa >gi568815590r:17199865_17473242|GENSCAN_predicted_peptide_2|142_aa MYAQDSIELLTTSGIQFKKHEEEGIETQYFAELLMTSGVVLCEGVKWLSFHSGYDFGYLI KILTNSNLPEEELDFFEILRLFFPVIYDVKYLMKSCKNLKMFFEDHIDDAKYCGHLYGLG SGSSYVQNGTGNAYEEEANKQS >gi568815590r:17199865_17473242|GENSCAN_predicted_CDS_2|429_bp atgtatgcccaggactctatagagctactaacaacatctggtatccagtttaaaaaacat gaggaggaaggaattgaaacccagtactttgcagaacttcttatgacttctggagtggtc ctctgtgaaggggtcaaatggttgtcatttcatagcggttacgactttggctacttaatc aaaatcctaaccaactctaacttgcctgaagaagaacttgacttctttgagatccttcga ttgttttttcctgtcatttatgatgtgaagtacctcatgaagagctgcaaaaatctcaaa atgttctttgaagatcatattgatgatgccaaatattgtggtcatttgtatggccttggt tctggttcatcctatgtacagaatggcacagggaatgcatatgaagaggaagccaacaag cagtcatga >gi568815590r:17199865_17473242|GENSCAN_predicted_peptide_3|759_aa MWEPTLVSMSFRGTVHSHRYTYSLTLGQLSHTISPNVHSLGMWEEIRGPGRKSTYTRERA HSTQSQKPRRDTKGLLITGESISVLQFPRETRGRHRGIQAIDNSLPWQSASAKVLITLTG ATLTPKHCRVKQYEEAAADARSTCVAELALLLLLLAIETALGGGGGGGGGSGGGSGEGKG EQELRRTVLRRRFSHVLTGALDDDATRRGGGERPRGRGTQVTAARNSGSAGRTGLEKTRS PALGPRTSHPAPLSLENRRAEPLGEAGQSLPGPPARGPEEDELAFSPDQERLLLRGWVPR WPHQPPAAEAAPDRVPPELTLQVTGRCLSTGGKSRVCLGSNKYLEQCPSLMGELPKGISQ QCGKAGKRFLPMVSVEYISYNMELLNRHSDLVFSLAADRAVSRVEDGSLASPLCLWLSSG IFLPSVLMMLLFPQEKPVISVYPPIRHHLMDKQGVYVTSPLVNNFTMHSDLGKIIQSLLD EFWKNPPVLAPTSTAFPYLYSNPSGMSPYASQGFPFLPPYPPQEANRSITSLSVADTVSS STTSHTTAKPAAPSFGVLSNLPLPIPTVDASIPVGITSQNGFGYKMPDVPDAFPELSELS VSQLTDMNEQEEVLLEQFLTLPQLKQIITDKDDLVKSIEELARKNLLLEPSLEAKRQTVL DKYELLTQMKSTFEKKMQRQHELSESCSASALQARLKVAAHEAEEESDNIAEDFLEGKME IDDFLSSFMEKRTICHCRRAKEEKLQQAIAMHSQFHAPL >gi568815590r:17199865_17473242|GENSCAN_predicted_CDS_3|2280_bp atgtgggaaccaaccctggtcagcatgtcgttccgtggcacagtgcactcacacagatac acctactcactcacactgggacaacttagccacaccatttcacctaacgtgcacagcctt gggatgtgggaggaaatcagaggacccgggagaaaatccacatatacacgagaacgtgca cactccactcagtcccagaagccccgaagagacaccaaaggccttctcattactggagaa agcatctccgtactgcagtttcccagagaaacgagaggacggcaccgaggtattcaagcc atagataactctcttccctggcaaagcgccagtgcgaaggtgctgataacactgactgga gctacactgactcccaagcactgccgcgtgaaacaatacgaagaggctgcagcggatgca agatctacttgtgtcgctgagctcgcgctcctcctcctcctcctcgccatagagacagca ctcggcggcggtggcggtggcggtggcggtagcggcggcggcagcggcgaagggaaaggc gagcaggagctgcgccgcaccgtgctgcgccgtcgcttttcgcacgtcctgacgggggcg ctagatgatgacgcgacacgcagagggggcggagagcgcccccgggggcggggcacgcaa gtgacggcggcgcggaactcggggagcgcaggcaggacaggcttagagaagacgcggtcc ccagcgcttgggccacggacgtcccaccccgctcctctgtcgctggagaaccgccgggcc gagccactgggagaagcaggccagagccttccagggcctccggcccgtggacccgaggag gatgagctggctttttcccctgaccaagagcgcctcctcctccgcggctgggtcccccgg tggcctcaccagcctccagcagcagaagcagcgcctgatcgagtccctccggaactcaca ctccaggtgactggtcgctgcctctccaccggaggaaaaagtagggtttgccttggctct aataagtacctggagcagtgcccatctctaatgggggagctcccaaaggggatatcccaa cagtgtgggaaggctggcaaacggttcctgcccatggtttctgtggaatatatctcctac aacatggagctgctgaataggcactctgatttggtgttttctttggctgcagacagggca gtttccagggttgaggatggtagtcttgcctctcccctttgtctctggctgtcctcagga atatttctcccttcagtcctcatgatgcttctgtttcctcaggaaaaaccagtgatcagt gtttatccaccaatacgacatcacttaatggataaacaaggagtgtatgttacctctcca ttagtaaacaattttacaatgcactcagatcttggaaaaattattcagagtctgttggat gagttttggaagaatcctccagttttagctcctacttcaacagcatttccttatctatac agtaacccaagtgggatgtctccttatgcttctcagggttttccatttcttcctccatat cctccacaagaagcaaacaggagtatcacttctttatctgttgctgacactgtttcttct tcaacaacaagtcataccacagccaagcctgccgctccttcatttggtgtcctttcaaat ctgccattacccattcccacagtggatgcttcaataccggttggtatcacaagccaaaat ggttttgggtacaagatgccagatgtccctgatgcatttccagaactctcagaactaagt gtgtcacaactcacagatatgaatgaacaagaggaggtattactagaacagtttctgact ttgcctcaactaaaacaaattattaccgacaaagatgacttagtaaaaagtattgaggaa ctagcaagaaaaaatctccttttggagcccagcttggaagccaaaagacaaactgtttta gataagtatgaattacttacacagatgaagtccactttcgaaaagaagatgcaaaggcag catgaacttagtgagagctgtagtgcaagtgcccttcaggcaagattgaaagtagctgca catgaagctgaggaagaatctgataatattgcagaagacttcttggagggaaagatggaa atagatgattttctcagtagcttcatggaaaagagaacaatttgccactgtagaagagcc aaggaagagaaacttcagcaggcgatagcaatgcacagccaatttcatgctccactatag >gi568815590r:17199865_17473242|GENSCAN_predicted_peptide_4|494_aa MSDFLWGLENSGWLRHIKAIMDAGIFIAKAVSEEGASVLVHCSDGWDRTAQVCSVASLLL DPHYRTLKGFMVLIEKDWISFGHKFNHRYGNLDGDPKEISPVIDQFIECVWQLMEQFPCA FEFNERFLIHIQHHIYSCQFGNFLCNSQKERRELKFWSGMYNRFEKGMQPRQSVTDYLMA VKEETQQLEEELEALEERLEKIQKVQLNCTKVKSKQSEPSKHSGFSTSDNSIANTPQDYS GNMKSFPSRSPSQGDEDSALILTQDNLKSSDPDLSANSDQESGVEDLSCRSPSGGRHLPA GVDRQLIQESSGWHFAGAPLGQSFQRKEQAAIFAVLQPPLVIPRQTGCGVDPQQTPADLQ KRGLLERQLTESNSININKNDDHAKIPSEGHQQQRTKVDKSMKMRKNQCKKAENSKNQKA SSPPKDHNSSPAREQNWTENEFDELTEVGFRRWVINSLELKEHVLTQCKEAKNLDKRLEE LLTRIISLEKNINK >gi568815590r:17199865_17473242|GENSCAN_predicted_CDS_4|1485_bp atgagtgatttcctgtggggtctggagaactctggctggttaaggcacattaaagccata atggatgcaggaatcttcattgcaaaggcagtgtcagaggaaggggcaagtgtgcttgtt cactgttctgatggctgggacaggaccgctcaggtgtgctcggtggcaagcctgctgctg gaccctcactaccggactctgaagggcttcatggtattaattgaaaaggactggatttcc tttggtcataagtttaatcaccgatatggcaatctagatggtgacccaaaagaaatctct ccagttattgaccagttcattgagtgtgtttggcagttaatggaacaatttccctgtgcc tttgagttcaatgagaggtttttgattcacattcaacatcacatttattcctgccagttt ggaaacttcctatgtaacagccaaaaggagagacgagaactcaagttttggagtggaatg tataaccgctttgaaaaggggatgcagccccgacagtcagttacagattacctaatggca gtgaaggaagaaactcagcagctagaggaagaactagaggccctggaagaaaggctggaa aaaattcaaaaggtccagttaaattgcactaaggtgaagagtaagcaaagtgagcccagc aagcactcagggttttctacctcagacaacagcatagccaacactccccaggattacagt gggaatatgaaatcatttccatcccggagcccttcacaaggcgatgaagattctgctctg attctaacccaagacaatctgaaaagttcagatccagatctgtcagccaacagtgaccaa gagtccggggtggaggatttgagctgtcggtctccaagtggtggcagacacctcccagca ggggttgacagacagctcatacaggagagctctggctggcactttgcaggtgcccctctg ggacaaagcttccagaggaaggagcaggcagcaatctttgctgttctgcagcctccgctg gtgatacccaggcaaacaggatgtggagtagacccccagcaaactccagcagacctgcag aagaggggcttgttagaaagacaactaacagaaagcaatagcatcaacatcaacaaaaat gacgaccatgcaaaaattccatccgaaggtcaccaacaacaaagaacaaaggtagataaa tccatgaagatgaggaaaaaccagtgcaaaaaggctgaaaattcaaaaaaccagaaagcc tcttctcctccaaaggatcacaactcctcgccagcaagggaacaaaactggactgagaat gagtttgatgaattgacagaagtaggcttcagaaggtgggtaataaactccttagagcta aaggagcatgttctaactcaatgcaaggaagctaagaaccttgataaaaggttagaagaa ttgctaactagaataatcagtttagagaagaacataaataaatga >gi568815590r:17199865_17473242|GENSCAN_predicted_peptide_5|127_aa MIPDLFFHAKVLRPDGLKSSFGVERMVGGRALGVVVGGTIVVAIIMNAVHWCAVNLSSAQ LFLWTPQNCGSAAPKQHLVYVCWVSDTGIQRDALRCSEDSMPGPTNIEDKGLADVTLAKS PIAKLGS >gi568815590r:17199865_17473242|GENSCAN_predicted_CDS_5|384_bp atgatacccgatctgttcttccacgcaaaggttttgaggcctgatggattgaaatcttca ttcggtgtggaaaggatggttggaggcagagcacttggagttgtcgtgggaggcaccatc gtggtggccattatcatgaacgctgttcactggtgtgctgtaaacttgtcttcagcccag ctttttctgtggactccacagaactgtggaagcgctgctcctaagcagcatttggtgtac gtctgctgggtgtctgacactggcatccaacgtgatgcactcaggtgttcagaagattcc atgccggggcctacaaacatcgaagacaaaggactcgcagatgttacattggccaaatcc cccatagcaaagcttggcagctaa >gi568815590r:17199865_17473242|GENSCAN_predicted_peptide_6|783_aa MVSLKVPENNTLGMDVNLLTEGPSGTGKDAVSPAGVFIWQNLKWSVRRPHPKQGPLCVSE NSGRIVLRRLPLLRESEQPSPRQPQAEARRGGGAGIQEPSGFKRTSVVACRNPMFCRLTE GRSRNDVKSLGLESGTPRDHVILYPAMAELVPEVQDKVPLTFPSVFFKHKESLSIASTAA NVLGSPEASTPQSHPRPTVYYLGITAHYSGPELFCQQMMNAARTGFFSSIRQVSFCPKIL VEEETDAGTSEYLGYLGYRNTWDIFTCIDAGTSEYLGRMISDIYHMRLVTALGMKGGTCL AEVGLGSCMVPPKLQEAFEPFDLKHAGAHFRAPPRESLDHRENRVFRGFAPPDKRNEQAG SSSAVVSVFYVCGMAQYSSSSSSVAQGSRKVENVRLVDRVSPKKAALGTLYLTATHVIFV ENSPDPRKETWILHSQISTIEKQATTATGCPLLIRCKNFQIIQLIIPQERDCHDVYISLI RLARPVKYEELYCFSFNPMLDKEEREQGWVLIDLSEEYTRMGLPNHYWQLSDVNRDYRFS ILLTKTVEVSKGGIITEDFIRGLDFSCGFSTPTNPSSRVIKVCDSYPTELYVPKSATAHI IVGSSKFRSRRRFPVLSYYYKDNHASICRSSQPLSGFSARCLEDEQMLQAIRKANPGSDF VYVVDTRPKLNAMANRAAGKGYENEDNYSNIKFQFIGIENIHVMRNSLQKMLEVTPRGIC RCSLPLPALRTGKRQRMQKPWPCALAVHALSALLGKALESWSWEPGKWAVLMGNAGSGMK RDK >gi568815590r:17199865_17473242|GENSCAN_predicted_CDS_6|2352_bp atggtcagtctgaaagttccagaaaacaacactcttggcatggatgttaatcttctcact gaagggcccagtggcactggcaaggatgcggtctccccagcaggagttttcatatggcag aatctgaagtggagcgtgaggaggccacacccaaaacaaggccctctttgtgtctcggag aacagtggccgtatagtgctccgccggctgcccctgttaagagagagcgagcagccctcg ccccggcaaccccaggcagaggcgcgtcgcggcggcggcgcaggtattcaagagccctcg ggctttaagcgaacgtcggtggtggcctgcagaaaccccatgttttgcagactcacggag ggcaggtccagaaatgatgtcaagagccttggcctggaatcagggaccccaagggaccac gtgatactctatcccgctatggctgagctagtacctgaggtgcaagacaaagtgccctta acttttccctctgtttttttcaagcacaaggagtctctctccatagcctccacagctgca aatgtgctgggctcccctgaagccagcacgcctcagtctcacccaaggcccactgtgtac tacctgggtatcactgctcattattcagggcccgagctcttttgtcagcagatgatgaat gccgctaggactgggttcttctcttccattcggcaggtttccttctgtcccaagatattg gtagaggaagaaacagatgctggtacatcagaatacctgggatatctgggatatcggaat acctgggatatcttcacatgcatagatgctggtacatcagaatacctgggaagaatgatc agtgatatctatcatatgagactggtgactgctttgggaatgaaaggaggaacctgttta gccgaagtggggcttggttcctgcatggttcctccaaagctgcaggaggcctttgaacct ttcgatttgaaacatgctggtgcacacttcagagctccgcccagagaatccctcgaccac cgggagaaccgagtctttcggggatttgcccctccagacaagagaaatgagcaagcaggg agctcatcagctgttgttagtgtattttacgtgtgtggtatggcccaatacagttcgtct tcttccagtgtggctcagggaagccgaaaggttgaaaatgtccgcttggtagatcgagtg tctcctaaaaaagcagctctaggtactttgtatttgacggctacccatgtcatattcgtg gaaaattcacctgacccaagaaaagaaacatggattcttcacagtcagatttccaccatt gagaaacaggcaacaaccgctaccggatgccctctgctgattcgctgcaagaactttcag ataatacagctcatcatacctcaggaaagagattgccacgacgtgtacatctccctgata cgccttgcaaggccagtgaaatatgaggagttatactgcttttcattcaaccccatgctg gataaagaagaaagagagcaaggctgggtgctgatcgatcttagtgaagaatacacgcgg atgggcctccctaatcattactggcagctcagcgatgtgaatagagactacagattttct attctgctaacaaagacagtggaggtcagcaagggaggcatcatcacagaagactttata agaggcttggatttcagttgtgggttttcaacccctacaaacccatcttccagagtgata aaggtctgtgactcttatcctactgaactgtacgttcccaaatcggccacggcacacatc atagtggggagttccaaattccggagtagacggcgatttcctgtcctttcttactattat aaagataaccacgcctccatctgccggagcagccagcccctgtccggcttcagtgcccgg tgcctagaggacgagcagatgctccaggccattaggaaagccaatccaggaagtgacttc gtttatgtcgttgacacccggcctaaacttaatgcaatggcaaatcgtgctgcagggaaa ggctatgagaatgaagacaattattccaatatcaagtttcagtttatcgggatagagaac atccatgtcatgaggaacagtctgcagaaaatgctggaagtgaccccaagaggaatctgt cgctgtagtctaccacttccggctctgcgaactgggaaaagacagaggatgcagaaacca tggccttgtgccttggctgtccatgcactgtctgcgcttctggggaaagccctggagagc tggagctgggagcccgggaagtgggctgtcttgatgggaaatgcaggctctggaatgaag agggacaaataa >gi568815590r:17199865_17473242|GENSCAN_predicted_peptide_7|169_aa MQASSPASHNALKFSKLLVCEDVLWIWRTTQTGRFHSGCEQHLVTAVAEPICDVYCRTHY AYFLRTCTWDSIMWRTQSMAKLMCHNECKSKHLVIQETNNSTITSKTTSCSERGQQSEAL SQKRKEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEKLV >gi568815590r:17199865_17473242|GENSCAN_predicted_CDS_7|510_bp atgcaagctagctcaccagcatcccacaacgccctcaaattcagcaaattacttgtgtgt gaagatgttctctggatttggcgaaccacgcaaacaggtcgttttcacagtggttgcgaa caacatctcgtaacagcagttgctgaacctatatgtgatgtttactgtcggacacattat gcatattttctccgcacatgtacatgggattctatcatgtggcgtacccaaagtatggcc aagctcatgtgccataatgaatgcaaatctaagcatttggtcatccaggaaactaacaac tccacaattaccagcaagactacatcctgttccgagcgtgggcaacaaagtgaggctctg tctcaaaaaagaaaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaggaggaagaagaagaagaa gaggaagaagaagaagaaaagctagtttaa