GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:29:59 Sequence gi568815587f:59961161_60169676 : 208516 bp : 38.01% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4407 4553 147 2 0 78 57 68 0.298 2.21 1.02 Intr + 5907 6004 98 2 2 71 115 5 0.710 -0.61 1.03 Intr + 10200 10283 84 0 0 51 110 74 0.824 3.92 1.04 Intr + 12055 12208 154 1 1 97 74 57 0.200 4.35 1.05 Intr + 16281 16422 142 2 1 46 72 29 0.025 -3.89 1.06 Intr + 22176 22354 179 1 2 84 55 97 0.075 4.72 1.07 Intr + 27148 27245 98 0 2 18 115 76 0.164 1.19 1.08 Intr + 34659 34774 116 0 2 108 84 37 0.551 4.47 1.09 Term + 43935 44065 131 1 2 79 47 168 0.802 9.16 1.10 PlyA + 44270 44275 6 1.05 2.07 PlyA - 44872 44867 6 1.05 2.06 Term - 58562 58261 302 0 2 36 45 243 0.054 9.20 2.05 Intr - 58856 58684 173 1 2 48 31 115 0.019 0.46 2.04 Intr - 89577 88203 1375 1 1 67 53 430 0.150 24.35 2.03 Intr - 90038 89939 100 2 1 60 89 -28 0.159 -6.74 2.02 Intr - 93915 93614 302 1 2 1 78 232 0.263 9.03 2.01 Init - 95942 95837 106 2 1 67 92 71 0.754 5.83 2.00 Prom - 98993 98954 40 -4.15 3.00 Prom + 99528 99567 40 -11.24 3.01 Init + 100001 100156 156 1 0 69 98 105 0.995 9.69 3.02 Intr + 101308 101445 138 0 0 136 59 108 0.883 12.44 3.03 Intr + 103102 103158 57 0 0 75 106 41 0.803 2.76 3.04 Intr + 105791 105952 162 1 0 95 48 85 0.238 4.45 3.05 Intr + 107695 107777 83 0 2 69 72 39 0.009 -2.08 3.06 Term + 116818 117259 442 1 1 24 41 376 0.006 20.24 3.07 PlyA + 117659 117664 6 1.05 4.00 Prom + 127470 127509 40 -6.25 4.01 Init + 127606 127661 56 0 2 77 96 31 0.961 3.61 4.02 Intr + 128532 128661 130 0 1 86 84 63 0.859 5.48 4.03 Intr + 132240 132398 159 2 0 72 40 96 0.820 2.46 4.04 Intr + 132804 132902 99 2 0 74 71 68 0.836 3.09 4.05 Term + 134398 134496 99 0 0 98 48 47 0.845 -1.15 4.06 PlyA + 135260 135265 6 1.05 5.03 PlyA - 136652 136647 6 1.05 5.02 Term - 141751 141341 411 1 0 65 48 193 0.796 7.26 5.01 Init - 148096 148085 12 1 0 66 108 0 0.207 -0.04 5.00 Prom - 151631 151592 40 -5.75 6.00 Prom + 153653 153692 40 -3.55 6.01 Init + 157833 158072 240 2 0 46 52 155 0.222 5.62 6.02 Intr + 160873 160928 56 0 2 92 113 18 0.661 1.56 6.03 Intr + 163960 164054 95 1 2 102 36 65 0.711 1.39 6.04 Term + 164180 164334 155 0 2 58 55 100 0.389 0.80 6.05 PlyA + 164524 164529 6 1.05 7.00 Prom + 168203 168242 40 -3.35 7.01 Sngl + 173105 173611 507 1 0 50 39 293 0.171 16.39 7.02 PlyA + 174030 174035 6 1.05 8.06 PlyA - 174086 174081 6 1.05 8.05 Term - 183406 183318 89 2 2 94 47 73 0.432 0.64 8.04 Intr - 183907 183840 68 0 2 56 86 54 0.237 -0.37 8.03 Intr - 193404 192456 949 1 1 45 53 239 0.001 5.35 8.02 Intr - 200346 200184 163 0 1 66 84 164 0.719 12.43 8.01 Init - 207961 207908 54 1 0 96 69 59 0.438 6.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 58689 58261 429 0 0 58 45 269 0.910 15.63 S.002 Sngl + 116834 117259 426 1 0 70 41 380 0.991 27.54 S.003 Sngl - 124726 124373 354 1 0 44 41 221 0.951 8.80 S.004 Sngl + 193821 194501 681 2 0 42 28 301 0.919 15.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_1|382_aa VKLRRTPLLNDLQPLQNELSLGIGCPVNMVEVDFFGFLYLLTFCGIRVSEHGVGILIESL IVYEPTNFDFNLHIPVSCYVQSQREEALPQSLEDVKGLGPGRFPEQQKSRPSTDWMMPSH VGKGTLLYLAYLFKCLSPLETPLQTHREAMFNQIAGHPVAQDFTFLGFTGQKLSHELFFS NLETYTFMTKYFGKAKLKQIGVFKAETTVFVECTESWLLVEIRGAPLVENLQPKHNELSL GNGCPVTRMTNDIFEFTYALEFSGIMKYECPWGILVESFVKYQPAFLNFRTYIPVRCALE RKPSSFEQNVIQNNTDKSKKIVQTQYYQAGNWGIEECYLGIPEEDIVITGDDNSLCVIAH EDLPVGQYVEREDGEIEDPDPA >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_1|1149_bp gtcaaacttagacgaacgccgctgttaaatgacctgcaacctttgcaaaacgaactctct ctaggaattggctgtcctgtaaacatggtcgaggtggatttctttgggttcctgtatctt ctaactttttgtggcatcagagtgagcgaacatggagtaggtattctcattgaaagttta attgtatatgaaccaacgaattttgacttcaatttacacatcccagtatcatgctatgtg caaagtcagagggaggaagctctgcctcagagtttagaagatgtaaagggcctggggcca ggaagatttccagaacagcaaaaatccagaccctcaacagattggatgatgcccagccac gttgggaaaggcactttgctttacttagcttacctattcaaatgcttatctcctctggaa acacccttgcaaacacaccgagaagcaatgtttaaccaaatagctgggcaccccgtggcc caggattttaccttcctaggttttactggacaaaaattatctcatgaacttttcttcagc aacctggagacatacacattcatgacaaaatactttgggaaagcaaaattaaagcagatt ggggttttcaaagctgaaaccacagtgtttgtagagtgtactgaatcttggcttctggtt gaaattagaggagcacctcttgtggaaaacctgcaacctaagcataatgagctgtctcta ggaaatgggtgtcctgtaaccaggatgacgaatgatatctttgagttcacttatgctctt gagttttctggcatcatgaaatatgaatgcccatggggcattcttgttgagagtttcgtc aaatatcaaccagcgtttttgaatttcagaacttacatcccagtaagatgcgcccttgaa agaaagccttcctcatttgaacaaaatgtaatacagaataatactgataaatctaagaaa atagtccaaactcagtattatcaagcaggaaattggggtatcgaggaatgttacttaggt attccagaagaagacattgttatcacaggagatgacaactccttgtgtgttattgcccac gaagaccttccagtgggacaatatgtggagagggaagacggtgagattgaggatcctgac cctgcatag >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_2|785_aa MHFADPILQAYRDHPSSSELADKLRCSQWDLMYLRGAKEGTDEREDRDRDRSQKEIAKRK VVKIRSFPPRFIQNHCKWNTEEERDDKKMPSAKKRKKSEEMKGKSGCFQHALFEFGAVEK AMKGSYLNSKTSVLPFHLLFPDFLMIAILTGVRWYLIVVLICISLMASDVLEVLARAIRQ EKDIKGIQLGKEEVKLSLFADDMIVYLENPVDSAQNLLKLISNFSKVSGYKINVQKSQAF LYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKN IPCSWVGRINIVKMAILPKVIYRFNAIPIKLSMTFFTELEKTTLKFIWHQKRVCITKSIL SQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITLHIYNYLIFDKPE KNKQWGKDSLFNKWCWGKWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENL GITIQDIGMGKDFMTKTPKAMATKAKIDKWDLIKLKSFCTAKETTVRVNRQPTKWEKIFT TYSSDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAI REMQIKTTMRYHLTPVRMAIIKKSGNNSYSGGDLENLCVHTLYLANLVGMWRTFVSSSGI VNAPISTLSKRTSQLSVKWTNQQDVAVWVHTAFMSCNTHAKVCSFAPEASQTTNPPGGTN NSRRAALRALILTAKICSFTPEPARPRTHQKEENAEHIRTSEGTNSGHATFKNCNTHREG PRLHS >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_2|2358_bp atgcactttgcagaccccatacttcaggcatatcgagaccacccctcatcatcagagctg gcagacaaactcaggtgcagccagtgggatttaatgtacctaagaggagcgaaggaaggg actgatgaaagagaagacagagacagagacagaagtcagaaggaaatagccaaaaggaag gtggtaaagataagatccttccctccccgttttatccagaaccattgcaaatggaatact gaagaagaaagagatgacaagaaaatgccttcagcgaagaagagaaagaaatctgaagaa atgaaaggaaagagtggatgctttcaacatgctctttttgaatttggagctgtagaaaaa gccatgaaaggaagttacttaaactctaagacttcggttctcccatttcacctgttgttt cctgactttttaatgattgccattctaactggtgtgagatggtatctcattgtggttttg atttgcatttctctgatggccagtgatgtgttggaagttctggccagggcaatcaggcag gagaaggacataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgca gatgacatgattgtatatctagaaaaccctgttgactcagcccaaaatctccttaagctg ataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattc ttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaatt gcttcaaagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttc aaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaac attccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggta atttatagattcaatgccatccccatcaagttatcaatgactttcttcacagaattggaa aaaactactttaaagttcatatggcaccaaaaaagagtttgcatcaccaagtcaatccta agccaaaagaacaaagctggaggcatcatgctacctgacttcaaactatactacaaggct acagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacaga acagagccctcagaaataacgctgcatatctacaactatctgatctttgacaaacctgag aaaaacaagcaatggggaaaggattccctatttaataaatggtgctggggaaagtggcta gccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattca agatggattaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaaccta ggcattaccattcaggacataggcatgggcaaggacttcatgactaaaacaccaaaagca atggcaacaaaagccaaaattgacaaatgggatctcattaaactaaagagcttctgcaca gcaaaagaaactaccgtcagagtgaacaggcaacctacaaaatgggagaaaattttcaca acctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttac aagaaaaaaacaaacaaccccatcaaaaagtgggcgaaggacatgaacagacacttctca aaagaagacatttatgcagccaaaaaacacatgaagaaatgctcatcatcactggccatc agagaaatgcaaatcaaaaccactatgagatatcatctcacaccagttagaatggcaatc attaaaaagtcaggaaacaacagctactctggtggggacttggagaacctttgtgtccac actctgtatctagctaatctagtgggcatgtggagaacttttgtgtctagctcagggatt gtaaacgcaccaatcagcaccctgtcaaaacggaccagtcagctctctgtaaaatggacc aatcagcaggatgtggctgtttgggtccacactgcctttatgagctgtaacactcatgcg aaggtctgcagcttcgctcccgaagccagccagaccacgaacccaccaggaggaacaaac aactccagacgcgctgccttaagagctctaatactcaccgcgaagatctgcagcttcact cctgagccagcgagaccacgaacccaccagaaggaagaaaacgccgaacacatccgaaca tcagaaggaacaaactccggacacgccacctttaagaactgtaacactcaccgcgagggt ccgcggcttcattcttga >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_3|345_aa MASHEVDNAELGSASAHGTPGSEAGPEELNTSVYQPIDGSPDYQKAKLQVLGAIQILNAA MILALGVFLGSLQYPYHFQKHFFFFTFYTGYPIWGAVFFCSSGTLSVVAGIKPTRTWIQN SFGMNIASATIALVGTAFLSLNIAVNIQSLRSCHSSSESPDLCNYMGSISNPPTPRQALV CDVPRLASKCSHCSVPTYGLGEGAMLHFNNNNNNNNNNNNNNNKETTIFTWPEAAAGYDT KGILDQSQQAYQEACEITKKEMQPTDPIRLGMALNFSVFYYELLNSPEKSHSLVKAAFDE ALAELDTLSEESYKDRLLMQLLRDNLTLWTLDTQGDETEARGRKS >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_3|1038_bp atggcctcccacgaagttgataatgcagagctggggtcagcctctgcccatggtacccca ggcagtgaggcgggaccagaagagctgaatacttctgtctaccagcccatagatggatca ccagattatcagaaagcaaaattacaagttcttggggccatccagatcctgaatgcagca atgattctggctttgggtgtctttctgggttccttgcaatacccataccacttccaaaag cacttctttttcttcaccttctacacaggctacccgatttggggtgctgtgtttttctgt agttcaggaaccttgtctgttgtagcagggataaaacccacaagaacatggatacagaac agttttggaatgaacattgccagtgctacaattgcactagtggggactgcttttctctca ctaaatatagcagttaatatccagtcattaaggagttgtcactcttcatcagagtcaccg gacctatgcaattacatgggctccatatcaaatccccccaccccacgacaggccctggtg tgtgatgttccccgccttgcgtccaagtgttctcattgttcagttcccacctatggcctg ggggagggagcaatgctccatttcaacaacaacaacaacaacaacaacaacaacaacaac aacaacaacaaagagactactatttttacttggcctgaggctgctgctggttatgacaca aaagggatcctagatcagtcacaacaagcataccaagaagcttgtgaaatcaccaaaaag gaaatgcaaccaacagatcctatcagattgggtatggctctaaacttctctgtcttctat tatgagcttctgaactccccagagaaatctcattcacttgtaaaggcagcttttgatgaa gcccttgctgaacttgatacattaagtgaagagtcatacaaagatcgcctgctaatgcag ttactgagagacaacttgacactgtggacattggatactcaaggagatgaaactgaagca agaggaagaaaaagttaa >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_4|180_aa MDTESNRRANLALPQEPSSVPAFEVLEISPQEVSSGRLLKSASSPPLHTWLTVLKKEQEF LGVRGSLGANTASSIAGGTGITILIINLKKSLAYIHIHSCQKFFETKCFMASFSTEIVVM MLFLTILGLGSAVSLTICGAGEELKGNKVPEDRVYEELNIYSATYSELEDPGEMSPPIDL >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_4|543_bp atggacacagaaagtaataggagagcaaatcttgctctcccacaggagccttccagtgtg cctgcatttgaagtcttggaaatatctccccaggaagtatcttcaggcagactattgaag tcggcctcatccccaccactgcatacatggctgacagttttgaaaaaagagcaggagttc ctgggggtgagaggaagcctgggagcaaacactgccagcagcatagctgggggaacggga attaccatcctgatcatcaacctgaagaagagcttggcctatatccacatccacagttgc cagaaattttttgagaccaagtgctttatggcttccttttccactgaaattgtagtgatg atgctgtttctcaccattctgggacttggtagtgctgtgtcactcacaatctgtggagct ggggaagaactcaaaggaaacaaggttccagaggatcgtgtttatgaagaattaaacata tattcagctacttacagtgagttggaagacccaggggaaatgtctcctcccattgattta taa >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_5|140_aa MEIQVIKASIIGQYCIARERKEFTHPVGRLSCLRQKLYNGTTETVTSWSSNHTERNPFSK FPKLRTVWTHPESHRDWTAPTGLYWICGHRAFAKLPDESAGSCVIGTIKPSFFLLPIRTG ELLSFPVYASREKKSIAIEN >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_5|423_bp atggaaattcaagtcataaaagcctcaattattggacaatattgcatagctagagaaaga aaagaattcactcaccccgtaggacgacttagttgtctaagacagaaactgtataatggt accacagaaacagtcacttcgtggagttcaaatcacacagagagaaatccatttagtaaa ttcccaaagttgcgaaccgtttggacccatccagagtcccaccgggactggacagccccc actggattatactggatatgtgggcatagagcttttgccaaattacctgacgagtcggca ggtagttgtgttattggcactattaaaccatctttcttcttactgcccataaggacaggt gaactcctgagcttccctgtctatgcttcccgcgaaaagaaaagcatagctatagaaaat tga >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_6|181_aa MKENLLGELTYMITRAACKLRSQEASPGPKISEVGKLIMRFQSLAEGLRAPGKSLVLSLR IQKLKNLESDVRGQEVSNKRMGPPSCRKTNMAPTDSTLRVFEQPSPLAMQPGKKPVSGIL HSSVAQSGSEETQHIDFCDQTCGDFSPPKRQAIISGVNTSWLSFNSVETSSKLEIISDSM V >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_6|546_bp atgaaggaaaatttattaggagaattgacttacatgatcacaagggcagcctgcaagctg aggagccaggaagccagtccgggtcccaaaatctcagaagtagggaagctgataatgcga tttcagtctttggccgaaggcctgcgagcccctggcaaatcactagtgttaagtctaaga atccaaaaactgaagaacttggagtctgatgttcgagggcaggaagtatccaacaaaaga atgggaccacctagttgcaggaaaacaaacatggctcccactgattctacattacgggtc tttgagcagcccagtcccctagccatgcagccaggcaagaaacctgtgtctggcatactc cattcatccgtcgctcagtcagggtctgaggagacacaacatatagacttctgcgaccaa acgtgtggggatttctccccaccaaaaaggcaagcaatcatttctggagtgaacaccagc tggctgtccttcaattctgttgagacatcatctaaactggagataatatcagattccatg gtttga >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_7|168_aa MVLGLQVHRSQELRLYRNAWMSRQRGVAGLEPSWRTSARAMWKGNVGYEPPHRVPTGALP NGAVRRRPPSSRLQNGRCTDSLHCMPGKAADTQRQPMKAARRVAIHCKATEAELLKAMGA HFLHQHDLDVRHRVKGDHFGTLRFNVYAVGFQTCMRPAALLFWTVSPM >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_7|507_bp atggtgttgggtctgcaagtgcacagaagtcaagaattgagattgtatagaaatgcctgg atgtccaggcagaggggtgttgcagggctggagccatcatggagaacctctgctagggca atgtggaagggaaatgtggggtatgagcccccacacagagtccccactggagcactgcct aatggagctgtgagaagacggccaccatcctccagactccagaatggtagatgcactgac agcttgcactgtatgcctggaaaagctgcagacactcaacgccagcccatgaaagcagcc aggagggtggccatacactgcaaagccacagaggcagagcttctcaaggccatgggagcc cacttcttgcatcagcatgacctagatgtgagacatagagtcaaaggagatcattttgga actttaaggtttaatgtctatgctgttggatttcagacttgcatgaggcctgcagccctt ttgttttggacagtttctcccatgtag >gi568815587f:59961161_60169676|GENSCAN_predicted_peptide_8|440_aa MGGGGQVSCEHGEGTEVPQYLDIRGRCSQGKDHKLSLERDHLTTKPVGGLDLALTATEGF EAFSEKAAPGPREKMAILPKVIDRVDAIPIKLPMPFFTELEKTTLKFIWNQKRARIAKSI LSQKNKPGGIMLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKP EKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNIRPKTIKTLEEN LGITIQGIGMGKDFMSKTPKAMATEAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIF TTYSSDKGLISRIYNELKQICKKKTNNPIKKWAKDMNRHFSKEDIYAAKRHMKKCSPSLA IREMQIKTTMRYHLTPVRMAIIKKSGNNRKQEEKKEASTYFKCLIQKLSYEEYLLMDKHS FVFLFYKSCEYSDVDGNMFA >gi568815587f:59961161_60169676|GENSCAN_predicted_CDS_8|1323_bp atgggtggtggtggacaggtctcctgtgagcatggagaaggcactgaagtacctcaatac ctggacatcagaggcagatgcagtcaagggaaggatcataaactttctttggaaagagac catcttactaccaagccagtcgggggcttggatcttgccctaacagcaactgaaggattt gaggctttttctgaaaaggcagccccaggtcctagggagaaaatggccatactgcccaag gtaattgatagagtcgatgccatccccatcaagctaccaatgcctttcttcacagaactg gaaaaaactactttaaagttcatatggaaccaaaaaagagcccgaattgccaagtcaatc ctaagccaaaagaacaaacctggaggcatcatgctacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaac agaacggagccctcagaaataacgccgcatatctacaactatctgatctttgacaaacct gagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactgg ctagccatatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaat tcaagatggattaaagacttaaacattagacctaaaaccataaaaaccttagaagaaaac ctaggcattaccattcagggcataggcatgggcaaggacttcatgtctaaaacaccaaaa gcaatggcaacagaagccaaaattgacaaatgggatctcattaaactaaagagcttctgc acagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttc acgacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatt tgcaagaaaaaaacaaacaaccccatcaaaaagtgggcaaaggacatgaacagacacttc tcaaaagaagacatttatgcagccaaaagacacatgaaaaaatgctcaccatcactggcc atcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttagaatggcg atcattaaaaagtcaggaaacaacaggaaacaggaagagaagaaagaagcatccacttat tttaagtgtttaattcagaaattgtcctatgaagaatatctcctgatggataaacattct tttgtcttccttttctacaaatcctgtgagtactcggatgtagatggaaacatgtttgcc tga