GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:56:14 Sequence gi568815581r:9798098_10005180 : 207083 bp : 47.22% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 2410 2405 6 1.05 1.05 Term - 4547 4289 259 1 1 82 42 162 0.648 6.02 1.04 Intr - 9504 9393 112 1 1 72 111 47 0.917 4.84 1.03 Intr - 10885 10733 153 2 0 81 80 91 0.954 7.64 1.02 Intr - 12521 12474 48 0 0 87 94 21 0.508 1.25 1.01 Init - 23974 23665 310 1 1 92 105 381 0.985 35.68 1.00 Prom - 29100 29061 40 -6.96 2.00 Prom + 29506 29545 40 -2.56 2.01 Init + 29887 29919 33 0 0 55 96 29 0.304 0.34 2.02 Intr + 35710 35797 88 0 1 85 107 30 0.769 4.14 2.03 Intr + 44398 44519 122 2 2 82 92 89 0.737 8.91 2.04 Intr + 45530 45712 183 1 0 97 37 59 0.500 1.68 2.05 Intr + 56398 56504 107 0 2 101 109 66 0.273 9.01 2.06 Intr + 59326 59479 154 1 1 60 74 202 0.899 16.07 2.07 Intr + 61845 62004 160 2 1 138 84 166 0.990 20.76 2.08 Intr + 65664 65859 196 1 1 110 75 41 0.612 3.47 2.09 Intr + 72404 72649 246 2 0 90 44 97 0.072 2.07 2.10 Intr + 82281 82419 139 0 1 102 103 129 0.999 16.27 2.11 Intr + 89835 89876 42 2 0 101 64 83 0.979 5.64 2.12 Term + 91273 91608 336 0 0 104 46 341 0.992 26.07 2.13 PlyA + 92645 92650 6 1.05 3.05 PlyA - 93444 93439 6 1.05 3.04 Term - 100107 99998 110 1 2 50 44 134 0.945 3.87 3.03 Intr - 100312 100286 27 2 0 97 90 27 0.759 1.89 3.02 Intr - 103003 102892 112 1 1 86 23 161 0.960 9.35 3.01 Init - 107083 106703 381 1 0 56 61 981 0.822 89.07 3.00 Prom - 112129 112090 40 -5.56 4.12 PlyA - 112440 112435 6 1.05 4.11 Term - 119244 119131 114 0 0 117 37 213 0.984 17.67 4.10 Intr - 120002 119904 99 2 0 87 75 183 0.891 17.21 4.09 Intr - 121608 121529 80 1 2 75 99 131 0.912 12.17 4.08 Intr - 127502 127379 124 2 1 64 119 219 0.953 22.76 4.07 Intr - 128672 128544 129 2 0 84 100 355 0.956 37.19 4.06 Intr - 136147 136069 79 0 1 111 86 63 0.987 7.95 4.05 Intr - 142103 142029 75 1 0 116 96 86 0.997 10.83 4.04 Intr - 145139 145024 116 2 2 117 67 175 0.998 17.55 4.03 Intr - 148886 148797 90 2 0 113 94 144 0.828 17.69 4.02 Intr - 161231 161105 127 1 1 78 98 97 0.563 10.38 4.01 Init - 163771 163617 155 1 2 99 53 60 0.920 3.06 4.00 Prom - 164146 164107 40 -2.16 5.07 PlyA - 164494 164489 6 1.05 5.06 Term - 167408 167340 69 2 0 97 54 21 0.495 -2.46 5.05 Intr - 171665 171580 86 0 2 97 110 51 0.439 7.74 5.04 Intr - 181366 181278 89 0 2 121 77 -9 0.297 0.91 5.03 Intr - 188018 187952 67 0 1 90 72 74 0.719 3.96 5.02 Intr - 189903 189825 79 0 1 33 52 99 0.615 -0.08 5.01 Init - 190448 190401 48 2 0 84 66 70 0.688 5.35 5.00 Prom - 194992 194953 40 -2.46 6.00 Prom + 195123 195162 40 -10.84 6.01 Sngl + 195299 196687 1389 1 0 35 48 461 0.702 32.88 6.02 PlyA + 197122 197127 6 1.05 7.02 PlyA - 198038 198033 6 1.05 7.01 Term - 200367 200250 118 2 1 103 38 102 0.423 4.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 54763 54566 198 1 0 -57 41 300 0.805 8.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:9798098_10005180|GENSCAN_predicted_peptide_1|293_aa MDRAKQQQALLLLPVCLALTFSLTAVVSSHWCEGTRRVVKPLCQDQPGGQHCIHFKRDNS SNGRMDNNSQAVLYIWELGDDKFIQRGFHVGLWQSCEESLNGEDEKCRSFRSVVPAEEQG VLWLSIGGEVLDIVLILTSAILLGSRVSCRSPGFHWLRVDALVAIFMVLAGLLGMVAHMM YTTIFQITVNLGPEDWKPQTWDYGWSYCLAWGSFALCLAVSVSAMSRFTAARLEFTEKQQ AQNGSRHSQHSFLEPEASESIWKTGAAPCPAEQAFRNVSGHLPPGAPGKVSIC >gi568815581r:9798098_10005180|GENSCAN_predicted_CDS_1|882_bp atggacagggccaagcagcagcaggcgctgctcctcctccctgtctgcctcgccctcacc ttctccctcaccgccgtggtcagcagccactggtgtgaggggacccgacgggtggtgaag ccactgtgccaggaccagccgggagggcagcactgcattcacttcaaacgggacaacagc agcaatggcaggatggacaacaatagccaggctgtcctgtacatttgggagctgggtgat gacaagttcattcagcgggggttccatgtggggctctggcagtcctgcgaggagagcctc aacggtgaagatgaaaagtgtaggagtttccggagtgtagtgccagctgaagaacaaggt gttttgtggctgtccatcgggggcgaggtcctggatatcgttctgatactgacaagcgcc atcctcctgggctccagagtgagttgtcgcagccctgggttccactggctcagggtggat gccttggtagccatcttcatggtgctggcagggcttctaggcatggtggcccacatgatg tacacaaccatttttcaaatcactgtgaaccttggaccagaagattggaagcctcagacc tgggactatggctggtcatattgccttgcctggggttctttcgccctctgcctggctgtg tcggtctcggccatgagcaggttcacggcagcccgcctggaattcaccgagaagcagcag gcacagaacggcagtcggcactctcaacacagcttcctggaacccgaggcttcggagagc atttggaaaacaggagctgctccttgccctgctgaacaagccttcaggaatgtttctgga cacctcccaccaggcgccccaggcaaggtgtccatatgctag >gi568815581r:9798098_10005180|GENSCAN_predicted_peptide_2|601_aa MAVFLLRSMGRVTGSLLEETTRKWAQYKQACLRDLLKEPSESSGRAYRHCLAQGTWQTIE NATDIWQDDSECSENHSFKQNNPSAPYFLVGLLLFSLRTSCLSNICSLSFKVQCKRLHKA EEQVFGDQTDIYGHTLVSFIPMVDRYALLSTLQLMYTVGYSFSLISLFLALTLLLFLRKL HCTRNYIHMNLFASFILRTLAVLVKDVVFYNSYSKRPDNENGWMSYLSEMSTSCRSVQVL LHYFVGANYLWLLVEGLYLHTLLEPTVLPERRLWPRYLLLGWDPVQASAPALANTISSIS LKPIFCHCKPHVLIILAASPSIWWVLDKHIQCTLMAYILDPERFLLTRLSVQMTFGTLEI QMAFGTVEIQMAFRTVEITAKHLLPGQLTAELPLVYIELMELMAEPSLTLKFTAMWNSSL TYPILLFFSRLAKSTLVLIPLLGVHEILFSFITDDQVEGFAKLIRLFIQLTLSSFHGFLV ALQYGFANGEVKAELRKYWVRFLLARHSGCRACVLGKDFRFLGKCPKKLSEGDGAEKLRK LQPSLNSGRLLHLAMRGLGELGAQPQQDHARWPRGSSLSECSEGDVTMANTMEEILEESE I >gi568815581r:9798098_10005180|GENSCAN_predicted_CDS_2|1806_bp atggctgtcttcctccttcgttccatgggcagggttacaggatccctccttgaggaaacg actcggaagtgggctcagtacaaacaggcatgtctgagagacttactcaaggaaccttct gagagctcaggaagggcctacagacactgcttggctcaggggacttggcagacgatagag aacgccacggatatttggcaggatgactccgaatgctccgagaaccacagcttcaagcaa aacaatccttctgcaccatactttctcgttggactcttgctcttctccctgaggacctcc tgtttgtccaacatctgctcactttccttcaaagtgcaatgcaagaggctgcacaaagcg gaggaacaggtgtttggggatcagacagatatctacggccacaccttggtttctttcatt ccgatggtggatcgttatgccttgctgtcaaccttgcagctgatgtacaccgtgggatac tccttctctcttatctccctcttcctggctctcaccctcctcttgtttcttcgaaaactc cactgcacgcgcaactacatccacatgaacttgtttgcttctttcatcctgagaaccctg gctgtactggtgaaggacgtcgtcttctacaactcttactccaagaggcctgacaatgag aatgggtggatgtcctacctgtcagagatgtccacctcctgccgctcagtccaggttctc ttgcattactttgtgggtgccaattacttatggctgctggttgaaggcctctacctccac acgctgctggagcccacagtgcttcctgagaggcggctgtggcccagatacctgctgttg ggttgggatcctgtgcaggcttctgctccagccctagccaacaccatttcaagtatctct ttaaaacccattttctgccactgtaagccccatgttctgatcatcctggcagccagccca agcatatggtgggttcttgataaacatattcagtgcacactcatggcctacattctggac cctgaaaggttcttgctaaccagactttcagttcagatgacctttgggactttggagatt cagatggcctttgggacagtggagattcagatggcctttaggacagtggagatcacagct aagcatttactacctggccagctaactgctgaattgcccctggtttacattgaactgatg gaactgatggccgagcccagcctgaccttgaagttcacagctatgtggaattcgtcactt acttacccaattctgctctttttttcaagattggcaaaatcaacactggtcctcattcct ttattgggcgttcatgagatcctcttctctttcatcactgatgatcaagttgaaggattt gcaaaacttatacgacttttcattcagttgacactgagctcctttcatgggttcctggtg gccttgcagtatggttttgccaatggagaggtgaaggctgagctgcggaaatactgggtc cgcttcttgctagcccgccactcaggctgcagagcctgtgtcctggggaaggacttccgg ttcctaggaaaatgtcccaagaagctctcggaaggagatggcgctgagaagcttcggaag ctgcagccctcacttaacagtgggcggctcctacatctagccatgcgaggtcttggggag ctgggcgcccagccccaacaggaccatgcacgctggccccggggcagcagcctgtccgag tgcagtgagggggatgtcaccatggccaacaccatggaggagattctggaagagagtgag atctag >gi568815581r:9798098_10005180|GENSCAN_predicted_peptide_3|209_aa MGNSKSGALSKEILEELQLNTKFSEEELCSWYQSFLKDCPTGRITQQQFQSIYAKFFPDT DPKAYAQHVFRSFDSNLDGTLDFKEYVIALHMTTAGKTNQKLEWAFSLYDVDGNGTISKN EVLEIVMAIFKMITPEDVKLLPDDENTPEKRAEKIWKYFGKNDDGSSKGHGKGDKLTEKE FIEGTLANKEILRLIQFEPQKVKEKMKNA >gi568815581r:9798098_10005180|GENSCAN_predicted_CDS_3|630_bp atggggaacagcaaaagtggggccctgtccaaggagatcctggaggagctgcagctgaac accaagttctcggaggaggagctgtgctcctggtaccagtccttcctgaaggactgtccc accggccgcatcacccagcagcagttccagagcatctacgccaagttcttccccgacacc gaccccaaggcctacgcccagcatgtgttccgcagcttcgattccaacctcgacggcacc ctggacttcaaggagtacgtcatcgccctgcacatgaccaccgcgggcaagaccaaccag aagctggagtgggccttctccctctacgacgtggacggtaacgggaccatcagcaagaat gaagtgctggagatcgtcatggctattttcaaaatgatcactcccgaggacgtgaagctc cttccagacgatgaaaacacgccggaaaagcgagccgagaagatctggaagtactttgga aagaatgatgatgggagctccaagggtcatgggaaaggagataaacttacagagaaagaa ttcattgaggggacactggccaataaggaaattctgcgactgatccagtttgagcctcaa aaagtgaaggaaaagatgaagaacgcctga >gi568815581r:9798098_10005180|GENSCAN_predicted_peptide_4|395_aa MAEDRGKSYAGLDSNFHSSVYYSLVTLDSFTSASPNFMICKMELMPFNLLGLEPEWGEKM SNMENSFDDVSCLSPQNLGSSSPSKKQSKENTITINCVTFPHPDTMPEQQLLKPTEWSYC DYFWADKKDPQGNGTVAGFELLLQKQLKGKQMQKEMSEFIRERIKIEEDYAKNLAKLSQN SLASQEEGSLGEAWAQVKKSLADEAEVHLKFSAKLHSEVEKPLMNFRENFKKDMKKCDHH IADLRKQLASRYASVEKARKALTERQRDLEMKTQQLEIKLSNKTEEDIKKARRKSTQAGD DLMRCVDLYNQAQSKWFEEMVTTTLELERLEVERVEMIRQHLCQYTQLRHETDMFNQSTV EPVDQLLRKVDPAKDRELWVREHKTGNIRPVDMEI >gi568815581r:9798098_10005180|GENSCAN_predicted_CDS_4|1188_bp atggcggaagatagaggaaagagttatgcaggcttggattctaatttccactcctctgtt tactatagccttgtgaccctggacagttttacttctgcaagccccaatttcatgatctgc aaaatggagttgatgccatttaacttgctggggttggaacctgaatggggggaaaaaatg tccaatatggagaacagctttgacgatgtttcttgcctctctccccagaacctgggatcc tcatcgccaagcaaaaagcagagcaaggaaaacaccatcacaataaactgtgtgacgttc cctcacccagacacgatgccggaacagcagctgctgaaaccaaccgagtggagctactgc gactacttctgggctgataagaaggacccccaaggcaacggcaccgtggctgggtttgaa ctactgctccagaaacagctgaagggcaaacaaatgcagaaggaaatgtcagaattcatc cgggaaaggataaagattgaagaagactatgcgaagaacttagctaagctctctcagaac tccttggcttcacaggaggaaggctccttgggagaggcgtgggcccaggtgaagaagagc ctggcggacgaagcagaagttcacctcaagttctctgccaagcttcacagcgaggtggag aagcccctgatgaacttccgtgagaacttcaagaaagacatgaagaagtgcgaccaccac attgccgaccttcgcaagcagctcgccagccgctatgcctcggtggagaaggcccggaaa gccctcacagagcggcagagagacctggagatgaagacccagcagctggagatcaagctg agcaacaagacagaggaggacatcaagaaggcgcggagaaagtccacacaggctggagac gacctcatgcgctgtgtggatctctacaaccaggcccagtccaaatggtttgaagagatg gtgaccaccacattggagctagagcggctggaggtggagagggtagagatgatccggcag cacctgtgccagtacacgcagctgcggcatgaaacagacatgttcaaccaaagcacagtc gagcccgtggatcagctgcttcgaaaagtggacccggccaaagacagggagctgtgggtc agagagcacaagacgggcaacatccgccctgtggacatggagatctag >gi568815581r:9798098_10005180|GENSCAN_predicted_peptide_5|145_aa MAVLEGIEKPEMEETLGAAESRHKPRKVDEPQQEATGGQRLVGQVPYLRGRTSRGFDPQG DKVSRCIKSLSLSSAALCAGPLKGAEKGGEMGGEVNGYHASGTPAHPPETAHMSVRKSTG DSQVLVCDVPPPVSIWSHCSSPTYE >gi568815581r:9798098_10005180|GENSCAN_predicted_CDS_5|438_bp atggctgtactggaaggcattgaaaaaccagagatggaggaaactctgggggcagcagag agccggcacaagccaagaaaagtggacgagccccagcaggaggccactgggggccagagg cttgtaggccaggtgccgtatttgagagggcgaaccagccgtggcttcgaccctcaagga gataaggtgtcaaggtgcatcaagagcctctctctgtccagcgctgcactgtgtgctggg cccctaaagggggctgagaaaggtggggagatgggtggagaagtgaatggataccacgca tcagggaccccagcgcaccctccagagactgcccacatgagtgtccgaaaatccaccggt gattcccaggtcctggtgtgtgatgttcccccaccggtgtccatatggtcccattgttca tctcccacttatgagtga >gi568815581r:9798098_10005180|GENSCAN_predicted_peptide_6|462_aa MSSFEKCLFMSFAHFLMDAEKAFDKIQQPFMLETLNKLGIDGTYFKIIRAIYDKPTANIM LNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFAD DMIVYLENPIVSAQNLLRLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMNELPLTIA SKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVI YRFNAIPIKLPMPFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLYYKAT VTKTAWYWYQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKHWGKDSLFNKWCWENWLA ICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFRSKTPKAM ATKAKIDKWDLIKLKSFCTAKDYHQSEQATYKMGENFHNLLT >gi568815581r:9798098_10005180|GENSCAN_predicted_CDS_6|1389_bp atgtcttcttttgagaagtgtctgttcatgtcctttgcccactttttgatggatgcagaa aaggcctttgacaaaattcaacaacccttcatgctagaaaccctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcatg ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaggctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaataacagacaaacagagagccaaattatgaatgaactcccactcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggatacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgcctttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccacatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacgctacctgacttcaaactatactataaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataacgccgcatatctacaactacctgatctttgacaaacctgagaaa aacaagcattggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagacttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcatgggcaaggacttcaggtctaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagca aaagactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttcacaaccta ctcacctga >gi568815581r:9798098_10005180|GENSCAN_predicted_peptide_7|39_aa XLAFSSSAAIWQKWPLIVEEDHPKSCSIQLLAGPVAVAK >gi568815581r:9798098_10005180|GENSCAN_predicted_CDS_7|120_bp nntcttgccttttcttcttcagcagccatttggcagaagtggccactcatcgtggaagag gaccaccccaagtcctgttctatacaactcttggcaggtccagtagcagtagccaagtag