GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:10:54 Sequence gi568815578r:5447664_5704412 : 256749 bp : 44.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5152 5298 147 0 0 48 53 160 0.753 6.59 1.02 Intr + 5485 5602 118 0 1 49 78 81 0.823 3.24 1.03 Term + 5863 6026 164 2 2 -5 45 178 0.579 2.30 1.04 PlyA + 6489 6494 6 1.05 2.00 Prom + 9804 9843 40 -4.06 2.01 Init + 18233 18290 58 1 1 48 48 80 0.209 1.47 2.02 Intr + 21490 21624 135 2 0 47 87 45 0.148 0.84 2.03 Intr + 22299 22480 182 1 2 -23 85 157 0.541 3.89 2.04 Term + 26040 26276 237 2 0 108 47 170 0.662 11.07 2.05 PlyA + 26509 26514 6 -0.45 3.00 Prom + 28359 28398 40 -4.46 3.01 Init + 28413 29028 616 2 1 71 113 100 0.367 6.30 3.02 Intr + 29637 29742 106 1 1 52 93 25 0.097 -1.33 3.03 Intr + 37386 37517 132 0 0 66 68 104 0.874 5.96 3.04 Term + 38114 38234 121 2 1 115 49 76 0.853 4.35 3.05 PlyA + 40946 40951 6 1.05 4.06 PlyA - 41878 41873 6 1.05 4.05 Term - 54131 53895 237 2 0 108 47 153 0.211 9.37 4.04 Intr - 58449 58327 123 0 0 54 53 81 0.758 1.98 4.03 Intr - 60665 60598 68 0 2 92 77 57 0.913 3.62 4.02 Intr - 62880 62766 115 0 1 100 70 66 0.886 6.02 4.01 Init - 68125 67484 642 1 0 86 -33 257 0.375 8.73 4.00 Prom - 68908 68869 40 -2.26 5.00 Prom + 69287 69326 40 -9.85 5.01 Sngl + 69439 70725 1287 0 0 58 55 362 0.395 25.88 5.02 PlyA + 71356 71361 6 1.05 6.00 Prom + 72725 72764 40 0.54 6.01 Sngl + 85424 85567 144 1 0 74 52 151 0.608 4.01 6.02 PlyA + 85715 85720 6 1.05 7.23 PlyA - 85902 85897 6 1.05 7.22 Term - 100187 99998 190 1 1 84 29 171 0.934 7.72 7.21 Intr - 110442 110282 161 0 2 96 36 31 0.711 -2.51 7.20 Intr - 111156 111021 136 2 1 77 119 67 0.963 9.27 7.19 Intr - 112413 112277 137 0 2 109 82 -9 0.899 -0.03 7.18 Intr - 117415 117354 62 2 2 49 115 6 0.671 -2.15 7.17 Intr - 119109 119070 40 0 1 109 95 35 0.812 4.10 7.16 Intr - 119897 119820 78 2 0 113 62 55 0.963 5.15 7.15 Intr - 122576 122484 93 2 0 68 102 28 0.618 2.36 7.14 Intr - 126306 126252 55 2 1 108 103 2 0.866 2.68 7.13 Intr - 127882 127750 133 2 1 37 100 59 0.887 1.70 7.12 Intr - 128315 128153 163 2 1 78 94 45 0.964 3.65 7.11 Intr - 130948 130717 232 0 1 68 81 161 0.784 11.18 7.10 Intr - 145748 145683 66 1 0 111 42 52 0.073 0.92 7.09 Intr - 151158 151062 97 1 1 114 113 45 0.859 8.67 7.08 Intr - 154448 154322 127 2 1 78 57 86 0.834 4.75 7.07 Intr - 163296 163179 118 2 1 68 78 27 0.125 0.17 7.06 Intr - 163618 163498 121 2 1 14 58 158 0.069 4.95 7.05 Intr - 187729 187616 114 2 0 81 97 -4 0.047 0.22 7.04 Intr - 205789 205676 114 2 0 60 81 64 0.032 3.32 7.03 Intr - 210141 210057 85 0 1 73 85 4 0.050 -2.01 7.02 Intr - 222007 221941 67 0 1 96 101 32 0.138 4.21 7.01 Init - 240716 240583 134 2 2 96 7 115 0.421 3.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 228313 228376 64 0 1 102 94 33 0.840 6.62 S.002 Term - 246766 246699 68 2 2 76 47 77 0.802 0.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:5447664_5704412|GENSCAN_predicted_peptide_1|142_aa MPKPGRPVWEVRCASAWPPVVWDVRSAAARPPIVWEVRSAAARPPIVWEEVRSAAARPPR LGSEERLCPAAHRLGSEERCCPATHRLGVRCASARLPVVWEVRSAAARPPIVWDVRSASA RLPRLGSVPNSSEETATIKNGP >gi568815578r:5447664_5704412|GENSCAN_predicted_CDS_1|429_bp atgcctaagcccggccgccccgtctgggaagtgaggtgcgcctctgcctggccgcctgtc gtctgggatgtgaggagcgccgctgcccggccacccatcgtctgggaagtgaggagcgct gctgcccggccacccatcgtctgggaagaagtgaggagcgccgctgcccggccaccccgt ctgggaagtgaggagcgcctctgcccggccgcccatcgtctgggaagtgaggagcgctgc tgcccggccacccatcgtctgggagtgaggtgcgcctctgcccggctgcccgtcgtctgg gaagtgaggagcgctgctgcccggccacccatcgtctgggatgtgaggagcgcctctgcc cggctgccccgtctgggaagtgtacccaacagctccgaagagacagcgaccatcaagaac gggccatga >gi568815578r:5447664_5704412|GENSCAN_predicted_peptide_2|203_aa MGPTVDDCYRVTLEEFRGRWSQELQTPRKCSHSAKSLQESTPGRLLKHMSTPGKKTSSIT LNAEAPVAISTSDHQHQWPSAPVIISTSGHQHHGHQHQWVISTMAISTSGPHGRGSHMGL LPGYLWLTLETKTKTPPPFSSTTQISTGKDKGLNPQLLKMDPGHMGWSDTPAQLSAGEEA QKRFRGLKDILLPCPYEQAISAP >gi568815578r:5447664_5704412|GENSCAN_predicted_CDS_2|612_bp atgggacctactgtcgatgactgctacagagttactctagaagaattcagaggaagatgg tctcaggagctccaaaccccaaggaagtgcagccattctgctaaatcactccaggaatca acccctgggcgtcttctgaagcacatgtctacaccggggaagaaaacctcctccataaca ctaaatgctgaagcaccagtggccatcagcaccagtgatcatcagcaccagtggccatca gcaccagtgatcatcagcaccagtggccatcagcaccatggccatcagcaccagtgggtc atcagtaccatggccatcagcaccagtggcccgcatgggaggggttctcacatggggctt ctgcctggctacctgtggctaacactggagacgaagaccaagacaccaccgcctttctcc agtaccactcaaatcagcacaggcaaggacaaaggcctcaatccacaactgctgaagatg gaccctggccacatgggatggtcagacacgcctgcccagctatctgcaggcgaagaggct cagaagaggtttaggggcctgaaggacatcttgcttccatgtccatatgagcaggctatt tctgctccatga >gi568815578r:5447664_5704412|GENSCAN_predicted_peptide_3|324_aa MGEDFMTKTPKAMATKAKIDKWHLIKLKSFCTAKETTIRMNRQPTEWEKIFAIYPSDKGL ISRICKELKQIYKKKSNNSINKWAKDMNRHFSKEDIYATKRHMKECSSSLAIREMQIKTT MRYYLTPVRMAIIKKSGNNRCWKGYGEIGTLLHCWWDCKLVQPLWKTVWRFLKDLELEIP FDPAIPLLGIYPKDYKSCCYKDTCTPYPGEAPWPSTNSQVKHKQPSPMAPSALKEEVALT LHIQHIWAITLDGNIRLGPTVVITGVDQVAESSVTMETPASSANRDLDLPSPLSWILLSV VQAFSGALEFPLTFSPASCICCSH >gi568815578r:5447664_5704412|GENSCAN_predicted_CDS_3|975_bp atgggcgaggacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatggcatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagaatg aacaggcaacctacagaatgggagaaaatttttgcaatctacccatctgacaaagggcta atatccagaatctgcaaagaacttaaacaaatttacaagaaaaaatcaaacaactccatc aataagtgggcaaaggatatgaacagacacttctcgaaagaagacatttatgcaaccaaa agacacatgaaagaatgttcatcatcactggccatcagagaaatgcaaatcaaaaccacc atgagatactatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagg tgctggaaaggatatggagaaataggaacacttttacactgttggtgggactgtaaacta gttcaaccattgtggaagacagtgtggcgattcctcaaggatctagaactagaaatacca tttgatccagcgatcccattactgggtatatacccaaaggattataaatcatgctgctat aaagacacatgcacaccttaccctggtgaagctccctggccaagcacaaactcccaggtc aagcacaaacagccttcaccaatggcaccatcagccctcaaagaggaggtggcactaaca cttcacatccagcacatctgggctatcactcttgatggcaacattagattggggcccact gtggtcataacaggggtggaccaggtggcagagtcttcagtcaccatggaaaccccggca tccagtgctaacagggacctcgacctcccttctccactctcctggatcctgctcagtgtg gtgcaggccttctctggggccctggagttcccactgaccttcagccccgccagttgcatc tgctgttcccactga >gi568815578r:5447664_5704412|GENSCAN_predicted_peptide_4|394_aa MAILPKVIYRFNAIPIKLPVTFFTELGKTTLRFIWNQKRACIGKSVLSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTESSEIMLHIYNHLIFDKPDKNKKWGKDSLFNK WCWENWLAICRKLKLDPFLTSHTKINSRWIKDLNVRPKTIKTLEENLGNTIQAIGMGKDF MTKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRLLGRPPALFTASSSVLKQLALEGILI LDSRALLGFLYEARHSHSNSPNHDAQNATSKKNIRDGYDKIYRQEQVLARMEEKTLITAG GNVKWCSHFRKQIGGQWLTLETKTKTPQPFSSTSQISTDKDKGLNPQLLKMDPGHMGWSD TPAQLSAGEEAQKRFRGLKDILLPCPYEQAISAP >gi568815578r:5447664_5704412|GENSCAN_predicted_CDS_4|1185_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccagtg actttcttcacagaattgggaaaaactactttaaggttcatatggaaccaaaaaagagcc tgcattggcaagtcagtcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagagtcctcagaaataatgctgcatatctacaaccat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca tctcatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggccataggcatgggcaaggacttc atgactaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagactgctaggcaggcctcct gcgctcttcactgcctctagcagcgtcttgaaacagctggctctggaaggcatcctgatt ctggattcaagagctttgcttggatttctgtatgaagctcggcacagtcacagcaactct ccaaatcatgacgctcaaaatgccacatctaaaaagaacatccgggatggctatgataaa atctacagacaagaacaagtgttagcaaggatggaggaaaaaaccctcatcactgctggt gggaatgtgaaatggtgcagccactttagaaaacagattggcggccagtggctaacactg gagacgaagaccaagacaccacagcctttctccagtacctctcaaatcagcacagacaag gacaaaggcctcaatccacaactgctgaagatggaccctggccacatgggatggtcagac acgcctgcccagctatctgcaggcgaagaggctcagaagaggtttaggggcctgaaggac atcttgcttccatgtccatatgagcaggctatttctgctccatga >gi568815578r:5447664_5704412|GENSCAN_predicted_peptide_5|428_aa MIVYLENPIVSAQNLLKLISNFNKVSGYKINVQKSQAFVYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDFFKENYKPLLNEIKEDTNKWKKIPCSWVGRINIVKMAILPKVIY RFNAIPIKLPMTFFTKLEKTTLKFIWNQKRAHIAKTILSQKNKAGSIALPDFKLYWKATV TKTAWYWYQNRDIDQWNRIEPSEIIPHIYNHLIFDKPDKNKKWGKDSLFNKWCWENWLAI CRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQAMGMGKDFMTETPKAMA TKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFTIYPSDKGLISRIYNELKQINKK KSNNPINKWAKDMNRRFSKEDIYAANRHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIK KSGNNRCW >gi568815578r:5447664_5704412|GENSCAN_predicted_CDS_5|1287_bp atgattgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcaacaaagtctcaggatacaaaatcaatgtgcaaaaatctcaagcattcgtatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacttctttaaggag aactacaaaccactgctcaacgaaataaaagaggacacaaacaaatggaagaaaattcca tgctcatgggtaggaagaatcaatattgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacaaaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccacattgccaagacaatcctaagccaa aagaacaaagctggaagcatcgcgctacctgacttcaaactatactggaaggccacagta accaaaacagcgtggtattggtaccaaaacagagatatagatcaatggaacagaatagag ccctcagaaataataccacatatctacaaccatctgatctttgacaaacctgacaaaaac aagaaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccata tgtagaaagctgaaactggatcccttccttacaccttatacaaaaattaattcaagatgg attaaagacttaaatgttagacctaaaaccataaaaaccctagaagaaaacctaggtaat actattcaggccatgggcatgggcaaggacttcatgactgaaacaccaaaagcaatggca acaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaa gaaactaccatcagagtgaacaggcaacctacagaatgggagaaaatttttacaatctac ccatctgacaaagggctaatatccagaatctacaatgaactcaaacaaattaacaagaaa aaatcaaacaatcccatcaataagtgggcaaaggatatgaacagacgcttctcaaaagaa gacatttatgcagccaacagacacatgaaaaaatgctcatcatcactggccatcagagaa atgcaaatcaaaaccacaatgagataccatctcacaccagttagaatggcgatcattaaa aagtcaggaaacaacaggtgctggtga >gi568815578r:5447664_5704412|GENSCAN_predicted_peptide_6|47_aa MQNPCQPKIPDWFSNRQKDIKDGKYSQDLANGLDNNLCENLEQLKKI >gi568815578r:5447664_5704412|GENSCAN_predicted_CDS_6|144_bp atgcagaatccatgccagcccaagatcccagactggttctcaaacagacagaaagatata aaggacggaaaatatagccaggacctggccaatggtctggataacaacctctgtgagaac ctggagcaactgaagaagatttga >gi568815578r:5447664_5704412|GENSCAN_predicted_peptide_7|840_aa MVGKKSTCYRSNREQPKLDLGGMEIPNSQSHKVALLVTVLQSSFKRILSGWALQLEAPIE AVGSAEMENIHHRGDFLVTFPSSSRSSFVQTGQFSGRDIDKDPKLSPVGRGWGFEWAIEL CMAVKEDVRQEVGSHIGLLPDVAMAFVNCRGTDGSVAVRMTRGHSHCHLGFGGFGGPQYK PMTPGQLSNVRAPGSAEKGSGDTGDARPPSAAPPGGSAGEARTAGARYLCPRSSLSGGAA ATRTCGLANPEEEGPSAKCGENGSAERTDLGGNKYNQERIQIEYVEVLFADFFREVFAIC GSCDALGNWNPQNAVALLPENDTGESMLWKATIVLSRGVSVQYRYFKGVKLTLEGLEEDD DDRVSPTVLHKMSNSLEISLISDNEFKCRHSQPECGYGLQPDRWTEYSIQTMEPDNLELI FDFFEEDLSEHVVQGDALPGHVGTACLLSSTIAESGKSAGILTLPIMSRNSRKTIGKVRV DYIIIKPLPGYSCDMKSSFSKYWKPRIPLDVGHRGAGNSTTTAQLAKVQENTIASLRNAA SHGAAFVEFDVHLSKDFVPVVYHDLTCCLTMKKKFDADPVELFEIPVKELTFDQLQLLKL THVTALKSKDRKESVVQEENSFSENQPFPSLKMDGMWDGNLSTYFDMNLFLDIILKTVLE NSGKRRIVFSSFDADICTMVRQKQNKYPILFLTQGKSEIYPELMDLRSRTTPIAMSFAQF ENLLGINVHTEDLLRNPSYIQEAKAKGLVIFCWGDDTNDPENRRKLKELGVNGLIYDRIY DWMPEQPNIFQVEQLERLKQELPELKSCLCPTVSRFVPSSLCGESDIHVDANGIDNVENA >gi568815578r:5447664_5704412|GENSCAN_predicted_CDS_7|2523_bp atggtgggcaagaagagcacctgctacaggtctaacagggagcagcctaaactggaccta gggggcatggagatccctaacagccaaagccataaagttgcactgctggtcacagtgctc caatcttcattcaagaggattctctcaggctgggctctgcagctggaggcccccatagag gcggtgggctccgctgagatggaaaacattcatcatcgcggtgacttccttgtaaccttt ccaagttcttcacgttcctcatttgttcagactggacagttctctgggagggacatagac aaggacccaaaactaagtcctgtgggcaggggctggggttttgaatgggctatagagcta tgcatggcagtcaaagaggatgttcgacaggaagtaggaagccatatagggttacttcct gatgttgccatggcatttgtaaactgtcgtggcactgatgggagtgtagcagtgaggatg accagaggtcactctcattgccatcttggttttggcggctttggtggaccacagtacaag cctatgaccccagggcagctctccaacgtgcgggcgccggggtcggctgagaagggcagc ggggacacgggggatgcccggccgccctcggccgcgccgcctggggggagcgctggcgag gcacggacggcgggcgcccggtacctctgcccgcggtcctcgctctcgggcggggcggcg gcgacgcggacctgcggactagcgaacccggaggaggaaggaccctctgctaaatgtggc gagaatggtagtgcagagaggactgaccttgggggaaacaagtacaaccaggagaggata caaatagagtatgtggaggtgctgtttgcagatttctttagagaagtttttgcgatatgt ggaagctgtgatgctttgggaaactggaatcctcaaaatgctgtggctcttcttccagag aatgacacaggtgaaagcatgctatggaaagcaaccattgtactcagtagaggagtatca gttcagtatcgctacttcaaaggggtgaagctgacactagaaggcctggaggaagatgac gatgatagggtatctcccactgtactccacaaaatgtccaatagcttggagatatcctta ataagcgacaatgagttcaagtgcaggcattcacagccggagtgtggttatggcttgcag cctgatcgttggacagagtacagcatacagacgatggaaccagataacctggaactaatc tttgattttttcgaagaagatctcagtgagcacgtagttcagggtgatgcccttcctgga catgtgggtacagcttgtctcttatcatccaccattgctgagagtggaaagagtgctgga attcttactcttcccatcatgagcagaaattcccggaaaacaataggcaaagtgagagtt gactatataattattaagccattaccaggatacagttgtgacatgaaatcttcattttcc aagtattggaagccaagaataccattggatgttggccatcgaggtgcaggaaactctaca acaactgcccagctggctaaagttcaagaaaatactattgcttctttaagaaatgctgct agtcatggtgcagcctttgtagaatttgacgtacacctttcaaaggactttgtgcccgtg gtatatcatgatcttacctgttgtttgactatgaaaaagaaatttgatgctgatccagtt gaattatttgaaattccagtaaaagaattaacatttgaccaactccagttgttaaagctc actcatgtgactgcactgaaatctaaggatcggaaagaatctgtggttcaggaggaaaat tccttttcagaaaatcagccatttccttctcttaagatggatggaatgtgggatggtaac ttatcaacatattttgacatgaatctgtttttggatataattttaaaaactgttttagaa aattctgggaagaggagaatagtgttttcttcatttgatgcagatatttgcacaatggtt cggcaaaagcagaacaaatatccgatactatttttaactcaaggaaaatctgagatttat cctgaactcatggacctcagatctcggacaacccccattgcaatgagctttgcacagttt gaaaatctactggggataaatgtacatactgaagacttgctcagaaacccatcctatatt caagaggcaaaagctaagggactagtcatattctgctggggtgatgataccaatgatcct gaaaacagaaggaaattgaaggaacttggagttaatggtctaatttatgataggatatat gattggatgcctgaacaaccaaatatattccaagtggagcaattggaacgcctgaagcag gaattgccagagcttaagagctgtttgtgtcccactgttagccgctttgttccctcatct ttgtgtggggagtctgatatccatgtggatgccaacggcattgataacgtggagaatgct tag