GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:21:44 Sequence gi568815595r:122810146_123061263 : 251118 bp : 44.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6594 6699 106 2 1 103 45 68 0.047 3.79 1.02 Intr + 16747 16920 174 2 0 88 91 22 0.156 2.41 1.03 Intr + 23172 23301 130 1 1 113 94 -35 0.007 -0.65 1.04 Intr + 35618 35726 109 2 1 92 108 28 0.350 5.49 1.05 Intr + 46162 46229 68 0 2 132 100 35 0.446 6.80 1.06 Intr + 55009 55075 67 1 1 50 64 89 0.283 1.61 1.07 Intr + 64477 64560 84 0 0 76 72 41 0.169 1.32 1.08 Term + 81133 81186 54 0 0 107 48 94 0.302 4.86 1.09 PlyA + 81990 81995 6 1.05 2.04 PlyA - 83081 83076 6 1.05 2.03 Term - 88461 88405 57 0 0 91 41 79 0.486 1.19 2.02 Intr - 93501 93365 137 1 2 80 -23 114 0.464 -0.21 2.01 Init - 95410 95251 160 1 1 40 72 113 0.562 4.89 2.00 Prom - 97144 97105 40 -3.96 3.23 PlyA - 98599 98594 6 1.05 3.22 Term - 100156 99998 159 1 0 129 43 200 0.995 17.54 3.21 Intr - 100900 100695 206 2 2 132 81 122 0.983 14.82 3.20 Intr - 101390 101346 45 0 0 97 98 50 0.877 5.28 3.19 Intr - 101924 101775 150 0 0 63 94 86 0.724 6.83 3.18 Intr - 102916 102698 219 2 0 18 91 296 0.729 21.17 3.17 Intr - 103279 103054 226 1 1 73 32 288 0.733 19.26 3.16 Intr - 103536 103389 148 2 1 88 65 141 0.974 12.04 3.15 Intr - 103856 103713 144 1 0 83 89 176 0.981 16.60 3.14 Intr - 105476 105295 182 2 2 87 91 126 0.991 11.47 3.13 Intr - 105745 105628 118 0 1 52 39 155 0.986 7.47 3.12 Intr - 111977 111770 208 0 1 90 86 383 0.930 36.44 3.11 Intr - 112302 112095 208 0 1 35 105 397 0.997 34.75 3.10 Intr - 113607 113472 136 2 1 105 92 142 0.998 16.87 3.09 Intr - 116532 116247 286 1 1 104 67 485 0.989 44.40 3.08 Intr - 117858 117645 214 0 1 98 91 320 0.995 31.59 3.07 Intr - 118470 118372 99 0 0 104 80 125 0.983 13.61 3.06 Intr - 118913 118851 63 2 0 114 94 19 0.900 4.01 3.05 Intr - 120522 120423 100 2 1 61 -37 113 0.387 -3.79 3.04 Intr - 123109 123021 89 1 2 83 121 43 0.566 5.87 3.03 Intr - 129325 129280 46 0 1 88 96 -5 0.506 -1.29 3.02 Intr - 133390 133291 100 2 1 77 92 110 0.816 9.47 3.01 Init - 138514 138361 154 1 1 74 116 252 0.999 24.74 3.00 Prom - 138939 138900 40 -3.26 4.00 Prom + 141191 141230 40 -6.16 4.01 Init + 152189 152386 198 1 0 92 66 128 0.217 9.90 4.02 Intr + 154090 154153 64 0 1 120 61 0 0.036 -1.21 4.03 Intr + 171215 171307 93 0 0 50 91 50 0.278 1.44 4.04 Intr + 178554 178725 172 1 1 103 107 -3 0.386 2.10 4.05 Intr + 183656 183765 110 2 2 117 30 89 0.573 5.93 4.06 Intr + 187535 187596 62 0 2 131 66 6 0.379 1.15 4.07 Term + 189527 189658 132 1 0 121 48 146 0.989 11.99 4.08 PlyA + 189770 189775 6 1.05 5.00 Prom + 190313 190352 40 -3.96 5.01 Init + 205767 205933 167 2 2 95 27 79 0.552 1.64 5.02 Intr + 208655 208777 123 2 0 74 65 50 0.524 1.00 5.03 Intr + 210757 210877 121 1 1 100 103 7 0.885 3.90 5.04 Intr + 212276 212433 158 1 2 24 32 165 0.425 3.31 5.05 Intr + 213335 213389 55 2 1 62 87 8 0.428 -2.92 5.06 Intr + 213963 214061 99 2 0 64 61 82 0.823 3.41 5.07 Term + 217000 217845 846 0 0 148 47 213 0.690 16.08 5.08 PlyA + 218706 218711 6 1.05 6.00 Prom + 221156 221195 40 -3.76 6.01 Init + 230707 230789 83 0 2 80 4 101 0.130 1.34 6.02 Term + 243586 243709 124 1 1 85 32 208 0.989 12.76 6.03 PlyA + 244333 244338 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595r:122810146_123061263|GENSCAN_predicted_peptide_1|263_aa GDDLDSIVNSQFISTVVTLHPQEEAEQGEQFCLHGDERATATAIASMLSYLGGACAFLVG PLVVPAPNGTSPLLAAESSRAHIKDRIEAVLYAEFGVVCLIFSATLAYFPPRPPLPPSVA AASQRLSYRRSVCRLLSNFRFLMIALAYAIPLGVFAGWSGVLDLILTPAHVSQVDAGWIG FWSIVGGCVVGIAMARQLKEEKVEYGLKEENGFVKNLNIQGVHVQDCYMGILHDNEVWDM NDPITQSQVTLDASDSFEEGIKK >gi568815595r:122810146_123061263|GENSCAN_predicted_CDS_1|792_bp ggagatgaccttgacagtatagtcaacagccagttcatcagcacagtggtcacattgcac ccacaggaagaagcagagcagggagaacagttctgtcttcatggagatgaaagggccaca gccacagctattgcatcaatgctcagttatcttgggggagcatgtgcatttttagttgga ccacttgttgttccagctcccaatgggacatcacctcttcttgctgcagagagcagcagg gcgcatattaaagatcgcatagaggctgtgttatatgcagaatttggagttgtctgctta atattttctgcaacactagcttatttcccaccccgacctcctcttcctcccagtgttgct gcagctagccagcggctgagttatcggagaagcgtttgtagattattaagcaattttcga tttttgatgattgctttagcatatgccataccacttggtgtatttgctggctggtctgga gttctggacttaattttaacaccagcgcatgtcagccaagtagatgctggctggattgga ttttggtccatagttggaggctgtgttgttggaatagctatggcaagacagctcaaggaa gaaaaagtagaatatggattaaaagaagagaatggatttgtgaaaaatcttaatattcaa ggagtacatgtgcaggattgttacatgggtatattgcatgacaatgaggtttgggacatg aatgatcccatcacccagtcccaggtgacacttgatgcctctgactcctttgaagagggc atcaagaagtga >gi568815595r:122810146_123061263|GENSCAN_predicted_peptide_2|117_aa MCIQKKGDLTQEGGEEGLKPGSSPSGQPIQIGIGGWYAPRRIQEKMELIGYVAGTCEYVF LGQPTGRALPLRLAAFQKGIFQKRRDIPPFLPNDVSVTWRKEEFDTFVFAFLGHIQK >gi568815595r:122810146_123061263|GENSCAN_predicted_CDS_2|354_bp atgtgcatccagaagaaaggggacctaacacaggaagggggtgaagaggggctgaagcca ggaagcagtcctagtgggcaaccaattcagattggtattggaggatggtatgcaccccgc aggatccaggagaaaatggaactgataggttatgtggcaggaacctgtgaatatgtcttc cttggacagccaactggacgagccttgcctctgagacttgcagcattccaaaaggggatc ttccagaagaggagagacatcccgccatttctaccaaacgatgtgtcagtcacctggaga aaagaggagtttgacacatttgtgtttgctttcctgggccacatccagaaataa >gi568815595r:122810146_123061263|GENSCAN_predicted_peptide_3|1099_aa MVLAGPLAVSLLLPSLTLLVSHLSSSQDVSSEPSSEQQLCALSKHPTVAFEDLQPWVSNF TYPGARDFSQLALDPSGNQLIVGARNYLFRLSLANVSLLQEDTGDVFHQNKRINQERGKH AIRKAGEERSPLFNGIKADPFGVLLCSETRGPFAHAVLAEPPTATEWASSEDTRRSCQSK GKTEEECQNYVRVLIVAGRKVFMCGTNAFSPMCTSRQVGNLSRTIEKINGVARCPYDPRH NSTAVISSQGELYAATVIDFSGRDPAIYRSLGSGPPLRTAQYNSKWLNEPNFVAAYDIGL FAYFFLRENAVEHDCGRTVYSRVARVCKNDVGGRFLLEDTWTTFMKARLNCSRPGEVPFY YNELQSAFHLPEQDLIYGVFTTNVNSIAASAVCAFNLSAISQAFNGPFRYQENPRAAWLP IANPIPNFQCGTLPETGPNENLTERSLQDAQRLFLMSEAVQPVTPEPCVTQDSVRFSHLV VDLVQAKDTLYHVLYIGTESGTILKALSTASRSLHGCYLEELHVLPPGRREPLRSLRILH SARALFVGLRDGVLRVPLERCAAYRSQGACLGARDPYCGWDGKQQRCSTLEDSSNMSLWT QNITACPVRNVTRDGGFGPWSPWQPCEHLDGDNSGSCLCRARSCDSPRPRCGGLDCLGPA IHIANCSRNGAWTPWSSWALCSTSCGIGFQVRQRSCSNPAPRHGGRICVGKSREERFCNE NTPCPVPIFWASWGSWSKCSSNCGGGMQSRRRACENGNSCLGCGVEFKTCNPEGCPEVRR NTPWTPWLPVNVTQGGARQEQRFRFTCRAPLADPHGLQFGRRRTETRTCPADGSGSCDTD ALVEVLLRSGSTSPHTVSGGWAAWGPWSSCSRDCELGFRVRKRTCTNPEPRNGGLPCVGD AAEYQDCNPQACPEGWSPWSEWSKCTDDGAQSRSRHCEELLPGSSACAGNSSQSRPCPYS EIPVILPASSMEEATDCAGFNLIHLVATGISCFLGSGLLTLAVYLSCQHCQRQSQESTLV HPATPNHLHYKGGGTPKNEKYTPMEFKTLNKNNLIPDDRANFYPLQQTNVYTTTYYPSPL NKHSFRPEASPGQRCFPNS >gi568815595r:122810146_123061263|GENSCAN_predicted_CDS_3|3300_bp atggtgcttgcaggccccctggctgtctcgctgttgctgcccagcctcacactgctggtg tcccacctctccagctcccaggatgtctccagtgagcccagcagtgagcagcagctgtgc gcccttagcaagcaccccaccgtggcctttgaagacctgcagccgtgggtctctaacttc acctaccctggagcccgggatttctcccagctggctttggacccctccgggaaccagctc atcgtgggagccaggaactacctcttcagactcagccttgccaatgtctctcttcttcag gaagatactggagatgtgttccaccaaaataagcgaataaatcaagaaagagggaaacat gcgattcggaaagcaggagaggaaagaagccccctcttcaatggaatcaaggctgacccc tttggtgtgctcctctgctcagagactcgaggaccctttgctcatgcggtcctcgcagag ccgcccaccgccacagagtgggcctccagtgaggacacgcgccgctcctgccaaagcaaa gggaagactgaggaggagtgtcagaactacgtgcgagtcctgatcgtcgccggccggaag gtgttcatgtgtggaaccaatgccttttcccccatgtgcaccagcagacaggtggggaac ctcagccggactattgagaagatcaatggtgtggcccgctgcccctatgacccacgccac aactccacagctgtcatctcctcccagggggagctctatgcagccacggtcatcgacttc tcaggtcgggaccctgccatctaccgcagcctgggcagtgggccaccgcttcgcactgcc caatataactccaagtggcttaatgagccaaacttcgtggcagcctatgatattgggctg tttgcatacttcttcctgcgggagaacgcagtggagcacgactgtggacgcaccgtgtac tctcgcgtggcccgcgtgtgcaagaatgacgtggggggccgattcctgctggaggacaca tggaccacattcatgaaggcccggctcaactgctcccgcccgggcgaggtccccttctac tataacgagctgcagagtgccttccacttgccggagcaggacctcatctatggagttttc acaaccaacgtaaacagcatcgcggcttctgctgtctgcgccttcaacctcagtgctatc tcccaggctttcaatggcccatttcgctaccaggagaaccccagggctgcctggctcccc atagccaaccccatccccaatttccagtgtggcaccctgcctgagaccggtcccaacgag aacctgacggagcgcagcctgcaggacgcgcagcgcctcttcctgatgagcgaggccgtg cagccggtgacacccgagccctgtgtcacccaggacagcgtgcgcttctcacacctcgtg gtggacctggtgcaggctaaagacacgctctaccatgtactctacattggcaccgagtcg ggcaccatcctgaaggcgctgtccacggcgagccgcagcctccacggctgctacctggag gagctgcacgtgctgccccccgggcgccgcgagcccctgcgcagcctgcgcatcctgcac agcgcccgcgcgctcttcgtggggctgagagacggcgtcctgcgggtcccactggagagg tgcgccgcctaccgcagccagggggcatgcctgggggcccgggacccgtactgtggctgg gacgggaagcagcaacgttgcagcacactcgaggacagctccaacatgagcctctggacc cagaacatcaccgcctgtcctgtgcggaatgtgacacgggatgggggcttcggcccatgg tcaccatggcaaccatgtgagcacttggatggggacaactcaggctcttgcctgtgtcga gctcgatcctgtgattcccctcgaccccgctgtgggggccttgactgcctggggccagcc atccacatcgccaactgctccaggaatggggcgtggaccccgtggtcatcgtgggcgctg tgcagcacgtcctgtggcatcggcttccaggtccgccagcgaagttgcagcaaccctgct ccccgccacgggggccgcatctgcgtgggcaagagccgggaggaacggttctgtaatgag aacacgccttgcccggtgcccatcttctgggcttcctggggctcctggagcaagtgcagc agcaactgtggagggggcatgcagtcgcggcgtcgggcctgcgagaacggcaactcctgc ctgggctgcggcgtggagttcaagacgtgcaaccccgagggctgccccgaagtgcggcgc aacaccccctggacgccgtggctgcccgtgaacgtgacgcagggcggggcacggcaggag cagcggttccgcttcacctgccgcgcgccccttgcagacccgcacggcctgcagttcggc aggagaaggaccgagacgaggacctgtcccgcggacggctccggctcctgcgacaccgac gccctggtggaggtcctcctgcgcagcgggagcacctccccgcacacggtgagcgggggc tgggccgcctggggcccgtggtcgtcctgctcccgggactgcgagctgggcttccgcgtc cgcaagagaacgtgcactaacccggagccccgcaacgggggcctgccctgcgtgggcgat gctgccgagtaccaggactgcaacccccaggcttgcccagaaggctggtcgccctggtct gagtggagtaagtgcactgacgacggagcccagagccgaagccggcactgtgaggagctc ctcccagggtccagcgcctgtgctggaaacagcagccagagccgcccctgcccctacagc gagattcccgtcatcctgccagcctccagcatggaggaggccaccgactgtgcagggttc aatctcatccacttggtggccacgggcatctcctgcttcttgggctctgggctcctgacc ctagcagtgtacctgtcttgccagcactgccagcgtcagtcccaggagtccacactggtc catcctgccacccccaaccatttgcactacaagggcggaggcaccccgaagaatgaaaag tacacacccatggaattcaagaccctgaacaagaataacttgatccctgatgacagagcc aacttctacccattgcagcagaccaatgtgtacacgactacttactacccaagccccctg aacaaacacagcttccggcccgaggcctcacctggacaacggtgcttccccaacagctga >gi568815595r:122810146_123061263|GENSCAN_predicted_peptide_4|276_aa MATWSQGRRSYCWRNIGKPDMQARRVVEREEQRCGNDRHRKGEQGLSREKEDGHKMERGD CGKNFQATHCRVLAPVPQWIPTPDGMAGLLHVPQGKDVTPALVHVWPGEFDSFLGKPTGL GVGPGKLCWENGQKNQLISPHLGSHAREGYHTSISLVLLETGSSSKPETHHLALICCCCQ QSAPPAEYSWGTTLEEAFHEIVSGIAPPFPSPAKMLHLCIMPVPCFTQKGKVKGGAVMQT GPMEREAGSMKTASIWFHADTWSQLDILQHNLWSGA >gi568815595r:122810146_123061263|GENSCAN_predicted_CDS_4|831_bp atggccacatggtctcagggaagaaggagctactgctggaggaacatcgggaagcccgac atgcaggcccggcgggtggtggagagggaggagcagagatgtggaaatgacaggcacagg aaaggagaacagggtctgtcccgtgagaaagaggacggccacaaaatggaaaggggagat tgcgggaagaacttccaggcaacccattgtcgagtcctggcacctgtacctcagtggatc ccaaccccagatgggatggcagggctgctccatgttccacaagggaaagacgtgacccca gccttggttcatgtgtggccaggcgaatttgacagcttcctgggaaaaccaacaggcctt ggggtggggcctggaaagctctgctgggagaatggccagaaaaaccaattaatctcgcct catctgggctctcatgcccgtgagggctaccacacctcaatcagccttgtcctcctcgag actgggagctcctcgaagccagagacacatcacctagctctaatctgctgctgctgtcag cagtcagctccccctgctgaatactcttggggaacaactttggaagaagcatttcacgaa atagtcagcggcatcgcccctccttttcccagcccagcaaagatgctgcatctctgcata atgcctgtgccctgtttcactcagaagggaaaagtcaagggaggagcagtcatgcagaca ggaccaatggagcgggaggctggctccatgaagactgccagcatctggttccacgctgat acttggtcccaactggacatcctgcagcacaatctctggagtggggcctaa >gi568815595r:122810146_123061263|GENSCAN_predicted_peptide_5|522_aa MRCLGLKTVHLWLRQIKLCADTGTHRPQKIRFGETQVHWREEPAHAHRIGSQEDEGLIFH TGKVAPKFFQLAGEAVIGIRHNLDLLSCKACWNLGSWAQQESEESLTLEPGELLKGSMCP KAPGLCRNITLSLGDIHNNEDSVMAGFSTYSIAFKAFMYSGKARWNPCPSISSQRVEENQ QTNCQEESSRAPDAQISRSLALHLFTPQSSMKVVNWLAGNELLTQIIGYKRVTKTKHGSV MAWVPGPHRRCSPDAPTTALAGPSATRTQAGWRRERSSDEGSQAEVAAGGAEARGPPVLP PAASPTWLQLARARTPTSLPRYPPELLKFASCSPNPRRGSARGSRDSPPRGGCRCPVPRG SRLPGGAERRELTRWAGGRSGNGAAAQERPPALSAPVQSPPGPAPAALGLPAAAGCAADP PGSAELGSGTNPRAPTSSRPGARREQGGGDRGSRRREGAVGEELQAKGEKEGSGSEKEEG AGADLPRRALHPPASRRAEEREGAPCIPRLLGRLPPRACRVW >gi568815595r:122810146_123061263|GENSCAN_predicted_CDS_5|1569_bp atgcgatgtctgggcctcaagactgttcatctatggcttcggcagattaaactgtgtgct gacacagggactcataggcctcagaagatcaggtttggtgagacccaggtccactggagg gaagaacctgctcatgctcacagaatagggtcccaggaggatgagggccttatattccat actgggaaagtagcacccaaattcttccaactggctggagaagctgtaataggaatacgt cataacttggacctgctcagctgcaaggcatgctggaatcttggcagctgggcccagcag gaaagtgaggaaagtctcactctggaacctggagagctgttgaaagggagcatgtgcccc aaggcacctggtctatgtagaaacattaccctcagccttggggacattcataacaatgag gacagtgtcatggcgggattcagcacatacagcattgcctttaaagctttcatgtactca gggaaagcccgatggaatccttgtccttctatttcaagtcagagagtggaggagaatcag cagaccaactgccaagaagagagctcaagagctccagatgcccagatctccaggagcctg gctttgcatctatttacaccacagtcatcaatgaaagtggtgaactggttggctggtaat gagttgctaactcagatcattggatataagagggtgaccaagactaagcacgggagtgtt atggcctgggtcccgggtccccatcgccgctgcagcccagatgcgccgaccacagccctg gcagggcccagtgcaacccggacccaagctggctggcggcgggaacgcagcagcgatgag ggctctcaagccgaggtggcagcgggaggcgcggaagccaggggaccccctgtcctcccg cccgcagcctcacctacctggcttcaacttgccagagcccggacccctacctctctaccc cgatacccacccgagttgctgaagtttgcaagttgcagccccaacccgcgcaggggaagc gcccggggctcccgggactcacctcccagaggcgggtgcaggtgcccggttccccgcggc tcccgactcccgggcggcgcggagcggcgggagctaaccagatgggctggcggccggtct gggaatggggcggccgcgcaggaacggcctccagctctcagcgctccggtgcagtcccca cccgggcccgcgcccgctgcgctcgggctcccagcagccgctggatgcgctgccgacccg cccggctcggcggagctgggctctggcaccaacccccgcgctccaactagctcccgaccc ggcgctcggagagagcagggaggaggagaccgcgggagcagaaggagggagggggcggtg ggggaggagttgcaagcaaagggagagaaggaggggagcgggagcgaaaaagaggagggg gctggagcagacctgccgcggcgggcgctgcacccgccagcctcgaggcgcgcggaggag cgcgagggggcgccctgcatcccgcggctgctcggccgactcccgccccgggcctgccga gtgtggtga >gi568815595r:122810146_123061263|GENSCAN_predicted_peptide_6|68_aa MKKKKDKEEVEKKRKRKGKERKEEEDTDMWFEKKLLFSSSPVVIVIDIVVTIIIIVNIII IITNPYRA >gi568815595r:122810146_123061263|GENSCAN_predicted_CDS_6|207_bp atgaagaagaagaaggataaggaggaagtggaaaagaaaagaaaaaggaaaggaaaggaa agaaaagaagaggaggacacagacatgtggttcgagaagaagttactcttttcctcctcc cctgtggtcattgtcattgacattgttgtcaccatcatcatcatcgtcaacatcattatc atcataactaacccatacagagcctaa