GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:51:28 Sequence gi568815581f:30732348_30994910 : 262563 bp : 43.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 52096 51980 117 2 0 98 38 114 0.827 7.84 1.04 Intr - 53684 53572 113 1 2 81 95 62 0.955 6.22 1.03 Intr - 65051 64964 88 0 1 69 98 108 0.540 8.93 1.02 Intr - 71761 71554 208 1 1 100 81 170 0.983 16.15 1.01 Init - 92292 92176 117 0 0 96 52 135 0.748 10.90 1.00 Prom - 95433 95394 40 -10.35 2.00 Prom + 97170 97209 40 -7.26 2.01 Init + 98622 98753 132 2 0 37 63 236 0.531 16.14 2.02 Intr + 101801 103701 1901 1 2 44 82 552 0.547 37.53 2.03 Intr + 104859 104967 109 0 1 84 86 22 0.978 1.89 2.04 Intr + 108270 108434 165 2 0 81 100 51 0.943 5.76 2.05 Intr + 122821 122980 160 0 1 4 115 111 0.027 4.96 2.06 Intr + 128083 128265 183 2 0 31 71 104 0.642 2.76 2.07 Intr + 136901 137043 143 0 2 57 96 110 0.955 8.77 2.08 Intr + 137149 137299 151 0 1 99 108 -30 0.642 -0.16 2.09 Intr + 144027 144203 177 1 0 57 116 25 0.623 2.09 2.10 Intr + 147076 147140 65 2 2 66 119 41 0.635 3.44 2.11 Term + 160947 161807 861 2 0 67 41 161 0.106 2.03 2.12 PlyA + 161839 161844 6 1.05 3.05 PlyA - 162389 162384 6 1.05 3.04 Term - 167259 166822 438 0 0 49 39 224 0.262 8.88 3.03 Intr - 172090 171719 372 1 0 -12 71 178 0.124 1.36 3.02 Intr - 174273 174128 146 1 2 89 62 92 0.775 6.70 3.01 Init - 177825 177777 49 0 1 86 58 20 0.750 -2.09 3.00 Prom - 178461 178422 40 -3.76 4.00 Prom + 189227 189266 40 -4.66 4.01 Init + 189668 189761 94 1 1 95 110 226 0.999 26.04 4.02 Intr + 190593 190723 131 1 2 119 60 138 0.980 14.51 4.03 Intr + 194480 194571 92 1 2 76 76 117 0.503 8.09 4.04 Intr + 199542 199621 80 0 2 93 103 39 0.631 5.09 4.05 Intr + 201838 201950 113 2 2 64 110 79 0.948 7.80 4.06 Intr + 212560 212706 147 0 0 136 100 134 0.995 19.83 4.07 Intr + 216940 217023 84 0 0 99 83 43 0.876 4.92 4.08 Intr + 220941 221003 63 2 0 95 77 37 0.717 2.21 4.09 Intr + 221856 221919 64 2 1 91 40 37 0.469 -2.51 4.10 Intr + 222087 222208 122 1 2 58 99 103 0.677 8.61 4.11 Intr + 223894 224122 229 0 1 139 46 288 0.757 27.14 4.12 Term + 225438 225739 302 1 2 120 47 85 0.495 2.98 4.13 PlyA + 226822 226827 6 1.05 5.00 Prom + 228841 228880 40 -4.26 5.01 Init + 238727 239098 372 1 0 91 53 405 0.919 32.26 5.02 Intr + 252231 252413 183 2 0 22 102 218 0.504 16.58 5.03 Intr + 255597 255759 163 2 1 30 84 122 0.553 5.55 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 63990 63813 178 2 1 58 95 69 0.815 4.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:30732348_30994910|GENSCAN_predicted_peptide_1|215_aa MELEPELLLQEARENVEAAQSYRRELGHRLEGLREARRQIKESASQTRDVLKQHFNDLKG TLGKLLDERLVTLLQEVDTIEQETIKPLDDCQKLIEHGVNTAEDLVREGEIAMLGGVGEE NEKLWSFTKKASHIQLDRVETVGQPDRRDSIGVCAEKQDGYDSLQRDQAVCISTNGAVFV NGKEMTNQLPAVTSGSTVTFDIEAVTLGTTSNNEX >gi568815581f:30732348_30994910|GENSCAN_predicted_CDS_1|645_bp atggagctggagcctgagctgctgttgcaggaggcccgcgagaacgtggaggcagcgcag agctaccggcgggagctgggtcaccggcttgaggggctgcgtgaggcgcggaggcagatc aaagaaagtgcatcacagacaagggatgttctcaaacagcattttaatgatttaaaggga acccttggaaagctcctggatgagcgattggtgacccttttgcaagaggtggacaccatt gaacaggagaccattaaaccactagatgactgccagaagctcatagaacacggagtcaac actgcagaggacttagtccgagaaggtgaaatcgccatgcttggtggtgtgggagaagag aatgagaaactgtggagctttaccaaaaaggcctcgcacattcagttggacagagttgaa actgtgggacagccagacagaagagatagcataggagtgtgtgcagaaaaacaggatgga tatgactctctgcagcgggatcaagctgtgtgcattagtacaaatggtgcagtttttgtc aatggaaaagaaatgacaaatcagttacccgcagttacttctgggtccactgtcacgttt gacattgaagccgtgactctaggaaccaccagtaataatgaagnn >gi568815581f:30732348_30994910|GENSCAN_predicted_peptide_2|1348_aa MRLGKDFHTNKRVCEEIAIIPSKKPRNKIAGYVTHLIKRIQRGPPCKKRKKDDDTSTCKT ITKYLSPLGKTRDRVFAPPKPSNILDYFRKTSPTNEKTQLGKECKIKSPESVPVDSNKDC TTPLEMFSNVEFKKKRKRVNLSHQLNNIKTENEAPIEISSDDSKEDYSLNNDFVESSTSV LRYKKQVEVLAENIQDTKSQPNTMTSLQNSKKVNPKQGTTKNDFKKLRKRKCRDVVDLSE SLPLAEELNLLKKDGKDTKQMENTTSHANSRDNVTEAAQLNDSIITVSYEEFLKSHKENK VEEIPDSTMSICVPSETVDEIVKSGYISESENSEISQQVRFKTVTVLAQVHPIPPKKTGK IPRIFLKQKQFEMENSLSDPENEQTVQKRKSNVVIQEEELELAVLEAGSSEAVKPKCTLE ERQQFMKAFRQPASDALKNGVKKSSDKQKDLNEKCLYEVGRDDNSKKIMENSGIQMVSKN GNLQLHTDKGSFLKEKNKKLKKKNKKTLDTGAIPGKNREGNTQKKETTFFLKEKQYQNRM SLRQRKTEFFKSSTLFNNESLVYEDIANDDLLKVSSLCNNNKLSRKTSIPVKDIKLTQSK AESEASLLNVSTPKSTRRSGRISSTPTTETIRGIDSDDVQDNSQLKASTQKAANLSEKHS LYTAELITVPFDSESPIRMKFTRISTPKKSKKKSNKRSEKSEATDGGFTSQIRKASNTSK NISKAKQLIEKAKALHISRSKVTEEIAIPLRRSSRHQTLPERKKLSETEDCDVQCKAKRD FLMSGLPDLLKRQIAKKAAALDVYNAVSTSFQRVVHVQQKDDAELEADVSHKETKRKLVE AENSKSKRKKPNEYSKNLEKTNRKSEELSKRNNSSGIKLDSSKDFSGGIDFKGSSDDEEE SRLCNTVLITGPTGVGKTAAVYACAQELGFKIFEVNASSQRSGRQILSQLKEATQSHQVD KQGVNSQKPCFFNSYYIGKSPKKISSPKKVVTSPRKVPPPSPKSSGPKRALPPKTLANYF KVSPKPKNNEEIGMLLENNKDPTFSLMFDGCFEEIKFSTPSLHKITMKEEWHKFIQLLTE FQMRNVDFLYSNLEFILPLPVDTIPETKNFCGPSVTVDASAATKSMNCLARKHSEREQPL KKSQKKKQKKTLVILDDSDLFDTDLDFPDQSISLSSVSSSSNAEESKTGDEESKARDKGN NPETKKSIPCPPKTTAGKKCSALVSHCLNSLSEFMDNMSFLDALLTDVREQNKYGRNDFS WTNGKVTSGLCDEFSLESNDGWTSQSSGELKAAAEALSFTKCSSAISKALETLNSCKKLG RDPTNDLTFYVSQKRNNVYFSQSAANLE >gi568815581f:30732348_30994910|GENSCAN_predicted_CDS_2|4047_bp atgcgcctgggcaaagacttccacacgaacaagcgcgtgtgcgaggaaatcgccattatt cccagcaagaagccccgcaacaagatagcaggctatgtcacgcatctgataaagcggatt cagagggggccaccatgcaaaaagcgaaagaaagatgatgacacatctacctgcaaaaca attacaaaatatttatcaccactagggaagactagagacagggtttttgctccaccaaaa cctagtaatattctggattattttagaaagacttcacccacaaatgagaagacacaatta gggaaagagtgcaagataaagtcacctgaatcagtacctgttgacagcaacaaagactgt acgacacctttggaaatgttctcaaatgtagagtttaagaagaaaagaaagagggttaat ttatctcatcaactaaataatattaaaactgaaaatgaagctccaattgaaattagtagc gacgatagcaaagaagactatagtttaaataatgattttgtggaaagtagtacttctgtt ttacgttacaagaaacaagtagaggtacttgcagaaaacattcaagatacaaaaagtcaa ccaaatactatgacctccctgcaaaattctaaaaaagtaaatcctaaacaagggaccaca aaaaatgacttcaaaaagttgagaaaaaggaaatgcagagatgtagtagatctatctgaa agcttacccttggcagaggaactaaatttgcttaaaaaagatggtaaagatactaaacag atggagaatactacaagccatgcaaactctagagataacgtaactgaagcagcccagtta aatgatagtataataactgtctcatatgaggaatttttaaaaagtcacaaggaaaataaa gtggaagagataccagactctacaatgtcaatttgtgttccttctgaaactgtcgacgaa atagtcaaaagtggttatataagtgaatcagaaaactccgaaatttcccagcaggtacgc tttaagacagttactgttcttgcacaggttcaccctattccgcccaaaaagacagggaaa ataccccgaattttcttgaaacaaaagcaatttgaaatggaaaatagtttatctgatcct gagaatgaacagacagttcagaaaagaaaatctaatgttgttatacaggaggaagaatta gaattggctgttttggaagctggaagttctgaagctgtgaaaccaaaatgcactctagaa gaaagacagcaatttatgaaagcatttaggcagccagcatcagatgcacttaaaaatgga gttaaaaagtcttctgataagcagaaagaccttaatgaaaaatgtctatatgaagtagga agagatgataattctaaaaaaatcatggaaaattctggtatccaaatggtttcaaaaaat ggcaatttacagttacacactgataaaggaagttttctgaaggagaaaaataaaaagcta aagaagaagaataagaaaacattagatactggggctattccaggcaaaaacagagaggga aacactcaaaagaaagaaacaacctttttcttaaaagagaaacaatatcaaaatagaatg agtttaagacaaaggaaaacagagtttttcaaaagcagcactttatttaacaatgaaagt cttgtttatgaagatatagcaaatgatgaccttctaaaggtttcctctctgtgtaacaat aataaattgtcaagaaaaaccagcataccagttaaagatattaagcttacacagtctaaa gctgaatctgaagccagcttgctaaatgtttccacgcccaagtcaactagaagatctgga agaattagcagcacacctactacagaaaccattagaggtattgattctgacgatgtacaa gataatagtcaactaaaggcttccactcaaaaagcagccaacttatcggaaaagcacagc ttatatacagcagaattaataacagtaccctttgattcagagagccctattagaatgaaa ttcaccagaattagtactcccaaaaaatctaagaaaaaatctaacaaaagatctgagaaa tctgaagcaactgatggaggttttacttctcagattagaaaggcaagcaatacttcaaaa aacatatcaaaagcaaaacaattgattgaaaaagcaaaagctttacacatcagtaggtca aaggtgactgaagaaatagcgatacccttaaggcgctcctctagacatcagacacttcct gaaaggaagaaattgtcagaaacagaagattgtgatgttcaatgtaaagcaaagcgtgac ttcctaatgagtggtttgccagatttgttgaaacggcaaattgcaaagaaagctgctgcg ctggatgtgtacaatgcagtgagtaccagtttccagagagtcgtacatgtgcaacaaaag gatgatgcagaactggaggctgatgtcagccataaagaaaccaaaaggaaactcgtagaa gcagaaaattctaagtcaaaaagaaagaaaccaaatgagtattcaaaaaatctggagaag accaataggaagtcagaagaacttagcaaaagaaacaactcttctgggataaagctagat tcttccaaagatttctcgggtggcatagactttaaaggcagttcagatgatgaagaagag agtcgtctttgcaatactgtccttataacagggccaacaggagtgggaaaaactgctgca gtgtatgcttgtgcccaggagcttggatttaagatatttgaagtgaatgcctcttcccag cgcagtggtagacaaattctatctcagttgaaggaagctactcagtcccatcaagtagac aaacaaggtgtaaactcacaaaaaccctgtttttttaatagctactacataggcaagtca ccaaaaaaaataagctcccctaagaaagttgttacatcaccaagaaaagttcctccacca tcaccaaaaagtagtggaccaaagcgagcacttcctcccaaaaccttggcaaattatttt aaagtatctcccaaacctaaaaataatgaagaaataggaatgcttctggaaaataataaa gacccaacatttagtttaatgtttgatggctgctttgaagaaatcaagttcagtactcct tccctgcacaaaatcacaatgaaggaagaatggcataaattcatccagcttcttacagaa ttccaaatgcggaatgtagattttttatatagtaatcttgagtttattctaccattacca gttgataccattccagaaactaaaaacttttgtggcccatcagtaactgtggatgccagt gcagcaacaaaaagtatgaattgtcttgctaggaaacactctgaaagagaacagccattg aaaaagtcccagaaaaagaaacaaaagaaaacattggtaatattagatgatagtgatcta tttgacactgacttggactttcctgatcaatctattagcctgtcctctgtatcatcttcc tcaaatgcagaagaaagcaaaaccggagacgaagaaagcaaagccagagacaaaggaaac aatccagagacaaagaaatctattccttgtcctcctaaaacaactgcaggaaaaaaatgt tctgcccttgtttctcattgtttaaattctctctctgagttcatggataacatgtccttc ttagatgcacttttaactgatgtaagggaacaaaacaaatacggtagaaatgactttagt tggacaaatggaaaggttacaagtggactttgtgatgagtttagtcttgagagtaatgat ggatggacttctcaaagctctggagaattaaaggcagctgcagaagctctcagctttact aaatgttcttctgctatttcaaaagcattggaaaccttgaattcttgcaagaaattagga agagatccaaccaacgatcttactttttatgtttcacaaaagcgcaataatgtatacttt agtcagtcagcagctaatttagagtaa >gi568815581f:30732348_30994910|GENSCAN_predicted_peptide_3|334_aa MGFHHVGQAGLELLTSGWAPTGAPQSKCLGETQGVSTKEALYPLQTKTGLGIAFLEIRPS GSQESKITPNVTFCDENAKEPENALDKLFSSEQQASILHVLNTASTKELEAFRLLRGRRS INIVEHRENFGPFQNLESLMNVPLFKYKSTVQVCNSILCPKTGREKRKSPENRFLRKLLK PDIERERLKISSIISKMPKADFYVLEKTGLSIQNSSLFPILLHFHIMEAMLYALLNKTFA QDGQHQVLSMNRNAVGKHFELMIGDSRTSGKELVKQFLFDSILKADPRVFFPSDKIVHYR QMFLSTELQRVEELYDSLLQAIAFYELAVFDSQP >gi568815581f:30732348_30994910|GENSCAN_predicted_CDS_3|1005_bp atggggtttcaccatgttggccaagctggtctcgaacttctgacctcaggctgggcccca acaggagctcctcaatctaagtgtctaggagaaacccagggagtctccacgaaggaggca ctttacccactacagactaaaactggcctcggaatcgccttcctggagattcggcccagc ggctctcaggagtcgaaaattactcccaatgttactttttgtgatgaaaatgcaaaggag cccgaaaatgcacttgacaagctcttctcttcagaacagcaggcttccatcttgcatgtg ttgaatacagcatctactaaagaacttgaagctttccgattgcttcgtggaagaaggtcc atcaatatcgtagagcacagagaaaactttgggccatttcagaatttagagagtttaatg aatgtgcccttgtttaagtataaaagtacagttcaagtttgtaactccatactttgtcca aagactggacgggaaaaaagaaagtcaccggaaaaccggttcctgagaaagctcctcaaa ccagacatagaaagagaaagacttaagatttcctcgatcatttcaaagatgcctaaagca gatttctatgttctggaaaaaacaggactttccattcagaactcatctctgtttccaata ctgttacattttcatatcatggaagccatgctgtatgccttattaaataaaacttttgcc caggatgggcagcatcaggtgctgagcatgaatcgaaatgcagtggggaagcattttgaa ctgatgattggtgactcccggactagtggaaaagagctagtgaagcagtttctcttcgat tctatactgaaggcggatcctcgggtgttcttcccatcagataaaatagttcactacaga cagatgtttttatctactgaactacaaagagtagaagagctttatgattcattattacaa gctattgccttctatgaattagcagtgtttgactctcagccttag >gi568815581f:30732348_30994910|GENSCAN_predicted_peptide_4|506_aa MGDRERNKKRLLELLRAPDTGNAHCADCGAADPDWASYKLGIFICLNCCGVHRNFPDISR VKSVRLDFWDDSIVEFMIHNGNLRVKAKFEARVPAFYYIPQANDCLVLKEQWIRAKYERR EFMADGETISLPGNREGFLWKRGRDNSQFLRRKFVLLAREGLLKYFTKEQGKSPKAVISI KDLNATFQTEKIGHPHGLQITYRRDGHTRNLFVYHESGKEIVDWFNALRAARLQYLKMAF PELPESELVPFLTRNYLKQGFMEKTGPKPEKEEGASSKLRQQGEAGKGYETDLVICGKKF PLLQQKEPFKKRWFALDCHERRLLYYKNPLDAFEQGQVFLGNKEQGYEAYEDLPKGIRGN RWKAGLTIVTPERRFVLTCPSEKEQQEWLESLRGVLSSPLTPLNRLRQHWHGHLPQPSSF PLQLHQQRVAAAAGDPLTEELAATEHLELLVGRSLHLGPGCPPSVPRSQQPFLAVNSART EAVALSISSLGLPRNPPRGSEDLVHR >gi568815581f:30732348_30994910|GENSCAN_predicted_CDS_4|1521_bp atgggcgatcgcgagcgcaacaagaagcggctgctggagctgctgcgggcgccggacaca ggcaacgcgcactgcgccgactgcggggcggcagatcccgactgggcctcttacaagctg gggatcttcatctgtctcaactgctgcggcgtccaccgtaacttccctgacatcagcaga gttaaatctgtgcgacttgacttctgggacgacagtattgtggagtttatgatccacaat ggaaacctccgtgtgaaggccaagttcgaagccagagtcccagctttctactacatcccc caggccaacgactgcctggtcttaaaggaacaatggattcgagctaagtatgagagacgg gaatttatggctgatggggaaaccatctcgctcccaggtaaccgagaaggattcctgtgg aagcgaggaagggacaactcacagtttctgagaaggaagtttgtacttctggcaagagaa ggcctcctgaagtacttcacaaaggaacagggtaaaagccccaaagctgtcatcagcatt aaggacttgaatgccaccttccagacagagaagatagggcacccccatgggctgcagatc acctacaggagagatggccacaccaggaacctgtttgtgtatcatgaaagtgggaaggag atagtggactggttcaatgccctccgtgcagcccgtctgcagtacctaaaaatggccttt cctgaactcccagagtctgagctcgtgccattcctcaccaggaactacctcaaacaaggc ttcatggaaaagactgggccaaagcctgaaaaagaggaaggtgcctcttccaagttgagg cagcagggtgaagctggcaaagggtatgaaactgacctggtcatctgtggtaagaagttc cctcttttgcagcagaaagaacctttcaagaaaaggtggttcgccctggattgccatgag cggaggctgctctattacaagaacccactggatgccttcgagcagggccaggtttttctt gggaacaaggagcagggatatgaagcctacgaagacctgcccaagggcatccgaggaaat cgctggaaagccggactcaccattgtcaccccagagcggagatttgtcctcacttgcccc agtgagaaggaacagcaggaatggctggaaagtttgcggggtgtcctgtccagccccttg acgcccctcaaccggcttagacagcattggcatggccacctccctcagccctcttcattt cccttgcagctgcatcaacagagagtggccgcagcagcaggtgacccattaactgaggaa ctggctgccactgaacacctggaactccttgtgggaagaagtttgcacctcggccctggc tgcccaccatcagtgccccgcagtcagcagccattcctggcagtgaactctgccaggact gaagctgtggctttatccatcagctccctgggccttccccgcaacccacctcggggatct gaggatctggtgcatagatga >gi568815581f:30732348_30994910|GENSCAN_predicted_peptide_5|240_aa MAGLGLGSAVPVWLAEDDLGCIICQGLLDWPATLPCGHSFCRHCLEALWGARDARRWACP TCRQGAAQQPHLRKNTLLQDLADKYRRAAREIQAGSDPAHCPCPGSSSLSSAAARPRRRP ELQRFYRTQDLNFAILKVAVEKSITEVAQELTELVEHLVDIVRSLQNQRPLSESGPDNEL SILGKAFSSGVDLSMASPKLVTSDTAAGKIRDILHDLEEIQEKLQESVTWKEAPEAQMQX >gi568815581f:30732348_30994910|GENSCAN_predicted_CDS_5|720_bp atggcgggcctgggcctgggctccgccgttcccgtgtggctggccgaggacgacctcggc tgcatcatctgccaggggctgctggactggcccgccacgctgccctgcggccacagcttc tgccgccactgcctggaggccctgtggggcgcccgcgacgcccgccgctgggcctgcccc acttgccgccagggcgccgcgcagcagccgcacctgcggaagaacacgctactgcaggac ctggccgacaagtaccgccgcgccgcacgcgagatacaggcgggctccgaccctgcccac tgcccctgcccgggctccagttccctctccagcgcggccgcgaggccccggcgccgcccg gaactgcagcggttttatagaacccaggacctgaactttgctattttgaaggtggcagta gagaagagcatcacagaagttgctcaggagctgacagagctggtggaacatcttgtagac attgtcagaagcctgcagaatcagaggcccctatcagaatctggaccagacaacgaactg agcatcctgggcaaggctttttcttctggggtggatctttccatggcttctccaaagctg gtgacttccgacacagctgcagggaaaatcagagatattctccatgacctagaagaaatt caggaaaaattacaagaaagcgtcacctggaaagaggctcctgaagcacaaatgcaggnn