GENSCAN 1.0 Date run: 4-Nov-116 Time: 03:58:22 Sequence gi568815592r:131849267_132051172 : 201906 bp : 38.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 724 840 117 2 0 61 110 114 0.975 10.42 1.02 Intr + 1876 2001 126 2 0 89 77 27 0.582 1.43 1.03 Intr + 9402 9481 80 1 2 116 77 38 0.931 3.85 1.04 Intr + 11121 11240 120 2 0 84 81 74 0.966 6.07 1.05 Intr + 12329 12438 110 1 2 100 71 23 0.758 -0.04 1.06 Intr + 15240 15305 66 0 0 89 66 61 0.496 1.10 1.07 Intr + 15600 15672 73 0 1 30 100 62 0.717 0.09 1.08 Intr + 18752 18860 109 1 1 59 44 115 0.961 3.14 1.09 Intr + 20092 20223 132 2 0 83 103 45 0.967 5.20 1.10 Intr + 22804 22835 32 2 2 104 65 64 0.999 2.63 1.11 Intr + 26510 26597 88 1 1 97 103 83 0.985 9.32 1.12 Intr + 27726 27895 170 1 2 48 123 115 0.981 9.74 1.13 Intr + 29276 29327 52 1 1 62 116 72 0.939 4.96 1.14 Intr + 31482 31805 324 1 0 -5 87 205 0.564 5.92 1.15 Intr + 34428 34508 81 1 0 95 94 27 0.851 2.69 1.16 Intr + 35665 35797 133 2 1 76 55 227 0.993 16.98 1.17 Intr + 37296 37458 163 0 1 60 103 107 0.914 8.46 1.18 Term + 41075 41245 171 1 0 80 40 101 0.922 1.34 1.19 PlyA + 41484 41489 6 1.05 2.06 PlyA - 42094 42089 6 1.05 2.05 Term - 43708 43574 135 1 0 101 42 56 0.322 -0.66 2.04 Intr - 45511 45427 85 0 1 97 76 32 0.378 1.80 2.03 Intr - 46011 45886 126 2 0 91 46 100 0.295 4.97 2.02 Intr - 54921 54803 119 0 2 -2 109 119 0.138 3.34 2.01 Init - 58562 58440 123 2 0 35 92 166 0.847 11.92 2.00 Prom - 65085 65046 40 -3.05 3.04 PlyA - 66236 66231 6 1.05 3.03 Term - 69903 69485 419 1 2 42 45 174 0.001 2.95 3.02 Intr - 80348 80284 65 1 2 104 69 30 0.002 0.14 3.01 Init - 86533 86466 68 1 2 88 71 87 0.738 7.60 3.00 Prom - 98918 98879 40 -4.25 4.05 PlyA - 98935 98930 6 1.05 4.04 Term - 100294 99998 297 1 0 99 48 293 0.962 20.68 4.03 Intr - 100894 100683 212 2 2 76 83 173 0.996 13.31 4.02 Intr - 101400 101026 375 1 0 42 100 280 0.497 18.66 4.01 Init - 101891 101504 388 2 1 76 111 460 0.507 42.11 4.00 Prom - 102145 102106 40 -1.75 5.05 PlyA - 103275 103270 6 1.05 5.04 Term - 110187 110071 117 0 0 74 54 111 0.872 3.86 5.03 Intr - 110390 110299 92 0 2 112 37 52 0.637 1.29 5.02 Intr - 111315 111201 115 0 1 70 92 23 0.678 0.00 5.01 Init - 114650 114576 75 2 0 53 55 89 0.514 3.24 5.00 Prom - 119555 119516 40 -2.25 6.00 Prom + 122112 122151 40 -6.45 6.01 Init + 127263 127408 146 2 2 60 68 80 0.125 2.84 6.02 Intr + 149823 149880 58 0 1 51 115 26 0.084 -0.43 6.03 Term + 158727 158954 228 2 0 64 54 184 0.771 8.25 6.04 PlyA + 158960 158965 6 1.05 7.03 PlyA - 159256 159251 6 1.05 7.02 Term - 160121 160063 59 0 2 49 48 67 0.022 -4.33 7.01 Init - 171661 171535 127 1 1 90 99 65 0.941 8.17 7.00 Prom - 179062 179023 40 -3.55 8.03 PlyA - 180960 180955 6 1.05 8.02 Term - 191414 191312 103 1 1 47 44 179 0.739 6.17 8.01 Init - 201260 201223 38 2 2 81 68 44 0.396 1.24 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 78220 78316 97 0 1 58 101 91 0.901 6.92 S.002 Term + 80145 80308 164 1 2 104 45 76 0.930 2.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_1|715_aa XKSCKGRCFERTFGNCRCDAACVELGNCCLDYQETCIEPEHIWTCNKFRCGEKRLTRSLC ACSDDCKDKGDCCINYSSVCQEKCGTYTKNMRPVYPTKTFPNHYSIVTGLYPESHGIIDN KMYDPKMNASFSLKSKEKFNPEWYKGEPIWVTAKYQGLKSGTFFWPGSDVEINGIFPDIY KMYNGSVPFEERILAVLQWLQLPKDERPHFYTLYLEEPDSSGHSYGPVSSEVIKALQRVD GMVGMLMDGLKELNLHRCLNLILISDHGMEQGSCKKYIYLNKYLGDVKNIKVIYGPAARL RPSDVPDKYYSFNYEGIARNLSALFVGYGPGFKHGIEADTFENIEVYNLMCDLLNLTPAP NNGTHGSLNHLLKNPVYTPKHPKEVHPLVQCPFTRNPRDNLGCSCNPSILPIEDFQTQFN LTVAEGGPHCLAKPLKLSNIIARTIYRKTRAQTRERPESCPPPTPEELGLERSEGKYPRI RAAFPLTHGPLGALVEGGMVKEMVLEREKGEDEILILRGIEILNWAETESDWQELNKNSS GIYSEALLTTNIVPMYQSFQVIWRYFHDTLLRKYAEERNGVNVVSGPVFDFDYDGRCDSL ENLRQKRRVIRNQEILIPTHFFIVLTSCKDTSQTPLHCENLDTLAFILPHRTDNSESCVH GKHDSSWVEELLMLHRARITDVEHITGLSFYQQRKEPVSDILKLKTHLPTFSQED >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_1|2148_bp nttaaaagttgcaaaggtcgctgtttcgagagaacatttgggaactgtcgctgtgatgct gcctgtgttgagcttggaaactgctgtttagattaccaggagacgtgcatagaaccagaa catatatggacttgcaacaaattcaggtgtggtgagaaaaggttgaccagaagcctctgt gcctgttcagatgactgcaaggacaagggcgactgctgcatcaactacagttctgtgtgt caagaaaaatgtggaacatatactaaaaacatgagaccggtatatccaacaaaaactttc cccaatcactacagcattgtcaccggattgtatccagaatctcatggcataatcgacaat aaaatgtatgatcccaaaatgaatgcttccttttcacttaaaagtaaagagaaatttaat cctgagtggtacaaaggagaaccaatttgggtcacagctaagtatcaaggcctcaagtct ggcacatttttctggccaggatcagatgtggaaattaacggaattttcccagacatctat aaaatgtataatggttcagtaccatttgaagaaaggattttagctgttcttcagtggcta cagcttcctaaagatgaaagaccacacttttacactctgtatttagaagaaccagattct tcaggtcattcatatggaccagtcagcagtgaagtcatcaaagccttgcagagggttgat ggtatggttggtatgctgatggatggtctgaaagagctgaacttgcacagatgcctgaac ctcatccttatttcagatcatggcatggaacaaggcagttgtaagaaatacatatatctg aataaatatttgggggatgttaaaaatattaaagttatctatggacctgcagctcgattg agaccctctgatgtcccagataaatactattcatttaactatgaaggcattgcccgaaat ctttctgccctctttgttggctatggacctggattcaagcatggcattgaggctgacacc tttgaaaacattgaagtctataacttaatgtgtgatttactgaatttgacaccggctcct aataacggaactcatggaagtcttaaccaccttctaaagaatcctgtttatacgccaaag catcccaaagaagtgcaccccctggtacagtgccccttcacaagaaaccccagagataac cttggctgctcatgtaacccttcgattttgccgattgaggattttcaaacacagttcaat ctgactgtggcagaagggggccctcactgtttggcaaagccattaaagctcagtaacatc attgcacgtaccatctataggaagacacgagcccaaaccagggagaggccagaatcctgc ccaccaccaacaccagaggagttaggccttgaaaggagtgagggcaaatatcccagaata agagcagccttccccctcacccatggccccctgggtgctctggtagaaggaggaatggtg aaggagatggtgctggaaagagagaagggggaagatgagattttgatcttacgtggaatt gaaattttaaactgggctgaaactgaaagtgactggcaagaactaaataaaaattcaagt ggaatatattctgaagctttgcttactacaaatatagtgccaatgtaccagagttttcaa gttatatggcgctactttcatgacaccctactgcgaaagtatgctgaagaaagaaatggt gtcaatgtcgtcagtggtcctgtgtttgactttgattatgatggacgttgtgattcctta gagaatctgaggcaaaaaagaagagtcatccgtaaccaagaaattttgattccaactcac ttctttattgtgctaacaagctgtaaagatacatctcagacgcctttgcactgtgaaaac ctagacaccttagctttcattttgcctcacaggactgataacagcgagagctgtgtgcat gggaagcatgactcctcatgggttgaagaattgttaatgttacacagagcacggatcaca gatgttgagcacatcactggactcagcttctatcaacaaagaaaagagccagtttcagac attttaaagttgaaaacacatttgccaacctttagccaagaagactga >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_2|195_aa MGEQQSHFEKEHAEWKDKTIATENGNMMGKLVKQGKGKQGQGASKAIGQCQSSAAKPRRS GKESVREPWARVPGALGVAASITAVVNCQQPSDSVSCLQRSTIAHDPTHPPHTKPQRKAA RVLNSQQAYQSLYSMQLLPLLEMSSCLLLPKLIYFQLGAEKVSNLEMSINAGEKKALRES CSLLPKDQKRVAKQG >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_2|588_bp atgggggagcagcaaagtcactttgaaaaggagcatgcagaatggaaagataaaacaatc gctacagaaaatggtaacatgatgggaaaactggtgaaacaaggaaagggaaaacaagga cagggggcttccaaggcaatcgggcagtgtcagtcttcagccgctaagccaagaagatct gggaaggagtcagtcagagagccttgggccagagttccaggggctctgggagtggctgcc agcattacagctgtggttaactgtcaacaaccctcggactcagtttcatgtctgcagaga agcaccatcgcccatgatcctacccatcctccacacacaaagccacaaaggaaagcagcc agagttctgaacagccaacaggcctaccagtccttgtactctatgcagctacttcccctg ctggaaatgagcagctgccttcttctccctaagctcatatatttccaacttggagctgaa aaagtcagcaacctggaaatgtcaataaatgcaggggaaaaaaaagccctaagagagtcc tgctctctcctgccaaaggaccagaaaagggtagccaagcaagggtaa >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_3|183_aa MADFMAQAEKMQDKPAKTCGTSKNQAKRVAPTGDISSFLRQREEMLAQGPGVEVGVAGEL ACSVPTKALSSMTARRGRGADYTPMHWQGKEGKTYLCRLVPAKTREEFPGPREAAVWGGG GWAGVWPRGSLAGALHQSDTICQHRSYGVASRAPKTALQAGLARLGPQKGQQTKGCSGRI SPV >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_3|552_bp atggctgatttcatggctcaggcagagaaaatgcaagataaacctgcaaaaacttgtggc accagcaagaaccaagctaaaagagtagcccctactggggacatatcatcattcttgagg cagagggaagaaatgttagcacaagggccaggtgttgaggtgggcgttgcgggggagctg gcttgctctgtgcccaccaaggctctgtcttcaatgacagccagacggggcaggggggca gactacactcccatgcactggcagggcaaggaaggcaaaacctacctgtgcagacttgtg ccagcaaagacacgtgaggagtttcctgggcctagggaagctgcagtgtggggagggggt ggatgggctggtgtgtggcctaggggcagccttgctggagctctccaccagtcagatact atctgccagcacagaagctatggcgtggcctccagggcacccaagactgccctgcaagca ggtctggccaggctgggaccccagaaaggccagcagaccaaggggtgctcaggtcgaatc agccctgtctga >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_4|423_aa MGPVRVAFVVLLALCSRVSAGSPRCGRRLPGRDSGPAGEGVRADRAPLTALSSLQPAVGQ NCSGPCRCPDEPAPRCPAGVSLVLDGCGCCRVCAKQLGELCTERDPCDPHKGLFCHFGSP ANRKIGVCTAAASPPPAAQIPTRIPDALDVRVPQCLTSASPTPLFPSSSPAKDGAPCIFG GTVYRSGESFQSSCKYQCTCLDGAVGCMPLCSMDVRLPSPDCPFPRRVKLPGKCCEEWVC DEPKDQTVVGPALAAYRLEDTFGPDPTMIRANCLVQTTEWSACSKTCGMGISTRVTNDNA SCRLEKQSRLCMVRPCEADLEENIKKGKKCIRTPKISKPIKFELSGCTSMKTYRAKFCGV CTDGRCCTPHRTTTLPVEFKCPDGEVMKKNMMFIKTCACHYNCPGDNDIFESLYYRKMYG DMA >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_4|1272_bp atgggccccgtccgcgtcgccttcgtggtcctcctcgccctctgcagccgggtaagcgcc gggagcccccgctgcggccggcggctgccagggagggactcggggccggccggggagggc gtgcgcgccgaccgagcgccgctgaccgccctgtcctccctgcagccggccgtcggccag aactgcagcgggccgtgccggtgcccggacgagccggcgccgcgctgcccggcgggcgtg agcctcgtgctggacggctgcggctgctgccgcgtctgcgccaagcagctgggcgagctg tgcaccgagcgcgacccatgcgacccgcacaagggcctattctgtcacttcggctccccg gccaaccgcaagatcggcgtgtgcaccgctgctgccagcccgccccctgcagcccagatc ccaactcgcatccctgacgctctggatgtgagagtgccccaatgcctgacctctgcatcc cccacccctctcttcccttcctcttctccagccaaagatggtgctccctgcatcttcggt ggtacggtgtaccgcagcggagagtccttccagagcagctgcaagtaccagtgcacgtgc ctggacggggcggtgggctgcatgcccctgtgcagcatggacgttcgtctgcccagccct gactgccccttcccgaggagggtcaagctgcccgggaaatgctgcgaggagtgggtgtgt gacgagcccaaggaccaaaccgtggttgggcctgccctcgcggcttaccgactggaagac acgtttggcccagacccaactatgattagagccaactgcctggtccagaccacagagtgg agcgcctgttccaagacctgtgggatgggcatctccacccgggttaccaatgacaacgcc tcctgcaggctagagaagcagagccgcctgtgcatggtcaggccttgcgaagctgacctg gaagagaacattaagaagggcaaaaagtgcatccgtactcccaaaatctccaagcctatc aagtttgagctttctggctgcaccagcatgaagacataccgagctaaattctgtggagta tgtaccgacggccgatgctgcaccccccacagaaccaccaccctgccggtggagttcaag tgccctgacggcgaggtcatgaagaagaacatgatgttcatcaagacctgtgcctgccat tacaactgtcccggagacaatgacatctttgaatcgctgtactacaggaagatgtacgga gacatggcatga >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_5|132_aa MNSEPSFRLGSRQQATINNVTGEEYNLYSPLWQFDDKNKNKTEKNRILKTESQFINHSQV QNNGLAWWNIGLPETVQQEYFPTTSNARKNKEVQGLRILLVLGAYGRQNGFQMAVSAQHK VIFPSWKVGYDA >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_5|399_bp atgaactcagaaccgagtttccggttgggatccagacagcaggcaaccatcaacaatgtg acaggagaagaatataacttatactcccccctgtggcaatttgatgacaaaaacaaaaac aaaacagaaaaaaacaggatcctgaagacagaatcacaattcataaatcattcacaagtt caaaataacggcctagcttggtggaatattgggctgccagaaacagtacaacaggagtat ttccctaccaccagcaatgccaggaaaaataaggaagtccaaggtctaagaattctcctc gtccttggcgcttacggtagacaaaatggcttccagatggccgtttctgctcaacataag gtcattttcccttcttggaaagttggatatgatgcatga >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_6|143_aa MNTYRWLEKLGKIFLDCAIVVCPDKDLGDPESFKPSSSMMPSTIETCLLCIIFKKIVTEV IHRENYSKDKITHDEPVIRPHPSAKGMPVSSIVYIDTEEVDDEHIDEHIAVSATRWSGSH QPNPVVSTTDSFNSALFKEMRLE >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_6|432_bp atgaatacctatagatggcttgagaaactcggaaaaatatttttggactgtgcaatagtg gtttgtccagataaggaccttggcgatccggagtcatttaaaccatctagcagcatgatg ccaagcacaatagaaacatgtttgctatgtatcatctttaagaaaattgtgactgaagta attcatagggaaaactactccaaggacaaaatcacacatgatgagccagtcatacggcca cacccaagtgcaaagggaatgccagtgtcatctattgtttacatagacacagaagaagta gatgatgaacatattgatgaacacattgctgtttcagccacaaggtggtctggaagtcac cagccaaatcctgttgtttcaacaacagatagttttaacagtgctttatttaaagagatg agactagaatga >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_7|61_aa MKEPRTVGREQVGVSFQNFAICEKDESSKNLYRLTKGAGRWPGKSKSMAIASGKGHPIEH G >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_7|186_bp atgaaagagccaagaactgtgggcagggaacaagtaggtgtcagttttcagaattttgcc atttgtgaaaaagatgagtcttctaaaaatctttacagactgacgaaaggagcagggaga tggccaggaaagtccaagagcatggcaatagcatctggcaaaggtcatcccatagagcat gggtga >gi568815592r:131849267_132051172|GENSCAN_predicted_peptide_8|46_aa MDSRMLISLDEHRHTDQEFNHSDCRTVTIVTDTLAIMTNPVRTGKE >gi568815592r:131849267_132051172|GENSCAN_predicted_CDS_8|141_bp atggactctcgaatgcttatcagcttggatgaacacagacatacagaccaagaatttaac cacagtgactgccgaacagtgacaatcgtgactgacactttggcaataatgactaatcca gtacggactggcaaggaatag