GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:58:26 Sequence gi568815577f:43069100_43272277 : 203178 bp : 50.93% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5241 5631 391 0 1 67 116 186 0.256 14.03 1.02 Intr + 7647 7719 73 2 1 57 71 81 0.608 2.28 1.03 Intr + 10562 10790 229 0 1 59 64 202 0.777 11.83 1.04 Term + 12449 12623 175 2 1 77 46 64 0.249 -1.67 1.05 PlyA + 14765 14770 6 1.05 2.15 PlyA - 17229 17224 6 1.05 2.14 Term - 18972 18849 124 2 1 49 38 157 0.703 4.66 2.13 Intr - 22064 22034 31 0 1 89 105 19 0.528 0.89 2.12 Intr - 24150 24068 83 2 2 93 28 122 0.564 5.98 2.11 Intr - 25464 25372 93 2 0 60 81 123 0.988 7.88 2.10 Intr - 25689 25556 134 0 2 87 84 234 0.999 22.34 2.09 Intr - 26437 26339 99 1 0 81 97 174 0.991 17.91 2.08 Intr - 26644 26595 50 2 2 118 78 51 0.981 5.50 2.07 Intr - 32333 32267 67 2 1 113 93 11 0.705 2.58 2.06 Intr - 38046 37926 121 2 1 -18 41 154 0.518 0.60 2.05 Intr - 38538 38352 187 1 1 42 89 222 0.755 16.45 2.04 Intr - 40382 40167 216 0 0 113 77 4 0.339 0.28 2.03 Intr - 45579 45538 42 1 0 90 59 64 0.453 1.91 2.02 Intr - 45855 45740 116 2 2 91 70 -4 0.579 -1.81 2.01 Init - 51882 50756 1127 0 2 60 53 320 0.636 19.37 2.00 Prom - 53140 53101 40 -4.66 3.00 Prom + 67927 67966 40 -5.26 3.01 Init + 71224 71360 137 0 2 69 84 116 0.936 8.91 3.02 Intr + 73088 73217 130 2 1 67 41 75 0.842 1.40 3.03 Term + 74734 74814 81 0 0 96 33 96 0.776 2.59 3.04 PlyA + 76565 76570 6 1.05 4.00 Prom + 83544 83583 40 -5.66 4.01 Init + 88361 88428 68 1 2 69 86 68 0.396 5.25 4.02 Intr + 91672 91877 206 1 2 109 75 67 0.671 6.34 4.03 Intr + 96807 96913 107 1 2 90 65 53 0.315 3.13 4.04 Intr + 99911 100189 279 1 0 52 94 480 0.579 42.67 4.05 Intr + 101418 101540 123 2 0 61 99 333 0.800 32.48 4.06 Term + 102972 103181 210 2 0 125 47 337 0.999 30.59 4.07 PlyA + 103682 103687 6 1.05 5.03 PlyA - 104729 104724 6 -3.74 5.02 Term - 105700 105522 179 2 2 27 42 194 0.554 6.55 5.01 Init - 111777 111681 97 0 1 104 106 29 0.303 6.87 5.00 Prom - 115109 115070 40 -5.46 6.00 Prom + 116338 116377 40 -6.76 6.01 Init + 116662 116699 38 0 2 59 94 36 0.398 0.78 6.02 Intr + 117559 117663 105 1 0 101 64 73 0.614 5.53 6.03 Intr + 122912 123903 992 2 2 34 80 412 0.813 25.50 6.04 Intr + 125479 125537 59 2 2 64 78 82 0.713 3.40 6.05 Term + 127021 127104 84 0 0 91 46 47 0.678 -1.55 6.06 PlyA + 128426 128431 6 1.05 7.00 Prom + 132573 132612 40 -2.46 7.01 Init + 193368 193822 455 2 2 65 81 481 0.239 38.44 7.02 Intr + 194575 194690 116 1 2 38 78 9 0.332 -5.01 7.03 Intr + 194746 194829 84 2 0 85 86 56 0.084 4.89 7.04 Intr + 200459 200637 179 0 2 56 94 51 0.073 2.14 7.05 Term + 201824 202117 294 1 0 86 50 96 0.105 0.91 7.06 PlyA + 202988 202993 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 85131 84655 477 0 0 71 43 191 0.839 7.00 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577f:43069100_43272277|GENSCAN_predicted_peptide_1|289_aa XLGAKAASNDPGGWCFQGLGTTDLSENPKALDLLFPRKMHPLSEVRVPRFSFYPNLVRSR RRERSSTFLRGSRESPTGRDSPSGTGPVWPGARMLRRRIFQKAVDPRPRQTQAAQRLAFG PSPRPGALSRKRKTRTVPISSGQRNVLRLRSVPGSEYNTYAESAHVMSGQRINRYPTGGR AEGRGLGLHHPLVAGRLGPQNWRRLEEEEEEEEGKEEEEEGKEEEEEGKEERIFLTLKEG HDLTIPLQKRKTSVQEQGAELGETRRRDSWAHSYPWDFVVLPRRQSAPV >gi568815577f:43069100_43272277|GENSCAN_predicted_CDS_1|870_bp nnactgggggccaaagcggcttcaaacgaccctggagggtggtgctttcaaggtcttggg accacggacttgtctgaaaatccgaaggcgttggatcttctcttccccagaaaaatgcac ccgctttcagaggttcgtgtcccacgcttttctttctatcccaaccttgtaagaagccgc cgccgtgagcggagcagcaccttcctccgcgggtcgcgggagtcacctacgggaagggac tctccgtctggcacaggccctgtctggcctggggcgcgcatgctccgccgccgaatcttc cagaaagccgtggacccgaggccccggcagacgcaggcggcccagcgccttgcttttggc ccctcgcctcgccctggagccctctctcgcaagcgaaagacacggacggtgcccatcagc tctgggcagaggaacgtgctgcgcctccgcagcgtccccggctcagagtataacacgtat gcagaaagtgcacatgtcatgagtggacagaggataaaccgctaccccactggggggcga gcagaggggagaggcctgggcctgcaccatcccctggtggccggaaggctgggccctcag aactggaggagattggaggaggaggaggaggaagaggaagggaaggaggaggaagaggaa gggaaggaggaggaagaggaagggaaggaggagaggatcttcctaacactgaaggagggg cacgacctgaccatccctttacagaagagaaagacgagtgtgcaggaacagggagcagag ctgggtgagaccaggcggcgggactcctgggcccactcttacccatgggactttgtggtc ctcccacgacgccaatcagccccagtttga >gi568815577f:43069100_43272277|GENSCAN_predicted_peptide_2|829_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSCIGRINIV KMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRTHIAKTILSQKNKAGGITLP DFKLYYKATVTKTAWYWYQNRDIDKWNRTEPSETIPHIYNHLIFDKPDKNKKWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRI YKELKQTYKKKTNNPIKKWAKDMNRHFSKEDIYAANRHMKKCLSSLAIREMQIKTIMRYH LTPVRMAIIKKSGNNSQINQHSYFLSRQIIFKNSDLQILGMSNLSNNKTPVARTATLSAN RMVPTHTEGTSQLRASCQLPTLQQSLFPDRVLRLHFLDMPTCWTGICLVTSLASPLRKEH KSQDPSSLCQKEKNELKAESCSTSPDWVALPRRGGEERRRVRVTSPEGVGRVGGVGSSVD GSGGGGWEMAEYLASIFGTEKDNRRLPEAASSRAESCLEHPGLLDVDTFVPTFGCECPVE QALTILIQNIYRNPQNSAQTADGSHCAVSDVEMQEHYDEFFEEVFTEMEEKYGEVEEMNV CDNLGDHLVGNVYVKFRREEDAEKAVIDLNNRWFNGQPIHAELSPVTDFREACCRQYEMG ECTRGGFCNFMHLKPISRELRRELYGRRRKKHRSRSRSRERRSRSRDRGRGGGGGGGGVW VYDAVEVGRCMIHTAKYLKAKPGLKDNCKVIIVVKKQHKASIQASPPRP >gi568815577f:43069100_43272277|GENSCAN_predicted_CDS_2|2490_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagagatgtgaaggacctcttcaaggagaactacaaaccactgctcaaagaaataaaa gaggacacaaacaaatggaagaacattccatgctcatgcataggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctacca atgactttcttcacagaattggaaaaaactactttaaagtttatatggaaccaaaaaaga acccacattgccaagacaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagacaaatggaatagaacagagccctcagaaacaataccacacatctacaac catctgatcttcgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcgagatggattaaagacttaaatgttagacctaaaacc ataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagagactaccatcagagtgaacaggcaacct acagaatgggagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatc tacaaagaactcaaacaaacttacaagaaaaaaacaaacaaccccatcaaaaagtgggca aaggatatgaacagacacttctcaaaagaagacatttatgcagccaacagacacatgaaa aaatgcttatcatcactggccatcagagaaatgcaaatcaaaaccataatgagataccat ctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacagccagatcaatcag cactcctacttcctgagccgccaaattatctttaaaaactctgatctgcaaattcttggg atgtctaatttgagtaataataaaactccagtcgcccgcacagccacactgtcagccaat cggatggtgcccacccacactgagggtacttcacagctgagagcctcttgccagcttcca actctccagcagtctttgttcccagaccgtgtcctgaggctgcactttctggacatgccc acctgctggactggcatctgtcttgtcacttctctggcctcacctctgcgaaaggaacac aaaagtcaggaccccagttcactctgtcaaaaggaaaagaatgagctgaaagctgagtca tgcagcacctcgcctgactgggtagcgctgccacgtcggggcggggaagagcgtcgtcgc gtccgggtgacgtctcccgagggcgtcggcagggtcggcggcgtcggcagcagtgtcgac ggcagcggcggcggcgggtgggaaatggcggagtatctggcctccatcttcggcaccgag aaagacaatcgccgactcccggaagccgcctcgagccgcgctgaaagttgcctggagcat ccgggccttttggacgtggacacgtttgttccaactttcggctgtgaatgtcctgtcgag caagctctgaccatcttgattcaaaacatctatcgtaatccccaaaacagtgcacagacg gctgacggctcacactgtgccgtgagcgatgtggagatgcaggaacactatgatgagttt tttgaggaggtttttacagaaatggaggagaagtatggggaagtagaggagatgaacgtc tgtgacaacctgggagaccacctggtggggaacgtgtacgtcaagtttcgccgtgaggaa gatgcggaaaaggctgtgattgacttgaataaccgttggtttaatggacagccgatccac gccgagctgtcacccgtgacggacttcagagaagcctgctgccgtcagtatgagatggga gaatgcacacgaggcggcttctgcaacttcatgcatttgaagcccatttccagagagctg cggcgggagctgtatggccgccgtcgcaagaagcatagatcaagatcccgatcccgggag cgtcgttctcggtctagagaccgtggtcgtggcggtggcggtggcggtggtggagtgtgg gtctatgatgcggttgaggtgggaagatgtatgatacatacagcaaagtacctcaaagct aaacctggtttaaaagacaattgcaaggtgatcatcgtggtcaagaagcaacacaaagcc agcatccaggccagccctccccggccataa >gi568815577f:43069100_43272277|GENSCAN_predicted_peptide_3|115_aa MIYPIFGMSVSDGGGSGGGEITESMSAKHTLLGAPPPTAQLKDKRPQGDSRKGDGVTGQD PSYEWELDQELSVLSHLLPVLGPLSETLQAMELTILPLAAVVLFRGTVIAFISIL >gi568815577f:43069100_43272277|GENSCAN_predicted_CDS_3|348_bp atgatctacccaatatttgggatgtctgtgagtgacggcggtggtagtggtggtggtgaa atcacggaatcaatgtctgcaaagcacacgctcttgggagcacctcctcccaccgcacag ctgaaagacaagcgaccacagggagacagcagaaagggagatggagtcaccggccaggac cccagctatgagtgggagctggaccaggagttgagtgtcctgagccacctcctccctgtg ctgggtccactctcagaaacactccaggccatggagctgaccatcctgcctctggcagct gtggtgctgtttaggggaacggtcattgccttcatctccattttatag >gi568815577f:43069100_43272277|GENSCAN_predicted_peptide_4|330_aa MPTTLVRPEITEESEHPGDAACPSPMQPTLGPAQPLGRRGWSPCAVPSAAHPAERPQAAG LHTCDRPEAPAAGPTRLQFPSRQPMALRPLPDSGAGSLRSPLQARPILWQSLDVPRRPLS RQALQRVGGLGLGSTLRCPEAPLTPASLQVPVVPKLNMDVTIQHPWFKRTLGPFYPSRLF DQFFGEGLFEYDLLPFLSSTISPYYRQSLFRTVLDSGISEVRSDRDKFVIFLDVKHFSPE DLTVKVQDDFVEIHGKHNERQDDHGYISREFHRRYRLPSNVDQSALSCSLSADGMLTFCG PKIQTGLDATHAERAIPVSREEKPTSAPSS >gi568815577f:43069100_43272277|GENSCAN_predicted_CDS_4|993_bp atgcccaccacgctcgtgaggccggagatcaccgaggaatccgagcacccgggggacgct gcatgcccgagccccatgcagcccacactgggtcctgcccagcccctgggccggagggga tggtccccctgcgccgtccccagcgctgcccaccctgcagaacgtcctcaggcggccggg ctccacacctgcgacaggcccgaggccccggctgcaggccccactcggctgcagttcccg tcgaggcagccaatggccttgaggcctctcccagactcaggcgctggctcactcagaagc cccctgcaggcccggccaatcctgtggcagagcctcgacgtcccacggcggcctctgagc cgccaggccctacagcgtgtgggagggctcggccttggctccacactgcgctgcccagag gccccgctgactcctgccagcctccaggtccccgtggtaccaaagctgaacatggacgtg accatccagcacccctggttcaagcgcaccctggggcccttctaccccagccggctgttc gaccagtttttcggcgagggcctttttgagtatgacctgctgcccttcctgtcgtccacc atcagcccctactaccgccagtccctcttccgcaccgtgctggactccggcatctctgag gttcgatccgaccgggacaagttcgtcatcttcctcgatgtgaagcacttctccccggag gacctcaccgtgaaggtgcaggacgactttgtggagatccacggaaagcacaacgagcgc caggacgaccacggctacatttcccgtgagttccaccgccgctaccgcctgccgtccaac gtggaccagtcggccctctcttgctccctgtctgccgatggcatgctgaccttctgtggc cccaagatccagactggcctggatgccacccacgccgagcgagccatccccgtgtcgcgg gaggagaagcccacctcggctccctcgtcctaa >gi568815577f:43069100_43272277|GENSCAN_predicted_peptide_5|91_aa MGSPKIQSLLSSSNHGLPWLTLDICHQGPTNVVLAEPVLLEDPTGGAEEIRQPTSQGKFC LGWPWCSVRLSKEPDRGSGEPPTGLADPAKA >gi568815577f:43069100_43272277|GENSCAN_predicted_CDS_5|276_bp atgggcagccccaaaatacaaagtctcctatccagttccaaccacggcctgccctggctg acactggacatttgccaccaggggcccacaaacgtggtccttgctgagcctgtgctgctg gaagacccaactgggggagcagaggagatcagacagcccaccagccaggggaagttctgc ctgggctggccctggtgctccgtgcgactctccaaggagcctgatcgaggatctggtgaa ccacccacgggccttgcagaccctgccaaggcctag >gi568815577f:43069100_43272277|GENSCAN_predicted_peptide_6|425_aa MQPAKEDSSAQVRVLRPHSQVPCEEEVEREERGLGPTLTDLECGPWARPLSPDTQVYQYS SFGPKWTPAAPLTQSLHQPRISTKPEFPPSQGFHQSRTCNNPELPPTQSFRQSRASTNPE LPPTQSFRQPRASTNPELPPIQSLHQPRASANPELPPIQSFRQPRASANPELPPTQSFRQ SRASIQSFCQPRASTNPIFCQPRTSANPELLPTQSFHQSRSSASPELLPTQSFRQPQASA NPERAPTQSFHQPRASANLELLPTQSFCPELPSNQSFHQPRAFANPELLPRASASPELPS TQSSHQPIASANLQLPPTQSFHQPRACTNPELLPMQSFHQARASTNHELPPTTSFHQPTA CSSVSRKGGAALWVAFVPVGKEDVLADLDLHAGSRTPEDALQVTSRHVESLLESGIHSSE LEEKG >gi568815577f:43069100_43272277|GENSCAN_predicted_CDS_6|1278_bp atgcaacctgcaaaagaggacagcagtgcccaagtcagggtcctgcggccccactcacag gtgccctgtgaggaggaggtggagcgagaggagaggggactgggcccgacactcactgat cttgaatgcggcccatgggccagacctctctcaccagacactcaggtgtaccagtacagc agctttggacctaagtggactccagcagctccattaacccagagcttgcaccaacccaga atttccaccaagccagagtttccaccaagccagggcttccaccaatctagaacttgcaac aatccagagcttccgccaacccagagcttccgtcaatccagagcttccaccaacccagag cttccaccaacccagagcttccgccaacccagagcttccaccaacccagagcttcctcca atccagagcttgcatcaacccagagcttccgccaacccagagcttccgccaatccagagc ttccgccaacccagagcttccgccaatccagagcttccgccaacccagagcttccgccaa tccagagcttccatccagagcttctgccaacccagagcttccaccaatccgatcttctgc cagcccagaacttccgccaacccagagcttctgccaacccagagcttccaccaatccaga tcttctgctagcccagagcttctgccaacccagagcttccgccagccccaagcttctgcc aacccagagcgtgcaccaacccagagcttccaccaacccagggcttctgccaatctagag cttctgccaacccagagcttctgcccagagcttccgtcaaaccagagcttccaccaaccc agagcttttgccaacccagagcttctgcccagagcttctgccagcccagagcttccatca acccagagctcgcaccaacctatagcttctgccaacctacagcttccaccaacccagagc ttccatcaacccagagcttgcaccaacccagagcttctgccaatgcaaagcttccaccaa gctagagcttccaccaaccacgagcttccaccaaccacgagcttccaccaacccacagct tgcagctcagtctccagaaaggggggagctgctctgtgggtggcattcgttccagtgggg aaagaggacgttcttgcggaccttgacctccatgccggcagccggaccccggaggatgct ctacaagtcaccagcaggcatgtggagtcactgttggagtcaggcatccactccagtgaa ctggaggagaagggctag >gi568815577f:43069100_43272277|GENSCAN_predicted_peptide_7|375_aa MVMVLVGVKVVVVVMVVVMVGVKVVVVGMVMVLVGVKVVVVMVVVVVVVMVMVVVGVKVV VVVMVVVVGVKVVVVVVGVKVVVVVMVVVVVGVKVVVVVIVMVVVGVKVVVVMVVILVVV MVVMVVIVRVMLVVMGTVTVVLVVVLVMGWECISASASVKWDESCNPYPRAVVRIKGTMV TPVKERGGSCGCCDLGELETSYADEHLGPGQLDQATPTECCKGACAWQSCTLLAALQCRP TVQDSAPPETVPPGGGILRSLLGFLPPSASLSGSDGPQHSFHGNPMNACRAGHLEIGVQS HVQPAAGSTTRKAERHNGHSNNHNNKSGRAEGSPQCARRRAELDIVSLTAGACPAEEDVE GGQLCSLQTPAGTPA >gi568815577f:43069100_43272277|GENSCAN_predicted_CDS_7|1128_bp atggtgatggtgctggtgggagtgaaggtagtggtggtggtgatggtggtggtgatggtg ggagtgaaggtagtggtggtggggatggtgatggtgctggtgggagtgaaggtagtggtg gtgatggtggtggtggtggtggtggtgatggtgatggtggtggtgggagtgaaggtggtg gtggtggtgatggtggtggtggtgggagtgaaagtggtggtggtggtggtgggagtgaag gtagtggtggtggtgatggtggtggtggtggtgggagtgaaggtggtggtggtggtgatc gtgatggtggtggtgggagtgaaggtggtggtggtgatggtggtgatactggttgtagtg atggtagtgatggtggtgatcgtgagggtgatgctggtggtgatgggaacagtgacagtt gtattggtggtagtattggtgatggggtgggaatgcatctcagcttctgcatctgtaaaa tgggatgagagttgcaatccttacccgagagctgtggtgaggatcaaagggacaatggtc acacctgtaaaagagagaggaggatcctgcggatgctgtgatctcggagagttggagacc tcttatgctgatgagcacttgggccctggccagctggaccaggctacccccacagagtgc tgcaagggggcatgtgcctggcagagctgcaccctgctggctgccctccagtgcagaccc accgtgcaagactccgcccctccagagactgtgccccctggtggaggcatcctcaggagc ctcctgggcttccttcctccatctgcctccttgagtggaagcgacgggccccagcacagt ttccacggcaaccccatgaatgcgtgcagggccgggcatctggaaataggtgtccaaagt cacgtgcaaccagcagcaggctcaacaacaaggaaggcagagcgtcacaatggccacagc aataaccacaataacaagagtggcagagctgagggcagtccccagtgtgccaggcgccga gctgagctggacatcgtctcactcacggcaggagcctgtcctgcggaggaggatgtggag ggcggccagttgtgtagcttacaaactccagccgggactccagcctga