GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:01:49 Sequence gi568815581f:16317669_16536886 : 219218 bp : 48.91% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 107 240 134 0 2 102 100 203 0.995 23.26 1.02 Term + 8132 8230 99 1 0 94 48 81 0.979 2.83 1.03 PlyA + 10135 10140 6 1.05 2.06 PlyA - 13749 13744 6 1.05 2.05 Term - 25273 25149 125 2 2 78 49 178 0.954 11.45 2.04 Intr - 27043 26929 115 1 1 58 109 80 0.994 7.12 2.03 Intr - 31017 30948 70 2 1 58 86 65 0.729 2.48 2.02 Intr - 32361 32263 99 2 0 59 89 52 0.712 1.63 2.01 Init - 35768 35359 410 2 2 94 70 552 0.984 50.03 2.00 Prom - 41535 41496 40 -3.86 3.00 Prom + 56960 56999 40 -4.66 3.01 Init + 61414 61446 33 0 0 107 77 75 0.141 6.23 3.02 Term + 64234 64929 696 0 0 67 31 1342 0.168 119.95 3.03 PlyA + 65050 65055 6 1.05 4.04 PlyA - 65814 65809 6 1.05 4.03 Term - 73945 73827 119 2 2 105 47 168 0.962 13.10 4.02 Intr - 87661 87611 51 2 0 108 63 20 0.287 0.38 4.01 Init - 89425 89347 79 1 1 58 89 38 0.404 0.78 4.00 Prom - 91924 91885 40 -8.66 5.00 Prom + 93006 93045 40 -4.96 5.01 Init + 96791 96794 4 1 1 97 68 0 0.276 -0.94 5.02 Intr + 98104 98173 70 2 1 103 60 43 0.694 1.24 5.03 Intr + 100033 100200 168 1 0 62 94 174 0.700 14.46 5.04 Intr + 102447 102580 134 0 2 53 74 233 0.999 18.69 5.05 Intr + 104931 105221 291 1 0 117 109 323 0.997 34.41 5.06 Intr + 105801 106099 299 1 2 93 83 323 0.992 28.89 5.07 Intr + 108431 108601 171 1 0 30 105 226 0.969 18.64 5.08 Intr + 109054 109217 164 0 2 81 41 202 0.014 13.57 5.09 Intr + 111149 111314 166 2 1 62 100 209 0.982 19.46 5.10 Intr + 114116 114182 67 1 1 68 71 99 0.911 4.68 5.11 Intr + 114298 114632 335 2 2 92 50 681 0.810 59.99 5.12 Intr + 115906 116030 125 0 2 106 109 150 0.999 18.28 5.13 Intr + 118563 118648 86 0 2 66 55 122 0.724 6.16 5.14 Intr + 121659 121746 88 1 1 69 106 45 0.603 3.43 5.15 Intr + 122517 122585 69 0 0 65 99 59 0.377 3.00 5.16 Term + 123406 123412 7 1 1 97 54 0 0.337 -5.16 5.17 PlyA + 123510 123515 6 1.05 6.04 PlyA - 123904 123899 6 -0.45 6.03 Term - 126463 125920 544 0 1 111 44 704 0.996 62.04 6.02 Intr - 130292 130177 116 2 2 93 105 129 0.868 14.35 6.01 Init - 130603 130601 3 1 0 91 101 0 0.846 1.60 6.00 Prom - 135086 135047 40 -4.86 7.00 Prom + 136418 136457 40 -5.66 7.01 Init + 137465 137475 11 1 2 41 77 15 0.037 -4.89 7.02 Term + 138293 138872 580 2 1 63 35 486 0.259 34.76 7.03 PlyA + 141223 141228 6 1.05 8.04 PlyA - 143090 143085 6 -0.45 8.03 Term - 144718 144449 270 1 0 135 51 293 0.510 25.78 8.02 Intr - 145939 145846 94 0 1 69 43 75 0.842 1.07 8.01 Init - 146998 146937 62 1 2 57 100 64 0.919 5.22 8.00 Prom - 154404 154365 40 -5.26 9.05 PlyA - 156774 156769 6 1.05 9.04 Term - 166927 166825 103 0 1 38 48 109 0.145 -0.35 9.03 Intr - 170386 170316 71 1 2 40 53 105 0.231 0.18 9.02 Intr - 173807 173775 33 2 0 119 60 15 0.439 0.22 9.01 Init - 174322 174077 246 1 0 95 99 367 0.999 34.10 9.00 Prom - 202047 202008 40 -1.96 10.02 PlyA - 204472 204467 6 1.05 10.01 Term - 218252 217974 279 2 0 51 49 192 0.688 7.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 64240 64929 690 0 0 82 31 1336 0.821 123.50 S.002 Term + 109054 109221 168 0 0 81 51 196 0.980 13.08 S.003 Init + 109820 109879 60 1 0 46 109 129 0.980 10.54 S.004 Term + 119121 119221 101 1 2 71 45 71 0.885 -0.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_1|77_aa XCSVLTLQSVNVLRKYISLLDLPLSLLHTQDVLFVLNSKEVAQAKKAMSCHRSQLLWFRR LYIIFSRYMRINSLSFL >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_1|234_bp nggtgctctgtgctcacgcttcagtctgtgaatgtgctgcgcaagtacatctcccttctg gatctgcccttgtctctgcttcatacgcaggatgtcctcttcgtgctcaacagcaaagaa gtggcacaggccaagaaagccatgtcctgccaccgcagccagctcctctggttccgccgc ctctacattatcttctcccggtacatgagaatcaactcactgagcttcctctga >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_2|272_aa MRRSRSSAAAKLRGQKRSGASAAPAASAAAALAPSATRTRRSASQAGSKSQAVEKPPSEK PRLRRSSPRAQEEGPGEPPPPELALLPPPPPPPPTPATPTSSASNLDLGEQRERWETFQK RQKLTSEGAAKLLLDTFEYQGLVKHTGGCHCGAVRFEVWASADLHIFDCNCSICKKKQNR HFIVPASRFKLLKGAEHITTYTFNTHKAQHTFCKRCGVQSFYTPRSNPGGFGIAPHCLDE GTVRSMVTEEFNGSDWEKAMKEHKTIKNMSKE >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_2|819_bp atgcggcgatcgaggagctctgcggccgccaagctgcgcgggcagaagcggtccggggcc tccgcggcccccgcggcctccgcggccgctgccttggcacccagcgccacccgcacacgg cgctccgctagccaggccgggagcaagagccaggcggtggagaagccgccgtcggagaag ccgcggctgaggcgctcgtcgccgcgggcccaggaggagggcccgggggagccgccgccg cctgagctggcgttgctcccgccaccgccgccgccgccgccgactcccgcgaccccgacg tcctcggcgtccaacctggacctgggcgagcagcgggagcgctgggagacgttccagaag cggcagaagcttacctccgagggtgccgccaagctcctgctagacacctttgaataccag ggcctggtgaagcacacaggaggctgccactgtggagcagttcgttttgaagtttgggcc tcagcagacttgcatatatttgactgcaattgcagcatttgcaagaagaagcagaataga cacttcattgttccagcttctcgcttcaagctcctgaagggagctgagcacataacgact tacacgttcaatactcacaaagcccagcataccttctgtaagagatgtggcgttcagagc ttctatactccacgatcaaaccccggaggcttcggaattgccccccactgcctggatgag ggcactgtgcggagtatggtcactgaggaattcaatggcagcgattgggagaaggccatg aaagagcacaagaccatcaagaacatgtctaaagagtga >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_3|242_aa MGSSLASLGLVVKMQIFVKTLTGKTITLEVEPSDTIENVKAKIQDKEGIPPDQQRLIFAG KQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLEVEPSDTIENVKAKIQ DKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITL EVEPSDTIENVKAKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRG GC >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_3|729_bp atgggctccagcctcgcctctctgggcctggtggtcaaaatgcagatcttcgtgaaaacc cttaccggcaagaccatcacccttgaggtggagcccagtgacaccatcgaaaatgtgaag gccaagatccaggataaggaaggcattccccccgaccagcagaggctcatctttgcaggc aagcagctggaagatggccgtactctttctgactacaacatccagaaggagtcgaccctg cacctggtcctgcgtctgagaggtggtatgcagatcttcgtgaagaccctgaccggcaag accatcaccctggaagtggagcccagtgacaccatcgaaaatgtgaaggccaagatccag gataaagaaggcatccctcccgaccagcagaggctcatctttgcaggcaagcagctggaa gatggccgcactctttctgactacaacatccagaaggagtcgaccctgcacctggtcctg cgtctgagaggtggtatgcagatcttcgtgaagaccctgaccggcaagaccatcactctg gaggtggagcccagtgacaccatcgaaaatgtgaaggccaagatccaagataaagaaggc atcccccccgaccagcagaggctcatctttgcaggcaagcagctggaagatggccgcact ctttctgactacaacatccagaaagagtcgaccctgcacctggtcctgcgcctgaggggt ggctgttaa >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_4|82_aa MQADGGTTAFPSGLEGGGRPGSALWSVYKTGEKGESKKLERNRSPAEERAAMLALHLPET LGQFTFQSHIVTAEVEAQRDAS >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_4|249_bp atgcaggcggacggcgggaccacagcatttccctccggcttggagggcggaggccgaccc gggagcgcgctctggagcgtttataagacaggagaaaagggagaaagcaaaaagttggaa agaaacagaagtcctgcagaagagcgggctgccatgctggctttgcatcttccagagacg cttggccagttcaccttccagagccacatcgtcacggctgaagtagaagcccagagagat gccagttga >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_5|747_aa MAPATEKLRDPSSRHALASACGVSRLETLDGGQEDGSEADRGKLDFGSGLPPMESQFQGE DRKFAPQIRVNLNYRKGTGASQPDPNRFDRDRLFNAVSRGVPEDLAGLPEYLSKTSKYLT DSEYTEGSTGKTCLMKAVLNLKDGVNACILPLLQIDRDSGNPQPLVNAQCTDDYYRGHSA LHIAIEKRSLQCVKLLVENGANVHARACGRFFQKGQGTCFYFGELPLSLAACTKQWDVVS YLLENPHQPASLQATDSQGNTVLHALVMISDNSAENIALVTSMYDGLLQAGARLCPTVQL EDIRNLQDLTPLKLAAKEGKIEIFRHILQREFSGLSHLSRKFTEWCYGPVRVSLYDLASV DSCEENSVLEIIAFHCKSPHRHRMVVLEPLNKLLQAKWDLLIPKFFLNFLCNLIYMFIFT AVAYHQPTLKKARALFQALLTVVSQVLCFLAIEWYLPLLVSALVLGWLNLLYYTRGFQHT GIYSVMIQKVILRDLLRFLLIYLVFLFGFAVALVSLSQEAWRPEAPTGPNATESVQPMEG QEDEGNGAQYRGILEASLELFKFTIGMGELAFQEQLHFRGMVLLLLLAYVLLTYILLLNM LIALMSETVNSVATDSWSIWKLQKAISVLEMENGYWWCRKKQRAGVMLTVGTKPDGSPDE RWCFSPANVDKNHMQPTLHNASHYLEGRTAQNGGPHLPEPGVCEGRDPVPRPTVGVCKPE RTGLQIREESASCLAAEYWSQEPAMRF >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_5|2244_bp atggcccctgctactgagaagctccgggatcccagcagccgccacgccctggcctcagcc tgcggggtaagtcggttggagacattagatggaggccaagaagatggctctgaggcggac agaggaaagctggattttgggagcgggctgcctcccatggagtcacagttccagggcgag gaccggaaattcgcccctcagataagagtcaacctcaactaccgaaagggaacaggtgcc agtcagccggatccaaaccgatttgaccgagatcggctcttcaatgcggtctcccggggt gtccccgaggatctggctggacttccagagtacctgagcaagaccagcaagtacctcacc gactcggaatacacagagggctccacaggtaagacgtgcctgatgaaggctgtgctgaac cttaaggacggagtcaatgcctgcattctgccactgctgcagatcgacagggactctggc aatcctcagcccctggtaaatgcccagtgcacagatgactattaccgaggccacagcgct ctgcacatcgccattgagaagaggagtctgcagtgtgtgaagctcctggtggagaatggg gccaatgtgcatgcccgggcctgcggccgcttcttccagaagggccaagggacttgcttt tatttcggtgagctacccctctctttggccgcttgcaccaagcagtgggatgtggtaagc tacctcctggagaacccacaccagcccgccagcctgcaggccactgactcccagggcaac acagtcctgcatgccctagtgatgatctcggacaactcagctgagaacattgcactggtg accagcatgtatgatgggctcctccaagctggggcccgcctctgccctaccgtgcagctt gaggacatccgcaacctgcaggatctcacgcctctgaagctggccgccaaggagggcaag atcgagattttcaggcacatcctgcagcgggagttttcaggactgagccacctttcccga aagttcaccgagtggtgctatgggcctgtccgggtgtcgctgtatgacctggcttctgtg gacagctgtgaggagaactcagtgctggagatcattgcctttcattgcaagagcccgcac cgacaccgaatggtcgttttggagcccctgaacaaactgctgcaggcgaaatgggatctg ctcatccccaagttcttcttaaacttcctgtgtaatctgatctacatgttcatcttcacc gctgttgcctaccatcagcctaccctgaagaaggcaagggccctgttccaggccctgctc acagtggtgtcccaggtgctgtgtttcctggccatcgagtggtacctgcccctgcttgtg tctgcgctggtgctgggctggctgaacctgctttactatacacgtggcttccagcacaca ggcatctacagtgtcatgatccagaaggtcatcctgcgggacctgctgcgcttccttctg atctacttagtcttccttttcggcttcgctgtagccctggtgagcctgagccaggaggct tggcgccccgaagctcctacaggccccaatgccacagagtcagtgcagcccatggaggga caggaggacgagggcaacggggcccagtacaggggtatcctggaagcctccttggagctc ttcaaattcaccatcggcatgggcgagctggccttccaggagcagctgcacttccgcggc atggtgctgctgctgctgctggcctacgtgctgctcacctacatcctgctgctcaacatg ctcatcgccctcatgagcgagaccgtcaacagtgtcgccactgacagctggagcatctgg aagctgcagaaagccatctctgtcctggagatggagaatggctattggtggtgcaggaag aagcagcgggcaggtgtgatgctgaccgttggcactaagccagatggcagccccgatgag cgctggtgcttcagtcctgctaatgtggacaaaaaccacatgcagcccacgctacacaat gcctcccactacctggaaggccgcactgctcagaatggagggccacatctgccagagcct ggagtctgcgaaggccgggacccggttccccggcccacagtgggggtgtgcaaacccgag agaactgggttgcaaattcgtgaagaatcagcatcatgtttggcagctgagtattggagc caggagcctgccatgaggttttga >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_6|220_aa MEGDALGAMEKLCRQLTYHLSPHSQWRRHRGLVKRKPQACLKAVLAGSPPDNTVDLSGIP LTSRDLERVTSYLQRCGEQVDSVELGFTGLTDDMVLQLLPALSTLPRLTTLALNGNRLTR AVLRDLTDILKDPSKFPNVTWIDLGNNVDIFSLPQPFLLSLRKRSPKQGHLPTILELGEG PGSGEEVREGTVGQEDPGGGPVAPAEDHHEGKETVAAAQT >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_6|663_bp atggaaggcgatgcactgggcgccatggagaagctgtgccggcagctgacataccacctc agcccccactcccagtggaggcggcaccgggggctggtgaaaaggaagccacaggcctgc ctcaaggctgtcctggccggaagccccccagacaacacagtggacctgtcgggaatccca ctgacctcccgagacctggagcgggtgaccagctacctacagcgctgtggggagcaggta gacagcgtggagctgggcttcacaggcctcacggacgacatggtcctgcagctgctgcca gcactcagcaccctgccccgcctcaccacactggcactcaatggcaaccggttgacccgg gccgtgctgcgcgacctcactgacatccttaaggatcccagcaagttccccaatgtcacg tggattgacctgggcaacaacgtggacatcttctccttgccccagcccttcctgctcagc ctgcgcaagcgctccccaaagcagggccacctacccaccatcctggagctgggtgagggc ccaggcagtggggaggaggtccgggaagggacagtaggccaggaggaccctggagggggc cctgtggcacctgccgaagaccaccatgagggcaaggagactgtagctgcagctcagacg tga >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_7|196_aa MLARTPPGLPPAENRTRQHWPRSGAVRRWGTGGGRGGEGGRGRGRGGGGRGGGGGGVAVV GGGGRGGRGGGRGGVGGGRGGRRRKRRRKRRKKRRKEEKKEEDQEEEEEEEEEEEKEEEE EEEEKKEEEEEEKEEEEEEKEEEEEEKKEKEEEEEEEEEQKKEEEEEEEEEEKEKEEEEE EEEEEEAGLDTETHIP >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_7|591_bp atgttggccagaacccctccagggctccctcctgcagagaacaggactaggcagcactgg cccagaagtggagcagtaagaaggtggggaacggggggagggagaggaggagaaggagga agaggaagagggagaggaggaggaggaaggggagggggaggaggaggagtagcagtagta ggaggaggaggaagaggagggagaggaggaggaagaggaggggtaggaggaggaagagga ggaaggaggaggaagaggaggaggaagaggaggaagaagaggagaaaggaggagaagaag gaagaggatcaggaagaggaggaggaggaagaggaggaggaagagaaggaagaggaggag gaggaagaggagaagaaggaagaggaggaggaagagaaggaagaggaggaggaagagaag gaagaggaggaggaagagaagaaggaaaaggaggaagaggaggaggaggaagaggagcag aagaaggaagaggaggaggaagaggaagaagaggagaaggagaaagaagaggaggaggaa gaggaagaggaggaggaagctggacttgacactgagacacacataccctaa >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_8|141_aa MAFDTFSIGSVVQEIENPCRRALDPSHLGWEPFSGLCWGSCGSPQNEDSLQWDLGMESTS LDDVLYRYASFRNLVDPITHDLIISLARYIHCPKPVGWLWSEPARSRGVRWVPFWAEQLY AQGGLADRPKEDKPVLQAPRA >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_8|426_bp atggcttttgacacattctcaattgggtctgtggtccaggagatcgagaacccctgcaga agggcactggaccccagccacctgggctgggagcctttctctgggctgtgctggggcagc tgtgggagcccccagaatgaggacagcctgcagtgggacctgggcatggagtcgacctcg ctagacgacgttctgtatcgctacgccagcttccggaacctggtggaccccatcacacac gacctcatcatcagcctggcacgctacatccactgtcccaagccggtagggtggctgtgg tccgagccagccaggtcccggggtgtgcggtgggtgcctttctgggcagagcagctgtat gcccaaggaggactggcagacaggcccaaggaggacaagcccgtcctccaggcaccaaga gcttga >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_9|150_aa MGTRQTKGSLAERASPGAAPGPRRERPDFWASLLLRAGDKAGRAGAGMPPYHRRVGMVQE LLRMVRQGRREEAGTLLQHLRQAEKEAGLRGQSTELSACVMEQQKKASVSLGYTQLSLPS FASGKVGEWGHENWKWLVAVKKGARLDLIV >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_9|453_bp atgggcacccggcagaccaagggcagcctggcggagagagccagccccggcgccgcgccg ggcccccgacgcgaacggccggacttctgggcgtcgctgctgctgcgcgccggggacaag gcggggcgcgcgggcgcggggatgcccccctaccaccggcgagtcggcatggtccaggag ctgctgcggatggtgcgccagggccggcgggaggaggcggggacgctgctgcagcacctg cgccaggcagagaaggaggctggtctccggggacagtcgactgagctgtcggcctgcgtc atggagcagcagaaaaaggcgtctgtttccctgggatacacccagctgagcctcccatcc tttgccagcgggaaggtgggggagtggggccatgagaactggaagtggttggtggcggtc aagaagggtgcccggctcgatttgattgtctga >gi568815581f:16317669_16536886|GENSCAN_predicted_peptide_10|92_aa VSERTAPRPRKRRREEPSFRLERDSGTLGGPRNAVFLGNVVHSLRCAGAVALLHTRTAED PYLRERARTLPGYEKAWRMLRQALPVSVAVLW >gi568815581f:16317669_16536886|GENSCAN_predicted_CDS_10|279_bp gtttcagagcggacagctccacggcccagaaagagacgcagggaggagcccagcttccgc ctcgaaagggactctgggacactgggcgggcccaggaacgcagtgttcttgggaaatgtg gtccacagcctccgctgcgcaggcgccgtggccctgctccacacgcgcacggctgaagac ccctatctccgagagcgcgcacgcacccttccggggtacgagaaggcgtggcggatgctg aggcaggccctaccggtctcggtggcggtgttgtggtga