GENSCAN 1.0 Date run: 7-Nov-116 Time: 19:54:19 Sequence gi568815597f:8224330_8444010 : 219681 bp : 47.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1025 1277 253 0 1 70 84 109 0.050 5.19 1.02 Intr + 28095 28289 195 0 0 38 93 111 0.001 5.13 1.03 Intr + 67788 67880 93 0 0 56 53 83 0.120 0.68 1.04 Term + 70053 70131 79 0 1 129 47 37 0.230 0.94 1.05 PlyA + 70663 70668 6 1.05 2.03 PlyA - 72077 72072 6 1.05 2.02 Term - 84471 84457 15 0 0 153 44 4 0.646 0.74 2.01 Init - 93436 93314 123 1 0 46 101 161 0.930 11.40 2.00 Prom - 95367 95328 40 -5.06 3.00 Prom + 96461 96500 40 -7.26 3.01 Init + 100001 100397 397 1 1 84 97 599 0.964 55.27 3.02 Intr + 101489 101713 225 0 0 84 109 466 0.999 46.26 3.03 Intr + 105880 106607 728 2 2 96 92 866 0.275 78.95 3.04 Intr + 111108 111285 178 2 1 68 86 307 0.430 27.99 3.05 Intr + 113487 113663 177 1 0 118 80 234 0.910 25.49 3.06 Intr + 115164 115369 206 1 2 99 115 272 0.987 29.92 3.07 Term + 119211 119684 474 2 0 27 44 441 0.897 28.49 3.08 PlyA + 119815 119820 6 1.05 4.00 Prom + 120356 120395 40 -4.26 4.01 Init + 121339 121433 95 0 2 80 80 143 0.860 10.56 4.02 Intr + 122003 122091 89 2 2 45 75 24 0.204 -3.59 4.03 Intr + 123961 124045 85 2 1 110 107 8 0.155 3.78 4.04 Intr + 125882 125962 81 2 0 46 75 72 0.137 0.45 4.05 Intr + 126996 127334 339 0 0 56 -2 194 0.145 1.79 4.06 Intr + 128281 128410 130 1 1 34 77 155 0.836 9.70 4.07 Term + 128474 128623 150 1 0 -4 36 158 0.717 -0.69 4.08 PlyA + 128901 128906 6 -0.45 5.16 PlyA - 130099 130094 6 1.05 5.15 Term - 130791 130758 34 2 1 105 38 42 0.887 -2.04 5.14 Intr - 131270 131090 181 0 1 49 94 117 0.581 7.33 5.13 Intr - 131917 131771 147 2 0 91 103 89 0.994 10.91 5.12 Intr - 134587 133867 721 1 1 121 100 935 0.819 89.01 5.11 Intr - 135657 135435 223 2 1 61 68 423 0.999 35.73 5.10 Intr - 137161 135783 1379 1 2 97 73 555 0.818 42.59 5.09 Intr - 137547 137434 114 0 0 108 94 119 0.999 15.14 5.08 Intr - 138515 138354 162 2 0 28 77 232 0.969 16.27 5.07 Intr - 139926 139727 200 1 2 43 109 207 0.949 17.37 5.06 Intr - 140509 140417 93 2 0 50 100 158 0.839 13.14 5.05 Intr - 141645 141483 163 0 1 115 65 129 0.974 12.85 5.04 Intr - 198478 198398 81 1 0 82 79 55 0.482 3.83 5.03 Intr - 199436 199266 171 2 0 90 -5 94 0.330 0.44 5.02 Intr - 200575 200459 117 1 0 100 89 -5 0.487 1.46 5.01 Init - 202497 202417 81 0 0 74 86 44 0.575 3.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 29233 29185 49 1 1 86 58 39 0.890 -0.19 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:8224330_8444010|GENSCAN_predicted_peptide_1|206_aa XDQYPSWNTTSIQDTKTTVPGTSNVQCANVCLPVYGEEEIPSGYQAQREDEPTQGHFWLL RKDEEFPEIINSLIQQIFRAPVMCSFVDHRSLLQLLKSAGSGSSSQGQQVMNEHGSSPVT LHLQTQPASRILPVNHSLLTPEPVPQFPQSAGQRILKLSSVTVQGMKRFMNPLVILAQEP GPNGNGAFYEVSLFLSWKVGSDVVSK >gi568815597f:8224330_8444010|GENSCAN_predicted_CDS_1|621_bp ngggatcaatatccctcttggaacaccacctctatccaggacaccaagacgacagttcca gggaccagcaatgtccagtgtgctaatgtgtgtcttcctgtctatggagaagaggagatt ccaagtggttaccaggcccaaagagaagatgaaccaactcagggccatttttggcttctc agaaaagatgaagagttcccagaaataattaattcactcattcaacaaatattcagagca cctgtaatgtgcagctttgtggaccataggtccctgttgcaactactcaagtctgcaggc agcgggagcagcagccagggacagcaggtaatgaatgagcatggcagttctccagtaaca ttgcatctgcaaacacagccggcatcccggattttgccagtgaaccatagtttgctgact cctgaaccagtgcctcagtttccccaaagcgctggtcagcggatactgaaattatcttct gtgactgtgcaaggcatgaaacgtttcatgaacccgctggtcatccttgcacaggagcca gggccaaatggaaatggagccttttatgaagtttccctctttttaagctggaaagttggc agcgacgtcgtttcgaaatga >gi568815597f:8224330_8444010|GENSCAN_predicted_peptide_2|45_aa MCAVGQPGLLEASRLWARSCPRHWGCRKNKAKFPPLGELCNVGFG >gi568815597f:8224330_8444010|GENSCAN_predicted_CDS_2|138_bp atgtgcgcagtggggcagccgggcctgctggaggcgtcgcgactctgggcccggagctgt ccaaggcactggggatgccgcaagaacaaggcaaagttcccgcccctgggggaactttgt aatgtaggatttgggtaa >gi568815597f:8224330_8444010|GENSCAN_predicted_peptide_3|794_aa MIPAASSTPPGDALFPSVAPQDFWRSQVTGYSGSVTRHLSHRANNFKRHPKRRKCIRPSP PPPPNTPCPLELVDFGDLHPQRSFRELLFNGCILFGIEFSYAMETAYVTPVLLQMGLPDQ LYSLVWFISPILGALLGLSLLLNGRDIGIALADVTGNHKWGLLLTVCGVVLMDFSADSAD NPSHAYMMDVCSPADQDRGLNIHALLAGLGGGFGYVVGGIHWDKTGFGRALGGQLRVIYL FTAVTLSVTTVLTLVSIPERPLRPPSEKRAAMKSPSLPLPPSPPVLPEEGPGDSLPSHTA TNFSSPISPPSPLTPKYGSFISRDSSLTGISEFASSFGTANIDSVLIDCFTGGHDSYLAI PGSVPRPPISVSFPRAPDGFYRQDRGLLEGREGALTSGCDGDILRVGSLDTSKPRSSGIL KRPQTLAIPDAAGGGGPETSRRRNVTFSQQVANILLNGVKYESELTGSSERAEQPLSVGR LCSTICNMPKALRTLCVNHFLGELPAKPPRWLSFEGMLLFYTDFMGEVVFQGDPKAPHTS EAYQKYNSGVTMGCWGMCIYAFSAAFYSAILEKLEEFLSVRTLYFIAYLAFGLGTGLATL SRNLYVVLSLCITYGILFSTLCTLPYSLLCDYYQSKKSWILFIVVISPNTNIVKLLAELP VHVGAEKCEAAVCGPLGPPGLAGHTELGHPVLAAACLADTFLPLGQFAGSSADGTRRGMG VDISLLSCQYFLAQILVSLVLGPLTSAVGSANGVMYFSSLVSFLGCLYSSLFVIYEIPPS DAADEEHRPLLLNV >gi568815597f:8224330_8444010|GENSCAN_predicted_CDS_3|2385_bp atgatccccgcagccagcagcaccccgccgggagatgccctcttccccagcgtggcccca caggacttctggaggtcccaggtcacgggctactcggggtccgtgacacgacacctcagt caccgggccaacaacttcaaacgacaccccaagaggaggaagtgcattcgtccctcccca cccccgccccccaacaccccgtgcccgcttgagctggtggacttcggggacctgcacccc cagaggtccttccgggagctgcttttcaacggctgcattctctttggcatcgagttcagc tacgccatggagacggcgtacgtgaccccggtgctcctgcagatgggcctgcccgaccag ctctacagcctggtgtggttcatcagccccatcctcggggcactgctgggcctctcgctc ttgctgaatggccgggacattggcatcgccctggctgacgtgaccgggaaccacaagtgg ggcctgctgctgaccgtgtgcggtgtggtgctgatggactttagcgccgactcggcggac aaccccagccacgcctacatgatggacgtgtgcagccccgcagaccaggaccgaggcctg aacatccacgccctcctggcaggtctcggaggaggctttggatacgtggtcggcggaatc cactgggataaaacgggcttcgggagggccctggggggacagctccgagtcatttacctc ttcactgcggtcaccctgagcgtcaccaccgtcctgaccctggtcagcatccctgagagg ccgctgcggccgccgagtgagaagcgggcagccatgaagagccccagcctcccgctgccc ccgtccccacccgtcctgccagaggaaggccctggcgacagcctcccgtcgcacacggcc accaacttctccagccccatctcgccgcccagccccctcacgcccaagtacggcagcttc atcagcagggacagctccctgacgggcatcagcgagttcgcctcatcctttggcacggcc aacatagacagcgtcctcattgactgcttcacgggcggccacgacagctacctggccatc cctggcagcgtccccaggccgcccatcagcgtcagcttcccccgggcccccgacggcttc taccgccaggaccgtggacttctggagggcagagagggtgccctgacctccggctgtgac ggggacattctgagggtgggctccttggacacctctaagccgaggtcatcagggattctg aagagacctcagaccttggccatcccggacgcagccggaggagggggtcccgaaaccagc aggagaaggaatgtgaccttcagtcagcaggtggccaatatcctgctcaacggcgtgaag tatgagagcgagctgacgggctccagcgagcgcgcggagcagcctctgtccgtggggcgc ctctgctccaccatctgcaacatgcccaaggcgctacgcaccctctgcgtcaaccacttc ctgggtgagctcccggccaagcctccccggtggctctcattcgaggggatgttgctcttc tacacagacttcatgggcgaggtggtgtttcagggggaccccaaggccccgcacacatca gaggcgtatcagaagtacaacagcggcgtgaccatgggctgctggggcatgtgtatctac gccttcagtgctgccttctactcagctatcctggagaagctggaggagttcctcagcgtc cgcaccctctacttcatcgcctatctcgccttcggcctggggaccgggcttgccaccctc tccaggaacctctacgtggtcctgtcgctctgcataacctacgggattttattttccacc ctgtgcaccttgccttactcgctgctctgcgattactatcagagtaagaagtcatggatt cttttcattgtcgtcatctctccgaacacaaatattgttaaactcttggctgaactccct gtgcatgttggcgctgaaaaatgtgaggccgctgtgtgtgggccgctcgggcctcctggg ctcgcaggacacaccgagctcggtcaccccgtgctggccgcggcgtgtctcgctgacacg tttcttcctctgggtcagtttgcagggtccagtgcggacggcacccggcggggcatgggc gtggacatctctctgctgagctgccagtacttcctggctcagattctggtctccctggtc ctggggcccctgacctcggccgtgggcagtgccaacggggtgatgtacttctccagcctc gtgtccttcctgggctgcctgtactcctccctgtttgtcatttatgaaattcctcccagc gacgctgcagacgaggagcaccggcccctcctgctgaacgtctga >gi568815597f:8224330_8444010|GENSCAN_predicted_peptide_4|322_aa MCLLRAGVAGPGSARAVPTASQCPFVDLAKLSWGAVNTGLSPSPLLVFHDTYPGSPDPKE SGCWANPGPFPGEPGQGKVAFPGEPGTGTRPYQSNRMAKCAAASLGWLIWQILLRRRPVP VKPCSFLRGTVPETHQDSAWIPASPSHPSAILSPKTWTESHLGLLWPACGPIPVPWSATL LSRTPAVRLKGRHRVRHSGLCMAKLSQVLSEAVPGACCSPSLYHVADPACGVKLNLEDTE QDTDTHRGAKPNPEQREENPKETEWLGFKDTMKMEESQRFALGYGEKSKANKNAFGRLSS TDLRGLDSSLEKWMPKEEIKKV >gi568815597f:8224330_8444010|GENSCAN_predicted_CDS_4|969_bp atgtgcctgctgcgtgcaggcgtcgctgggcctgggtctgcccgggctgtgcccactgcc tcccagtgcccctttgtggatctggcaaagctctcctggggggctgtaaacacgggcctg agcccatcccctcttctagtcttccatgacacctaccctggctccccagaccctaaggaa tcaggctgttgggctaaccctggaccattccctggggaaccgggacagggtaaggtggca ttccctggagaaccgggaaccgggacccgcccgtatcagtccaaccggatggccaaatgt gcggctgcatcccttggctggctcatctggcagatcctccttaggaggagacctgtacct gttaaaccctgcagcttcctaaggggcacagtcccggagacccaccaagacagcgcctgg attccagcctctcccagccaccccagtgccatcctctcacccaagacctggactgagagc cacctggggctcctgtggccagcctgcgggcccatccctgtgccctggtcagccacgctg ctgagcaggacccctgctgttcgcctcaagggccgtcatcgtgtgcgccactcgggtctt tgcatggcgaagctttctcaggtcctgagcgaggcagtgcctggtgcctgctgctcccca agtctttaccacgtggcggaccctgcctgtggagtgaagctcaacctcgaggacaccgaa caagatacggacacacacagaggagccaaaccaaaccccgaacaaagggaggaaaatccg aaggaaaccgagtggttgggcttcaaagacaccatgaagatggaagaatctcagcgcttc gccttgggatatggggagaaaagcaaagctaacaagaatgccttcggacggctttcaagc acagacctcagaggactcgactcttctttggaaaaatggatgccaaaggaggagataaag aaggtttaa >gi568815597f:8224330_8444010|GENSCAN_predicted_peptide_5|1288_aa MPYPDSGVETEVPQCGPVGITGHLAGKFPECSLCLFLADFCWKPCWSPPGRLTLFLVTSS KTSASRRRPQPRAPRAPRRAAPPRSHAEPPAAASALRFVEPEPSDRRRPLAAPDPGRLPV AAGKRFVKGLRQYGKNFFRIRKELLPNKETGELITFYYYWKKTPEAASSRAHRRHRRQAV FRRIKTRTASTPVNTPSRPPSSEFLDLSSASEDDFDSEDSEQELKGYACRHCFTTTSKDW HHGGRENILLCTDCRIHFKKYGELPPIEKPVDPPPFMFKPVKEEDDGLSGKHSMRTRRSR GSMSTLRSGRKKQPASPDGRTSPINEDIRSSGRNSPSAASTSSNDSKAETVKKSAKKVKE EASSPLKSNKRQREKVASDTEEADRTSSKKTKTQEISRPNSPSEGEGESSDSRSVNDEGS SDPKDIDQDNRSTSPSIPSPQDNESDSDSSAQQQMLQAQPPALQAPTGVTPAPSSAPPGT PQLPTPGPTPSATAVPPQGSPTASQAPNQPQAPTAPVPHTHIQQAPALHPQRPPSPHPPP HPSPHPPLQPLTGSAGQPSAPSHAQPPLHGQGPPGPHSLQAGPLLQHPGPPQPFGLPPQA SQGQAPLGTSPAAAYPHTSLQLPASQSALQSQQPPREQPLPPAPLAMPHIKPPPTTPIPQ LPAPQAHKHPPHLSGPSPFSMNANLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQPLP SSPAQPPGLTQSQNLPPPPASHPPTGLHQVAPQPPFAQHPFVPGGPPPITPPTCPSTSTP PAGPGTSAQPPCSGAAASGGSIAGGSSCPLPTVQIKEEALDDAEEPESPPPPPRSPSPEP TVVDTPSHASQSARFYKHLDRGYNSCARTDLYFMPLAGSKLAKKREEAIEKAKREAEQKA REEREREKEKEKEREREREREREAERAAKASSSAHEGRLSDPQLSGPGHMRPSFEPPPTT IAAVPPYIGPDTPALRTLSEYARPHVMSPTNRNHPFYMPLNPTDPLLAYHMPGLYNVDPT IRERELREREIREREIRERELRERMKPGFEVKPPELDPLHPAANPMEHFARHSALTIPPT AGPHPFASFHPGLNPLERERLALAGPQLRPEMSYPDRLAAERIHAERMASLTSDPLARLQ MFNVTPHHHQHSHIHSHLHLHQQDPLHQGSAGPVHPLVDPLTAGPHLARFPYPPGTLPNP LLGQPPHEHEMLRHPVFGTPYPRDLPGAIPPPMSAAHQLQAMHAQSAELQRLAMEQQWLH GHPHMHGGHLPSQEDYYSRLKKEGDKQL >gi568815597f:8224330_8444010|GENSCAN_predicted_CDS_5|3867_bp atgccttacccagattctggagttgagacagaggtcccccagtgtgggccagttggcatt actggacatctggctggaaagttcccagaatgttccctgtgccttttcctggctgacttc tgctggaagccctgctggagcccacctggcaggctcacactcttccttgtcacttcctct aagacatcagcaagcagacggcggccccagccccgcgccccccgggccccgcgccgcgcg gccccgccgcggtcacacgccgagccacccgcggccgcctccgccctgcgctttgtggag ccggagcccagtgacaggcgacggcctcttgccgccccggaccccgggcgcttacctgtg gctgctgggaaacgcttcgttaagggactcaggcagtacgggaagaacttcttcagaatt agaaaggagctgcttcccaataaggaaacaggggagctgatcaccttctattactattgg aagaagacccccgaagcagccagctcccgagcccatcgtaggcaccgcaggcaggccgtg ttcaggaggattaagactcgcaccgcgtccacacccgtcaacacaccctccagacccccg tccagtgaattcttggacctaagttcagccagtgaagatgacttcgacagtgaggacagt gagcaggagctgaaggggtacgcctgccgccactgcttcaccaccacctccaaagattgg caccacggaggccgggagaacatcctgctttgcaccgactgtcgcatccacttcaagaaa tacggtgagctcccgcccattgagaagcccgtggacccgccaccgtttatgttcaaaccc gtcaaggaagaggatgatgggctcagtgggaagcatagcatgaggacacggcggagtcgg ggctcgatgtcgacactacgcagtggtcggaagaagcagccagccagccctgatggtcgc acctcacccatcaatgaagacatccgctccagcggccggaactcccccagcgctgccagt acctccagcaatgacagtaaagcagagacagtgaagaagtcggccaagaaggtgaaggag gaagcctcttcccctcttaagagtaacaaacgccagcgggagaaggtggcctctgatacg gaggaggctgacaggaccagctccaagaagacaaaaacgcaggagatcagcaggcccaac tcgccatctgaaggtgagggagagagttcagacagtcgcagcgtcaacgatgagggtagc agtgaccccaaagacatcgaccaggacaatcgcagcacgtccccgagcatccccagcccc caggacaatgagagtgactcggactcgtcagcccagcagcagatgctgcaggcccagccc ccagccttgcaggctcccactggggtcaccccagctccctcctcagctcctccagggacc cctcagctgcccacgccagggcccacgccctctgccactgcagttcccccacagggctcc cccacggcctcccaggcccctaaccagccacaggctcccacagcgcctgttccccacacc cacatccaacaggcaccggccttgcacccccagcggccgccctcaccgcatcccccgccg catccctcgccacatcccccgctgcagcctctgactgggtcggcgggccagccttctgca ccctctcatgcccagcccccactgcacggtcagggcccacccggccctcacagcctgcag gctgggcccctgctgcagcacccaggccccccacagccctttggcctccctccccaggcc tcccaaggccaggcccctctggggacctccccagcagcagcgtaccctcacacctccctg cagctgccagcctctcagtcagcgctgcagtcccaacagcctccacgggagcagcccctg ccaccagcgcccttggccatgccccacatcaagcccccgcctaccactcccatcccccag ctgccggcgccacaggcccacaagcaccctccccacctctcggggccctcacccttctcc atgaatgccaacctgcctccccctccagccctgaagcccctgagctccctgtccacacat caccccccgtcggctcaccccccacccctgcaactcatgcctcagagccagccattgccc tcctcgcccgcccagccccccgggctgacccagagccagaacctgcccccgccccctgcc tcccacccccctacaggcctccaccaggtggccccccaacccccgtttgctcagcacccc tttgtccctggaggccctcctcccatcacccctccgacctgcccctccacctctacccca ccggcgggacctggcacctcggcccagccaccctgctctggtgcggcggcttcaggaggc agcatagcgggggggtcgtcctgcccactccccaccgtccagatcaaggaggaggctctg gacgacgctgaggagcctgagagcccccctcccccaccaaggagcccgtccccggagccc actgtggtggacacccccagtcacgccagccagtcagctaggttctacaaacacctggac cggggctacaactcgtgtgcccggacagacctgtacttcatgcctctggccgggtccaag ctggccaagaagagggaggaggccattgagaaggccaagcgcgaggctgagcagaaagcc cgagaggagcgagagcgggagaaggagaaggagaaggagcgggagcgggagcgagagcgg gagcgcgaggcagagcgggcggctaaggcgtccagctcagcgcatgaaggtcgcctcagt gacccacagctcagtggtcctggccacatgcggccatccttcgagccaccaccaaccacc attgctgctgtgcccccctacatcgggcccgacacacctgcccttcggactctgagcgag tacgcccggccccacgtcatgtcgcccaccaaccgcaaccaccccttctacatgcccctt aaccccacggaccccctgctggcctaccacatgcctggcctctacaacgtcgaccccacc atccgcgagcgggagctccgggagcgggagatccgagagcgggagatccgagagcgggag ctgcgggagaggatgaagccgggcttcgaggtgaagcccccagagctggaccccctgcac ccagccgccaaccccatggagcactttgcccggcacagcgccctcaccatccccccgacc gccgggccccacccttttgcttctttccacccgggcctgaaccccttggagagggagaga ctggccctggcgggcccccagctgcggcccgagatgagctaccctgacagactggcagcc gagcgtatccacgcagagcgcatggcatcgctgaccagcgatcccctggcccgactgcag atgttcaacgtgactccgcaccatcaccagcactctcacattcactcccacctccacctc caccagcaggaccccctccaccaaggttcagcaggccccgttcacccgctggtcgacccc ctgactgccggtccccacctggctcgcttcccctacccgcctggcactctccccaaccct ctgcttggacagcccccacacgagcacgagatgcttcgccacccagttttcggcaccccc tacccccgtgacctgcctggggccatcccaccccccatgtcagcagcccaccagctgcag gccatgcatgcccagtcggccgagctgcagagactggccatggagcagcagtggctgcat ggacacccccacatgcatggtggccacctaccaagtcaggaagattattacagtcgactg aagaaagaaggtgacaagcagttataa