GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:37:47 Sequence gi568815596r:119340120_119624226 : 284107 bp : 44.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 6688 6727 40 -2.96 1.01 Init + 6914 6956 43 1 1 35 110 29 0.278 0.38 1.02 Intr + 18723 18914 192 1 0 -26 82 134 0.031 0.86 1.03 Intr + 26819 26941 123 0 0 36 92 105 0.150 6.26 1.04 Intr + 27688 27812 125 2 2 52 22 31 0.080 -6.80 1.05 Intr + 28069 28186 118 0 1 98 85 155 0.950 16.24 1.06 Intr + 30621 30683 63 1 0 71 86 93 0.678 6.09 1.07 Term + 32126 32199 74 0 2 96 54 133 0.888 8.77 1.08 PlyA + 33708 33713 6 1.05 2.02 PlyA - 33821 33816 6 1.05 2.01 Sngl - 74723 74286 438 2 0 29 35 303 0.996 15.36 2.00 Prom - 78430 78391 40 -2.16 3.00 Prom + 82097 82136 40 -7.46 3.01 Init + 86945 87053 109 1 1 96 55 55 0.721 3.46 3.02 Intr + 91366 91590 225 2 0 82 3 169 0.596 5.76 3.03 Intr + 91686 91805 120 1 0 76 68 85 0.767 5.77 3.04 Intr + 96405 96535 131 1 2 43 77 32 0.285 -1.99 3.05 Term + 96770 97321 552 1 0 91 47 655 0.977 56.01 3.06 PlyA + 97585 97590 6 -0.45 4.21 PlyA - 98598 98593 6 1.05 4.20 Term - 100138 99998 141 1 0 101 54 132 0.999 9.03 4.19 Intr - 101480 101439 42 2 0 118 105 96 0.974 12.74 4.18 Intr - 106766 106640 127 1 1 55 111 162 0.671 15.88 4.17 Intr - 108661 108570 92 1 2 114 97 -11 0.993 1.19 4.16 Intr - 111960 111891 70 2 1 120 105 89 0.990 12.98 4.15 Intr - 113228 113168 61 0 1 126 94 8 0.842 2.99 4.14 Intr - 121881 121728 154 0 1 102 63 192 0.888 17.75 4.13 Intr - 124136 124004 133 1 1 124 52 279 0.994 28.55 4.12 Intr - 125767 125670 98 1 2 82 97 197 0.995 18.81 4.11 Intr - 133437 133334 104 1 2 92 102 127 0.836 14.39 4.10 Intr - 138799 138692 108 2 0 110 46 32 0.579 1.46 4.09 Intr - 142062 141993 70 0 1 103 94 34 0.943 4.25 4.08 Intr - 143077 142898 180 1 0 38 61 102 0.035 2.56 4.07 Intr - 148229 148150 80 0 2 58 94 66 0.042 3.47 4.06 Intr - 154429 154309 121 1 1 56 80 86 0.120 4.67 4.05 Intr - 155464 155355 110 2 2 56 94 43 0.050 1.70 4.04 Intr - 156052 155969 84 2 0 63 94 72 0.013 5.09 4.03 Intr - 175664 175454 211 2 1 72 89 29 0.421 -0.11 4.02 Intr - 180547 180419 129 1 0 85 53 80 0.779 5.09 4.01 Init - 184107 184036 72 0 0 97 109 156 0.645 17.70 4.00 Prom - 194006 193967 40 -1.86 5.00 Prom + 194906 194945 40 -3.16 5.01 Init + 200568 200612 45 2 0 73 29 64 0.185 -0.32 5.02 Intr + 204193 204391 199 0 1 67 84 190 0.371 15.42 5.03 Intr + 208966 209066 101 2 2 78 82 29 0.356 1.13 5.04 Intr + 219570 219656 87 2 0 98 121 40 0.979 8.47 5.05 Intr + 219770 219907 138 1 0 73 100 172 0.797 17.56 5.06 Term + 220492 220515 24 0 0 104 54 4 0.602 -3.18 5.07 PlyA + 221178 221183 6 1.05 6.00 Prom + 225117 225156 40 -2.86 6.01 Init + 232653 232822 170 2 2 79 77 105 0.194 7.51 6.02 Intr + 247000 247103 104 1 2 104 110 17 0.223 5.32 6.03 Intr + 261099 261258 160 1 1 96 22 80 0.180 1.25 6.04 Intr + 264553 264673 121 1 1 77 102 41 0.657 4.90 6.05 Intr + 264757 264868 112 0 1 87 86 114 0.994 11.05 6.06 Intr + 265062 265170 109 1 1 79 44 78 0.868 1.84 6.07 Intr + 268383 268470 88 0 1 106 77 17 0.833 2.37 6.08 Intr + 271534 271623 90 0 0 22 90 89 0.710 2.69 6.09 Term + 274899 274910 12 2 0 145 46 1 0.410 -0.20 6.10 PlyA + 275353 275358 6 1.05 7.00 Prom + 275692 275731 40 -4.86 7.01 Init + 279506 279651 146 1 2 78 98 92 0.743 8.79 7.02 Term + 279751 280501 751 1 1 -51 42 378 0.957 12.22 7.03 PlyA + 280760 280765 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:119340120_119624226|GENSCAN_predicted_peptide_1|245_aa MSVGDDHKAVLKPSGADVEAAPSYPEDLAKIDEAGYTKQQIFNTDKTALYWKMKPSRICI AREKSTPGFKASKDRLTVGLGRVDRASKGACQCNLGDRFLVLASSAVSLEFLQVGQDVSV PGPVVARQLAALLLPQRGPPLGAKAWPALFTAVFPDLMPAFAEFEKAAEEVRHLKTKPSD EEMLFIYGHYKQATVGDINTERPGMLDFTGKAKWDAWNELKGTSKEDAMKAYINKVEELK KKYGI >gi568815596r:119340120_119624226|GENSCAN_predicted_CDS_1|738_bp atgagtgtaggtgatgatcataaagcggtgttaaaaccatcaggtgctgatgtagaagct gcaccaagttatcctgaagatctagctaagattgatgaagctggctacactaaacaacag attttcaacacagacaaaacagccttatattggaagatgaagccatctaggatctgcata gctagggagaagtcaacgcctggttttaaagcttcaaaggacaggctgactgtggggttg gggcgagtggaccgcgcctctaaaggcgcttgccagtgcaatctgggcgatcgcttcctg gtcctcgcctcctccgctgtctccctggagttcttgcaagtcggccaggatgtctcagtg cctggcccggtggtggccaggcagttggccgcgctgcttctcccgcagaggggaccccca ctgggggcgaaggcttggcctgccctcttcactgctgtatttccagacctgatgcctgcg tttgctgagtttgagaaagctgcagaggaggttaggcaccttaagaccaagccatcggat gaggagatgctgttcatctatggccactacaaacaagcaactgtgggcgacataaataca gaacggcccgggatgttggacttcacgggcaaggccaagtgggatgcctggaatgagctg aaagggacttccaaggaagatgccatgaaagcttacatcaacaaagtagaagagctaaag aaaaaatacgggatatga >gi568815596r:119340120_119624226|GENSCAN_predicted_peptide_2|145_aa MAIIYDLKKQKDKLLRLYTESDEQKKLMKHRKTLHKAKNEDPNCVLKEWIYQHCHEHTPL NGMLIMIQAKMCHNELKIKGNCKYSTDSLQKCKKRHNITFLKISGDETSADHKAVEEFTD EFAKVIADENLMPGRVYNADEASLF >gi568815596r:119340120_119624226|GENSCAN_predicted_CDS_2|438_bp atggccattatatatgacctgaagaaacagaaggataaactgttgaggctctacactgag agtgatgaacagaagaagttaatgaaacatagaaaaacactgcataaagctaaaaatgaa gatcccaattgtgtattgaaagagtggatctatcagcattgccatgaacacacgcccctt aatggtatgctgatcatgatacaagcaaagatgtgtcacaatgaactaaaaattaaaggg aactgtaaatattcaacagattctttgcagaaatgtaagaaaagacataacattacattt ttaaagatttctggtgatgaaacatctgctgatcacaaagcagtggaggaattcactgat gagtttgccaaggtcattgctgatgaaaatctgatgccaggacgagtctataatgctgat gaagcatcattgttttag >gi568815596r:119340120_119624226|GENSCAN_predicted_peptide_3|378_aa MAWGLGEEEDLVGKGGAAGGEEREEAMGHRQGKTWREAGLSKLEQESNPQALVAVPALEE SAPQAKLSSEVRERRSDFPILSPGTRGIPPPKCRVLPAAPQGLLRPPSLTQPAAPPLRRS PGLAPAATAEQLERSRLQRGRRAQHDCRRRAETEAGCMLKVMGCRHDRFSELRTDSDLPV GLDFSPTLKITKERKAQRPLGQRQPRRSFFESFIRTLIITCVALAVVLSSVSICDGHWLL AEDRLFGLWHFCTTTNQTICFRDLGQAHVPGLAVGMGLVRSVGALAVVAAIFGLEFLMVS QLCEDKHSQCKWVMGSILLLVSFVLSSGGLLGFVILLRNQVTLIGFTLMFWCEFTASFLL FLNAISGLHINSITHPWE >gi568815596r:119340120_119624226|GENSCAN_predicted_CDS_3|1137_bp atggcgtgggggctgggagaagaggaagacctggttggtaaaggcggggcagcaggagga gaggagagggaagaggccatgggccacagacaaggcaagacctggagagaggctggactc agcaagctggaacaggaatcgaaccctcaggccctcgtcgccgtcccagccctcgaggaa tctgcgccccaggcgaagctgtcctcggaggttcgggagcgtcggagtgacttcccgatc ctttcccctgggacccgagggatccctccccccaagtgccgggtcctccccgcggctccc caggggctcctccggccgccctcgctgactcagccagccgccccgcccctgcggagaagt cccgggctggcgccggcggccacagcggagcagctggagcgatcgaggctgcagcgcggc cgccgggcgcagcatgactgccgtcggcgtgcagaaaccgaggcaggctgcatgctcaag gtcatgggatgcaggcacgacaggttttctgaactcagaactgactcagatttgccagtc ggtttggacttctcacccactctgaagatcacaaaagaaagaaaggcccagaggcctttg ggccaaaggcagccccgccggtccttctttgaatccttcatccggaccctcatcatcacg tgtgtggccctggctgtggtcctgtcctcggtctccatttgtgatgggcactggctcctg gctgaggaccgcctcttcgggctctggcacttctgcaccaccaccaaccagacgatctgc ttcagagacctgggccaggcccatgtgcccgggctggccgtgggcatgggcctggtacgc agcgtgggcgccttggccgtggtggccgccatttttggcctggagttcctcatggtgtcc cagttgtgcgaggacaaacactcacagtgcaagtgggtcatgggttccatcctcctcctg gtgtctttcgtcctctcctccggcgggctcctgggttttgtgatcctcctcaggaaccaa gtcacactcatcggcttcaccctaatgttttggtgcgaattcactgcctccttcctcctc ttcctgaacgccatcagcggccttcacatcaacagcatcacccatccctgggaatga >gi568815596r:119340120_119624226|GENSCAN_predicted_peptide_4|728_aa MRPHLSPPLQQLLLPVLLACAAHSLMAKKYKNSKYRLWHMEAFSTWWSLPCPVMHLDPEA DKPCGCTQHGSGQRGSESDRLRIREDDGGTQSEARVLGTMGGPLVQVLKSKAVEPGILMF KDRRRVSQFQERERKRESGFFHSSSSTSSTEKPEGARDYGLRQEPELLITWANLECGFVS RVFSPCSIMLEFPETMGFSKNQTGALPRLCDVLQVLWEEQDQCLQELSREQTGDLGTEQP VPEKGSPGFARAVVSGQALAAEPRRASVQASLGALPDVTQVRKPLRRAGTVHICLANQPG IVQPHGLEPAAGTLADLASPASTESVFPQFPEPGLNKEGTWQLGPVSPGDWSGCEGMWDN ISCWPSSVPGRMVEVECPRFLRMLTSRNGSLFRNCTQDGWSETFPRPNLACGVNVNDSSN EKRHSYLLKLKVMYTVGYSSSLVMLLVALGILCAFRRLHCTRNYIHMHLFVSFILRALSN FIKDAVLFSSDDVTYCDAHRAGCKLVMVLFQYCIMANYSWLLVEGLYLHTLLAISFFSER KYLQGFVAFGWGSPAIFVALWAIARHFLEDVGCWDINANASIWWIIRGPVILSILINFIL FINILRILMRKLRTQETRGNEVSHYKRLARSTLLLIPLFGIHYIVFAFSPEDAMEIQLFF ELALGSFQGLVVAVLYCFLNGEVQLEVQKKWQQWHLREFPLHPVASFSNSTKASHLEQSQ GTCRTSII >gi568815596r:119340120_119624226|GENSCAN_predicted_CDS_4|2187_bp atgcgtccccacctgtcgccgccgctgcagcagctactactgccggtgctgctcgcctgc gccgcgcactcgttgatggctaagaaatataaaaattccaagtacaggctctggcacatg gaagccttcagtacatggtggtcgctgccctgtccggtcatgcatctggatccagaagct gacaagccctgtggctgcacacaacatggctcaggccaacgtggttctgagtctgaccgc ctcagaatcagggaagatgatggtggaactcagtcagaagctagagtcctgggaaccatg gggggaccacttgtgcaagttctaaaatccaaggctgtagaacctgggattctgatgttc aaggacaggagaagggtgtcccagttccaggaaagagagagaaagagagagagtggattt ttccactcaagctcttccacttcctccactgagaagccagagggtgccagagactatggg cttcggcaagaaccagaacttttgattacttgggccaatttggaatgtggctttgtctcc agggtcttctctccctgctccattatgttggaatttccagagactatgggcttcagcaag aaccagactggagcccttccccgactatgtgacgtgctacaagtgctgtgggaagagcaa gaccagtgcctgcaggaactctccagagagcagacaggagacctgggcacggagcagcca gtgccagaaaagggctcccctggctttgcaagagctgtggtgtctggccaggccctggcg gctgaaccaaggagggccagcgtgcaggcatccctgggggccctgcctgacgtgacccaa gttaggaagcctctcagacgtgctggcaccgtccacatctgtctggccaaccagccaggc attgtccagcctcatggcctggagcctgcggctggtaccctggctgacctggccagtcca gccagcactgaatctgtttttccacaattcccagagcctggcttgaacaaggagggcact tggcagctggggccggtgtcacctggagactggtcaggttgtgaggggatgtgggacaac ataagctgctggccctcttctgtgccgggccggatggtggaggtggaatgcccgagattc ctccggatgctcaccagcagaaatggttccttgttccgaaactgcacacaggatggctgg tcagaaaccttccccaggcctaatctggcctgtggcgttaatgtgaacgactcttccaac gagaagcggcactcctacctgctgaagctgaaagtcatgtacaccgtgggctacagctcc tccctggtcatgctcctggtcgcccttggcatcctctgtgctttccggaggctccactgc actcgcaactacatccacatgcacctgttcgtgtccttcatccttcgtgccctgtccaac ttcatcaaggacgccgtgctcttctcctcagatgatgtcacctactgcgatgcccacagg gcgggctgcaagctggtcatggtgctgttccagtactgcatcatggccaactactcctgg ctgctggtggaaggcctctaccttcacacactcctcgccatctccttcttctctgaaaga aagtacctccagggatttgtggcattcggatggggttctccagccatttttgttgctttg tgggctattgccagacactttctggaagatgttgggtgctgggacatcaatgccaacgca tccatctggtggatcattcgtggtcctgtgatcctctccatcctgattaatttcatcctt ttcataaacattctaagaatcctgatgagaaaacttagaacccaagaaacaagaggaaat gaagtcagccattataagcgcctggccaggtccactctcctgctgatccccctctttggc atccactacatcgtcttcgccttctccccagaggacgctatggagatccagctgtttttt gaactagcccttggctcattccagggactggtggtggccgtcctctactgcttcctcaat ggggaggtgcagctggaggttcagaagaagtggcagcaatggcacctccgtgagttccca ctgcaccccgtggcctccttcagcaacagcaccaaggccagccacttggagcagagccag ggcacctgcaggaccagcatcatctga >gi568815596r:119340120_119624226|GENSCAN_predicted_peptide_5|197_aa MKQGEEEANQGASSQARQTAPSSQESPAASGRGPQQPGSPALGRLVAPGRRPWERVRVVM ATLRAGAGAGAGRIRPGNHLQEVYAKLVNNKVIQARPGIIHFGGYQVEKQHQQILHLVNV SNEDTRVHILPPQTKYFEINYVRKSGACPNKMPPVFQEHHLVPGLSLTVTVTFSPDEWRY YYDCIRVHCKSRPQSSA >gi568815596r:119340120_119624226|GENSCAN_predicted_CDS_5|594_bp atgaagcagggcgaggaggaggcaaatcaaggggccagcagccaggcccggcaaactgct ccgagctcccaggaatcacccgccgcctccggtcgtggcccgcagcagcccggctccccc gccctggggcgtttggtcgcgcccggccgccggccctgggagcgcgtccgcgtcgtcatg gcgacgctccgagcgggcgccggcgctggcgccggccgaatccggcccgggaaccacctc caggaggtttatgcaaaacttgtgaataataaggtcatacaggcaagacctggcataata cattttggaggctatcaagtagaaaaacaacaccaacagattctgcatctggtcaatgtt tccaatgaagacacacgtgttcatattttacccccgcaaaccaaatactttgagatcaat tatgtaagaaagtccggggcttgtcccaacaagatgccacctgtcttccaggaacaccac ctggtccctggcttgtccctcacggtcaccgttacattttctccagatgagtggcgatac tattatgactgcatccgtgttcactgtaagtccagaccccagtcttcagcttga >gi568815596r:119340120_119624226|GENSCAN_predicted_peptide_6|321_aa MGKAAMGSCSKWTMYSSQGYRGSVACRATLTTFTEEAPGDSIPKPGYFSFLNFYVISKTY VIPLQCSCPVDFEFYITLIQSHQAFAIEPTSGIIPANGKMTVTIKFTPFQYGTAQIKMQL WISQFNSQPYECVFTGTCYPNMALPLEEFERLNTLSKKVNVPPEKAMMHINFHRPPAKPK PQKVKEIEYQNLRFPVDLSNPFAVATVLNQEPGKLKIKELREVLDQGTEISKTRQMKEAL FEQKVRQDIHEEMENHLKWQVHLGKDPMSFKLKKELTEEWQKACAKYKLDRGDPILDEEF QRLKTEVSHKRVVRNQEELMA >gi568815596r:119340120_119624226|GENSCAN_predicted_CDS_6|966_bp atgggcaaggccgccatgggctcctgcagcaagtggaccatgtactcatctcaaggctat agaggctctgtagcctgcagggccacattaacaacattcacagaagaggctcctggtgac tccattccaaagccaggatactttagcttcttgaacttttatgtcataagcaaaacttat gttattcctttgcagtgcagctgccctgtagattttgagttttatatcaccttgattcag tctcatcaagcctttgctattgagccaacatcaggaataattccggctaatgggaagatg actgtgactattaagtttacaccctttcagtatgggactgcacaaataaaaatgcagtta tggatttcgcagttcaactctcaaccatacgaatgtgtcttcaccggaacatgctatccc aacatggccttaccattagaagagtttgaaaggttgaataccctttctaagaaagtaaac gttcctccagaaaaagcaatgatgcatataaattttcaccgaccgccagcgaagccgaag cctcagaaggtgaaggaaattgagtaccagaacctcagatttccagtagatttatcgaat ccatttgctgtggcaactgttttaaaccaagaaccaggaaaattgaagattaaagaatta agagaagttttggaccagggcactgaaatttcaaaaacgagacagatgaaggaggcactc tttgaacagaaagtcagacaggacattcacgaagagatggaaaatcatcttaagtggcag gtgcaccttggtaaagatcctatgtcttttaaacttaaaaaagagcttactgaagagtgg caaaaagcatgtgccaaatataagctagacagaggagatcctattttggatgaggaattt cagcgacttaaaacagaagttagccataaacgggttgttcgcaatcaagaagagctcatg gcataa >gi568815596r:119340120_119624226|GENSCAN_predicted_peptide_7|298_aa MRKNQCKKAENSKNQNTSSPPKDHNSSTAREQNWMENEFDKLTELGLRRITSLEKNINNL IELKNTARELHEEYTSIKSQIDQVEERISEIEDQLDEIKREDKIREKRMKRNEQSPQEMW DYMKRPNLRLTGVPKSDGENGTKLENTLQDIIQENFPNLARQANIQIQEIQRTPQRYSSR RATPRHIIIRFTKVEMKEKMLRVAREKGHVTHKGKPIRPTADLSAETLQDRREWGPIFNI LTEKHFQPTISYPAKLSFISEGEIKSFTDKQMLRDFVTTRPALQELLKEAINLERKIR >gi568815596r:119340120_119624226|GENSCAN_predicted_CDS_7|897_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaaaccagaatacctcctctcct ccaaaggatcacaattcctcgacagcaagggaacaaaactggatggagaatgagtttgac aaattgacagaattaggcctcagaagaataaccagtttagagaagaacataaataacctg atagagctgaaaaacacagcacgagaacttcatgaagaatacacaagtatcaagagccaa atcgatcaagtggaagaaaggatatcagagattgaagatcaacttgatgaaataaagcgt gaagacaagattagagaaaaaagaatgaaaaggaatgaacaaagcccccaagaaatgtgg gactatatgaaaagaccaaacctacgtttgactggtgtacctaaaagtgatggggagaat ggaaccaagttggaaaacactcttcaggatattatccaggagaacttccccaacctagca agacaggccaatattcaaattcaggaaatacagagaacaccacaaagatactcctcgaga agagcaaccccaagacacataatcatcagattcaccaaggttgaaatgaaggaaaaaatg ttaagggtagccagagagaaaggtcatgttacccacaaagggaagcccatcagaccaaca gcggacctctctgcagaaaccctacaagaccgaagagagtgggggccaatattcaacatt ctcacagaaaagcattttcaacccacaatttcatatccagccaaactaagcttcataagt gaaggagaaataaaatcctttacagacaagcaaatgctgagagattttgtcaccaccagg ccagccttacaagagctcctgaaggaagcaataaatttggaaaggaaaatccggtaa