GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:08:12 Sequence gi568815586r:21665726_21873616 : 207891 bp : 37.36% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 276 406 131 2 2 58 77 106 0.630 6.17 1.02 Intr + 3846 3892 47 0 2 60 67 74 0.178 -0.37 1.03 Intr + 7682 7795 114 0 0 21 -2 166 0.489 0.40 1.04 Intr + 11570 11946 377 0 2 32 54 286 0.055 13.41 1.05 Intr + 12129 12182 54 2 0 74 94 28 0.037 0.26 1.06 Intr + 17149 17176 28 0 1 95 105 36 0.034 2.77 1.07 Term + 22512 22687 176 1 2 43 42 142 0.017 2.04 1.08 PlyA + 23013 23018 6 1.05 2.03 PlyA - 26168 26163 6 1.05 2.02 Term - 29191 28826 366 1 0 61 49 225 0.992 9.52 2.01 Init - 29658 29533 126 0 0 106 68 85 0.974 8.51 2.00 Prom - 30186 30147 40 -4.75 3.00 Prom + 33405 33444 40 -5.05 3.01 Init + 55692 55842 151 2 1 65 55 66 0.790 1.26 3.02 Term + 55911 56041 131 1 2 102 48 93 0.765 4.06 3.03 PlyA + 56162 56167 6 1.05 4.00 Prom + 58748 58787 40 -5.85 4.01 Init + 59692 59790 99 0 0 62 116 98 0.818 10.33 4.02 Intr + 83791 84591 801 0 0 75 61 247 0.179 11.42 4.03 Term + 87450 87758 309 2 0 75 45 206 0.866 9.08 4.04 PlyA + 88288 88293 6 1.05 5.03 PlyA - 88572 88567 6 1.05 5.02 Term - 100898 99998 901 1 1 107 48 821 0.718 70.83 5.01 Init - 107891 107518 374 2 2 59 110 470 0.697 42.78 5.00 Prom - 117377 117338 40 -3.75 6.02 PlyA - 117669 117664 6 1.05 6.01 Sngl - 124905 124570 336 0 0 71 44 201 0.832 9.78 6.00 Prom - 127536 127497 40 -6.45 7.03 PlyA - 127702 127697 6 1.05 7.02 Term - 128884 128557 328 0 1 67 39 184 0.952 4.60 7.01 Init - 129204 128969 236 0 2 50 71 139 0.862 6.06 7.00 Prom - 133475 133436 40 -6.85 8.20 PlyA - 134140 134135 6 1.05 8.19 Term - 135456 135319 138 0 0 85 49 132 0.993 5.98 8.18 Intr - 140335 140273 63 1 0 88 87 93 0.991 7.20 8.17 Intr - 141754 141621 134 2 2 60 79 129 0.945 8.64 8.16 Intr - 144230 144127 104 1 2 96 44 114 0.850 6.70 8.15 Intr - 146432 146324 109 0 1 102 82 -36 0.412 -4.38 8.14 Intr - 148997 148919 79 2 1 59 101 30 0.582 -0.39 8.13 Intr - 150168 150038 131 1 2 84 86 93 0.675 8.19 8.12 Intr - 152526 152421 106 0 1 86 52 5 0.079 -4.43 8.11 Intr - 163335 163233 103 2 1 112 100 37 0.384 6.56 8.10 Intr - 172445 172353 93 1 0 93 95 78 0.951 7.16 8.09 Intr - 176746 176589 158 1 2 47 97 78 0.739 2.49 8.08 Intr - 178827 178758 70 2 1 62 109 70 0.987 4.77 8.07 Intr - 179190 179042 149 0 2 83 93 47 0.952 2.81 8.06 Intr - 180107 179878 230 0 2 18 87 233 0.990 12.87 8.05 Intr - 182521 182425 97 1 1 50 71 133 0.990 6.46 8.04 Intr - 186497 186372 126 2 0 56 115 156 0.999 15.06 8.03 Intr - 186780 186643 138 0 0 66 86 89 0.894 6.24 8.02 Intr - 195330 195246 85 2 1 66 84 116 0.997 7.90 8.01 Intr - 207005 206900 106 0 1 105 55 111 0.849 7.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 11570 11970 401 0 2 32 50 301 0.818 14.99 S.002 Sngl + 22433 22687 255 1 0 84 42 237 0.932 11.57 S.003 Term - 169343 169178 166 1 1 37 43 148 0.827 1.61 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_1|308_aa MNVLRIGQPPQISADSQRLQRSHVVEEDLQTKKKGNDVQKLEVRTYISGPPGPDALRLRG VKLRTFAVSVTAHEGNVDPKTEQQQDLLQRAKEQSFHNHPGRPRPHPVPIKPPETLPGRD TSSWTSRGAHWRRNTQAAGRREHIDRHQQAIDRWNNVGFGRGGQRRARLLSDLTPTGKPS SYSVSLLAPPSAESYFHSIKPCIHSPSPREIQFFGYTKARNPRAISGASGLAVNRKGLGS QGLLEEAAKKECCQQTRALQPVLHSRCLTSSGQRTKLWACYQPPRVSMQPTVAELSVGHL KSFRNEAG >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_1|927_bp atgaatgttttgcgaatcgggcagcccccccaaatctcagcagattcacagaggctccag cgcagccatgtggtggaagaagacttacagacaaaaaaaaagggaaatgacgtacagaaa ttggaagtgagaacttacatcagtggtcccccaggtcctgacgctctcagactgagagga gtgaagctgcggaccttcgcagtgagtgttacagctcatgaaggcaatgtggacccaaag accgagcagcagcaagatttattgcaaagagcgaaagaacaaagcttccacaaccaccct ggccggccacgtccccatcctgtgcctataaaaccccctgagaccctaccaggcagagac acaagcagctggacgtcaagaggagcacactggcggaggaacacacaagcagctggacgt cgagagcacatcgacaggcaccagcaggccatcgacaggtggaacaacgtggggtttgga cggggtggtcagaggagagcccggctgctgagtgacctgactccaacaggaaaaccatct tcctactctgtctcccttctggctcccccatctgctgagagctacttccactcaataaaa ccttgcattcattctccaagcccacgtgagatccaattcttcgggtacaccaaggcaaga aaccccagggcgatatctggggcatctggccttgctgtgaacagaaagggtttaggcagc cagggattgttagaagaggcagctaagaaagagtgttgtcagcagaccagagcacttcag cccgtcctgcatagcaggtgcttaacctcaagtggacagagaacaaagctgtgggcctgc taccagccccccagggttagcatgcagcccacagttgctgagctgagcgttggccatctg aaatcattcagaaatgaagccggctga >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_2|163_aa MGRILKVHTTVAVGRSDIFSHQCDEKTRLNETLFTYLVLECVADTQTAGHREEHITGGTR RWLDIERNAPTSTSRPTSHQPTGRLTRSLAGAVGGERPGHPAAQLQRKTIFLLAPSSAES YFHSVKPCTHSSSPRVNRFFRHTKARTRDTERPLSLPRGRVSN >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_2|492_bp atggggagaatactcaaggttcacacgacagtggccgtaggaagaagcgatattttttct catcaatgcgatgagaaaacaaggttgaatgaaacattattcacttacctggtgcttgag tgtgtggcagacacacaaacggctggacatcgagaggagcacatcactggaggaacacgc aggtggctggacattgagaggaacgcaccgacaagcaccagcaggccaacaagccaccaa ccgactggcagactgacacggagtttggctggggcagtcggaggagagcgcccaggccat ccagcagcccaactccagaggaaaaccatcttccttctggctccctcatctgctgagagc tacttccactcagtaaaaccttgcactcattcttcaagcccacgtgtgaaccgattcttc aggcacaccaaggcaagaacccgggatacagaaaggcctctgtccttgccacgaggcaga gtgtctaattga >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_3|93_aa MWLVSAELLSSFLPKERGEELVTRVEKKSCLHTATLKELVTLGQGTQPIQVSYNSSPLAK PKKKSVQGGRLGPRRGSREKISSRGAEVEKWRS >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_3|282_bp atgtggctagtgtcagcagagctgttatcatcctttctcccaaaagaacgaggggaagaa ttggttactagagtggaaaagaagagctgcctgcacacagccaccttgaaagaattagtg accttaggccaagggacacagccaattcaagtctcctataatagctctccattggccaaa cctaagaagaagtcagtacaaggaggtagactggggccaagaagggggagcagagagaag atatccagtagaggagcagaggtggagaagtggaggtcttga >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_4|402_aa MANFIAVLFCDVATATPTFSNHHLDQSAAVNIKNWKKLKFIWNQKRACIAKTILSKKNNA GGITLPDFKLYYKATVTKRAWYWYQNRYIDQWNRTEASEITPHIYNHLIFDKPDKNKKWG KDSLFNKWCWENWLSICRKLKLDPFLIPYTKINSRWIKDFRLKTIKTLEENLENTIQDIS MGKDFMTKTPKAMAAKAKIDKWDLIKLKSFRVAKETTIRVNRQPTEWEKNFAIYPSDKGL ISRIYRELKQIYKKKPNNPIKKWAKDMNRHFSKEDIYAANRHMKTCSSSLVIREMQIKTT NLQVLSDGEGGPLFDKPLLVFPRICKEYMNINSSVSLSFGEGIRLAGIDCDNREKKQWYS APGKVGMTPHPRQVFCGHCKMQKRTSILLKLHMKNEVKEDRE >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_4|1209_bp atggcaaacttcattgctgtcttattttgtgacgttgccacagccaccccaaccttcagc aaccaccaccttgatcagtcagcagctgtcaacatcaagaattggaaaaaactaaagttc atatggaaccaaaaaagggcctgcattgccaagacaatcctaagcaaaaagaacaacgct ggaggcatcacactacctgacttcaagctatactacaaggctacagtaaccaaaagagca tggtactggtaccaaaacagatatatagaccaatggaacagaacagaggcttcagaaata acaccacacatctacaaccatctgatctttgacaaacctgacaaaaacaagaaatgggga aaggattccctatttaataaatggtgctgggaaaactggctatccatatgtagaaagctg aaactggatcccttccttataccttatacaaaaattaattcaagatggattaaagacttt agacttaaaaccataaaaaccctagaggaaaacctagagaataccattcaggacataagc atgggcaaggacttcatgactaaaacaccaaaagcaatggcagcaaaagccaaaattgac aaatgggatctaattaaactaaagagcttccgtgtggcaaaagaaactaccatcagagtg aacaggcaacctacggaatgggagaaaaattttgcaatctatccctctgacaaagggcta atatctagaatctacagagaacttaaacaaatttacaagaaaaaaccaaacaatcccatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaac agacacatgaaaacatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccaca aacttacaggtcctctcagatggagagggtgggccactttttgacaagcctttacttgtg ttccctagaatctgcaaggagtacatgaatataaactctagcgtcagcttgtcatttgga gaaggtataaggcttgctggaattgattgtgacaacagagagaagaaacagtggtatagt gcccctgggaaagtgggaatgacccctcaccccagacaggttttttgtggccattgcaag atgcagaagagaacttcaattcttttgaagctccacatgaagaacgaagtgaaggaagat agagaataa >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_5|424_aa MLARKSIIPEEYVLARIAAENLRKPRIRDRLPKARFIAKSGACNLAHKNIREQGRFLQDI FTTLVDLKWRHTLVIFTMSFLCSWLLFAIMWWLVAFAHGDIYAYMEKSGMEKSGLESTVC VTNVRSFTSAFLFSIEVQVTIGFGGRMMTEECPLAITVLILQNIVGLIINAVMLGCIFMK TAQAHRRAETLIFSRHAVIAVRNGKLCFMFRVGDLRKSMIISASVRIQVVKKTTTPEGEV VPIHQLDIPVDNPIESNNIFLVAPLIICHVIDKRSPLYDISATDLANQDLEVIVILEGVV ETTGITTQARTSYIAEEIQWGHRFVSIVTEEEGVYSVDYSKFGNTVKVAAPRCSARELDE KPSILIQTLQKSELSHQNSLRKRNSMRRNNSMRRNNSIRRNNSSLMVPKVQFMTPEGNQN TSES >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_5|1275_bp atgttggccagaaagagtatcatcccggaggagtatgtgctggcgcgcatcgccgcagag aacctgcgcaagccgcgcatccgagaccgcctccccaaagcccgcttcatcgccaagagc ggggcctgcaacctggcgcataagaacatccgtgagcaaggacgctttctacaggacatc ttcaccaccttggtggacctgaaatggcgccacacgctggtcatctttaccatgtccttc ctctgcagctggctgctcttcgctatcatgtggtggctggtggcctttgcccatggggac atctatgcttacatggagaaaagtggaatggagaaaagtggtttggagtccactgtgtgt gtgactaatgtcaggtctttcacttctgcttttctcttctccattgaagttcaagttacc attgggtttggagggaggatgatgacagaggaatgccctttggccatcacggttttgatt ctccagaatattgtgggtttgatcatcaatgcagtcatgttaggctgcattttcatgaaa acagctcaggctcacagaagggcagaaactttgattttcagccgccatgctgtgattgcc gtccgaaatggcaagctgtgcttcatgttccgagtgggtgacctgaggaaaagcatgatc attagtgcctctgtgcgcatccaggtggtcaagaaaacaactacacctgaaggggaggtg gttcctattcaccaactggacattcctgttgataacccaatcgagagcaataacattttt ctggtggcccctttgatcatctgccacgtgattgacaagcgcagtcccctgtatgacatc tcagcaactgacctggccaaccaagacttggaggtcatagttattctggaaggagtggtt gaaactactggcatcaccacacaagcacgaacctcctacattgctgaggagatccaatgg ggccaccgctttgtgtccattgtgactgaggaagaaggagtgtattctgtggattactcc aaatttggcaacactgttaaagtagctgctccacggtgcagtgcccgagagctggatgag aaaccttccatccttattcagaccctccaaaagagtgaactgtctcatcaaaattctctg aggaagcgcaactccatgagaagaaacaattccatgaggaggaacaattctatccgaagg aacaattcttccctcatggtaccaaaggtgcaatttatgactccagaaggaaatcaaaac acatcggaatcatga >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_6|111_aa MGKDFMTKTPKAMAAKAKIDKWDLIKLKSFCTAKETIIRVNRQPTEWEKNFAIYPSDKGL ISRIYKELKQMYKKKTNNPIKKWAKDMNRHFSKEDIYVANKHEKKVIITSD >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_6|336_bp atgggcaaagacttcatgactaaaacaccaaaagcaatggcagcaaaagccaaaatagac aaatgggatctaattaaacttaagagcttctgcacagcaaaagaaactatcatcagagta aacaggcaacctacagaatgggagaaaaattttgcaatctatccctctgacaaagggcta atatccagaatctacaaggaacttaaacaaatgtacaagaaaaaaacaaacaaccctatc aaaaagtgggcaaaggatatgaacagacacttctcaaaagaagacatttacgtggccaac aaacatgaaaaaaaggtcatcatcactagtgattag >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_7|187_aa MKLPEKGTGSILCCPAASTGDTQANRVWSGPPPNFSRLQLRGLTVKRKTNKQKAKASTST KRMSTKKPHLQVTNIKYQRITIPHQQGNKTGRTENEFDESTEVGFRRWVITNSSELKEHV PTKCKEAKNLEKRLDEWLTRITSLENINDLMELKNTQENFVKHTQVSIAELIKRKKGYQR LKMNIMK >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_7|564_bp atgaagcttccagagaaaggaacaggcagcattctttgctgtcctgcagcctccactggt gatacccaggcaaacagggtctggagtggacctccaccaaacttcagcagactgcagctg aggggcctgactgttaaaaggaaaactaacaaacagaaagcaaaagcatcaacatcaaca aaaaggatgtccacaaagaaaccccatctgcaggtcaccaacatcaaataccaaaggatc acaattcctcaccagcaagggaacaaaactggacggacagagaatgagtttgacgaatcg acagaagtaggcttcagaaggtgggtaataacaaactcctccgagctaaaggagcatgtt ccaacaaaatgcaaggaagctaagaaccttgaaaaaaggttagacgaatggctaactaga ataaccagtttagagaacataaatgacttaatggagctgaaaaacacacaagagaacttc gtgaaacatacacaagtatcaatagctgaactgatcaagcggaagaaaggatatcagaga ttgaagatgaacataatgaaataa >gi568815586r:21665726_21873616|GENSCAN_predicted_peptide_8|739_aa XQLTMIVGQVGCGKSSLLLAILGEMQTLEGKVHWSKYKAVTDACSLQPDIDLLPFGDQTE IGERDDPFSALDIHLSDHLMQEGILKFLQDDKRTLVLVTHKLQYLTHADWIIAMKDGSVL REGTLKDIQTKDVELYEHWKTLMNRQDQELEKDMEADQTTLERKTLRRAMYSREAKAQME DEDEEEEEEEDEDDNMSTVMRLRTKMPWKTCWRYLTSGGFFLLILMIFSKLLKHSVIVAI DYWLATWTSEYSINNTGKADQTYYVAGFSILCGAGIFLCLVTSLTVEWMGLTAAKNLHHN LLNKIILGPIRFFDTTPLGLILNRFSADTNIIDQHIPPTLESLTRSTLLCLSAIGMISYA TPVFLVALLPLGVAFYFIQKYFRVASKDLQELDDSTQLPLLCHFSETAEGLTTIRAFRHE TRFKQRMLELTDTNNIAYLFLSAANRWLEVRTDYLGACIVLTASIASISGSSNSGLVGLG LLYALTVDPSQVPEHWPQEGEIKIHDLCVRYENNLKPVLKHVKAYIKPGQKVGICGRTGS GKSSLSLAFFRMVDIFDGKIVIDGIDISKLPLHTLRSRLSIILQDPILFSGSIRFNLDPE CKCTDDRLWEALEIAQLKNMVKSLPGGLDAVVTEGGENFSVGQRQLFCLARAFVRKSSIL IMDEATASIDMATENILQKVVMTAFADRTVVTIAHRVHTILTADLVIVMKRGNILEYDTP ESLLAQENGVFASFVRADM >gi568815586r:21665726_21873616|GENSCAN_predicted_CDS_8|2220_bp ngtcagttaaccatgattgtgggccaagtaggatgtgggaagtcctctcttctccttgcc atcctcggtgagatgcagacattggaaggaaaagttcactggagcaagtacaaagctgtc acagatgcctgttctcttcagccagatattgacttattaccatttggagatcaaactgaa attggagagagggatgatccattctcagccctggacattcacttgagtgatcatttaatg caggaggggattttgaaattcctgcaagatgacaaaaggacactcgttcttgtgactcac aaattacagtatctgacgcatgctgactggatcatagccatgaaagatggaagtgtccta agagaaggaactttgaaggacattcaaaccaaagatgttgagctttatgaacactggaaa acacttatgaatcggcaagatcaagaattagaaaaggatatggaagctgaccaaactact ttagagaggaaaactctccgacgggccatgtattcaagagaagccaaagcccagatggag gacgaagacgaagaggaagaagaggaggaagatgaggatgataacatgtccactgtaatg aggctcaggactaaaatgccatggaaaacctgctggcgctacctgacatctggaggattc ttcctgctcatcctgatgattttctctaagcttttgaagcattcggtcattgtagctata gactattggctggccacatggacatcggagtacagtataaacaatactggaaaagctgat cagacctactatgtggctggctttagcatactctgtggagcaggcattttcctttgcctt gttacatccctcactgtagaatggatgggtctcacagctgccaaaaatcttcaccacaac cttctcaataagataatccttggaccaataaggttttttgataccacacccctgggactg attctcaatcgcttttcagctgatactaatatcattgatcagcacatccctccaaccttg gaatctctaactcgctcaacactgctctgcctgtctgccattgggatgatttcttatgct actcctgtgttcctggttgctctcctgccccttggtgttgccttttattttatccagaaa tactttcgggttgcctctaaggacctccaggaacttgacgatagtacccagctccctctg ctctgtcacttctcagaaacagcagaaggactcaccaccattcgggcctttaggcatgaa accagatttaaacaacgtatgctggaactgacggatacaaacaacattgcctacttattt ctctcagctgccaacagatggctggaggtcaggacggattatctgggagcttgcattgtc ctcactgcatctatagcatccattagtgggtcttcgaattctggattggtaggcttgggt cttctgtatgcacttacggtagatccttctcaagttccagaacattggccacaagaaggg gagatcaagatacatgatctgtgtgtcagatatgaaaataatctgaaacctgttcttaag cacgtcaaggcttacatcaaacctggacaaaaggtgggcatatgtggtcgcactggcagt gggaaatcatcgttatctctggctttcttcagaatggttgatatatttgatggaaaaatt gtcattgatgggatagacatttccaaattaccactgcacacactacgttctagactttca atcattctgcaggatccaatactattcagtggttccattagatttaatttagatccagag tgcaaatgcacagatgacagactctgggaagccttagaaattgctcagctgaagaatatg gtcaaatctctacctggaggtctagatgcggttgtcactgaaggtggggagaattttagc gttggacagagacagctattttgccttgccagggcctttgtccgcaaaagcagcattctt attatggatgaggcaacagcttccattgacatggccacagagaatattttgcaaaaagta gtaatgacagcctttgcagaccggaccgtggtgacaatagctcatcgagtacacactatt ctgacggcagacctggttattgtgatgaagcgaggaaatattttagaatatgacactcca gaaagcctcttggctcaggaaaatggagtatttgcttcttttgttcgcgcagacatgtga