GENSCAN 1.0 Date run: 8-Nov-116 Time: 08:26:29 Sequence gi568815587r:1_187586 : 187586 bp : 41.46% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 60855 61016 162 2 0 65 85 75 0.632 4.68 1.02 Intr + 68654 68739 86 1 2 68 63 64 0.731 -0.40 1.03 Intr + 70388 70705 318 2 0 103 41 154 0.024 6.35 1.04 Intr + 77682 77866 185 0 2 46 50 154 0.017 5.91 1.05 Intr + 78253 78337 85 2 1 72 100 72 0.071 4.76 1.06 Term + 78508 78850 343 1 1 -28 38 286 0.069 5.00 1.07 PlyA + 79077 79082 6 1.05 2.00 Prom + 79611 79650 40 -7.05 2.01 Init + 80554 80718 165 0 0 74 72 69 0.236 3.60 2.02 Intr + 87212 87334 123 1 0 11 40 167 0.518 4.06 2.03 Intr + 89182 89268 87 0 0 49 101 92 0.616 5.85 2.04 Term + 90903 91052 150 2 0 41 48 103 0.804 -1.47 2.05 PlyA + 91067 91072 6 1.05 3.00 Prom + 94251 94290 40 -6.95 3.01 Sngl + 95389 95577 189 0 0 72 42 206 0.958 9.26 3.02 PlyA + 95595 95600 6 1.05 4.04 PlyA - 95671 95666 6 1.05 4.03 Term - 96848 96741 108 2 0 68 41 61 0.092 -3.07 4.02 Intr - 100931 100701 231 2 0 83 60 183 0.248 12.05 4.01 Init - 101734 101702 33 1 0 84 82 -4 0.381 -1.47 4.00 Prom - 105981 105942 40 -2.55 5.00 Prom + 111852 111891 40 -6.95 5.01 Init + 111973 112026 54 0 0 95 43 47 0.412 2.23 5.02 Intr + 123361 123705 345 0 0 36 42 287 0.791 13.56 5.03 Intr + 123792 124206 415 2 1 -10 21 434 0.306 19.65 5.04 Intr + 124218 124503 286 1 1 56 68 236 0.595 13.88 5.05 Intr + 124805 125033 229 2 1 61 16 194 0.645 6.45 5.06 Intr + 125339 125520 182 1 2 91 76 163 0.702 13.14 5.07 Intr + 125550 125692 143 0 2 50 -6 104 0.173 -3.72 5.08 Intr + 126628 126837 210 2 0 96 109 83 0.549 9.16 5.09 Intr + 131789 131916 128 0 2 26 98 47 0.000 -1.02 5.10 Intr + 135088 135339 252 0 0 -8 98 153 0.072 3.41 5.11 Intr + 144115 144241 127 0 1 38 113 118 0.706 8.63 5.12 Term + 150713 150801 89 0 2 130 43 52 0.636 1.74 5.13 PlyA + 151598 151603 6 1.05 6.06 PlyA - 154968 154963 6 1.05 6.05 Term - 157695 157612 84 0 0 99 38 88 0.685 1.57 6.04 Intr - 163400 163296 105 2 0 42 80 108 0.358 4.89 6.03 Intr - 164460 164439 22 2 1 99 98 3 0.570 -0.97 6.02 Intr - 167884 167753 132 0 0 115 94 44 0.894 6.54 6.01 Init - 169052 168943 110 2 2 68 119 27 0.538 3.55 6.00 Prom - 176616 176577 40 -3.85 7.04 PlyA - 176938 176933 6 1.05 7.03 Term - 177995 177823 173 0 2 70 39 98 0.425 0.11 7.02 Intr - 180404 180209 196 2 1 59 97 149 0.916 10.97 7.01 Intr - 184955 184840 116 0 2 111 72 22 0.778 2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 135323 135175 149 2 2 66 105 112 0.817 10.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:1_187586|GENSCAN_predicted_peptide_1|392_aa MILHCQVIAPKIWFSSTLSDYRKPETRLGAKDAKMNETSSLPSKSLLSSGRVTHAWSQHG ERLGDAVKQRHWEQLQTFSQTRKDMDEAGSHHPQQTNTGKENQTPHVLSRKRELNNESKH MDTWRGTTHTRASQRDRGRRPSGQTRGYMEGNNTHQGLSGGQGVGDHQDKHVDTWRGTTH TRASQGDRGAPRGKGGCGHSFSRLKASLKSLMALKRAADLPAQYSSSDKGQTASSSGSLT PVYPDWETPPRSQLLTSKGTKENWTENEFDELREVGFRSMHKDQALIRSSGRKISETEYQ LNEINQEDKIREKRMKRNEQSLQEIWDYVKRPNLRLIAVPESDGENGTKLENTLRDIIQE NFPNLARQANIQIQKYGEHHKDTPQEKQPQDT >gi568815587r:1_187586|GENSCAN_predicted_CDS_1|1179_bp atgatactgcattgtcaagtcattgctccaaaaatatggtttagctcaacactgagtgac tataggaaaccagaaaccaggctgggcgctaaagatgcaaagatgaatgagacatcatct ctgccgtccaaaagcttactgtctagtgggagagttacacacgcctggagtcaacatggg gagaggcttggagatgctgtgaagcaaagacactgggaacagctgcagacattttcccag accaggaaggacatggatgaagctggaagccatcaccctcagcaaactaacacaggaaaa gaaaaccaaacaccacatgttctcagtcgtaagagggagttgaacaatgagagcaaacac atggatacatggaggggaacaacacacaccagggcctctcagcgggacaggggtaggaga ccatcaggacaaacacgtggatacatggaggggaacaacacacaccagggcctctcaggg ggacagggggtaggagaccatcaagacaaacacgtggatacatggaggggaacaacacac accagggcctctcagggggacaggggagcacctaggggaaagggcggctgtgggcacagc ttcagcagacttaaagcatctttgaaaagcctgatggctctgaagagagcagcagatctc cctgcacagtattcgagctctgataagggtcagactgcctcctcaagtgggtccctgacc cccgtgtatcctgactgggagacacctcccagatcacaactcctcaccagcaagggaaca aaagaaaactggacagagaatgagtttgacgaattgagagaagtaggtttcagaagcatg cacaaggatcaagcactgattcggtcaagcggaagaaagatatcagagactgaatatcaa cttaatgaaataaatcaagaagacaagattagagaaaaaagaatgaaaagaaatgaacaa agcctccaagaaatatgggactatgtgaaacgaccaaatctacgtttgattgctgtacct gaaagtgatggggagaatggaaccaagttagaaaacactcttcgggatattatccaggag aacttccctaacctagcaaggcaggccaatattcaaattcagaaatatggagaacatcac aaagacactcctcaagaaaagcaaccccaagacacatag >gi568815587r:1_187586|GENSCAN_predicted_peptide_2|174_aa MDKFLDTYALPRLNQEEVESLNRPITRSEIVAVINSLPTKKQSRTRWIHSRILPEGHITV KGHGYEQHHLHTTNDVDEEDLSDAASKGDDFALSEQSQDAHFLQPEAYGLGEGAETATGT AHQGNDATEFDFGNFSPLIHSFGDATDFDIGLHLKGERKGSCFYYRPVGLTEFL >gi568815587r:1_187586|GENSCAN_predicted_CDS_2|525_bp atggacaaattcctggacacatacgccctcccaagactaaaccaggaagaagttgaatcc ctgaatagaccaataacaaggtctgaaattgtggcagtaattaatagcctaccaaccaaa aaacagtccaggaccagatggattcacagccgaattctaccagagggccacataactgtc aaaggccatggctatgagcagcaccatctccacaccaccaatgacgtggatgaagaagat ttgagcgatgcagcctccaaaggagatgactttgcactttctgaacagtctcaagatgca cactttttacaaccagaagcctatggactgggtgagggagcagaaacagccacaggtact gcccatcagggtaatgatgctactgagtttgactttgggaatttctcacctttaatacac tcatttggggatgctacggactttgacattgggttgcatttaaagggggagagaaagggc agttgcttctattatcgccctgttggactcacagagtttctttga >gi568815587r:1_187586|GENSCAN_predicted_peptide_3|62_aa MNQIQYGITADIHTELHKTGRELLITAPEGSGQSDALEIRECQTEITRPGELPPMVILVF PG >gi568815587r:1_187586|GENSCAN_predicted_CDS_3|189_bp atgaaccagatccagtacggcatcactgcagacatacacacagagctgcataaaacagga agagagctgctaatcacagccccagagggtagtggccaaagtgatgccttggagatccga gaatgccagactgagatcacacggcctggggaattaccgcctatggtcattttggttttc ccgggatag >gi568815587r:1_187586|GENSCAN_predicted_peptide_4|123_aa MIVRPPHPPGNPRDLVLCVPAAPATAKRGQGTAQAMASEGASLKLWQLPCGVEPVDSRKS RIEVWEPPPRFQRMYGNAWISRQKFAAGILVKGYCGGGAGGSKYDGRQTSFGILSPKHSH LLE >gi568815587r:1_187586|GENSCAN_predicted_CDS_4|372_bp atgattgtgaggcctccccatccacctggaaatcctagggacttggtgctctgtgtccca gctgctccagccacagctaaaaggggtcaaggtacagctcaggccatggcttcagagggt gcaagcctcaagctttggcagcttccatgtggtgttgagcctgtggattcacggaagtcc agaatcgaggtatgggaacctccacctagatttcagagaatgtatggaaatgcctggatc tccagacagaagtttgctgcagggatacttgtgaaaggatattgtggaggaggagctggg ggtagtaagtatgatggaaggcagacgagttttggaatcttgtctccaaaacactctcat ctgttagaatga >gi568815587r:1_187586|GENSCAN_predicted_peptide_5|819_aa MKKCRAKGKESLIKPSNLHVRVEECGRSLCGRVLLVLHRLPDPSLQPHEAQQPASHSVAS NQRKQPAKLAAVAHERPPGGTGSVDPGRPPGATCPESPGPATPHTLGVVEPGKTSPPTME EEPWAPQGSPCWTAQSLSALRKEQDSSSEKDGRSPNKSDKYHIRWPMSGAHDLQQAAPGP GGAHQGHPNQDNRTVSQMLSERWYTLGPNEMQKYNLAFQVKVAHLQQGPKEVQLRGQAHK PGASRSVTRARGSGAYQRRALPLPLGRPLNSSQTLQSSDAKEQLLWGRTAAHSQGTWLSL AQAFSHSGVHSLDGREIDRQALRELTQVVSGTASYSGPKPSTQHGAPGHFAAPGEGGDPW AALLPPTFWQSMVTPCPPPTHTRMLPPQPWHPPPSYWAQEPSKPRSLVNAAERAPYGPNP WGWGPRDAFQGGLFPPNGSCHLLVEKPRTGSGNRRPRRRCPLHCTYPGPVPALIMQLFQA HCFFLSTRPQPPSRPTMHTSSPPSPVGVLTAPHLAQTLDAALAAPPLLLPESHVKLRVGA GAAVVPVGGRSTVPGESWAVSLLRAPVHTMNCGTFRAFHFSENSSCWGYKMECEEGLGPQ GGACGLGGVHAPLLSPEGLDSGVGDPCSSASKATLPTPVFTVVKGSNHLAPKACASAGIS SQSTVELRRECEQRGVAVNTDEASLTPSLDTTHLLLCGSLRLHGSGVGDPCSSASKTTLP TPVFTVVKGSNHLAPKACASAGISSQSTPKQQSKTLSLKRKENKCKFLSLCECSQVGGEI DNNKRALNNDGLGFNVTLTGSFSPSPEELPIALDACTII >gi568815587r:1_187586|GENSCAN_predicted_CDS_5|2460_bp atgaagaagtgccgagcaaaggggaaagaatcccttataaaaccatcaaatctccatgtt cgtgtggaggaatgtggaaggtcactctgcggccgtgttctcctggtactccatcgcctt cctgacccctccctgcagccacacgaggcccagcaacctgccagtcactcagtggcctcc aaccagagaaaacaacctgccaagttggcagctgttgctcatgagcgtccaccaggtggg acagggagtgttgaccctgggcggccccctggagccacctgccctgaaagcccagggccc gcaaccccacacactttgggggtggtggaacctggtaaaacctcacctcccaccatggag gaggagccctgggcccctcaggggagtccctgctggacagcccagtccctcagtgccctg cgcaaggaacaggactcatcttctgagaaggatggacgcagccccaacaaatcagacaag taccacatccggtggcccatgagtggcgctcatgatcttcagcaggcggcaccaggccct ggcggggcgcaccagggtcaccccaaccaggataaccggaccgtcagccagatgctgagc gagcggtggtacaccctggggcccaatgagatgcagaaatacaacctggccttccaggtg aaggtggcccacttgcaacaaggaccgaaagaagtccagctcagaggccaagcccacaag ccaggggctagcaggagtgtaacaagggctcgtgggagcggagcatatcagagacgggca ctgccactgcccctggggcgtcctctgaactcctcccaaacactccagagctcggatgcc aaggagcagcttctgtggggcagaacggctgcacacagtcagggaacctggctcagcctg gcccaagccttctcccacagcggggtacacagcctggacggcagggaaatagaccgtcag gcactacgggaactgacacaggtggtgtctggcactgcatcatactctggcccaaagcct tctactcagcatggagctccaggccactttgcagcccctggtgagggaggtgacccgtgg gcagccctgctgccgcccaccttttggcagtctatggtcacaccctgtcctcctcctaca catactcggatgcttcctcctcaaccttggcacccacctccttcttactgggcccaggag ccttcaaagcccaggagtctggtcaacgcagcagagcgggccccctacggccccaacccc tggggatgggggcccagggacgccttccaaggtggcctgtttcctcccaatggatcctgc caccttctggtggagaagccgaggacaggctcagggaaccggagaccgagaaggcgctgt cctcttcactgcacgtaccctggaccagtgccggccctgatcatgcagctcttccaggcc cactgcttcttcctgtccactaggccacagccgccctccaggcccactatgcacacatcc tcccctccaagccctgtgggggtcctgaccgcacctcacctggctcagactcttgacgct gccctggctgccccaccactgcttctgcccgagagtcacgtgaagctgagagtaggggca ggggcagcagtggtgccagttggggggcggtccactgtgccgggggaaagctgggcagtt tccctcctccgagcccctgtacataccatgaattgtgggaccttcagagcttttcacttt tcggaaaatagctcctgctggggctacaagatggagtgtgaagagggccttgggccacag ggaggcgcctgtggactagggggagttcatgcaccccttctttccccagaggggctggac tcaggggttggggacccctgctcaagtgcatccaaagcgacccttcccacaccagtcttc acagtggtcaagggcagcaaccacttagctcccaaggcatgtgcctcagctggcatttcg tcacaatcaacagtggagctcaggcgggaatgtgagcaaaggggagtggctgtaaataca gacgaagcttccctcactccctcactcgacaccactcacctcctgctgtgtggctccttg cggctccatggctcaggggttggggacccctgctcaagtgcatccaaaacgacccttccc acaccagtcttcacagtggtcaagggcagcaaccacttagctcccaaggcatgtgcctca gctggcatttcgtcacaatcaacacctaagcaacagagcaagacgctgtctctgaaaagg aaagaaaacaaatgcaagtttttatcactttgtgagtgtagccaagttggaggagaaata gacaataataaaagagcactgaataatgacggacttggcttcaatgtcaccttaactgga agcttctctccctctccagaagagcttccgattgcacttgatgcatgcactattatttga >gi568815587r:1_187586|GENSCAN_predicted_peptide_6|150_aa MMIIPHLLRDKDQRATTATSPRTRQCWGTENEGVQLWPSQSSGGDETHLPLSLWARLVPG ASVDRGHAPGLPKLEKAARLRSEGQNSTGCVIEKQLLLGFQRNGIIVSLIYVNKLKNCLK ENMWDDAARRPLPDAGPSTLDFLTSRTVKK >gi568815587r:1_187586|GENSCAN_predicted_CDS_6|453_bp atgatgattattccccaccttctaagagacaaagaccaacgagccaccacagccaccagt cccagaacccgccaatgctggggaacggaaaatgagggagttcaactctggccctcacaa tccagtggaggagacgaaactcatctgcctctgtccctctgggcacgcctcgtgccaggt gcatctgtggacaggggccatgctcctgggcttccaaagttggagaaagctgccaggctc aggtctgaaggccagaattctacaggatgtgttattgagaagcaacttttgcttggtttt cagagaaatggaatcatcgtatcgctgatctacgtaaacaaactgaagaattgtctgaaa gaaaatatgtgggatgatgcagcaagaagacccttaccagatgcaggcccctcaaccttg gacttcctaacatccagaactgttaagaaataa >gi568815587r:1_187586|GENSCAN_predicted_peptide_7|161_aa XSYLGSHRTFSHHVRLFLAVTVSQTLLLRTWNSVRRIGQVFQDSQEGAHIRRETVSKSVC AEPWRHQRARDPAPTNFPLKCQKQRGASTSSGQHGGRVNLVFFIDTSYTANFSNCVITCK FQGPNLETNWHFHISPPPGDWQQTTVNLQPGSARGGASKTT >gi568815587r:1_187586|GENSCAN_predicted_CDS_7|486_bp ngatcctatctaggatcccataggacatttagtcatcatgtcaggctcttcttggctgtg acagtctctcagactttacttctgaggacctggaacagtgttaggaggattggtcaggta tttcaggacagccaggagggggcgcacatccgccgagaaactgtgagcaagagcgtctgt gctgaaccatggcgccaccagagggcgcgcgatcccgccccaaccaacttcccgctgaag tgccagaagcagcgaggagcttcaacttcctcagggcagcacgggggtcgtgttaatttg gtgttcttcattgacacttcctatactgcaaacttttccaactgtgtgattacttgtaag ttccagggaccaaaccttgaaacaaactggcacttccatatctctcccccaccaggagat tggcagcagacaacagtcaatttacaacctggctctgcccgtggtggtgctagcaagacc acctaa