GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:10:47 Sequence gi568815596r:151703279_151928176 : 224898 bp : 39.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 600 49 552 0 0 48 -24 510 0.722 30.79 1.00 Prom - 2091 2052 40 -7.75 2.12 PlyA - 2804 2799 6 -0.45 2.11 Term - 3366 3247 120 0 0 17 36 65 0.241 -8.31 2.10 Intr - 3719 3603 117 2 0 84 105 130 0.997 14.04 2.09 Intr - 6485 6378 108 2 0 56 86 153 0.578 11.46 2.08 Intr - 14242 14138 105 1 0 67 115 74 0.934 7.49 2.07 Intr - 20208 20104 105 0 0 76 95 68 0.971 5.79 2.06 Intr - 21086 20982 105 2 0 83 116 139 0.991 15.69 2.05 Intr - 21683 21579 105 2 0 77 116 115 0.997 12.69 2.04 Intr - 22282 22175 108 1 0 63 64 124 0.981 7.06 2.03 Intr - 24628 24413 216 1 0 49 100 230 0.934 18.08 2.02 Intr - 26378 26337 42 2 0 72 94 47 0.647 1.22 2.01 Init - 29878 29843 36 1 0 81 94 69 0.924 6.86 2.00 Prom - 30214 30175 40 -8.75 3.00 Prom + 37764 37803 40 -5.85 3.01 Sngl + 41085 41273 189 2 0 88 46 228 0.805 13.46 3.02 PlyA + 41329 41334 6 1.05 4.00 Prom + 41375 41414 40 -13.59 4.01 Init + 41474 41804 331 1 1 45 77 184 0.701 10.52 4.02 Term + 42634 43232 599 2 2 35 42 252 0.415 9.00 4.03 PlyA + 43313 43318 6 1.05 5.13 PlyA - 45765 45760 6 1.05 5.12 Term - 54613 54440 174 1 0 66 42 130 0.123 2.98 5.11 Intr - 60436 60385 52 0 1 113 69 25 0.022 1.09 5.10 Intr - 62911 62805 107 1 2 72 75 69 0.011 2.09 5.09 Intr - 76344 76215 130 2 1 52 101 54 0.051 2.88 5.08 Intr - 78417 78331 87 2 0 -25 92 139 0.050 1.17 5.07 Intr - 78631 78512 120 0 0 87 26 85 0.043 0.89 5.06 Intr - 82292 82224 69 1 0 69 90 72 0.124 2.88 5.05 Intr - 103694 103543 152 2 2 66 78 88 0.412 3.64 5.04 Intr - 109162 109079 84 1 0 64 107 78 0.883 6.40 5.03 Intr - 111038 110891 148 1 1 74 80 98 0.487 6.92 5.02 Intr - 111921 111861 61 1 1 89 87 22 0.279 -0.93 5.01 Init - 125459 125351 109 2 1 60 65 86 0.044 3.93 5.00 Prom - 128089 128050 40 -5.15 6.06 PlyA - 128130 128125 6 1.05 6.05 Term - 136101 135841 261 0 0 59 47 227 0.999 10.24 6.04 Intr - 138810 138625 186 0 0 78 72 184 0.984 14.76 6.03 Intr - 150265 150170 96 1 0 80 100 78 0.576 7.49 6.02 Intr - 152097 151946 152 1 2 57 86 94 0.973 5.06 6.01 Init - 156972 156906 67 0 1 60 87 42 0.539 2.59 6.00 Prom - 159371 159332 40 -3.25 7.12 PlyA - 160100 160095 6 1.05 7.11 Term - 166457 166118 340 1 1 117 47 130 0.830 4.82 7.10 Intr - 167333 167119 215 2 2 66 38 190 0.642 8.39 7.09 Intr - 167583 167564 20 1 2 79 103 0 0.311 -3.99 7.08 Intr - 169215 169139 77 2 2 74 94 76 0.480 5.04 7.07 Intr - 173278 173148 131 1 2 48 83 66 0.643 0.67 7.06 Intr - 177225 177193 33 0 0 91 101 11 0.500 0.20 7.05 Intr - 177644 177522 123 2 0 97 84 121 0.940 12.46 7.04 Intr - 180092 179973 120 2 0 67 113 161 0.999 16.27 7.03 Intr - 205234 205198 37 0 1 65 77 11 0.001 -4.95 7.02 Intr - 211229 211095 135 1 0 61 116 114 0.854 10.26 7.01 Intr - 224451 224367 85 1 1 58 89 107 0.212 5.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:151703279_151928176|GENSCAN_predicted_peptide_1|184_aa MENDFDELREEGFRRSNYSELREDIQTKGKEVENFEKNLEECITRITNTEKCLKELMELK TKARELREECRSLRSRCDQLEERVSAMEDEMNEMKQEGKFREKRIKRNEQSLQEVWDYVK RPNLRLIGVPESDGENGTKLENTLQDIIQENFPNLARQANVQIQEIQRTPQRYSLRRATP RHII >gi568815596r:151703279_151928176|GENSCAN_predicted_CDS_1|552_bp atggagaatgactttgacgagctgagagaagaaggcttcagacgatcaaattactctgag ctacgggaggacattcaaaccaaaggcaaagaagttgaaaactttgaaaaaaatttagaa gaatgtataactagaataaccaatacagagaagtgcttaaaggagctgatggagctgaaa accaaggctcgagaactacgtgaagaatgcagaagcctcaggagccgatgcgatcaactg gaagaaagggtatcagcaatggaagatgaaatgaatgaaatgaagcaagaagggaagttt agagaaaaaagaataaaaagaaatgagcaaagcctccaagaagtatgggactatgtgaaa agaccaaatctacgtctgattggtgtacctgaaagtgatggggagaatggaaccaagttg gaaaacactctgcaggatattatccaggagaacttccccaatctagcaaggcaggccaac gttcagattcaggaaatacagagaacgccacaaagatactccttgagaagagcaactcca agacacataatt >gi568815596r:151703279_151928176|GENSCAN_predicted_peptide_2|388_aa MADDEDYEEVVEYYTEEVVYEEVPGETITKIYETTTTRTSDYEQSETSKPALAQPALAQP ASAKPVERRKVIRKKVDPSKFMTPYIAHSQKMQDLFSPNKYKEKFEKTKGQPYASTTDTP ELRRIKKVQDQLSEVKYRMDGDVAKTICHVDEKAKDIEHAKKVSQQVSKVLYKQNWEDTK DKYLLPPDAPELVQAVKNTAMFSKKLYTEDWEADKSLFYPYNDSPELRRVAQAQKALSDV AYKKGLAEQQAQFTPLADPPDIEFAKKVTNQVSKRKYQEDFENMKDQIYFMQTETPEYKM NKKAGVAASKVKYKEDYEKNKGKADYNVLPASENPQLRQLKAAGDALSDELWAPGPPYGC VAYELRAPGSPHCRRVVILTEAEGDFQR >gi568815596r:151703279_151928176|GENSCAN_predicted_CDS_2|1167_bp atggcagatgacgaagactatgaggaggtggtggagtactacacagaagaagtggtttac gaagaggtgccgggagagacaataacaaaaatttatgagactacgacaacaaggacatct gactatgagcaatcagaaacttccaaaccagctctggcacagccagcactggcacagcca gcatcagcaaagccggtggagaggaggaaggtcatccggaagaaagtggatccttcaaag ttcatgaccccctacattgcacacagtcagaaaatgcaggatctttttagcccaaataaa tacaaggagaagtttgagaaaacaaaaggacagccatacgccagcacaacagatactcca gaacttcgcagaatcaaaaaagtacaagatcaactcagtgaggttaagtatcgaatggat ggtgatgttgctaagactatatgtcacgtagatgaaaaagcaaaggatattgaacatgca aagaaagtgtcgcagcaagtcagtaaggttttatacaagcagaactgggaagacaccaag gataagtacctgcttcctcctgatgcccctgaacttgtccaggccgttaagaacaccgcc atgttcagcaagaaactgtacactgaagactgggaagcagacaaaagtttgttttacccc tataatgatagcccggaactgaggagagttgcccaggcccagaaagctctcagtgatgtt gcctacaaaaaaggtctcgctgaacagcaagctcaattcacgcctctggctgatcctcca gatatagaatttgccaagaaagtaaccaatcaagtgagcaagaggaaataccaggaagat tttgaaaacatgaaagaccagatctacttcatgcagaccgaaacaccagagtataaaatg aataaaaaagctggtgtggcagctagcaaggtaaaatacaaagaagactatgaaaagaat aaaggaaaagcagattataatgtgcttcctgcttcagagaacccacagcttaggcagctg aaggcagcaggagatgccctaagtgacgagctctgggcacctgggcctccatatggatgt gttgcttatgagctccgggcacctgggtctccacattgcagacgtgttgtgattttgaca gaagcagaaggagactttcaaagatag >gi568815596r:151703279_151928176|GENSCAN_predicted_peptide_3|62_aa MGRNQNRKAENSKNQSASSPPKEYSSSPATEQSQMENDFDELREEVFRRSVIINFSELKE DV >gi568815596r:151703279_151928176|GENSCAN_predicted_CDS_3|189_bp atggggagaaaccagaacagaaaagcagaaaattctaaaaatcagagcgcgtcttctcct ccaaaggaatacagctcctcaccagcaacggaacaaagccagatggagaatgactttgat gagctgagagaagaagtcttcagacgatcggtaataataaacttctccgagctaaaggag gatgtatga >gi568815596r:151703279_151928176|GENSCAN_predicted_peptide_4|309_aa MKQEEKFREKRVKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFP NLARQANIQIQEIQRTPQRYSLRRATPRHINVRFTKVEMKEKMLRAAREKDRSTRQKVNK DIKELNSALHQADLIDIYRTLHPKSTEYTFFSAPHSTYSKIDHIFGSKALLSKCKRTEII TNSLSDHSAIKLELRIKKLTQNRSTTWKLNNFLLNDYWVHNEMKAKIKMFFETNENKDTT YQNLWDTFKTVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEETHSKASRRQEITKI RAELKEIET >gi568815596r:151703279_151928176|GENSCAN_predicted_CDS_4|930_bp atgaagcaagaagagaagtttagagaaaaaagagtaaaaagaaacgaacaaagcctccaa gaaatatgggactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgat ggggagaatggaaccaagttggaaaacactcttcaggatattatccaggagaacttcccc aacctagcaaggcaggccaacattcaaattcaggaaatacagagaacaccacaaagatac tccttgagaagagcaaccccaagacacataaatgtcagattcactaaagttgaaatgaag gaaaaaatgttaagggcagccagagagaaagacagatcaacgagacagaaggttaacaag gatatcaaggaattgaactcagctctgcaccaagcagacctaatagacatctacagaact ctccaccccaaatcaacagaatatacattcttctcagcaccacatagcacttattccaaa attgaccacatatttggaagtaaagcactcctcagcaaatgtaaaagaacagaaattata acaaactctctctcagaccacagtgcaatcaaactagaacttaggattaagaaactcact cagaaccgctcaactacatggaaactgaacaacttcctcctgaatgactactgggtacat aacgaaatgaaggcaaaaataaagatgttctttgaaaccaatgagaacaaagacacaaca taccagaatctctgggacacattcaaaacagtgtgtagagggaaatttatagcactaaat gcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaa ctagagaagcaagaggaaacacattcaaaagctagccgaaggcaagaaataactaagatc agagcagaactgaaggagatagagacataa >gi568815596r:151703279_151928176|GENSCAN_predicted_peptide_5|430_aa MAQWRRFEGNINTRGRNSKQSEAICPNLRSSQPQSQEHKVIIVGLDNAGKTTILYQFSMN EVVHTSPTIGSNVEEIVINNTRFLMWDIGGQESLRSSWNTYYTNTEFVIVVVDSTDRERI SVTREELYKMLAHEDLRKAGLLIFANKQDVKECMTVAEISQFLKLTSIKDHQWHIQACCA LTGEGVYLDPSNLQIHAACVIVQMMSERSPVGYAGKVASAPKNGKERKKEAPYRSDLRIQ SLKNKWGRCQLERAETGKIPPCQAYGPVLGSERPEDRLVLAPPRLVLAREGHIQSKLNRE SSTEFRARNRSEGERDHRENELIHSMKQAPAVPTLSSPFPLGEVRSHPDRQLYNGRGYSP CCVKYPQDRVKRKLWSPLSSDIAVKGCFSQAHCLFKQGHWRDLKNPAEFTCDGVIQALLE TLEELNTLAF >gi568815596r:151703279_151928176|GENSCAN_predicted_CDS_5|1293_bp atggcgcagtggagaagattcgagggaaatataaatactcggggacgcaacagcaagcag tcggaagccatctgccccaacttgcggtcttcacagcctcagtcccaggagcacaaagtt atcattgttgggctggataatgcagggaaaactaccattctttaccaattttctatgaat gaagttgtacatacatctcctacaataggaagtaatgtagaagagatagtgattaataat acacgtttcctaatgtgggatattggtggccaagaatctcttcgttcttcctggaacact tactatactaacacagagtttgtaatagttgttgtggacagtacagacagagagaggatt tctgtaactagagaagaactctataaaatgttagcgcatgaggacctaagaaaagctgga ttgctgatttttgctaataaacaagatgttaaagaatgcatgactgtagcagaaatctcc cagtttttgaagctaacttctattaaagatcaccagtggcatatccaggcatgctgtgct ctaactggcgagggtgtttatttagaccctagtaacctacagattcatgctgcctgtgtg attgttcagatgatgtctgaaaggtctcctgttggatacgcaggaaaggtagcttctgct cctaaaaacggaaaagaaaggaaaaaggaagcaccgtataggtctgacctcagaatacaa agtctgaaaaacaaatggggaaggtgccagcttgagagggcagagacaggaaaaatccca ccctgccaagcctatggtcccgttctaggctctgaacgacctgaagacagacttgtatta gcaccgcctagacttgtattagcacgggaaggtcacattcagagcaagctgaacagagaa tcaagtacagagtttagagcaaggaacaggagcgagggtgagcgagatcaccgggaaaat gaactaattcacagcatgaaacaagcccctgcagtcccaactcttagcagcccgttcccc ctgggagaggtcaggtcacacccagatagacagctctacaacggaagggggtacagtcca tgctgtgtgaaatacccacaagaccgtgtcaagaggaagctatggtcccctctgagctct gatattgctgtaaagggctgtttttcacaagcccattgcttgtttaagcagggccactgg agggacctcaaaaaccctgcagaatttacttgtgatggagttattcaggcacttcttgaa acacttgaggaacttaacacattggctttctag >gi568815596r:151703279_151928176|GENSCAN_predicted_peptide_6|253_aa MSFLGYQLESILKFPMMEKNKGSEVQSEIERIFELARSLQLVVLDADTINHPAQLIKTSL APIIVHVKVSSPKVLQRLIKSRGKSQSKHLNVQLVAADKLAQCPPEMFDVILDENQLEDA CEHLGEYLEAYWRATHTTSSTPMTPLLGRNLGSTALSPYPTAISGLQSQRMRHSNHSTEN SPIERRSLMTSDENYHNERARKSRNRLSSSSQHSRDHYPLVEEDYPDSYQDTYKPHRNRG SPGGYSHDSRHRL >gi568815596r:151703279_151928176|GENSCAN_predicted_CDS_6|762_bp atgtctttcctggggtaccagttggaaagcatcctaaaattccccatgatggagaagaat aaaggctcggaagtacaaagtgaaattgaaagaatctttgagttggcaagatctttgcaa ctggttgttcttgatgcagacaccatcaatcacccagcacaacttataaagacttcctta gcaccaattattgttcatgtaaaagtctcatctccaaaggttttacagcggttgattaaa tctagaggaaagtcacaaagtaaacacttgaatgttcaactggtggcagctgataaactt gcacaatgccccccagaaatgtttgatgttatattggatgaaaatcagcttgaggatgca tgtgaacatctaggggagtacctggaggcgtactggcgtgccacccacacaaccagtagc acacccatgaccccgctgctgggaaggaatttgggctccacggcactctcaccatatccc acagcaatttctgggttacagagtcagcgaatgaggcacagcaaccactccacagagaac tctccaattgaaagacgaagtctaatgacctctgatgaaaattatcacaatgaaagggct cggaagagtaggaaccgcttgtcttccagttctcagcatagccgagatcattaccctctt gtggaagaagattaccctgactcataccaggacacttacaaaccccataggaaccgagga tcacctgggggatatagccatgactcccgacataggctttga >gi568815596r:151703279_151928176|GENSCAN_predicted_peptide_7|438_aa XPALLQLDKTWVKHAIALTHLQFVIIDLKAQNWTEGQMDELTEVGFRRWVIKNYTELKEH VLTQCKEAKNLDKRFLKPSRVDNARKGSADSYTSRPSDSDVSLEEDREAIRQEREQQAAI QLERAKSKPVAFAVKTNVSYCGALDEDVPVPSTAISFDAKDFLHIKESHKYMDPTLENKY NNDWWIGRLVKEGCEIGFIPSPLRLENIRIQQEQKRGRFHGGKSSGNSSSSLGEMVSGTF RATPTSTAKQKQKVTEHIPPYDVVPSMRPVVLVGPSLKGYEVQISSVTLQFLIEHPGDSH VDACSSRVCFHVGPDGSQCSAWEGRGVGNVRGMSRRVVYKNPAMGMENFRQHTQTSRKAE SSAGLSSLAAPHWRQAQLRRAKDTRHLCTKPACCLLAAVLSPNEGRGKREEGLVNTAGLF LNTLEENQGGWKLAIAHG >gi568815596r:151703279_151928176|GENSCAN_predicted_CDS_7|1317_bp ncacctgcgttgctgcagctggataagacttgggtgaagcacgcaattgcgctgactcac ctgcagttcgtgatcattgatctcaaggcacagaactggacagagggtcagatggatgaa ttgacagaagtaggcttcagaagatgggtaataaaaaactacactgagctaaaggagcat gttctaacccaatgcaaagaagctaagaaccttgataaaagattcctaaaaccaagcagg gttgacaatgccaggaagggttcagcggattcctacacaagcaggccgtctgactccgat gtctctttggaagaggaccgggaagcaattcgacaggagagagaacagcaagcagctatc cagcttgagagagcaaagtccaaacctgtagcatttgccgtgaagacaaatgtgagctac tgcggcgccctggacgaggatgtgcctgttccaagcacagctatctcctttgatgctaaa gactttctacatattaaagagagccataaatatatggatccaactttggaaaataaatat aacaatgattggtggataggaaggctggtgaaagagggctgtgaaattggcttcattcca agtccactcagattggagaacatacggatccagcaagaacaaaaaagaggacgttttcac ggagggaaatcaagtggaaattcttcttcaagtcttggagaaatggtatctgggacattc cgagcaactcccacatcaacagcaaaacagaagcaaaaagtgacggagcacattcctcct tacgatgttgtaccgtcaatgcgtccggtggtgttagtggggccgtcactgaaaggttac gaggtacaaatatcatctgttactcttcagttcttgatcgagcacccaggagacagtcat gttgacgcttgctcctcaagggtctgtttccacgtgggtcctgatggaagccagtgctca gcatgggaaggcaggggagttggaaatgtcagagggatgagcaggagagtggtctacaaa aatccagccatgggcatggagaatttccgccaacatacacaaactagcaggaaggctgag tccagtgctggcctcagttctctagctgctcctcactggaggcaggcacaactcaggcgt gccaaggatacccggcacctgtgtacaaagccagcctgttgtctcctggcagcagttcta tcccctaacgagggaagagggaagagggaagagggtctggtgaacactgctggtttattc ctcaacaccctggaggagaaccaaggggggtggaaattagcaattgcacatggttaa