GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:40:45 Sequence gi568815575r:154059360_154196473 : 137114 bp : 47.97% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2293 2430 138 0 0 85 75 126 0.576 11.14 1.02 Term + 28393 28479 87 0 0 87 44 45 0.028 -2.34 1.03 PlyA + 29852 29857 6 1.05 2.05 PlyA - 31316 31311 6 1.05 2.04 Term - 31822 31772 51 1 0 112 38 59 0.491 0.73 2.03 Intr - 35130 35019 112 2 1 63 57 25 0.146 -2.72 2.02 Intr - 38496 38245 252 2 0 104 74 184 0.875 15.15 2.01 Init - 50061 49997 65 0 2 73 81 62 0.789 4.62 2.00 Prom - 50683 50644 40 -7.66 3.02 PlyA - 51691 51686 6 1.05 3.01 Sngl - 52805 52467 339 2 0 40 42 226 0.984 9.23 3.00 Prom - 59247 59208 40 -8.46 4.03 PlyA - 59374 59369 6 1.05 4.02 Term - 62964 62848 117 0 0 94 42 100 0.858 4.54 4.01 Init - 63387 63253 135 0 0 76 110 85 0.951 9.65 4.00 Prom - 70984 70945 40 -5.46 5.00 Prom + 72005 72044 40 -5.46 5.01 Init + 72260 72308 49 1 1 86 58 8 0.519 -3.21 5.02 Intr + 72825 72979 155 1 2 84 41 149 0.862 9.59 5.03 Intr + 79492 79702 211 0 1 67 33 100 0.452 0.89 5.04 Intr + 80041 80105 65 2 2 73 87 116 0.914 8.44 5.05 Intr + 84850 85036 187 0 1 21 83 226 0.755 14.66 5.06 Intr + 91297 91593 297 2 0 129 91 522 0.931 53.45 5.07 Intr + 93581 93749 169 0 1 118 110 120 0.994 16.20 5.08 Intr + 95215 95380 166 1 1 82 103 259 0.998 26.76 5.09 Intr + 96935 97174 240 1 0 127 115 380 0.999 42.35 5.10 Term + 99457 99567 111 0 0 116 51 123 0.992 9.96 5.11 PlyA + 99643 99648 6 1.05 6.09 PlyA - 99659 99654 6 -4.33 6.08 Term - 100354 99998 357 1 0 93 49 150 0.741 6.11 6.07 Intr - 101109 100974 136 2 1 55 72 116 0.265 7.27 6.06 Intr - 109263 109148 116 0 2 70 32 98 0.169 1.65 6.05 Intr - 109594 109572 23 2 2 85 113 12 0.676 0.66 6.04 Intr - 110507 110408 100 2 1 80 80 54 0.659 3.48 6.03 Intr - 114689 114650 40 1 1 98 37 13 0.070 -4.57 6.02 Intr - 116919 116263 657 2 0 69 66 737 0.043 60.55 6.01 Init - 117161 117103 59 2 2 65 75 21 0.148 -0.53 6.00 Prom - 119897 119858 40 -7.16 7.00 Prom + 121343 121382 40 -3.76 7.01 Init + 123319 123430 112 0 1 93 83 140 0.543 14.38 7.02 Intr + 128411 128707 297 0 0 129 91 576 0.911 58.85 7.03 Intr + 130695 130863 169 1 1 118 110 120 0.994 16.20 7.04 Intr + 132329 132494 166 2 1 82 103 279 0.998 28.76 7.05 Intr + 134049 134288 240 2 0 127 115 311 0.999 35.45 7.06 Term + 136571 136681 111 1 0 116 51 123 0.977 9.96 7.07 PlyA + 136757 136762 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:154059360_154196473|GENSCAN_predicted_peptide_1|74_aa MKVITGDLMEQLGRAVAAKEQLEKVEEGVETGTGVLLVKENRDARQVLQKSFKIIMEENY HFYQYKTFIASQPS >gi568815575r:154059360_154196473|GENSCAN_predicted_CDS_1|225_bp atgaaggtcatcactggagacctcatggagcagcttggcagagctgtggcagcaaaggaa caactggagaaggttgaagagggtgtggagacagggactggagtcttgctggtaaaggaa aacagagatgctagacaggtgttacagaaaagcttcaaaatcatcatggaagaaaattat catttttaccagtacaaaacttttattgcatcccaaccaagctaa >gi568815575r:154059360_154196473|GENSCAN_predicted_peptide_2|159_aa MRRQYTPTRMAKIQNTDNTKCCPAITANDGRARSGAEGGARARRCSSARAGQEGGARRRP CGVPASAARALPPLGERAVVKAVRKMAAAAAAAPSGGGGGGEEERLALSPLRWKDLVLGA SIPALSSETTAVPASPCACASFRWFLTPQCEAVVDIHKK >gi568815575r:154059360_154196473|GENSCAN_predicted_CDS_2|480_bp atgagacgccagtacacacctactagaatggccaaaatccagaacactgataacaccaaa tgctgcccggccatcacagccaatgacgggcgggctcgcagcggcgccgagggcggggcg cgggcgcgcaggtgcagcagcgcgcgggccggccaagagggcggggcgcgacgtcggccg tgcggggtcccggcgtcggcggcgcgcgcgctccctcctctcggagagagggctgtggta aaagccgtccggaaaatggccgccgccgccgccgccgcgccgagcggaggaggaggagga ggcgaggaggagagactagcattatcaccgctgcgatggaaggaccttgtcctcggtgcc tccataccagccctttccagtgaaactactgctgtcccggcaagcccctgtgcgtgtgca tcattccggtggtttctgacacctcagtgtgaagctgttgtagacatccataagaaatag >gi568815575r:154059360_154196473|GENSCAN_predicted_peptide_3|112_aa MRQKLVERQGEIDESIDKVRDFNTPLSEINRSSRQKIRKDIVELTNTINQPNIIDIYRIL YPTAAEYMFFSSSRGTFTEMDHILGHKAHLNKCKRIKSHSVFSQTTLELNKK >gi568815575r:154059360_154196473|GENSCAN_predicted_CDS_3|339_bp atgaggcaaaaactggtagaacgacaaggagaaattgatgagtccattgacaaagttaga gacttcaatacccctctgtcagaaattaacagatccagcaggcagaaaatcaggaaggac atagttgaactcaccaacaccatcaatcaacctaatataattgacatctatagaatactt tatccaacagcagcagagtacatgttcttctcaagctcccgtgggacattcaccgagatg gaccacattctgggccataaagcacaccttaacaaatgtaaaagaataaaatcacacagt gttttctctcagaccacattggaattaaacaagaaatga >gi568815575r:154059360_154196473|GENSCAN_predicted_peptide_4|83_aa MLESVKEASPEDMSPSPVTVPGLRVFDGYRVSRCPYRVGQLRNPQYKTQTRMTMIEMSRV PSSHPPQEPNSSQESIIPGTAVL >gi568815575r:154059360_154196473|GENSCAN_predicted_CDS_4|252_bp atgctggagagtgtcaaggaggcctcccctgaggacatgagtccctcccctgtgactgtg cccggactgagggtctttgatggctacagagtttcacggtgtccctacagagttggtcag ctcaggaacccccagtacaagactcagacccgcatgaccatgattgagatgtctcgggtg ccctcgagccaccccccccaggagccaaattcttcccaagagtccattatcccggggaca gctgtcttgtga >gi568815575r:154059360_154196473|GENSCAN_predicted_peptide_5|549_aa MGFHHVDQAGLKLLTSGQLCSSLKQLTRVSWQLSLVLDFSKISPQINNNTDKGLRTTVFG TWLLGTIQPPGNPVDSTYKSSPESATSYHPTVASQTWSPRFCPGPYSSQTAARGVHPNQL QIASLFLSKSDKAPHSIQKRQLQLRSTGSPSAAASRDLEQASIKRRDPQVMRQGRLPSGT GLSIAMAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTRGPFEGPNYHIAPRWVYHL TSVWMIFVVTASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIASTISIVNQV SGYFVLGHPMCVLEGYTVSLCGITGLWSLAIISWERWMVVCKPFGNVRFDAKLAIVGIAF SWIWAAVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCIIPLAII MLCYLQVWLAIRAVAKQQKESESTQKAEKEVTRMVVVMIFAYCVCWGPYTFFACFAAANP GYAFHPLMAALPAYFAKSATIYNPVIYVFMNRQFRNCILQLFGKKVDDGSELSSASKTEV SSVSSVSPA >gi568815575r:154059360_154196473|GENSCAN_predicted_CDS_5|1650_bp atggggtttcaccatgttgaccaggctggcctcaaacttctgacctcagggcagctgtgc tcctctcttaagcaactgacccgtgtctcctggcaactctctttggtactggacttctcc aagatcagtcctcaaattaataacaacactgacaagggtcttcggaccaccgtgtttggt acctggctgcttgggaccatccagccaccaggaaatcctgttgactccacttacaaaagc tctccagagtctgctacctcttaccaccctactgtcgcctcccagacctggtctccccgc ttctgccctggcccctacagctcgcagacagctgccagaggagtccatccgaatcaactt cagattgcgtcactttttctttcaaaatccgacaaagctcctcattccattcagaaaagg cagctgcagctgcggagcacgggatctccatctgcagctgcatcccgggacctagaacag gccagtataaagcgccgtgaccctcaggtgatgcgccagggccggctgccgtcggggaca gggctttccatagccatggcccagcagtggagcctccaaaggctcgcaggccgccatccg caggacagctatgaggacagcacccagtccagcatcttcacctacaccaacagcaactcc accagaggccccttcgaaggcccgaattaccacatcgctcccagatgggtgtaccacctc accagtgtctggatgatctttgtggtcactgcatccgtcttcacaaatgggcttgtgctg gcggccaccatgaagttcaagaagctgcgccacccgctgaactggatcctggtgaacctg gcggtcgctgacctagcagagaccgtcatcgccagcactatcagcattgtgaaccaggtc tctggctacttcgtgctgggccaccctatgtgtgtcctggagggctacaccgtctccctg tgtgggatcacaggtctctggtctctggccatcatttcctgggagagatggatggtggtc tgcaagccctttggcaatgtgagatttgatgccaagctggccatcgtgggcattgccttc tcctggatctgggctgctgtgtggacagccccgcccatctttggttggagcaggtactgg ccccacggcctgaagacttcatgcggcccagacgtgttcagcggcagctcgtaccccggg gtgcagtcttacatgattgtcctcatggtcacctgctgcatcatcccactcgctatcatc atgctctgctacctccaagtgtggctggccatccgagcggtggcaaagcagcagaaagag tctgaatccacccagaaggcagagaaggaagtgacgcgcatggtggtggtgatgatcttt gcgtactgcgtctgctggggaccctacaccttcttcgcatgctttgctgctgccaaccct ggttacgccttccaccctttgatggctgccctgccggcctactttgccaaaagtgccact atctacaaccccgttatctatgtctttatgaaccggcagtttcgaaactgcatcttgcag cttttcgggaagaaggttgacgatggctctgaactctccagcgcctccaaaacggaggtc tcatctgtgtcctcggtatcgcctgcatga >gi568815575r:154059360_154196473|GENSCAN_predicted_peptide_6|495_aa MSITSSVCVGPILNAQILRVEDGPSGPSSLADGGLAHNLQDSVRHRILYLSEQLRVEKAS RDGNTVSYLKLVSKADRHQVPHIQQAFEKVNQRASATIAQIEHRLHQCHQQLQELEEGCR PEGLLLMAESDPANCEPPSEKALLSEPPEPGGEDGPVNLPHASRPFILESRFQSLQQGTC LETEDVAQQQNLLLQKVKAELEEAKRFHISLQESYHSLKERSLTDLQLLLESLQEEKCRL LLQILEACTSDPKGETETRTHPEGERRVETEAEADLPWQKPRKPAASSPFELHSTAGWVV YKERKFNVKALAAGEDLCTASSYGGRQKGKRAQALMEEQVNGRLQGQLNEIYNLKHNLAC SEERMAYLSYERAKEIWEITETFKSRISKLEMLQQVTQLEAAEHLQSRPPQMLFKFLSPR LSLATVLLVFVSTLCACPSSLISSRLCTCTMLMLIGLGVLAWQRWRAIPATDWQEWVPSR CRLYSKDSGPPADGP >gi568815575r:154059360_154196473|GENSCAN_predicted_CDS_6|1488_bp atgagcatcacctcctcagtctgtgtcgggcccatcttgaatgcacaaatcttgcgtgtc gaagacggccccagtggcccttccagcctcgcagatggaggcctagcccacaacttacag gatagtgtcaggcaccgcatcctctacctctcagagcagctgagagtggagaaggccagt cgggatggcaacactgtgagctacctcaagctggtatccaaagcagaccggcaccaggtg ccgcacatccagcaggcctttgagaaggtgaaccagcgcgcctctgccaccatcgcccag atcgagcacaggctccaccagtgtcaccagcagctccaggagctggaggaaggctgcagg cccgagggcttactgctgatggcagaaagcgacccagccaactgcgagccacccagtgag aaggccctgctttcagagccccccgagccaggtggggaagacgggccggtcaacctgcct catgccagcaggcccttcatcttggagagtcgcttccagagcttacagcaggggacgtgc ttagagacagaggatgtggcccagcaacaaaacctgctgttgcagaaggtaaaggcagag ctggaagaagccaagaggttccacatcagcctccaggagtcctatcacagcctaaaggag aggtctctgactgacctgcagctgttgctggagtcccttcaggaggagaagtgtagactt ttgctgcaaattctagaagcctgcacttctgacccgaaaggggagacggagacacggaca cacccagagggagagcgccgtgtggagacagaggctgaggccgacctgccgtggcagaag ccaaggaagcccgcagcttcatctccttttgaacttcacagtaccgcaggctgggtggtt tataaagaaaggaagttcaatgtcaaggcgctggcagctggtgaggacctttgtacggca tcatcctatggcggaaggcagaagggcaagagggcgcaagcattgatggaagaacaggtg aatggtcgcctgcagggacagctgaatgagatttacaacctcaaacacaatctggcctgc agcgaagagagaatggcctatctatcctatgagagagccaaggaaatatgggagatcacg gagaccttcaagagccgaatatccaagctggagatgctacagcaagtcacccaactggag gcagcggagcacctccaaagccgtcccccgcagatgttgttcaagttcctgagtccgcgc ctctcactggcaaccgtcctcttggtctttgtctccaccttgtgtgcctgcccctcgtca ctgatcagctcacgcctgtgcacctgcaccatgctgatgctgatcgggcttggggtcctg gcctggcagaggtggcgcgccatccctgccacagactggcaggaatgggtcccctccagg tgtagactgtactccaaggactctgggcctccagcagatggaccttaa >gi568815575r:154059360_154196473|GENSCAN_predicted_peptide_7|364_aa MAQQWSLQRLAGRHPQDSYEDSTQSSIFTYTNSNSTRGPFEGPNYHIAPRWVYHLTSVWM IFVVIASVFTNGLVLAATMKFKKLRHPLNWILVNLAVADLAETVIASTISVVNQVYGYFV LGHPMCVLEGYTVSLCGITGLWSLAIISWERWMVVCKPFGNVRFDAKLAIVGIAFSWIWA AVWTAPPIFGWSRYWPHGLKTSCGPDVFSGSSYPGVQSYMIVLMVTCCITPLSIIVLCYL QVWLAIRAVAKQQKESESTQKAEKEVTRMVVVMVLAFCFCWGPYAFFACFAAANPGYPFH PLMAALPAFFAKSATIYNPVIYVFMNRQFRNCILQLFGKKVDDGSELSSASKTEVSSVSS VSPA >gi568815575r:154059360_154196473|GENSCAN_predicted_CDS_7|1095_bp atggcccagcagtggagcctccaaaggctcgcaggccgccatccgcaggacagctatgag gacagcacccagtccagcatcttcacctacaccaacagcaactccaccagaggccccttc gaaggcccgaattaccacatcgctcccagatgggtgtaccacctcaccagtgtctggatg atctttgtggtcattgcatccgtcttcacaaatgggcttgtgctggcggccaccatgaag ttcaagaagctgcgccacccgctgaactggatcctggtgaacctggcggtcgctgacctg gcagagaccgtcatcgccagcactatcagcgttgtgaaccaggtctatggctacttcgtg ctgggccaccctatgtgtgtcctggagggctacaccgtctccctgtgtgggatcacaggt ctctggtctctggccatcatttcctgggagagatggatggtggtctgcaagccctttggc aatgtgagatttgatgccaagctggccatcgtgggcattgccttctcctggatctgggct gctgtgtggacagccccgcccatctttggttggagcaggtactggccccacggcctgaag acttcatgcggcccagacgtgttcagcggcagctcgtaccccggggtgcagtcttacatg attgtcctcatggtcacctgctgcatcaccccactcagcatcatcgtgctctgctacctc caagtgtggctggccatccgagcggtggcaaagcagcagaaagagtctgaatccacccag aaggcagagaaggaagtgacgcgcatggtggtggtgatggtcctggcattctgcttctgc tggggaccatacgccttcttcgcatgctttgctgctgccaaccctggctaccccttccac cctttgatggctgccctgccggccttctttgccaaaagtgccactatctacaaccccgtt atctatgtctttatgaaccggcagtttcgaaactgcatcttgcagcttttcgggaagaag gttgacgatggctctgaactctccagcgcctccaaaacggaggtctcatctgtgtcctcg gtatcgcctgcatga