GENSCAN 1.0 Date run: 8-Nov-116 Time: 07:48:54 Sequence gi568815587r:58407179_58608105 : 200927 bp : 37.61% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 210 294 85 2 1 72 56 105 0.688 6.73 1.02 Term + 6491 7197 707 0 2 28 38 241 0.373 5.79 1.03 PlyA + 7379 7384 6 1.05 2.05 PlyA - 8674 8669 6 1.05 2.04 Term - 15452 15154 299 0 2 105 38 68 0.473 -2.06 2.03 Intr - 15898 15669 230 0 2 72 19 219 0.349 9.99 2.02 Intr - 20928 20841 88 1 1 85 116 12 0.647 1.81 2.01 Init - 21277 21124 154 1 1 94 49 112 0.748 8.09 2.00 Prom - 26145 26106 40 -5.55 3.02 PlyA - 28217 28212 6 1.05 3.01 Sngl - 32973 32029 945 0 0 52 32 311 0.970 18.19 3.00 Prom - 46435 46396 40 -3.65 4.00 Prom + 50510 50549 40 -5.05 4.01 Init + 50693 50768 76 1 1 83 99 54 0.723 7.30 4.02 Intr + 59348 59436 89 0 2 53 40 53 0.032 -4.23 4.03 Term + 76592 76813 222 1 0 91 43 165 0.709 8.23 4.04 PlyA + 77709 77714 6 1.05 5.05 PlyA - 78183 78178 6 1.05 5.04 Term - 86576 86458 119 0 2 76 47 46 0.022 -2.98 5.03 Intr - 88677 88597 81 1 0 46 86 85 0.025 2.79 5.02 Intr - 98455 98291 165 2 0 120 86 27 0.264 4.71 5.01 Init - 100927 100042 886 1 1 107 -19 480 0.281 33.82 5.00 Prom - 104528 104489 40 -5.85 6.00 Prom + 105422 105461 40 -9.05 6.01 Init + 108450 108536 87 2 0 88 88 28 0.212 3.55 6.02 Term + 120246 120554 309 2 0 35 44 329 0.549 17.28 6.03 PlyA + 121160 121165 6 1.05 7.15 PlyA - 121425 121420 6 1.05 7.14 Term - 124921 124686 236 2 2 38 49 213 0.003 7.90 7.13 Intr - 125763 125632 132 1 0 36 39 165 0.003 6.10 7.12 Intr - 142689 142608 82 0 1 108 86 15 0.185 1.59 7.11 Intr - 142968 142795 174 0 0 93 95 115 0.976 11.91 7.10 Intr - 144054 143887 168 0 0 69 100 124 0.984 10.92 7.09 Intr - 147762 147663 100 2 1 106 113 152 0.973 18.69 7.08 Intr - 157023 156977 47 0 2 54 68 43 0.048 -4.81 7.07 Intr - 163535 163378 158 0 2 83 56 131 0.521 8.21 7.06 Intr - 166956 166832 125 2 2 66 63 109 0.518 5.51 7.05 Intr - 171920 171804 117 1 0 68 19 103 0.000 0.06 7.04 Intr - 172302 172149 154 1 1 21 -6 170 0.000 -0.89 7.03 Intr - 172817 172639 179 1 2 44 87 110 0.000 5.14 7.02 Intr - 190776 190636 141 2 0 73 85 126 0.571 9.35 7.01 Intr - 199723 199618 106 2 1 58 113 30 0.335 0.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 123913 124055 143 0 2 73 98 118 0.893 10.95 S.002 Intr + 124420 124893 474 1 0 -1 36 287 0.801 5.85 S.003 Init - 150073 149988 86 1 2 72 13 57 0.865 -3.06 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:58407179_58608105|GENSCAN_predicted_peptide_1|263_aa MEISGDVPCERIYSSSFYFPGEEAIKAAEKLSGSEFTAEFYQRYKEKLVPFLLKLFQSVE KEGMLHNSFYEASIILIPKPGTDTTKKENFKPISLMNINAKILNKILANRIQQHIKKLIH HDQVGFISGMQGWFKICKSINVIQHINRTKDKNHMIISIDAEKAFDKIQQPFMLKTLNKL GLDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNTMLEVLARAIR QEKEIKGIQSGEEEVKLSLLQMT >gi568815587r:58407179_58608105|GENSCAN_predicted_CDS_1|792_bp atggagatctctggagatgtcccctgtgagaggatctacagctcctcattctactttcct ggtgaagaagcaattaaagctgccgagaagttatctggttccgaattcacggccgaattc taccagaggtacaaggagaagctggtaccattccttctgaaactattccaatcagtagaa aaagagggaatgctccataattcattttatgaggccagcatcattctgataccaaagcct ggcacagacacaacaaaaaaagagaattttaaaccaatatccctgatgaacatcaatgca aaaatcctcaataaaatactggccaaccgaatccagcagcacatcaaaaagcttatccac catgatcaagtgggcttcatctctgggatgcaaggctggttcaaaatatgcaaatcaata aatgtaatccagcatataaacagaaccaaagacaaaaaccacatgattatctcaatagat gcagaaaaagccttcgacaaaattcaacagcccttcatgctaaaaactctcaataaatta ggtcttgatgggacgtatctcaaaataataagagctatttatgacaaacccacagccaat atcatactgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacaggga tgccctctctcaccactcctattcaacacaatgttggaagttctggccagggccatcagg caggagaaagaaataaagggtattcaatcaggagaagaggaagtcaaattgtccctgttg cagatgacatga >gi568815587r:58407179_58608105|GENSCAN_predicted_peptide_2|256_aa MACTSAGPYKVQRAILRPLKKRSSNKQSNCCYANDPMRERVHPRLHGLSRGSGFPIPLST AEAHAILASPFWPCGFLLQSSNLSLVDFGYSSAVTPKVMAGFLRGDKVISYNACAVQMFF FVALATVENYLLASMAYDRYAAVCKPLHYTTTMTASVVIFISYLFIFITILKMHSAKGHQ KALSTCASHFTAVSVFYGTVIFIYLQPSSSHSMDTDKMASVFYAMIIPMLNPVVYSLRNR EVQNAFKKVLRRQKFL >gi568815587r:58407179_58608105|GENSCAN_predicted_CDS_2|771_bp atggcatgtacaagtgctggcccttacaaggttcagagagcaatcctcagacctctgaag aaacgctccagcaataagcagagcaactgctgctatgccaatgaccccatgagggaaaga gtacatcctaggctccatggcctatcaaggggtagtggattcccaattccattgtccaca gctgaagcccatgctattcttgcctctccattctggccatgtggattcctcctccaatca agtaacctgtctctggtggactttggatactcctcagctgtcactcccaaggtcatggct gggttccttagaggagacaaggtcatctcctacaatgcatgtgctgttcagatgttcttc tttgtagccttggccacggtggaaaattacttgttggcctcaatggcctatgaccgctat gcagcagtgtgcaaacccctacactacaccaccaccatgacggccagtgtagttatcttt atctcctacttgttcatattcatcaccatcttgaagatgcattcagctaagggacaccaa aaagcattgtccacctgtgcctctcacttcactgcagtctccgtcttctatgggacagta atcttcatctacttgcagcccagctccagccactccatggacacagacaaaatggcatct gtgttctatgctatgatcatccccatgctgaaccctgtggtctacagcctgaggaacaga gaagtccagaatgcattcaagaaagtgttgagaaggcaaaaatttctataa >gi568815587r:58407179_58608105|GENSCAN_predicted_peptide_3|314_aa MENNTEVTEFILVGLTDDPELQIPLFIVFLFIYLITLVGNLGMIELILLDSCLHTPMYFF LSNLSLVDFGYSSAVTPKVMVGFLTGDKFILYNACATQFFFFVAFITAESFLLASMAYDR YAALCKPLHYTTTMTTNVCACLAIGSYICGFLNASIHTGNTFRLSFCRSNVVEHFFCDAP PLLTLSCSDNYISEMVIFFVVGFNDLFSILVILISYLFIFITIMKMRSPEGRQKAFSTCA SHLTAVSIFYGTGIFMYLRPNSSHFMGTDKMASVFYAIVIPMLNPLVYSLRNKEVKSAFK KTVGKAKASIGFIF >gi568815587r:58407179_58608105|GENSCAN_predicted_CDS_3|945_bp atggagaacaacacagaggtgactgaattcatccttgtggggttaactgatgacccagaa ctgcagatcccactcttcatagtcttccttttcatctacctcatcactctggttgggaac ctggggatgattgaattgattctactggactcctgtctccacacccccatgtacttcttc ctcagtaacctctccctggtggactttggttattcctcagctgtcactcccaaggtgatg gtggggtttctcacaggagacaaattcatattatataatgcttgtgccacacaattcttc ttctttgtagcctttatcactgcagaaagtttcctcctggcatcaatggcctatgaccgc tatgcagcattgtgtaaacccctgcattacaccaccaccatgacaacaaatgtatgtgct tgcctggccataggctcctacatctgtggtttcctgaatgcatccattcatactgggaac actttcaggctctccttctgtagatccaatgtagttgaacactttttctgtgatgctcct cctctcttgactctctcatgttcagacaactacatcagtgagatggttattttttttgtg gtgggattcaatgacctcttttctatcctggtaatcttgatctcctacttatttatattt atcaccatcatgaagatgcgctcacctgaaggacgccagaaggccttttctacttgtgct tcccaccttactgcagtttccatcttttatgggacaggaatctttatgtacttacgacct aactccagccatttcatgggcacagacaaaatggcatctgtgttctatgccatagtcatt cccatgttgaatccactggtctacagcctgaggaacaaagaggttaagagtgcctttaaa aagactgtagggaaggcaaaggcctctataggattcatattttaa >gi568815587r:58407179_58608105|GENSCAN_predicted_peptide_4|128_aa MDEAGNPHSQQTNTETENQTQHVLTPAAGAPQSLNTIIIRRLTYLSREKSNSWELGLISF YQGNCEIVLILEEQKHQCVTSVRVKGYGDQVINGALAQFCLTVGPMGPQTHPVVISPVLE CVVGIDSS >gi568815587r:58407179_58608105|GENSCAN_predicted_CDS_4|387_bp atggatgaagctggaaaccctcattctcagcaaactaacacagaaacagaaaaccaaaca cagcatgttctcactccagcagctggagcacctcaaagcctgaacactataatcatcaga aggctcacctacctgagtcgagaaaaatcaaacagctgggaactgggacttatttccttt taccagggtaactgtgaaattgtactaattctggaagaacaaaaacatcagtgtgtcaca tcagtcagagtaaagggttatggagatcaggtgatcaatggagctttagctcagttttgt ctcacagtgggcccaatgggtccccaaacacatccagttgtcatttccccagttctggaa tgcgtagttggaattgatagcagctag >gi568815587r:58407179_58608105|GENSCAN_predicted_peptide_5|416_aa MENSTEVTEFILLGLTDDPNLQIPLLLAFLFIYLITLLGNGGMMVIIHSDSHLHTPMYFF LSNLSLVDLGYSSAVAPKTVAALRSGDKAISYDGCAAQFFFFVGFATVECYLLASMAYDR HAAVCRPLHYTTTMTAGVCALLATGSYVSGFLNASIHAAGTFRLSFCGSNEINHFFCDIP PLLALSCSDTRISKLVVFVAGFNVFFTLLVILISYFFICITIQRMHSAEGQKKVFSTCAS HLTALSIFYGTIIFMYLQPNSSQSVDTDKIASVFYTVVIPMLNPLIYSLRNKEVKTMRPI QQHTGRTAFTGQDPLHIGIFLIEIYRAQEYLHLEKSLGSSYPSTGCIPISDEFRDLMLFA VPAEFILEGTDSRTRQQGGTFISQDLLKSFRTGLSCLWSVLCPTYTAQVKRRERNH >gi568815587r:58407179_58608105|GENSCAN_predicted_CDS_5|1251_bp atggagaatagcacagaagtgacagagtttatcctcttgggattaacagatgaccccaat cttcagatacccctcctcctggcatttttattcatctacctcatcaccctgcttgggaat gggggaatgatggtgatcatccactcagactcccatctccacactccaatgtactttttc ctcagtaacctctcccttgtagacttgggttactcatcagctgtagcccccaaaacggtg gctgcattgcggtcaggggacaaggccatctcctacgatggatgtgcagctcagttcttc ttctttgtggggtttgccactgttgagtgctacctcctggcctccatggcctatgatcgc catgcagcggtatgtaggcctcttcattacaccaccaccatgacagcaggtgtgtgtgcc ctccttgctactggttcctatgtctctggcttcctcaatgcctctatccatgcagcaggc accttcagactctccttctgtggttctaatgagattaatcatttcttctgtgacattccc ccactcctggctctctcatgctctgacacacgcatcagcaagttggtggtctttgtggca ggcttcaacgtctttttcaccctcctggtcatccttatttcttacttcttcatatgcatc accattcagaggatgcattctgctgaagggcagaagaaagtcttctccacctgtgcttcc catctcactgctttgtccatcttctatggcacaatcatcttcatgtacttacagcccaac tccagccagtccgtggacacagacaaaatagcctctgtgttttacacagtggtgattccc atgctgaatcccttgatatacagccttaggaacaaagaagtgaaaacaatgaggcctatt cagcaacacacaggaagaacagcctttacaggccaggatccactgcatataggaatcttt ctcattgaaatctacagagctcaagagtatctgcatttggagaagagcctgggaagcagc tatccaagcactgggtgcattccaatctcagatgaatttcgtgaccttatgctctttgct gtacctgcagaattcattttggagggcacagactctagaaccagacagcagggagggacc ttcatatcccaggacctgctgaagtcattccgaacaggtttaagttgcctctggtctgtc ctctgtcccacctacacagcacaagtaaaaaggagagagagaaaccactga >gi568815587r:58407179_58608105|GENSCAN_predicted_peptide_6|131_aa MGCRMDIVLPGMKTTFISLSISIKALSDQESEEAMDQLALQWEELIEARLTIGLVILLPE NALRQLCQAESTHKVLRMELVPHGTDTTASDGLPTPMAERSPAVMVMELTEWTSIQFKEG ASRKTGEAVLG >gi568815587r:58407179_58608105|GENSCAN_predicted_CDS_6|396_bp atgggctgcagaatggatattgtattaccaggcatgaaaaccacattcatctccttgtcc atctccataaaagctttgagtgaccaggaatctgaagaggctatggatcagttggcatta cagtgggaagagcttattgaagcaaggttgacaataggtcttgtcattctgctccctgaa aatgcccttcgacaactgtgtcaggcagaaagcacacacaaagtgctcaggatggaactt gtaccccatggcactgatacaacggccagtgatgggctgcccacacccatggcagagcgt tccccggcggtgatggtaatggagctcacagaatggacgtccatccagttcaaagaagga gccagtagaaaaactggtgaagcagtcctggggtag >gi568815587r:58407179_58608105|GENSCAN_predicted_peptide_7|639_aa XINTYHEQTLSESTLWKLLTIVKGYKSQVSTESRKRAIPETLTNTTGPIRVATSDTGSAS KKQRKVTVLQEKVELLNKHRRLRACDVILESRLKERDRVTKGVRMPERGMSLRVSGRRWI TSECRGVSKDDFGEWGNGPTSEAAAAAAAAARPRSRPPLSTRLLVVPAGAAATASGGRCW GSSGAALAASPPSWSATAGGARARATDAQRANRKRGQASSPRPSQSYTQRRPRLLMATLQ QFLQIVISPVTTLETWCGSFFLLAEKWHRGQVGSNALLEELERSTLQDSDEYSNPAPLPL DQHSRKETNLDETSEILSIQDNTSPLPAQLVYTTNIQELNVYSEAQEPKESPPPSKTSAA AQLDELMAHLTEMQAKVAVRADAGKKHLPDKQDHKASLDSMLGGLEQELQDLGIATVPKG HCASCQKPIAGKVIHALGQSWHPEHFVCTHCKEEIGSSPFFERSGLAYCPNDYHQLFSPR CAYCAAPILDKVLTAMNQTWHPEHFFCSHCGEVFGAEGVKLQTFAVSVTAHKGSADPKSE QQQDLLQRVKEQNFYSVEGDPTAVSVANPLTAWGWRCWPATLSAGPADPVLPETRAGPRA PHAAPVPARASPSTPPRRQRELAPASASPEWGSHSAAAG >gi568815587r:58407179_58608105|GENSCAN_predicted_CDS_7|1920_bp naaataaacacatatcatgaacaaacactgtcagaatcaactttgtggaaactcttaaca atagttaaaggttataaaagccaagtgagcactgaatcaagaaaaagagcaatacctgaa accttgactaacactacgggacccataagagttgccactagtgatactggaagtgcttcc aagaagcagagaaaagtcacggtattacaagaaaaagttgaattgcttaataagcataga aggttgagggcatgtgatgtaatactggagagccgcctaaaggaaagggatagggtgaca aagggggtgcggatgccagaaagagggatgagcctgcgggtgtcagggaggaggtggatc actagcgagtgcaggggtgtatcaaaagatgatttcggggagtggggaaacggcccgaca agcgaagctgcggcggcggcggccgcagcggcccggcctcggtcccgacctcccctcagc acgcggctgctagtggtccctgcaggcgccgccgcgaccgcctcagggggccgttgttgg ggctcctccggagccgccttggccgcctctcccccttcctggtccgctaccgctggcggc gcgcgcgcgcgcgccacagacgctcagcgggccaatcggaaaagaggacaagcctcctca ccccgcccctcgcagtcttacacgcagcggcgccccaggctgttgatggctacacttcaa cagtttcttcaaattgtcatcagcccagtaactacccttgagacttggtgtggatctttc ttcctgctggcagagaaatggcatcgaggacaggttggctcaaatgccttattggaggaa ctggaacgctccacccttcaggacagtgatgaatattccaacccagctcctcttcccctg gatcagcattccagaaaggagactaaccttgatgagacttcggagatcctttctattcag gataacacaagtcccttgccggcgcagctcgtgtatactaccaatatccaggagctcaat gtctacagtgaagcccaagagccaaaggaatcaccaccaccttctaaaacgtcagcagct gctcagttggatgagctcatggctcacctgactgagatgcaggccaaggttgcagtgaga gcagatgctggcaagaagcacttaccagacaagcaggatcacaaggcctccctggactca atgcttgggggtctggagcaggaattgcaggaccttggcattgccacagtgcccaagggc cattgtgcatcctgccagaaaccgattgctgggaaggtgatccatgctctagggcaatca tggcatcctgagcattttgtctgtactcattgcaaagaagagattggctccagtcccttc tttgagcggagtggcttggcctactgccccaacgactaccaccaacttttttctccacgc tgtgcttactgcgctgctcccatcctggataaagtgctgacagcaatgaaccagacctgg cacccagagcacttcttctgctctcactgcggagaggtgtttggtgcagaaggagtgaag ctgcagaccttcgcggtgagtgttacagctcataaaggcagtgcggacccaaagagtgag cagcagcaagatttattgcaaagagtgaaagaacaaaacttctacagtgtggaaggggac ccaactgctgtctcagttgctaatcctctcactgcctggggctggcggtgctggccggcc actctgagtgcggggcctgctgatcccgtgctacccgaaactcgtgctggcccacgagca ccgcatgcagccccagttcctgcccgtgcctctccgtccacacctccccgcaggcagagg gagctggctccagcctcagccagcccagagtggggctcccacagtgcagcggcaggctga