GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:21:15 Sequence gi568815592f:117898975_118099445 : 200471 bp : 38.41% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8753 8925 173 1 2 91 99 266 0.789 26.96 1.02 Intr + 17392 17510 119 1 2 28 77 113 0.077 3.39 1.03 Term + 18857 19071 215 0 2 11 47 184 0.910 3.01 1.04 PlyA + 20766 20771 6 1.05 2.06 PlyA - 21049 21044 6 -1.95 2.05 Term - 21274 21053 222 1 0 108 42 89 0.060 2.23 2.04 Intr - 37348 37170 179 2 2 43 68 141 0.092 6.32 2.03 Intr - 38240 38161 80 1 2 -16 99 62 0.184 -4.82 2.02 Intr - 39741 39554 188 0 2 37 80 75 0.218 -0.83 2.01 Init - 42152 42075 78 2 0 117 92 31 0.729 7.51 2.00 Prom - 45765 45726 40 -7.15 3.00 Prom + 51562 51601 40 -4.85 3.01 Sngl + 57010 57564 555 0 0 73 38 242 0.704 13.61 3.02 PlyA + 59182 59187 6 1.05 4.04 PlyA - 60961 60956 6 1.05 4.03 Term - 66947 66871 77 0 2 69 37 90 0.742 -0.98 4.02 Intr - 67439 67314 126 0 0 66 78 167 0.944 13.23 4.01 Init - 67798 67615 184 1 1 34 89 154 0.939 9.43 4.00 Prom - 70030 69991 40 -8.05 5.02 PlyA - 70198 70193 6 1.05 5.01 Sngl - 72875 72393 483 2 0 76 48 286 0.613 18.62 5.00 Prom - 73549 73510 40 -5.15 6.00 Prom + 73674 73713 40 -5.45 6.01 Init + 81307 81326 20 0 2 68 73 19 0.415 -2.02 6.02 Intr + 87829 87960 132 1 0 67 110 99 0.959 8.84 6.03 Intr + 88153 88294 142 1 1 117 101 31 0.968 6.73 6.04 Intr + 95462 95576 115 1 1 24 79 100 0.044 1.80 6.05 Term + 97778 98172 395 0 2 -30 44 367 0.027 14.71 6.06 PlyA + 98256 98261 6 1.05 7.00 Prom + 99256 99295 40 -6.15 7.01 Init + 100001 100442 442 1 1 110 75 500 0.982 47.17 7.02 Term + 100812 101380 569 1 2 46 38 250 0.879 9.39 7.03 PlyA + 101446 101451 6 1.05 8.04 PlyA - 103188 103183 6 1.05 8.03 Term - 106144 106076 69 1 0 123 50 2 0.069 -3.14 8.02 Intr - 112402 112358 45 1 0 92 87 59 0.389 3.89 8.01 Init - 112786 112724 63 1 0 75 92 28 0.288 3.26 8.00 Prom - 117655 117616 40 -3.15 9.07 PlyA - 119041 119036 6 1.05 9.06 Term - 123606 123430 177 0 0 137 46 56 0.328 3.00 9.05 Intr - 143916 143656 261 0 0 93 92 29 0.029 0.56 9.04 Intr - 156694 156647 48 1 0 103 50 50 0.010 0.66 9.03 Intr - 181202 181160 43 1 1 114 82 21 0.016 1.42 9.02 Intr - 188553 188468 86 0 2 89 94 62 0.527 4.60 9.01 Intr - 190395 190342 54 0 0 107 55 57 0.625 2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 17393 17510 118 1 1 42 77 114 0.823 6.11 S.002 Sngl + 97843 98172 330 0 0 88 44 319 0.899 23.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_1|168_aa MIPPEQPQQQLQPPSPAPPNHVVTTIENLPAEGSGGGGSLSASSRAGVRQRIRKVLNRMF EKHWCQPDIQVEVRVNLDSGSASNHGWGERFLIVGASDREKEGYQEQFGGHTNPQTQKDS AKELEKEGSEVGRKPGEYGVQKPREKMFPEGGNKQSTVLRVIEERTDK >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_1|507_bp atgatcccccctgagcagccgcagcagcagctgcagccgccgtcgccagccccgccgaac catgtggtgaccaccatcgagaacctgccggccgagggcagcggcggcggcgggagcctg tccgcctcctcccgggctggcgtgcgccagaggatccgcaaagtgctgaacagaatgttt gagaagcactggtgccagccagatatccaagtggaagtaagagtgaatttggattctggg agtgcatctaatcatggatggggggagcggttcctcattgtgggggcatcagatagagag aaagaggggtaccaagaacaatttggggggcacactaaccctcaaacacagaaggactca gcaaaggaactggagaaggaaggatctgaagtaggaaggaaaccaggagagtatggtgtc cagaagccaagagagaagatgtttccagaaggaggcaataagcagtccactgtgctgaga gtaattgaggaaaggacagataaatga >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_2|248_aa MGTPSLKITKKEINHDQGSNLTCSLQFGPKNPCSGPFTTENYQKGRKHLPSHFPRDGQAV RSRLKLLRSLSDSAGIRPSCWQNANQGGRVWQTEEIWSRRVIRSDPNSTTYLEPTAIAEA STPGPTQHVRTVDFGGTWTVSVHKKQKVPICRRGAFKHHTWQHIPTLTLQQRQILPPPKP RGSRTPSPSTRVPRVCGAICGGQGPSLRAQCFPGLQQDPDRYRDSSQPREPGVTKSSTEP AGIWASPH >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_2|747_bp atggggacaccaagtcttaaaataacgaagaaggagataaaccatgaccagggatcaaat ttgacatgcagccttcagtttggacccaagaacccttgttctgggcccttcaccactgag aattaccaaaaaggaagaaaacatcttccttcccacttcccaagagatggccaagctgtc aggagcaggctgaagcttctcaggagccttagtgactcagcaggaatccgcccctcctgc tggcaaaatgccaatcaggggggcagagtgtggcagactgaagaaatttggagtcgaaga gttattagatcagaccccaactctactacttatttggaacccacagcaatagcagaagca tccacccctggacctacacagcatgtgaggacagtagactttggagggacttggactgtc agtgttcacaagaaacagaaagtccctatatgcagaagaggagcatttaaacaccacact tggcagcacatcccaactctcaccctccaacaaaggcagatcctaccaccacccaagccc cgaggctcgcgcacgccctccccgtccacccgcgttccgagggtctgcggggctatctgt ggaggacagggcccctctctaagagcgcagtgtttccccggcctgcagcaggatccagat aggtacagagactccagccagccccgggaacccggcgtcacaaagagctccactgagccc gctggcatctgggcttcccctcattag >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_3|184_aa MRDQYRAGLNQLAVLWLMGMAMFCHLGEYELLYRKIKNLQPELNVVPASRPTITCLFCSC IGCPRVVHKYLSFSSQTGVGLGVGQGIFLPHRMEHLKLFVSSIHSLLDSGHQAFPEGEQA LILCLAQLERTYKGQCRAMVLSFDVLLNCLLQGGFCPLHCGCETVIKPSMSAALLGNAGT VPGI >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_3|555_bp atgagggatcaatatcgtgctggcttaaaccagctagctgtgctctggctcatggggatg gcaatgttttgtcacttgggagagtatgagttgctttataggaagattaagaacctccaa ccagagctgaatgtagtccctgcttccaggcctactattacatgtcttttctgcagttgt attggatgccccagggtagtgcacaaatacctgtctttcagttcacaaacaggtgttggc ttaggagtggggcagggtatcttcctaccgcacagaatggagcacctcaagctctttgtc tccagcattcacagccttctggacagtggacatcaggctttcccagaaggtgagcaagcc ctgattctttgccttgcccagctggagagaacatataaggggcaatgcagagcaatggtg ctgtcttttgatgtgctgttgaactgccttttgcagggaggattctgccccttgcattgt ggatgtgaaactgtgattaaaccttcaatgagtgctgctttactaggcaacgcggggaca gtgcctggaatctag >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_4|128_aa MALQEEEIWTQETETRPCEDTGRRWPSTSQGERPQKNQSCCVQNWLVLGLADFKNEATDP RGVKLQIFAASVTALKGSASGVVSSCRWVRGLADFRSEAEDLCPRQKSSPGPLPDQLDIA LIGAFTNL >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_4|387_bp atggctttacaagaagaggaaatttggacacaggaaacagagacaagaccatgtgaagac acagggagaagatggccatcaacaagccaaggagagaggcctcagaagaaccaatcctgc tgtgtccagaattggttggttcttggtctcgctgacttcaagaatgaagccacggaccct cgtggagtgaagctgcagatcttcgcggcgagtgttacagctcttaaaggcagtgcctct ggagttgttagttcctgccggtgggttcgtggtctcgctgacttcaggagcgaagctgaa gacctttgccctagacagaaaagttctccaggtcccctacccgatcagttagacatagcg ctgattggtgcatttacaaacctttag >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_5|160_aa MVSWARPRAPVLYVAPDMVPCVPAAIALARAKRGQCTGQAVASEGASSKPWWFTCGIGPA GAYKSRTEVWEPSLRFQRMYGNDWMSRQKLTAGVDPSWRASARAVQKGNVGLESPHRVPT GALLSGAVRREPPSSRCQNCDPVTACTVHLERPQTLNNSL >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_5|483_bp atggtttcctgggccaggcccagggcccctgtcctctatgtagccccagacatggtgccc tgtgtcccagctgctatagctctagccagggctaaaaggggccaatgtacaggtcaggct gttgcttcagagggtgcaagctccaagccttggtggtttacatgtggcattgggcctgca ggtgcatacaagtcaagaactgaggtttgggaaccttcacttagatttcagaggatgtat ggaaatgactggatgtccaggcagaagcttactgcaggggtggacccctcatggagagcc tctgctagggcagtgcagaagggaaatgttgggttggagtccccacacagagtccctact ggggcactgcttagtggagctgtgagaagagagccaccatcctccagatgccagaactgt gatccagtgacagcttgcactgtgcacctggaaaggccacagacactcaacaacagcctg tga >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_6|267_aa MHIIKEISEELLLSTSLAAPTAGDDFQQPVLGLLGLVQFLIYQVGAHDTDRTTFAINQAA IYVHICFGPIHSSLCAWVSSMMSLLELDDKFDILISSNGLGYGQLWEAILPSTAFIDLLT RAISRLFKSLLQPYETERTSTPKTHLYITIIKVDKTTKMGKKQSRKTGNSKKQSASPPPK EHSSSPATKQSWTENDFDELREEGFRRSNYSELQEEIQTKGKEVKNFEKNLDECITRITN TEKCLKELMELKAKARELREECRSLSS >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_6|804_bp atgcacatcattaaagaaatctctgaagagcttttgctctcgacatcactggcagctcct actgctggagatgatttccagcagcctgtgttgggcctcttaggcttagttcagttcctt atctatcaggtgggtgctcatgatacagacaggaccacctttgccataaatcaggcagct atatatgtgcacatctgcttcggccctatccactcatctctttgtgcatgggtcagttcc atgatgtctttattggaactagatgataagtttgatatcttgatatcaagtaatggatta ggatatggacagctttgggaagccattttgccttccacagctttcatagacctcctcaca agagctatttccagactcttcaaatccttactacaaccctatgagacagaaaggacatcc acaccaaaaacccatctgtacatcaccatcatcaaagtagataaaaccacaaagatgggg aaaaaacagagcagaaaaactggaaactctaaaaagcagagtgcctctcctcctccaaag gaacacagttcctcaccagcaacgaaacaaagctggacggagaatgactttgacgagttg agagaagaaggcttcagacgatcaaactactccgagctacaggaggaaattcaaaccaaa ggcaaagaagttaaaaactttgaaaaaaatttagatgaatgtataactagaataaccaat acagagaagtgcttaaaggagctgatggagctgaaagccaaggctcgagaactacgtgaa gaatgcagaagcctcagcagctga >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_7|336_aa MAKSKNHTTHNQSRKWHRNGIKKPRSQRYESLKGVDPKFLRNMRFAKKHNKKGLKKMQAN NAKAMSARAEAIKALVKPKEVKPKIPKGVSCKLDRHAYVAHPKLGKRALARIAKGLRLCR PKAKAKAKDQTKAQAAAPASVPAQAPKEIQTTIREYYKHLYANKLENLEEMDKFLDTYTL PRLNQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQS IEKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILANGIQQHIKKL IYHDQGGFIPVMQGWFNICKSINVIQHINRTKDKTT >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_7|1011_bp atggccaagtccaagaaccacaccacacacaaccagtcccgaaaatggcacagaaatggt atcaagaaaccccgatcacaaagatacgaatctcttaagggggtggaccccaagttcctg aggaacatgcgctttgccaagaagcacaacaaaaagggcctaaagaagatgcaggccaac aatgccaaggccatgagtgcacgtgccgaggctatcaaggccctcgtaaagcccaaggag gttaagcccaagatcccaaagggagtcagctgcaagctcgatcgacatgcctacgttgcc caccccaagcttgggaagcgtgctcttgcccgtattgccaaggggctcaggctgtgccgg ccaaaggccaaggccaaggccaaggatcaaaccaaggcccaggctgcagctccagcttca gttccagctcaggctcccaaagaaatacaaactaccatcagagaatactacaaacacctc tacgcaaataaactagaaaatctagaagaaatggataaattcctcgacacatataccctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctctgaaatt gtggcaataatcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagcc gaattctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatca atagaaaaagagggaatcctccctaactcattttacgaggccagcatcatcctgatacca aagcctggcagagatacaaccaaaaaagagaattttagaccaatatccttgatgaacatt gatgcaaaaatcctcaataaaatactggcaaacggaatccagcagcacatcaaaaagctt atctaccatgatcaagggggcttcatccctgtgatgcaaggctggttcaatatatgcaaa tcaataaatgtaatccagcatataaacagaaccaaagacaaaaccacatga >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_8|58_aa MNWYWLVACLELGSTAGGEWWPCEDVPASPSPSSMILYITLSFCKTSSGNSSQNDESN >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_8|177_bp atgaactggtactggttggtggcctgtttggaattgggcagcacagcaggaggtgagtgg tggccatgtgaagatgtacctgcttccccttcaccttcctccatgattctatacatcaca ttgtctttctgtaagacctcctctggaaactcttctcagaatgacgagtctaactga >gi568815592f:117898975_118099445|GENSCAN_predicted_peptide_9|222_aa VLGIRKDEPSHGCSMGLQVKRLNKQQKPLHKCDCDSNSDFCYQLLKSSPLTELMKLLLQE LTSWGSHLVNTVMADPELMGTSFTQASPSKSSPFPICSTFNWSPILVASKAELSLRSSPS CSFILPLLKCMLKYLPSFIWTIATDQHDPSSSLPIQDHPPYCHQVRALPQVRALPLCRPL RLNIPAALNLPSKPGPSSVFLVLVNVITLLPTCPNQTLGLPS >gi568815592f:117898975_118099445|GENSCAN_predicted_CDS_9|669_bp gttttgggcatacgtaaggatgaaccttctcatggctgctctatggggctccaagtaaag aggctaaacaaacaacaaaaaccactccacaaatgtgactgtgactctaattctgacttt tgctaccagctgctaaagagttccccacttactgaattgatgaaactattacttcaggag ctgacctcatggggctctcatctggttaacacagtgatggcagatcctgagttaatggga acatcattcacccaagccagcccttccaaaagctctccttttcccatatgttccacattc aattggtcaccaatactggtagcttctaaggcagaattgtctcttagatcgagcccttcc tgttcattcattctgccattgctgaaatgtatgcttaaatacttgccatcattcatctgg actattgcaacagaccaacatgatccctcatcttctctccctattcaagatcatcctcca tactgccaccaggtgagagctctgcctcaggtgagagctctgcctctctgcaggcctctc agactcaatattccagcagccctcaacttgccatccaagcctggtccttcttcagtcttc cttgtgttggtcaatgtcatcactttgctccccacttgtccaaaccagacacttggtctt ccttcttga