GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:37:46 Sequence gi568815578f:49883060_50088053 : 204994 bp : 50.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 787 1007 221 2 2 89 49 607 0.914 54.92 1.02 Intr + 3693 3839 147 2 0 69 59 322 0.846 27.83 1.03 Term + 4680 4877 198 2 0 113 48 245 0.760 20.40 1.04 PlyA + 8358 8363 6 1.05 2.03 PlyA - 12647 12642 6 1.05 2.02 Term - 23786 22560 1227 2 0 126 41 1451 0.999 135.82 2.01 Init - 25431 25096 336 0 0 33 100 473 0.379 40.28 2.00 Prom - 26913 26874 40 -7.76 3.00 Prom + 27214 27253 40 -3.16 3.01 Init + 28652 28793 142 1 1 68 77 52 0.510 2.40 3.02 Term + 32427 32653 227 1 2 59 43 208 0.718 10.34 3.03 PlyA + 33126 33131 6 1.05 4.00 Prom + 35581 35620 40 -6.76 4.01 Init + 37642 37715 74 0 2 63 68 70 0.282 1.05 4.02 Intr + 45666 45756 91 0 1 117 117 9 0.771 6.70 4.03 Intr + 53219 53476 258 1 0 69 49 214 0.014 13.26 4.04 Intr + 58524 58652 129 2 0 84 48 128 0.501 9.29 4.05 Intr + 62338 62429 92 0 2 63 94 85 0.528 5.39 4.06 Intr + 63077 63191 115 2 1 92 89 56 0.998 6.55 4.07 Intr + 66189 66296 108 2 0 46 103 125 0.994 10.28 4.08 Intr + 69017 69052 36 1 0 100 41 87 0.011 3.66 4.09 Intr + 73744 74178 435 0 0 74 -56 582 0.002 36.18 4.10 Term + 74506 74973 468 0 0 -32 46 422 0.014 20.97 4.11 PlyA + 75013 75018 6 1.05 5.00 Prom + 80592 80631 40 -3.76 5.01 Init + 87694 87828 135 0 0 84 93 53 0.477 5.68 5.02 Intr + 98459 98541 83 1 2 94 77 62 0.878 4.14 5.03 Intr + 99669 99827 159 0 0 57 46 151 0.924 6.90 5.04 Intr + 100024 100082 59 1 2 102 96 127 0.740 13.43 5.05 Intr + 100765 101292 528 2 0 137 92 512 0.910 49.41 5.06 Term + 104813 104997 185 0 2 123 46 228 0.999 19.81 5.07 PlyA + 105799 105804 6 1.05 6.00 Prom + 111757 111796 40 -5.96 6.01 Init + 115288 115346 59 0 2 50 61 57 0.134 -0.02 6.02 Intr + 118922 119155 234 2 0 57 97 93 0.151 3.90 6.03 Intr + 126916 127101 186 1 0 76 66 99 0.129 5.30 6.04 Intr + 131195 131243 49 2 1 77 89 34 0.065 1.08 6.05 Intr + 132524 132571 48 1 0 65 86 66 0.059 2.98 6.06 Term + 140264 140335 72 1 0 92 50 32 0.202 -2.29 6.07 PlyA + 143905 143910 6 1.05 7.12 PlyA - 146076 146071 6 1.05 7.11 Term - 146429 146274 156 2 0 90 41 54 0.374 -1.17 7.10 Intr - 150715 150689 27 1 0 95 87 45 0.652 3.41 7.09 Intr - 155668 155548 121 0 1 71 67 54 0.664 2.10 7.08 Intr - 158436 158302 135 2 0 61 70 92 0.726 4.38 7.07 Intr - 168999 168854 146 0 2 52 53 70 0.044 -1.02 7.06 Intr - 170042 169990 53 0 2 97 59 59 0.191 2.53 7.05 Intr - 170287 170137 151 1 1 39 -11 139 0.043 -1.16 7.04 Intr - 171500 171298 203 0 2 52 80 70 0.010 1.60 7.03 Intr - 179421 179262 160 0 1 80 77 70 0.272 4.66 7.02 Intr - 201195 201070 126 0 0 101 103 78 0.952 11.48 7.01 Intr - 202588 202505 84 1 0 78 46 56 0.505 0.42 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 69017 69082 66 1 0 100 53 167 0.955 12.34 S.002 Sngl + 74563 74973 411 0 0 74 46 362 0.976 26.89 S.003 Intr + 173516 173661 146 2 2 106 53 108 0.873 9.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:49883060_50088053|GENSCAN_predicted_peptide_1|188_aa XLRGAIPYALSLHLDLEPMEKRQLIGTTTIVIVLFTILLLGGSTMPLIRLMDIEDAKAHR RNKKDVNLSKTEKMGNTVESEHLSELTEEEYEAHYIRRQDLKGFVWLDAKYLNPFFTRRL TQETPHTKHPVASPPFHGPAPDALTAWLCLSTQDLHHGRIQMKTLTNKWYEEVRQGPSGS EDDEQELL >gi568815578f:49883060_50088053|GENSCAN_predicted_CDS_1|567_bp ngcctgcggggagccatcccctatgccctgagcctacacctggacctggagcccatggag aagcggcagctcatcggcaccaccaccatcgtcatcgtgctcttcaccatcctgctgctg ggcggcagcaccatgcccctcattcgcctcatggacatcgaggacgccaaggcacaccgc aggaacaagaaggacgtcaacctcagcaagactgagaagatgggcaacactgtggagtcg gagcacctgtcggagctcacggaggaggagtacgaggcccactacatcaggcggcaggac cttaagggcttcgtgtggctggacgccaagtacctgaaccccttcttcactcggaggctg acgcaggagacaccccacacaaaacacccagtagcatcccctcccttccatggccctgcc cctgacgccctgacggcttggttgtgtctctcgacccaggacctgcaccacgggcgcatc cagatgaaaactctcaccaacaagtggtacgaggaggtacgccagggcccctccggctcc gaggacgacgagcaggagctgctctga >gi568815578f:49883060_50088053|GENSCAN_predicted_peptide_2|520_aa MGKPSSMDTKFKDDLFRKYVQFHESKVDTTTSRQRPGSDECLRVAASTLLSLHKVDPFYR FRLIQFYEVVESSLRSLSSSSLRALHGAFSMLETVGINLFLYPWKKEFRSIKTYTGPFVY YVKSTLLEEDIRAILSCMGYTPELGTAYKLRELVETLQVKMVSFELFLAKVECEQMLEIH SQVKDKGYSELDIVSERKSSAEDVRGCSDALRRRAEGREHLTASMSRVALQKSASERAAK DYYKPRVTKPSRSVDAYDSYWESRKPPLKASLSLRKEPVATDVGDDLKDEIIRPSPSLLT MASSPHGSPDVLPPASPSNGPALLRGTYFSTQDDVDLYTDSEPRATYRRQDALRPDVWLL RNDAHSLYHKRSPPAKESALSKCQSCGLSCSSSLCQRCDSLLTCPPASKPSAFPSKASTH DSLAHGASLREKYPGQTQGLDRLPHLHSKSKPSTTPTSRCGFCNRPGATNTCTQCSKVSC DACLSAYHYDPCYKKSELHKFMPNNQLNYKSTQLSHLVYR >gi568815578f:49883060_50088053|GENSCAN_predicted_CDS_2|1563_bp atggggaagcccagttcaatggatactaaattcaaggatgacttatttcggaagtacgtg cagttccatgagagcaaagtggataccaccaccagcaggcagcggcctggcagcgatgag tgcctgcgggtggcagcctcaaccctgctcagcctgcacaaggtggatcccttttatcga ttccggctgatccagttctatgaggtggtggagagctccttgcgctcgctcagctcctct agcctgcgggctctgcacggcgccttcagcatgctggagacggtgggcatcaacctcttc ctctacccgtggaagaaggaattcagaagcatcaagacctacacgggcccttttgtttat tatgtcaagtcgacattactggaagaggacatccgagccatcctgagctgcatgggctac acacctgagctgggcactgcatacaagctcagagagctcgtggagaccctccaggtgaag atggtctcctttgagctctttctggccaaagtcgagtgtgagcagatgctagaaatccac tcacaagtgaaggacaagggctactccgagctggacattgtgagcgagcgcaagagcagt gcagaggatgtgcgcggctgctcggacgccctgcggcggcgggcagagggccgggagcac ctgacggcctccatgtcacgagtggcactccagaagtcggccagcgagcgggcggccaag gactactacaagccccgcgtgaccaagccctcgaggtcagtggatgcctatgacagctac tgggagagccggaagccacccctgaaggcctcattgagtcttcggaaggagcctgtggca acggatgtgggggacgacctcaaggatgagatcatccgcccatccccttcgctgctgacc atggccagctccccccacggcagcccggatgtgcttccacccgcctcccccagcaacggc ccggccctgctgcgcggtacctacttctccactcaggatgacgtggatctgtacacagac tctgaacccagggccacctaccgtcggcaggatgctctgcggccggatgtgtggctgctc agaaacgatgcccactccctctaccacaagcgctcgccccctgccaaagagtccgccctc tccaagtgccaaagctgcgggctgtcctgcagctcctccctctgccagcgctgtgacagc ctgctcacctgtcctccagcttccaagcccagcgccttccccagcaaggcctcgactcat gacagcctggcccacggggcatctctgcgggagaagtacccaggccagactcagggcctc gaccgcctcccgcaccttcactccaaatccaagccctccaccacgcccacttcccgctgt ggcttctgcaaccgcccaggcgccaccaacacctgcacccagtgttcaaaagtctcatgt gacgcctgcctcagcgcttaccattatgacccctgctacaaaaagagtgagctgcacaag ttcatgcccaacaaccagctgaactacaagtccacccagctctcccatctcgtgtacaga tag >gi568815578f:49883060_50088053|GENSCAN_predicted_peptide_3|122_aa MDAAKDVCIHDAITLVSHWYLPRILGDQLESKETGSERWHEVRQVAQGVLRPVPGGQQRP GNTRPGNDVTMRRARREGGPRLRAGSGKRSMPAADGDYNPEAEDKAEGRRARTKPAEPHF GA >gi568815578f:49883060_50088053|GENSCAN_predicted_CDS_3|369_bp atggatgctgcaaaggatgtctgcattcacgatgccatcactttagtttctcattggtac ctcccaagaatcctgggagatcagctggagagcaaagaaacgggctcagaaagatggcat gaggtacgacaagtagctcaaggggtactgagaccagtacccgggggccagcagcgaccc ggaaacactcggcccggaaatgatgtcaccatgaggcgggcccgaagagagggtggacca cggctgcgcgctggctccgggaagcggtcgatgcccgcggccgacggagactacaaccca gaggcggaggacaaagcggaaggccgaagagcgaggacgaaaccggcggaaccgcacttt ggagcctaa >gi568815578f:49883060_50088053|GENSCAN_predicted_peptide_4|601_aa MVITGVWLRLCLALGARSAPCTKDGQCTVSHDGQRQEILRELERIKEPVFSSQGPLACVR GAPRGGPPASPAPNRSPPREPRPLGLLLIGRRCAAQSGSKMAAQQRDCGGAAQLAGPAAE ADPLGRFTCPVCLEVYEKPVQECLKPKKPVCGVCRSALAPGVRAVELERQIESTETSCHG CRKNIRSHVATCSKYQNYIMEGVKATIKDASLQPRNVPNRYTFPCPYCPEKNFDQEGLVE HCKLFHSTDTKSVVCPICASMPWGDPNYRSANFREHIQRRHRFSYDTFVDYDVDEEDMMN QAPSYGARPVSSMVSVYAGARGSGSRISESHSTSFWGGMGSGDLAGGMAGDLAGMGGIQN EKETMQSLNDHLASYLDRMRSLETKNWKPESKIREHLEKKGPQVRDWSHHFKTIEDLRAQ IFTNTVDNACIVLQINNACLAADDFTIEENTTEVTTQSTEVGTAEMTHRTETCSPVLGDR PGLHEKSEGQLGEQPEGGGGPLFPADGTAQWDTAVPGVRAGTDPGRGTVPGPGVGSPAEH KVKLEAEITTYCRLLEDSEDFNLGDALDSRNSMQTIQKTTTRQTVDGKVVSETNDTKVLR H >gi568815578f:49883060_50088053|GENSCAN_predicted_CDS_4|1806_bp atggttatcacaggagtgtggctgaggctgtgtttggccctgggtgccaggtctgcaccc tgcactaaggatgggcaatgtacagtaagccatgatgggcagaggcaggaaatacttaga gaactggaaagaataaaggagccggtgttctcttcgcagggcccgctcgcttgcgtcaga ggggccccgaggggcggcccacccgctagccccgcccccaaccgctcaccgccccgcgag ccccgccccctcggcctcctcctcatcggccgccgttgcgcggcgcagagcggcagcaag atggcggcgcaacagcgggactgcgggggtgctgcgcagctggcggggccggcggcggag gctgaccccctaggacgcttcacgtgtcccgtgtgcttagaggtgtacgagaagccggta caggaatgtctgaagccgaagaagcctgtctgtggggtgtgtcgcagcgctctggcacct ggcgtccgagccgtggagctcgagcggcagatcgagagcacagagacttcttgccatggc tgccgtaagaatatccggtcccacgtggctacttgttccaaataccagaattacatcatg gaaggtgtgaaggccaccattaaggatgcatctcttcagccaaggaatgttccaaaccgt tacacctttccttgtccttactgtcctgagaagaactttgatcaggaaggacttgtggaa cactgcaaattattccatagcacggataccaaatctgtggtttgtccgatatgtgcctcg atgccctggggagaccccaactaccgcagcgccaacttcagagagcacatccagcgccgg caccggttttcttatgacacttttgtggattatgatgttgatgaagaggacatgatgaat caggcgcccagctatggcgcccggccggtcagcagtatggttagcgtctatgcaggtgcc cggggctctggttcccggatctccgagtcccactccaccagcttctggggcggcatgggg tccggggacctggccggggggatggctggggatctggcaggaatgggaggcatccagaac gagaaggagaccatgcaaagcctgaacgaccatctggcctcctacctggacagaatgagg agcctggagaccaagaactggaagccggagagcaaaatccgggagcacctggagaagaag ggaccccaggtcagagactggagccatcacttcaaaaccatcgaggacctgagggctcag atcttcacaaatactgtggacaatgcctgcattgttctgcagatcaacaatgcctgtctt gctgctgatgactttacaattgaggagaacactacagaagtcaccacgcagtccaccgag gttggaactgctgagatgactcacagaactgagacatgcagtccagtccttggagatcga cctggactccatgagaaatctgaaggccagcttggagaacagcctgagggaggtggaggc ccgctatttcctgcagatggaacagctcagtgggatactgctgtacctggagtcagagct ggcacagacccaggcagagggacagtgccaggcccaggagtaggaagccctgctgaacat aaggtcaagctggaggctgagatcaccacctactgccgcctgctggaagacagcgaggac ttcaatcttggtgatgccctggacagccgcaactccatgcaaaccatccaaaagaccacc acccgccagacagtggatggcaaagtggtgtctgagaccaacgacaccaaagttctgaga cattaa >gi568815578f:49883060_50088053|GENSCAN_predicted_peptide_5|382_aa MERAGLYQCSLMGKGPRPETVCEQVCVYEPLSPPTLQVPGDSLGKLEPGEDDFVHGCHTR HQVTKQTVVLPFSPSTGDDPRCASEPRLGGVPARALTATRREPGQQPAHLLGEWPSAETS LRLARRKPSDPNRKPNYSELQDSNPEFTFQQPYDQAHLLAAIPPPEILNPTASLPMLIWD SVLAPQAQPIAWASLRLQESPRVAELTSLSDEDSGKGSQPPSPPSPAPSSFSSTSVSSLE AEAYAAFPGLGQVPKQLAQLSEAKDLQARKAFNCKYCNKEYLSLGALKMHIRSHTLPCVC GTCGKAFSRPWLLQGHVRTHTGEKPFSCPHCSRAFADRSNLRAHLQTHSDVKKYQCQACA RTFSRMSLLHKHQESGCSGCPR >gi568815578f:49883060_50088053|GENSCAN_predicted_CDS_5|1149_bp atggagagagcagggctctatcagtgcagtctgatgggtaaggggcctaggcccgagaca gtctgcgagcaagtgtgcgtgtatgagcccctgagcccgcccaccctgcaagtgcctgga gactcactggggaagctagaaccaggggaggacgattttgttcacggctgtcacacccgg caccaagtgactaaacagacagtagttctgcccttcagccccagcaccggggacgacccg cgctgcgccagcgaaccccgcctcggaggagtccccgcccgggctctcaccgccacgcgg cgcgagcccggccagcagccggcgcacctgctcggggagtggccttcggcggagacgagc ctccgattggcgcggaggaagccctccgaccccaatcggaagcctaactacagcgagctg caggactctaatccagagtttaccttccagcagccctacgaccaggcccacctgctggca gccatcccacctccggagatcctcaaccccaccgcctcgctgccaatgctcatctgggac tctgtcctggcgccccaagcccagccaattgcctgggcctcccttcggctccaggagagt cccagggtggcagagctgacctccctgtcagatgaggacagtgggaaaggctcccagccc cccagcccaccctcaccggctccttcgtccttctcctctacttcagtctcttccttggag gccgaggcctatgctgccttcccaggcttgggccaagtgcccaagcagctggcccagctc tctgaggccaaggatctccaggctcgaaaggccttcaactgcaaatactgcaacaaggaa tacctcagcctgggtgccctcaagatgcacatccgaagccacacgctgccctgcgtctgc ggaacctgcgggaaggccttctctaggccctggctgctacaaggccatgtccggacccac actggcgagaagcccttctcctgtccccactgcagccgtgccttcgctgaccgctccaac ctgcgggcccacctccagacccactcagatgtcaagaagtaccagtgccaggcgtgtgct cggaccttctcccgaatgtccctgctccacaagcaccaagagtccggctgctcaggatgt ccccgctga >gi568815578f:49883060_50088053|GENSCAN_predicted_peptide_6|215_aa MNEQTEGGDKDDAGTGFFANIYGQLILQLWGADIHQDHEYSPRPRIFTKTTNLHLAGAHC CDSIIPGNELHGVTAPSRESGTELRKDKGLVQECTAIRPRPRRDECSTRPPLWGPQTARR QANPAWKIRRWKARRLIEGAGSGGGGGEGAAGAVRPGPETPSSQIPEFSSRALEVVGRTL TPETLVDVNSNRPSPVGESPYGSPTTGGWCSQDAG >gi568815578f:49883060_50088053|GENSCAN_predicted_CDS_6|648_bp atgaatgagcagactgagggaggagacaaggatgatgccgggacaggatttttcgccaat atctatggacagctcattcttcaactctggggggccgatattcaccaagaccacgaatat tcacccagaccacgaatcttcaccaagaccacaaatcttcaccttgcaggtgctcactgc tgtgactcaatcattccagggaatgagctgcatggggtgacagccccttcgagagagtca gggactgagctcagaaaagacaaaggacttgtccaagaatgcacagccattaggccgcgg ccccggcgagacgaatgcagcacacggccgcctttatggggcccgcagacagcgcgtcgc caggctaaccctgcgtggaaaattcggaggtggaaggcgaggcgccttattgagggggcc ggcagcggcggcggcggcggcgagggggcggcgggggctgtgcggcccgggccggaaact ccaagttcacagatccctgaattctcttcgagggccctggaggtggtgggtaggacccta actccagaaaccctggtagatgtcaacagcaacaggccctcacctgttggggaaagtccc tatgggtcacccacaactggaggctggtgtagccaggatgcaggctag >gi568815578f:49883060_50088053|GENSCAN_predicted_peptide_7|453_aa TTRCRCLHGSNYAVQDSFPEERARMPERTIYENRIYSLKIECGPKYPEAPPFVRFVTKIN MNGVNSSNGVESDQECGLSHQKDWRWNSGPASHRLCAPGKCPDITDHLPRGVHGLPHAAV GRFGTRIKMVKNGDSGLGELCLCSAEQEGMVLRGTHRATGSPTLSCCRVSWDKATTQRPP SQDPAQFGDRQSPMNLLARLEPAAIWSDGLFMEPGGGGPGPDPANLEVISSRGWGAFGVL IRHSSSDLLQGQERRTPAEITHIIRFSEKCEPYCRLHIREICTTHNPPHNPQPLSDCLPR NRSLVLKRDGEDEQGFYTRRQKGTSAKENENTRPGEAAGMRAVVVLRGEAVSQCTFILAW GGGSSKWDLFSTDTIDRGLSTHPCQLGNFGAGKGLSVATVLAGLHTGCSLCLELSLRRHP HGFLRFVLQISLASLSETPYSTPSNRESPRAMS >gi568815578f:49883060_50088053|GENSCAN_predicted_CDS_7|1362_bp accactagatgccgctgtttacacgggagcaattatgcagtacaagactcctttcctgag gagagggcacggatgccggagaggacaatttatgaaaaccgaatatacagccttaaaata gaatgtggacctaaatacccagaagcacccccctttgtaagatttgtaacaaaaattaat atgaatggagtaaatagttctaatggagtggaaagtgaccaagaatgtggactgagccac cagaaggactggcgctggaattctggccctgcctctcaccggctgtgtgccccgggcaag tgtccagacatcactgatcatctgcctcggggggtgcatggcttgcctcatgcagctgta ggaagatttggaacgaggataaagatggtgaaaaatggggactcaggcctcggggagctg tgtctctgctcagcggagcaggaagggatggtcctgaggggaactcacagagcaactggc tcccccaccttgtcctgctgccgggtgtcctgggacaaggctaccacccagcggccaccc agccaggatccagcccagtttggagacaggcagtcccccatgaacctgctcgccaggctg gagccagccgccatctggagcgacgggctgtttatggagccaggcggcggcggccctggc cctgacccggccaatctggaagtgattagcagccggggctggggagcatttggagttctc atcaggcattccagctcggacctgctccaaggccaggagcgcagaacaccagcagagatc acccacatcattagattctcagagaagtgcgaaccctattgcagattgcatatacgagag atatgcacaacacacaaccccccacacaacccccagcccctttcagattgtcttccacga aaccggtccctggtgctaaaaagagacggagaagatgaacagggattttataccaggcgt cagaagggaaccagtgctaaagaaaatgaaaacaccaggccgggagaggcagctggcatg cgggccgtggtggttttacgtggtgaggcggtatctcagtgcacgtttattctggcctgg ggaggaggcagcagcaaatgggacttgttcagcacagacaccattgacaggggcctctcc actcatccctgccagctggggaacttcggggctgggaagggcctttctgtggccacggtc ctggctggccttcacactggctgttccctctgcctggaactctctctccgcagacaccca catggcttccttcgattcgtcctgcagatctccctggcatcactttctgagacaccgtac tcaaccccctccaacagggaaagcccccgcgctatgtcctga