GENSCAN 1.0 Date run: 3-Nov-116 Time: 07:26:39 Sequence gi568815575f:107500773_107703370 : 202598 bp : 45.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6144 6238 95 2 2 86 127 -22 0.003 0.26 1.02 Intr + 25810 25964 155 1 2 89 96 183 0.464 18.92 1.03 Intr + 29637 29739 103 1 1 151 92 112 0.970 17.13 1.04 Intr + 32733 32778 46 0 1 121 109 14 0.878 5.21 1.05 Intr + 44965 45069 105 0 0 101 92 133 0.998 15.41 1.06 Intr + 49277 49384 108 1 0 98 83 113 0.999 12.28 1.07 Intr + 52023 52154 132 2 0 115 92 34 0.966 7.34 1.08 Intr + 53613 53732 120 2 0 50 80 84 0.917 4.49 1.09 Intr + 59485 59621 137 0 2 84 64 165 0.997 13.07 1.10 Intr + 59955 60081 127 0 1 136 63 46 0.994 7.58 1.11 Intr + 61637 61726 90 1 0 66 75 36 0.550 0.29 1.12 Intr + 62339 62428 90 1 0 93 65 74 0.684 5.79 1.13 Intr + 64115 64294 180 1 0 46 121 304 0.999 29.56 1.14 Intr + 75543 75687 145 2 1 113 91 142 0.987 16.86 1.15 Intr + 96549 97370 822 1 0 63 95 915 0.132 81.09 1.16 Term + 99531 102601 3071 1 2 98 49 2983 0.884 280.81 1.17 PlyA + 104088 104093 6 1.05 2.03 PlyA - 105346 105341 6 -0.45 2.02 Term - 106875 106717 159 0 0 74 46 98 0.301 2.14 2.01 Init - 113030 112125 906 2 0 86 -13 320 0.164 15.89 2.00 Prom - 122450 122411 40 -3.16 3.00 Prom + 126863 126902 40 -4.56 3.01 Init + 127857 127978 122 2 2 94 95 174 0.999 18.36 3.02 Intr + 138523 138706 184 1 1 85 97 169 0.999 17.29 3.03 Intr + 140130 140262 133 2 1 83 81 29 0.807 1.92 3.04 Intr + 144422 144578 157 0 1 40 108 288 0.576 25.07 3.05 Intr + 146834 146993 160 2 1 70 94 106 0.933 9.39 3.06 Term + 149168 149260 93 1 0 145 38 27 0.952 1.23 3.07 PlyA + 149287 149292 6 1.05 4.00 Prom + 153899 153938 40 -5.76 4.01 Init + 158225 158303 79 1 1 73 110 28 0.964 4.72 4.02 Term + 159098 159246 149 0 2 124 38 68 0.932 3.46 4.03 PlyA + 159442 159447 6 1.05 5.04 PlyA - 161405 161400 6 1.05 5.03 Term - 173070 172928 143 1 2 83 55 79 0.398 2.19 5.02 Intr - 176236 176062 175 1 1 25 105 77 0.101 2.61 5.01 Init - 197545 197540 6 1 0 63 105 17 0.102 0.78 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 17709 17491 219 0 0 53 40 202 0.926 8.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:107500773_107703370|GENSCAN_predicted_peptide_1|1841_aa RWAQPGAAREGGNERVPGPGSSPAPARRGARRHVMQEESANDMECEQLPAEILRQVTVHR DPIYGFGFVAGSERPVVVRSVRPGGPSENKLLAGDQIVAINEEDVSEAPRERLIELIRSA KEFIVLTVLHTHQSPKSAFISAAKKAKLRSNPVKVRFSEQVAVGETDAKMMKKEALLLIP NVLKVFLENGQIKSFTFDGRTTVKDVMLTLQDRLSLRFIEHFALVLEYAGPEQNHKFLLL QDKQPLAYVVQRTHYHGMKCLFRISFFPKDPVELLRRDPAAFEYLYIQSRNDVIRERFGM DPKPEMLLGLAALHIYITVSATRPSQKISLKNVEKEWGLEPFLPPSLLQVIKEKNLRKSL SQQLKAHQTHPSCGTKDAPSNSVEALEGKRETSVSPEGNLAVTLTQGSAIQAKLQYLRIL NELPTFTGVLFNTVGLDEKQSATTLLVGPRHGISHVIDLKTNLTTVLSEFSKISKIQLFR ENQGVARVETSIMDAKPLVLLMEWPEATNFACLIAGYCRLLLDSRKMVFSRPASQPLPPP MIKADYMHSAHRPVTGGHLGKKESSYVGSVGTSPRKSSRCTPPPADSELVSFCYLHMREQ RKEQESRTDVNENLIFFEETRPRTKSDPTSKSSGQGYEVVPDDFDAASLDHEPCASRARS YTLDNSLGAEALNFYCDSCKAKLQEQLGPRKGGKPGSSRDNIVDLMSLPPPGSEEEEEEE DETTSLLPAIAAPPPGFRDNSSDEDDPKRRAVQSQEQGRHLRGLLYDEIPVTLIDSVQTR TVRDHAQELDDALVSTLQALEALAASEDGPHPPPPQTAGLIVLATITPESSLDSGHETNS SELTDMSEMMSAMKQHQNTTYFLAQHLNKDSLLARKDLPFRIQSCAAQAVLTAPYSLGRP DPNPSLQPIATGQSPGPPGARRKLPQSEGQVQGERTYSLAVHPALSPQLSEQKNLSLLSP VPEDKGPGHTRAGLEMSLRAATSSLSEEQVSELRDNLPKEVRLSPKLILDPKSSVTPAII SAALQQVVHNKSLVTAGGALGNPPSRGERRLEASMGRPEVSMMSSSASKNLKFKISPSAP ETSWNSQHQLGAEVSSSPRAPTGSRADSLHLSQQEDSLPVQNFPPKSYLLRTSRESVGKQ ATGEVAGKGGPVGGKPTLQKQGTISSQGEKAQLESTPKRSKLEETSLVPRATYPMALQSP SCQSRSHSPSCQPHGHSPSSQSRGQSPSCQPRGQSPLRSQAASRQVSTMPSRKLETTLNG AHSTSEGPAKPKSSRGPFRLRNLFSATFPTRQKKETDERQAQLQKVKQYELEFLEELLKP PSQGELPGTEYLQPPAPGRCSCQLRSSPVQQGPGMSREQRRSCDCKRICRGGRPQATQTP VPSLRGRERDRVLPSQRQPEAGPGVSLSSPINVQRIRSTSLESRECRSDPESGVSCLTTC ASGGECLGAPNYRKLMRRYSISELDQGDRASLTSDVYPHPPLGMLPREAKEVEASLPIAL GPKSRSLESPTLGDPSYVQVAPETKGPRQMAVFSLPEEVYRKPAELDEDSESSKCCSIRY CFYYRKCDMADDASDGKDELSYSIPMKILPGMKLDEQVVPVVSRTLQVLDAATCSSSSPE ASRTQEIDLRVSTFEGSLAKINALRAHAYGLPDGFLAARLDTNELLTVLRQCVASPEARA PKPYVSQISEYKLELALKFKELRASCRRVANVDKSPTHMLAAITGSFQVLSSLIETFVRL VFIVRSEAQRQELLAKVEEVVRNYTFLLRAAEESTARNLNQQQQQQQQQQQQQQQQQQQQ QQQQQQQVAAAAGAATEHPPGSPTSATVMSTFTHSLKTLIK >gi568815575f:107500773_107703370|GENSCAN_predicted_CDS_1|5526_bp cgctgggcacagccaggggcagcgcgagagggaggcaacgagagggttcccgggccgggc agcagcccagccccggcccggaggggggcccgacgtcatgtgatgcaggaagagagcgca aatgatatggaatgtgagcagctgccagcagagatactgcgacaagtgaccgttcaccga gaccctatatatggctttggcttcgtggctggcagtgagaggcctgtggtggttcgatct gtgaggccaggaggcccctctgagaacaagctcctggctggtgaccagattgtggctatt aatgaggaagacgtgagtgaagccccgagggagagactcatagaacttatcaggagcgct aaggaattcatcgttcttacagttctgcacactcatcagtcccccaaatctgctttcatc agtgctgcgaagaaggccaagttgaggtccaatcctgtgaaggttcgattttctgagcag gtggcagttggagaaacagatgcaaaaatgatgaagaaggaagctctcctcctcatccct aatgtcctgaaggttttcttagaaaatgggcagatcaagtcattcacatttgatggtcgg accactgttaaggatgtgatgttgacattacaggaccgcctttccctgaggttcattgag cactttgctcttgtccttgagtatgccgggccagaacagaatcacaagtttctgcttctt caggacaagcaacccctggcttatgtggtacagcggacacactatcatggaatgaaatgc ctcttccgaataagcttctttcccaaagaccctgtggagctgctgcgtcgggatcctgct gcttttgagtacctatacatccagagtcggaatgatgttattcgagaacgctttggaatg gatcccaagccagagatgcttttgggccttgctgcgctccacatctatatcactgtctca gccactcgacctagtcagaagatctcgctcaagaatgtggagaaggagtggggcctggaa ccctttcttcccccctccctcctgcaggtcatcaaagagaagaacctccggaaatctctc tctcagcaactgaaggctcaccaaacacatccttcctgcggcaccaaggatgctccctct aacagtgtggaggcacttgagggaaagagggaaacaagtgtcagtcctgaaggtaacctg gctgtcaccctgacacagggctctgcaattcaggcaaagctccagtatctacgaattctg aatgaacttcctaccttcacgggcgttttgttcaacactgtaggcctggatgagaagcag tcggccaccacgctcctggtgggaccccgtcatggcatcagccatgttattgacctcaaa accaacctcaccactgtgctgtccgagttcagcaagatcagcaagatccagctgttccgg gagaaccagggcgtggcccgggtggagaccagcatcatggatgccaagcctctggtgctg ttgatggagtggcctgaagccaccaactttgcctgcctgatcgcggggtactgccgcctc ttgctggattccaggaagatggtcttctccaggcctgccagccagccacttccacctcca atgatcaaggcagattacatgcacagcgcccaccgccctgtcactgggggccacctgggg aaaaaggagagtagttatgtgggcagcgtgggcaccagccccaggaaatcgagccgctgc acgcccccacctgccgactctgagcttgtcagcttctgctacctccatatgcgggaacaa aggaaggagcaggaaagccggacagatgtcaacgagaacctaatcttctttgaggagacc aggccccgaaccaagtctgaccccacatccaaaagctctggccaaggttatgaggtagtc cctgatgactttgatgcagctagcctagaccacgagccttgtgccagcagggcccggtcc tacaccttggacaattcccttggggctgaagccctgaatttctactgtgactcttgcaaa gccaaacttcaggagcagctgggccctcgcaaaggtgggaagcctggctcctctcgtgac aatatagtagatttgatgtccctcccaccacctgggagtgaggaggaggaggaggaggaa gatgagacaacttctctgttgccagccattgctgccccaccccctggtttccgagacaac agctctgatgaggatgaccccaagcgccgggctgtccagagccaggaacaaggacgccac ctgcgtgggcttctgtacgatgagattccagtgacattgattgacagtgtgcagacccgg acagttcgagatcatgcccaggagctagatgatgccctggtgtccactctgcaggctcta gaagccctggctgcatccgaggatggaccacacccaccacccccacagactgcaggtctg attgtgctggccacaatcactcctgaatcatcgctggactcaggtcatgaaaccaactct tcagagctcacagacatgtcagagatgatgtcggccatgaagcagcaccagaacaccacc tacttcctggcccagcacctcaacaaggacagcctccttgcccgcaaggacctgcccttc cggatccagagctgtgcagcccaggccgttcttacggccccttactctcttgggcgcccg gatcccaacccatctctccaacccattgccacaggccagagtcctggcccccctggcgct cggaggaagctgccccagtcagagggccaggtacagggagaacgaacatactccttggca gtgcacccagcactgtccccacagcttagtgaacagaagaatctgagtctgctgtcccca gttcctgaggacaaagggcctggccacactagggcaggcctagaaatgtcactgagggca gccacatcatccctcagtgaagagcaggtctctgagctgagggacaacctgcccaaggag gtcaggttgagccccaagcttatcctcgacccaaagagcagtgtgacccctgccatcatc tcggccgccctacagcaagtggttcacaataagagtctagtcactgctggtggggctttg gggaacccccccagcaggggtgagagaaggctggaggccagcatggggaggccagaggtt agcatgatgagcagcagtgccagtaagaatctgaagttcaaaattagccccagtgctcca gagacctcatggaattctcaacatcagctgggtgcagaggtctcttccagccccagagca cccacaggcagccgggctgacagcctgcacctctcccaacaagaggacagtctgcctgtt caaaatttccctcccaaaagctatcttttgcgaacaagccgagagtcagtgggcaagcaa gctacaggggaggtggcaggcaaaggcgggccagtgggtggtaagcccaccctgcagaag cagggcaccatctccagccaaggggagaaggcgcagctggagagcacacccaaaagaagc aagctcgaagagaccagcctggttccccgagctacctaccccatggctctgcagagcccc agctgccagtcaagaagccacagccccagctgccagcctcatggccacagccccagcagc cagtctcgaggtcagagccccagctgccaacctcgaggccagagcccactgaggtctcag gctgccagccggcaggtgagcaccatgccctctaggaagcttgaaacaactctcaatgga gcccactcgacctctgaaggccctgccaaacccaagtcatcccgaggtcctttccggcta cgcaatttattctctgccaccttcccaacccgccagaagaaggagacagatgagcggcag gcccaactgcagaaggtaaagcagtatgaactggagttccttgaggaacttctaaagcca ccaagccagggggagctgccaggcaccgagtacctgcaacctccagcacctggccgctgc agctgccagctccgcagcagccctgtgcagcaggggcctggcatgtcccgtgagcagagg cgcagctgtgactgcaagcgcatctgccgggggggccggccacaagccacccagacacca gtgcccagcctccgggggagggaaagggacagagtcctccctagccagaggcagccagag gctggcccaggcgtgagcctcagcagccccatcaatgtccagcgcattcgttctaccagc ctggagtcccgagagtgccgatcggaccctgagagtggtgtttcgtgcctgaccacgtgt gcctcggggggcgagtgtctgggagctcccaattacaggaaactgatgcgccgctacagt atcagtgagctggaccagggtgacagggcctcgctgacctcggatgtctacccacatcct cccctgggcatgctgcccagggaggccaaggaggtagaggcaagcctccccatagccttg ggtcccaaaagcaggtctctggagtcaccgacgctgggagacccctcctacgtccaggtt gccccagagaccaaaggccccagacagatggccgtgttctcactgcccgaggaggtgtac cggaagcctgccgagctagacgaggacagtgagagcagcaagtgctgctccatccgctac tgcttctactaccgcaagtgtgacatggcagatgatgccagtgatggcaaggatgagctc tcctactctatccccatgaagatcctgcctggcatgaagctggacgagcaggtggtgcct gtggtgagcaggaccctgcaggtgctggatgctgctacctgcagcagcagcagccctgag gcctcccgcactcaggagattgacctccgtgtgtccaccttcgaggggagcctggccaag atcaatgccctgcgggcccatgcctatggcctccctgatggcttcctggctgcccggctg gacaccaacgagctgctgacagtcctgcggcagtgtgtggccagccccgaggcccgtgcc cccaagccctatgtgtctcagatctccgagtataagcttgagctagctctcaagttcaag gagctccgggcctcctgccgccgtgtggccaatgtggacaagagcccaactcacatgctg gcagccatcacgggcagcttccaggtgctgagcagcctcattgagaccttcgtgcggctg gtgttcattgtgcgctccgaggcccagcgccaagagctgctggccaaggtagaagaggtg gtgaggaactacaccttcctgctgcgtgcagctgaggagtccacagcccgtaaccttaac cagcagcagcagcaacaacaacagcagcagcagcaacaacaacagcagcagcaacaacaa cagcagcagcagcagcagcaggtggcagcagctgcaggggcagccacagagcatccacca ggctccccaacttcggcgactgttatgagcacattcacccactccttaaaaacccttatt aagtag >gi568815575f:107500773_107703370|GENSCAN_predicted_peptide_2|354_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDF MSKTPKAMATKAKIDKWDLIKLKSFCTVKETTIRVNRQPTTWEKIFATYSSDKGLISRIY NELKQIYKKKTNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYHL TPMYTHSLASRLQDEEWDVAITLESDVVITLTLLAVRRNSILRHKCMPPPNAGS >gi568815575f:107500773_107703370|GENSCAN_predicted_CDS_2|1065_bp atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaatcaattcaagatggattaaagatttaaacgttagacctaaaaccata aaaaccctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggacttc atgtccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagtaaaagaaactaccatcagagtgaacaggcaacctaca acatgggagaaaattttcgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag gacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaagaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccactatgagatatcatctc acaccaatgtacacccacagtctggcttccaggctgcaggatgaggaatgggatgtggct attaccttggaatcagatgtggttattacccttaccctactggctgttcggaggaattcg atcctaagacataaatgtatgccccctcccaatgctggcagttga >gi568815575f:107500773_107703370|GENSCAN_predicted_peptide_3|282_aa MPNIKIFSGSSHQDLSQKIADRLGLELGKVVTKKFSNQETCVEIGESVRGEDVYIVQSGC GEINDNLMELLIMINACKIASASRVTAVIPCFPYARQDKKDKSRAPISAKLVANMLSVAG ADHIITMDLHASQIQVSVEAKYWCLKDRLNVDFALIHKERKKANEVDRMVLVGDVKDRVA ILVDDMADTCGTICHAADKLLSAGATRVYAILTHGIFSGPAISRINNACFEAVVVTNTIP QEDKMKHCSKIQVIDISMILAEAIRRTHNGESVSYLFSHVPL >gi568815575f:107500773_107703370|GENSCAN_predicted_CDS_3|849_bp atgccgaatatcaaaatcttcagcggcagctcccaccaggacttatctcagaaaattgct gaccgcctgggcctggagctaggcaaggtggtgactaagaaattcagcaaccaggagacc tgtgtggaaattggtgaaagtgtacgtggagaggatgtctacattgttcagagtggttgt ggcgaaatcaatgacaatttaatggagcttttgatcatgattaatgcctgcaagattgct tcagccagccgggttactgcagtcatcccatgcttcccttatgcccggcaggataagaaa gataagagccgggcgccaatctcagccaagcttgttgcaaatatgctatctgtagcaggt gcagatcatattatcaccatggacctacatgcttctcaaattcaggtatcagtggaagct aaatattggtgtttgaaagacaggctgaatgtggactttgccttgattcacaaagaacgg aagaaggccaatgaagtggaccgcatggtgcttgtgggagatgtgaaggatcgggtggcc atccttgtggatgacatggctgacacttgtggcacaatctgccatgcagctgacaaactt ctctcagctggcgccaccagagtttatgccatcttgactcatggaatcttctccggtcct gctatttctcgcatcaacaacgcatgctttgaggcagtagtagtcaccaataccatacct caggaggacaagatgaagcattgctccaaaatacaggtgattgacatctctatgatcctt gcagaagccatcaggagaactcacaatggagaatccgtttcttacctattcagccatgtc cctttataa >gi568815575f:107500773_107703370|GENSCAN_predicted_peptide_4|75_aa MKRGRNYVVKDEVNSGAWHISQPSNKEKFATASRFMKLPFSLQTAEGLISTNSMSLANIW EGKFAKVLEAQQNFD >gi568815575f:107500773_107703370|GENSCAN_predicted_CDS_4|228_bp atgaagagagggaggaattacgtggtgaaggatgaagtgaacagcggtgcttggcatata agccagccctcaaacaaggagaaatttgctacagcctctagatttatgaaattgcccttc tctctgcagacagcagagggcctcattagcacaaactcaatgtcactggcaaatatatgg gaaggaaagtttgccaaggttctagaagcacagcaaaactttgattaa >gi568815575f:107500773_107703370|GENSCAN_predicted_peptide_5|107_aa MLAASKPMLISITCLWPPSPVTGSTCRPTRPRLGERKACGPPCGRQTRAGPGRFALPGAP EPLKQGFPFSASTRHLHIPLAIALTARFSGLCTFIVIISIVISATTD >gi568815575f:107500773_107703370|GENSCAN_predicted_CDS_5|324_bp atgctggcggcttccaagcccatgctgatcagcatcacttgcctctggccaccctcccct gtgaccggcagcacatgtcgccccacgcggccccggcttggggagcgtaaggcatgtggg cctccatgcggccgccagacgagggcgggaccgggacgcttcgccctgcctggggctcct gagcccctgaaacagggctttcctttctctgccagcactaggcatctccacattcctctg gccattgctctgactgctaggttctcgggcctctgcaccttcatcgttattattagcatc gttatatcagctaccactgattga