GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:34:25 Sequence gi568815578f:49836413_50052138 : 215726 bp : 49.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1041 1036 6 1.05 1.01 Sngl - 9146 8751 396 2 0 78 54 253 0.959 17.15 1.00 Prom - 11832 11793 40 -6.36 2.00 Prom + 16225 16264 40 -3.16 2.01 Init + 18506 18729 224 1 2 78 -13 153 0.776 0.26 2.02 Intr + 19026 19169 144 0 0 95 108 189 0.958 21.00 2.03 Intr + 23088 23191 104 0 2 110 61 -12 0.216 -1.88 2.04 Intr + 28316 28432 117 0 0 84 111 35 0.794 5.84 2.05 Intr + 38293 38409 117 2 0 112 89 149 0.486 17.84 2.06 Intr + 47434 47654 221 2 2 89 49 607 0.915 54.92 2.07 Intr + 50340 50486 147 2 0 69 59 322 0.846 27.83 2.08 Term + 51327 51524 198 2 0 113 48 245 0.760 20.40 2.09 PlyA + 55005 55010 6 1.05 3.03 PlyA - 59294 59289 6 1.05 3.02 Term - 70433 69207 1227 2 0 126 41 1451 0.999 135.82 3.01 Init - 72078 71743 336 0 0 33 100 473 0.379 40.28 3.00 Prom - 73560 73521 40 -7.76 4.00 Prom + 73861 73900 40 -3.16 4.01 Init + 75299 75440 142 1 1 68 77 52 0.510 2.40 4.02 Term + 79074 79300 227 1 2 59 43 208 0.718 10.34 4.03 PlyA + 79773 79778 6 1.05 5.00 Prom + 82228 82267 40 -6.76 5.01 Init + 84289 84362 74 0 2 63 68 70 0.282 1.05 5.02 Intr + 92313 92403 91 0 1 117 117 9 0.771 6.70 5.03 Intr + 99866 100123 258 1 0 69 49 214 0.014 13.26 5.04 Intr + 105171 105299 129 2 0 84 48 128 0.501 9.29 5.05 Intr + 108985 109076 92 0 2 63 94 85 0.528 5.39 5.06 Intr + 109724 109838 115 2 1 92 89 56 0.998 6.55 5.07 Intr + 112836 112943 108 2 0 46 103 125 0.994 10.28 5.08 Intr + 115664 115699 36 1 0 100 41 87 0.011 3.66 5.09 Intr + 120391 120825 435 0 0 74 -56 582 0.002 36.18 5.10 Term + 121153 121620 468 0 0 -32 46 422 0.014 20.97 5.11 PlyA + 121660 121665 6 1.05 6.00 Prom + 127239 127278 40 -3.76 6.01 Init + 134341 134475 135 0 0 84 93 53 0.477 5.68 6.02 Intr + 145106 145188 83 1 2 94 77 62 0.878 4.14 6.03 Intr + 146316 146474 159 0 0 57 46 151 0.924 6.90 6.04 Intr + 146671 146729 59 1 2 102 96 127 0.740 13.43 6.05 Intr + 147412 147939 528 2 0 137 92 512 0.910 49.41 6.06 Term + 151460 151644 185 0 2 123 46 228 0.999 19.81 6.07 PlyA + 152446 152451 6 1.05 7.00 Prom + 158404 158443 40 -5.96 7.01 Init + 161935 161993 59 0 2 50 61 57 0.134 -0.02 7.02 Intr + 165569 165802 234 2 0 57 97 93 0.151 3.90 7.03 Intr + 173563 173748 186 1 0 76 66 99 0.129 5.30 7.04 Intr + 177842 177890 49 2 1 77 89 34 0.065 1.08 7.05 Intr + 179171 179218 48 1 0 65 86 66 0.058 2.98 7.06 Term + 186911 186982 72 1 0 92 50 32 0.202 -2.29 7.07 PlyA + 190552 190557 6 1.05 8.06 PlyA - 192723 192718 6 1.05 8.05 Term - 193076 192921 156 2 0 90 41 54 0.381 -1.17 8.04 Intr - 197362 197336 27 1 0 95 87 45 0.676 3.41 8.03 Intr - 202315 202195 121 0 1 71 67 54 0.691 2.10 8.02 Intr - 205083 204949 135 2 0 61 70 92 0.756 4.38 8.01 Init - 211136 211033 104 2 2 75 94 12 0.027 0.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 115664 115729 66 1 0 100 53 167 0.955 12.34 S.002 Sngl + 121210 121620 411 0 0 74 46 362 0.976 26.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_1|131_aa MDGPEKVRKAPVGELKETKVLGHMSPSSQDTVYPLYIYQSVSRKTSQCIICEPLNITSAI FLGKAPIYGKHWLSEPYCLKPLKQPIHHVCSSPEVRHCSPASTEHNQEGAELLLLLLLNY TQAFKELALQT >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_1|396_bp atggatgggcctgagaaggttagaaaggccccagtgggggagctgaaagaaacaaaagtc ctggggcacatgtccccgtcgtcccaggacacagtctatcctctttacatctatcagtct gtgtccaggaagacgagtcaatgcatcatctgtgagccacttaacatcacttcagccatc ttcctgggcaaagcacccatttatggcaaacactggctcagtgagccatactgcctgaaa ccattaaagcagccaattcatcatgtctgctcctctccagaagtcaggcactgcagtcca gcttccactgagcacaaccaggaaggagcagagctgctgctgctgctgctgctaaattat acccaagcttttaaggaacttgccttacaaacatga >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_2|423_aa MASVFLSAVYATHPQSTLGLLLLFDTFLDLANLSAGWGRSRTAFQGAGGEGQSRGQMQEL MGHLALAALTRGTGRFAFGSLISAVDPVATIAIFNALHVDPVLNMLVFGESILNDAVSIV LTNTEEKYAPGSCCHLSLVIRIIPGFEEVYFINDHLQIVIYVLKHIDLRKTPSLEFGMMI IFAYLPYGLAEGISLSGIMAILFSGIVMSHYTHHNLSPVTQILMQQTLRTVAFLCGLRGA IPYALSLHLDLEPMEKRQLIGTTTIVIVLFTILLLGGSTMPLIRLMDIEDAKAHRRNKKD VNLSKTEKMGNTVESEHLSELTEEEYEAHYIRRQDLKGFVWLDAKYLNPFFTRRLTQETP HTKHPVASPPFHGPAPDALTAWLCLSTQDLHHGRIQMKTLTNKWYEEVRQGPSGSEDDEQ ELL >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_2|1272_bp atggcctcagtgtttctctcggctgtctacgccactcacccccaaagcacactggggttg ctgctgctttttgacactttcttagatcttgcaaatctgagtgcaggctggggtcggtca cggacagcattccaaggggctggtggcgaggggcagagcagaggtcagatgcaggagctc atgggccatttggctctagctgctctgacccgtggaactgggcgttttgcgtttggctcc ctaatatctgctgtcgatccagtggccactattgccattttcaatgcacttcatgtggac cccgtgctcaacatgctggtctttggagaaagtattctcaacgatgcagtctccattgtt ctgaccaatacagaagaaaaatatgctcctgggagctgttgtcacctttccttggttatc agaatcatacctggttttgaagaggtatacttcataaacgatcatctccaaattgtcatt tacgtgctgaagcatattgacttgaggaaaacgccttccttggagtttggcatgatgatc atttttgcttatctgccttatgggcttgcagaaggaatctcactctcaggcatcatggcc atccttttctcaggcatcgtgatgtcccactacacgcaccataacctctccccagtcacc cagatcctcatgcagcagaccctccgcaccgtggccttcttatgtggcctgcggggagcc atcccctatgccctgagcctacacctggacctggagcccatggagaagcggcagctcatc ggcaccaccaccatcgtcatcgtgctcttcaccatcctgctgctgggcggcagcaccatg cccctcattcgcctcatggacatcgaggacgccaaggcacaccgcaggaacaagaaggac gtcaacctcagcaagactgagaagatgggcaacactgtggagtcggagcacctgtcggag ctcacggaggaggagtacgaggcccactacatcaggcggcaggaccttaagggcttcgtg tggctggacgccaagtacctgaaccccttcttcactcggaggctgacgcaggagacaccc cacacaaaacacccagtagcatcccctcccttccatggccctgcccctgacgccctgacg gcttggttgtgtctctcgacccaggacctgcaccacgggcgcatccagatgaaaactctc accaacaagtggtacgaggaggtacgccagggcccctccggctccgaggacgacgagcag gagctgctctga >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_3|520_aa MGKPSSMDTKFKDDLFRKYVQFHESKVDTTTSRQRPGSDECLRVAASTLLSLHKVDPFYR FRLIQFYEVVESSLRSLSSSSLRALHGAFSMLETVGINLFLYPWKKEFRSIKTYTGPFVY YVKSTLLEEDIRAILSCMGYTPELGTAYKLRELVETLQVKMVSFELFLAKVECEQMLEIH SQVKDKGYSELDIVSERKSSAEDVRGCSDALRRRAEGREHLTASMSRVALQKSASERAAK DYYKPRVTKPSRSVDAYDSYWESRKPPLKASLSLRKEPVATDVGDDLKDEIIRPSPSLLT MASSPHGSPDVLPPASPSNGPALLRGTYFSTQDDVDLYTDSEPRATYRRQDALRPDVWLL RNDAHSLYHKRSPPAKESALSKCQSCGLSCSSSLCQRCDSLLTCPPASKPSAFPSKASTH DSLAHGASLREKYPGQTQGLDRLPHLHSKSKPSTTPTSRCGFCNRPGATNTCTQCSKVSC DACLSAYHYDPCYKKSELHKFMPNNQLNYKSTQLSHLVYR >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_3|1563_bp atggggaagcccagttcaatggatactaaattcaaggatgacttatttcggaagtacgtg cagttccatgagagcaaagtggataccaccaccagcaggcagcggcctggcagcgatgag tgcctgcgggtggcagcctcaaccctgctcagcctgcacaaggtggatcccttttatcga ttccggctgatccagttctatgaggtggtggagagctccttgcgctcgctcagctcctct agcctgcgggctctgcacggcgccttcagcatgctggagacggtgggcatcaacctcttc ctctacccgtggaagaaggaattcagaagcatcaagacctacacgggcccttttgtttat tatgtcaagtcgacattactggaagaggacatccgagccatcctgagctgcatgggctac acacctgagctgggcactgcatacaagctcagagagctcgtggagaccctccaggtgaag atggtctcctttgagctctttctggccaaagtcgagtgtgagcagatgctagaaatccac tcacaagtgaaggacaagggctactccgagctggacattgtgagcgagcgcaagagcagt gcagaggatgtgcgcggctgctcggacgccctgcggcggcgggcagagggccgggagcac ctgacggcctccatgtcacgagtggcactccagaagtcggccagcgagcgggcggccaag gactactacaagccccgcgtgaccaagccctcgaggtcagtggatgcctatgacagctac tgggagagccggaagccacccctgaaggcctcattgagtcttcggaaggagcctgtggca acggatgtgggggacgacctcaaggatgagatcatccgcccatccccttcgctgctgacc atggccagctccccccacggcagcccggatgtgcttccacccgcctcccccagcaacggc ccggccctgctgcgcggtacctacttctccactcaggatgacgtggatctgtacacagac tctgaacccagggccacctaccgtcggcaggatgctctgcggccggatgtgtggctgctc agaaacgatgcccactccctctaccacaagcgctcgccccctgccaaagagtccgccctc tccaagtgccaaagctgcgggctgtcctgcagctcctccctctgccagcgctgtgacagc ctgctcacctgtcctccagcttccaagcccagcgccttccccagcaaggcctcgactcat gacagcctggcccacggggcatctctgcgggagaagtacccaggccagactcagggcctc gaccgcctcccgcaccttcactccaaatccaagccctccaccacgcccacttcccgctgt ggcttctgcaaccgcccaggcgccaccaacacctgcacccagtgttcaaaagtctcatgt gacgcctgcctcagcgcttaccattatgacccctgctacaaaaagagtgagctgcacaag ttcatgcccaacaaccagctgaactacaagtccacccagctctcccatctcgtgtacaga tag >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_4|122_aa MDAAKDVCIHDAITLVSHWYLPRILGDQLESKETGSERWHEVRQVAQGVLRPVPGGQQRP GNTRPGNDVTMRRARREGGPRLRAGSGKRSMPAADGDYNPEAEDKAEGRRARTKPAEPHF GA >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_4|369_bp atggatgctgcaaaggatgtctgcattcacgatgccatcactttagtttctcattggtac ctcccaagaatcctgggagatcagctggagagcaaagaaacgggctcagaaagatggcat gaggtacgacaagtagctcaaggggtactgagaccagtacccgggggccagcagcgaccc ggaaacactcggcccggaaatgatgtcaccatgaggcgggcccgaagagagggtggacca cggctgcgcgctggctccgggaagcggtcgatgcccgcggccgacggagactacaaccca gaggcggaggacaaagcggaaggccgaagagcgaggacgaaaccggcggaaccgcacttt ggagcctaa >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_5|601_aa MVITGVWLRLCLALGARSAPCTKDGQCTVSHDGQRQEILRELERIKEPVFSSQGPLACVR GAPRGGPPASPAPNRSPPREPRPLGLLLIGRRCAAQSGSKMAAQQRDCGGAAQLAGPAAE ADPLGRFTCPVCLEVYEKPVQECLKPKKPVCGVCRSALAPGVRAVELERQIESTETSCHG CRKNIRSHVATCSKYQNYIMEGVKATIKDASLQPRNVPNRYTFPCPYCPEKNFDQEGLVE HCKLFHSTDTKSVVCPICASMPWGDPNYRSANFREHIQRRHRFSYDTFVDYDVDEEDMMN QAPSYGARPVSSMVSVYAGARGSGSRISESHSTSFWGGMGSGDLAGGMAGDLAGMGGIQN EKETMQSLNDHLASYLDRMRSLETKNWKPESKIREHLEKKGPQVRDWSHHFKTIEDLRAQ IFTNTVDNACIVLQINNACLAADDFTIEENTTEVTTQSTEVGTAEMTHRTETCSPVLGDR PGLHEKSEGQLGEQPEGGGGPLFPADGTAQWDTAVPGVRAGTDPGRGTVPGPGVGSPAEH KVKLEAEITTYCRLLEDSEDFNLGDALDSRNSMQTIQKTTTRQTVDGKVVSETNDTKVLR H >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_5|1806_bp atggttatcacaggagtgtggctgaggctgtgtttggccctgggtgccaggtctgcaccc tgcactaaggatgggcaatgtacagtaagccatgatgggcagaggcaggaaatacttaga gaactggaaagaataaaggagccggtgttctcttcgcagggcccgctcgcttgcgtcaga ggggccccgaggggcggcccacccgctagccccgcccccaaccgctcaccgccccgcgag ccccgccccctcggcctcctcctcatcggccgccgttgcgcggcgcagagcggcagcaag atggcggcgcaacagcgggactgcgggggtgctgcgcagctggcggggccggcggcggag gctgaccccctaggacgcttcacgtgtcccgtgtgcttagaggtgtacgagaagccggta caggaatgtctgaagccgaagaagcctgtctgtggggtgtgtcgcagcgctctggcacct ggcgtccgagccgtggagctcgagcggcagatcgagagcacagagacttcttgccatggc tgccgtaagaatatccggtcccacgtggctacttgttccaaataccagaattacatcatg gaaggtgtgaaggccaccattaaggatgcatctcttcagccaaggaatgttccaaaccgt tacacctttccttgtccttactgtcctgagaagaactttgatcaggaaggacttgtggaa cactgcaaattattccatagcacggataccaaatctgtggtttgtccgatatgtgcctcg atgccctggggagaccccaactaccgcagcgccaacttcagagagcacatccagcgccgg caccggttttcttatgacacttttgtggattatgatgttgatgaagaggacatgatgaat caggcgcccagctatggcgcccggccggtcagcagtatggttagcgtctatgcaggtgcc cggggctctggttcccggatctccgagtcccactccaccagcttctggggcggcatgggg tccggggacctggccggggggatggctggggatctggcaggaatgggaggcatccagaac gagaaggagaccatgcaaagcctgaacgaccatctggcctcctacctggacagaatgagg agcctggagaccaagaactggaagccggagagcaaaatccgggagcacctggagaagaag ggaccccaggtcagagactggagccatcacttcaaaaccatcgaggacctgagggctcag atcttcacaaatactgtggacaatgcctgcattgttctgcagatcaacaatgcctgtctt gctgctgatgactttacaattgaggagaacactacagaagtcaccacgcagtccaccgag gttggaactgctgagatgactcacagaactgagacatgcagtccagtccttggagatcga cctggactccatgagaaatctgaaggccagcttggagaacagcctgagggaggtggaggc ccgctatttcctgcagatggaacagctcagtgggatactgctgtacctggagtcagagct ggcacagacccaggcagagggacagtgccaggcccaggagtaggaagccctgctgaacat aaggtcaagctggaggctgagatcaccacctactgccgcctgctggaagacagcgaggac ttcaatcttggtgatgccctggacagccgcaactccatgcaaaccatccaaaagaccacc acccgccagacagtggatggcaaagtggtgtctgagaccaacgacaccaaagttctgaga cattaa >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_6|382_aa MERAGLYQCSLMGKGPRPETVCEQVCVYEPLSPPTLQVPGDSLGKLEPGEDDFVHGCHTR HQVTKQTVVLPFSPSTGDDPRCASEPRLGGVPARALTATRREPGQQPAHLLGEWPSAETS LRLARRKPSDPNRKPNYSELQDSNPEFTFQQPYDQAHLLAAIPPPEILNPTASLPMLIWD SVLAPQAQPIAWASLRLQESPRVAELTSLSDEDSGKGSQPPSPPSPAPSSFSSTSVSSLE AEAYAAFPGLGQVPKQLAQLSEAKDLQARKAFNCKYCNKEYLSLGALKMHIRSHTLPCVC GTCGKAFSRPWLLQGHVRTHTGEKPFSCPHCSRAFADRSNLRAHLQTHSDVKKYQCQACA RTFSRMSLLHKHQESGCSGCPR >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_6|1149_bp atggagagagcagggctctatcagtgcagtctgatgggtaaggggcctaggcccgagaca gtctgcgagcaagtgtgcgtgtatgagcccctgagcccgcccaccctgcaagtgcctgga gactcactggggaagctagaaccaggggaggacgattttgttcacggctgtcacacccgg caccaagtgactaaacagacagtagttctgcccttcagccccagcaccggggacgacccg cgctgcgccagcgaaccccgcctcggaggagtccccgcccgggctctcaccgccacgcgg cgcgagcccggccagcagccggcgcacctgctcggggagtggccttcggcggagacgagc ctccgattggcgcggaggaagccctccgaccccaatcggaagcctaactacagcgagctg caggactctaatccagagtttaccttccagcagccctacgaccaggcccacctgctggca gccatcccacctccggagatcctcaaccccaccgcctcgctgccaatgctcatctgggac tctgtcctggcgccccaagcccagccaattgcctgggcctcccttcggctccaggagagt cccagggtggcagagctgacctccctgtcagatgaggacagtgggaaaggctcccagccc cccagcccaccctcaccggctccttcgtccttctcctctacttcagtctcttccttggag gccgaggcctatgctgccttcccaggcttgggccaagtgcccaagcagctggcccagctc tctgaggccaaggatctccaggctcgaaaggccttcaactgcaaatactgcaacaaggaa tacctcagcctgggtgccctcaagatgcacatccgaagccacacgctgccctgcgtctgc ggaacctgcgggaaggccttctctaggccctggctgctacaaggccatgtccggacccac actggcgagaagcccttctcctgtccccactgcagccgtgccttcgctgaccgctccaac ctgcgggcccacctccagacccactcagatgtcaagaagtaccagtgccaggcgtgtgct cggaccttctcccgaatgtccctgctccacaagcaccaagagtccggctgctcaggatgt ccccgctga >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_7|215_aa MNEQTEGGDKDDAGTGFFANIYGQLILQLWGADIHQDHEYSPRPRIFTKTTNLHLAGAHC CDSIIPGNELHGVTAPSRESGTELRKDKGLVQECTAIRPRPRRDECSTRPPLWGPQTARR QANPAWKIRRWKARRLIEGAGSGGGGGEGAAGAVRPGPETPSSQIPEFSSRALEVVGRTL TPETLVDVNSNRPSPVGESPYGSPTTGGWCSQDAG >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_7|648_bp atgaatgagcagactgagggaggagacaaggatgatgccgggacaggatttttcgccaat atctatggacagctcattcttcaactctggggggccgatattcaccaagaccacgaatat tcacccagaccacgaatcttcaccaagaccacaaatcttcaccttgcaggtgctcactgc tgtgactcaatcattccagggaatgagctgcatggggtgacagccccttcgagagagtca gggactgagctcagaaaagacaaaggacttgtccaagaatgcacagccattaggccgcgg ccccggcgagacgaatgcagcacacggccgcctttatggggcccgcagacagcgcgtcgc caggctaaccctgcgtggaaaattcggaggtggaaggcgaggcgccttattgagggggcc ggcagcggcggcggcggcggcgagggggcggcgggggctgtgcggcccgggccggaaact ccaagttcacagatccctgaattctcttcgagggccctggaggtggtgggtaggacccta actccagaaaccctggtagatgtcaacagcaacaggccctcacctgttggggaaagtccc tatgggtcacccacaactggaggctggtgtagccaggatgcaggctag >gi568815578f:49836413_50052138|GENSCAN_predicted_peptide_8|180_aa MQGEAFSAIHAWSGGRGVFLRKGPSSASFPSCLGRDGEDEQGFYTRRQKGTSAKENENTR PGEAAGMRAVVVLRGEAVSQCTFILAWGGGSSKWDLFSTDTIDRGLSTHPCQLGNFGAGK GLSVATVLAGLHTGCSLCLELSLRRHPHGFLRFVLQISLASLSETPYSTPSNRESPRAMS >gi568815578f:49836413_50052138|GENSCAN_predicted_CDS_8|543_bp atgcagggagaggccttcagtgccatccacgcctggagcggaggtaggggcgtcttcctg agaaagggacccagcagcgcctcgttcccatcttgcttgggcagagacggagaagatgaa cagggattttataccaggcgtcagaagggaaccagtgctaaagaaaatgaaaacaccagg ccgggagaggcagctggcatgcgggccgtggtggttttacgtggtgaggcggtatctcag tgcacgtttattctggcctggggaggaggcagcagcaaatgggacttgttcagcacagac accattgacaggggcctctccactcatccctgccagctggggaacttcggggctgggaag ggcctttctgtggccacggtcctggctggccttcacactggctgttccctctgcctggaa ctctctctccgcagacacccacatggcttccttcgattcgtcctgcagatctccctggca tcactttctgagacaccgtactcaaccccctccaacagggaaagcccccgcgctatgtcc tga