GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:36:32 Sequence gi568815592f:137771228_137981316 : 210089 bp : 40.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8928 9124 197 0 2 126 73 69 0.070 7.34 1.02 Intr + 20574 20734 161 1 2 80 71 73 0.016 3.59 1.03 Term + 28438 28569 132 0 0 -27 52 216 0.029 3.61 1.04 PlyA + 30510 30515 6 1.05 2.00 Prom + 34915 34954 40 -6.85 2.01 Init + 37878 38055 178 2 1 49 77 114 0.021 5.87 2.02 Intr + 40628 40803 176 0 2 43 42 127 0.018 2.34 2.03 Term + 45155 45280 126 1 0 67 50 153 0.058 6.70 2.04 PlyA + 46360 46365 6 1.05 3.00 Prom + 47369 47408 40 -6.95 3.01 Init + 47511 47550 40 2 1 77 110 43 0.365 5.80 3.02 Intr + 49028 49210 183 0 0 96 95 18 0.195 2.14 3.03 Term + 53016 53134 119 1 2 49 42 168 0.501 6.02 3.04 PlyA + 53550 53555 6 1.05 4.06 PlyA - 57692 57687 6 1.05 4.05 Term - 62422 62324 99 1 0 68 36 112 0.834 1.15 4.04 Intr - 64347 63995 353 1 2 121 45 278 0.297 20.92 4.03 Intr - 68133 68075 59 2 2 69 58 40 0.008 -3.29 4.02 Intr - 71276 71126 151 0 1 76 88 127 0.027 9.80 4.01 Init - 96679 96415 265 1 1 86 74 261 0.458 21.72 4.00 Prom - 99311 99272 40 -8.85 5.00 Prom + 99328 99367 40 -8.05 5.01 Init + 100001 100295 295 1 1 89 107 309 0.928 30.29 5.02 Intr + 103195 103242 48 2 0 85 58 64 0.629 0.83 5.03 Intr + 103618 103808 191 2 2 138 77 144 0.999 16.68 5.04 Intr + 104461 104697 237 0 0 78 92 156 0.467 11.89 5.05 Intr + 104858 104939 82 1 1 46 85 39 0.471 -2.31 5.06 Intr + 105849 106029 181 1 1 93 80 207 0.999 18.40 5.07 Intr + 107205 108124 920 0 2 101 92 477 0.989 39.36 5.08 Intr + 108784 109025 242 2 2 32 85 114 0.456 1.85 5.09 Intr + 109808 109967 160 1 1 106 72 138 0.684 12.64 5.10 Term + 110523 110701 179 1 2 55 37 146 0.575 3.07 5.11 PlyA + 111337 111342 6 1.05 6.08 PlyA - 111470 111465 6 1.05 6.07 Term - 116419 115654 766 0 1 28 49 225 0.046 4.49 6.06 Intr - 119722 119304 419 1 2 78 71 248 0.190 14.00 6.05 Intr - 130162 130071 92 2 2 64 115 1 0.074 -0.81 6.04 Intr - 154268 154107 162 0 0 -11 70 201 0.160 7.43 6.03 Intr - 154936 154643 294 2 0 104 58 91 0.483 3.76 6.02 Intr - 174575 174442 134 1 2 109 43 98 0.592 6.77 6.01 Init - 175878 175814 65 0 2 104 73 52 0.933 5.97 6.00 Prom - 178301 178262 40 -6.35 7.06 PlyA - 178321 178316 6 1.05 7.05 Term - 180143 179986 158 0 2 46 39 130 0.367 1.01 7.04 Intr - 186467 186335 133 2 1 53 106 35 0.034 1.10 7.03 Intr - 195819 195698 122 1 2 88 37 61 0.140 0.29 7.02 Intr - 200330 200199 132 0 0 -4 92 92 0.008 0.10 7.01 Intr - 209423 209373 51 0 0 61 131 52 0.470 4.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 37878 38318 441 2 0 49 43 242 0.946 11.80 S.002 Sngl - 140938 140615 324 1 0 94 48 130 0.837 5.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:137771228_137981316|GENSCAN_predicted_peptide_1|163_aa XHGPPSVWPLLWWTCLVTGHPGPEALKVVQPHLHNNITILLLSLPLAFTLLLWDLGTKLW PQSTGLAEFLLLPQMLAKKIQVKSLGRIDECHNQSHIIKNEADLSYITDEEPYRIDACYQ NIAGAGDHQLQVSALKSNRAAALDKGLAATDTRIPVSATCELS >gi568815592f:137771228_137981316|GENSCAN_predicted_CDS_1|492_bp nnacatgggcctccctccgtgtggcctctgctgtggtggacctgcctggtcacaggacat ccagggccagaagctctgaaggtagtgcagcctcatctccacaataacatcacaatcctg ctgctgtcactgccattggccttcactctcctactttgggacctgggtactaagttatgg cctcagagtacaggccttgcagagtttcttttgctcccacaaatgctagccaagaaaatc caagtgaaatctctggggagaatagatgaatgtcacaaccagagtcacattatcaagaat gaggctgacttgagttatataacagatgaagaaccttataggattgatgcctgttatcag aacattgctggagctggagaccaccagctgcaggtttcagccctgaagagcaacagggca gcagctctggacaaggggctggctgctacagatacacggattcctgtcagcgcaacctgt gaactgtcatga >gi568815592f:137771228_137981316|GENSCAN_predicted_peptide_2|159_aa MGTGCRVQAYTYSVSRPLTSARNSSSAPEVHGANHSSAYSRAQRAGSSGEAMSLQRATQG NESFIRATPFVKGLLPFSHISGMSSVRQTVVTKCVLWNPNDASRVEKEFQNYRKGAMRMP TTCTGMSETAKDSDHEIFSLVMEKDDDQEIKANKYTTTN >gi568815592f:137771228_137981316|GENSCAN_predicted_CDS_2|480_bp atggggacagggtgcagagtccaggcctacacgtacagtgtaagcaggcccctgacatca gccaggaactccagctcagcacctgaggtgcatggggctaatcactcctctgcgtacagc agagctcagagagcaggctcctcaggagaagccatgagtctgcagagggcaacacaaggc aatgaatcatttatccgggcaacccctttcgtcaagggtcttcttcctttcagccacatt tcggggatgtcttctgtaaggcaaacggttgtgaccaagtgcgtactctggaatccaaat gatgcaagtagagttgaaaaggaatttcaaaattacaggaaaggagctatgaggatgcct acaacgtgcacagggatgagtgaaaccgccaaggactctgaccatgaaattttcagccta gtcatggagaaagatgatgaccaagaaataaaagcaaataaatacacaaccacaaattga >gi568815592f:137771228_137981316|GENSCAN_predicted_peptide_3|113_aa MVLDTHNTLAHKTDLKLSLDAQSSTSVKLLLNRGTQSQGSEWGEAWKQKKVLCLLKVCEK RHLRSLGNKKSTNHGFTRRKSLVAASPRTATHDAQKSPYSSPGGGDDAGKCHA >gi568815592f:137771228_137981316|GENSCAN_predicted_CDS_3|342_bp atggtattggacactcacaatacgttagcgcataaaacagatctcaagctttccttagat gctcagagttcaacttcagtgaagttgctgctgaacagaggcacacagagccaggggagt gagtggggagaagcatggaagcagaagaaggtactgtgccttctcaaagtctgtgaaaag agacacctgagatcattgggaaacaaaaaatctacaaatcatggttttacccgcaggaaa agtctggtggcagccagtcccaggacagccacgcatgacgcacagaagagtccttatagt tcccctggagggggtgatgatgcaggcaaatgccacgcctga >gi568815592f:137771228_137981316|GENSCAN_predicted_peptide_4|308_aa MSMQRLAPKKTRKEQSANDHPIGGPEGRLFTSQLQLKFRALSERNSWLEVSRAVTPTSAA VTSTPSTSKPRQKRPTNSQSRSAAKPTPERTQMRRNQKTNPGNMTKQGLSTPPINHTSSP AMDPNQEEIPDLPEKEPRSTGNPSQNNQARERNKGNPNAFPDKTISFLQGFKGFDLNMYY PNHEDSDGRVINDDDLGGNGLEIQADMAASPSYKGRDKTPRGKKCRVIGFGTEERTTENV HGKKILIKQMKYYEQSNVNQLVQYFLNLLDNKDHQESFPPYFDAAHKPSPEAKQMPAPCF LYSLQNMS >gi568815592f:137771228_137981316|GENSCAN_predicted_CDS_4|927_bp atgagcatgcaacgcttggctccaaaaaagacacgaaaggagcaaagcgccaacgaccac ccgatcggagggcccgaggggcgcctcttcaccagtcagctgcagcttaagttccgtgca ttatctgaaaggaacagctggctggaggtatccagggctgtcactccaacctctgcagca gtgacctcaactcccagcacttcaaaacccagacagaaacgtccaacaaactcccagtcc aggagcgctgcaaaaccaacgccagagcgtacccaaatgagaaggaaccagaaaaccaac cctggtaatatgacaaaacaaggcttgtcaacaccccccataaatcacactagttcacca gcaatggatccaaaccaagaagaaatccctgatttacctgaaaaagaacccaggagtact ggaaatcctagccagaacaatcaggcaagagaaagaaataaagggaatccaaatgctttt cctgataaaactatttcctttctccaaggttttaaaggttttgacctgaacatgtattac ccgaaccatgaggactctgatggaagagttatcaatgatgacgacttgggtggaaatggt ctagagattcaggcagatatggctgcatcaccatcctataagggaagagataagactcct agagggaaaaaatgtagggtaatagggtttgggactgaagaaaggaccactgaaaatgtc catggtaaaaagatcttaattaagcaaatgaaatattatgaacagagcaacgtgaatcag ttagttcagtatttcctaaacctgcttgataacaaggatcaccaggagtcttttccacca tattttgatgcagcacacaagccctcaccagaagctaagcagatgccggcaccttgcttc ttgtacagcctgcagaacatgagctaa >gi568815592f:137771228_137981316|GENSCAN_predicted_peptide_5|844_aa MAEQVLPQALYLSNMRKAVKIRERTPEDIFKPTNGIIHHFKTMHRYTLEMFRTCQFCPQF REIIHKALIDRNIQATLESQKKLNWCREVRKLVALKTNELESSNASAFTSKSRDGDGNCL MHATSQYMWGVQDTDLVLRKALFSTLKETDTRNFKFRWQLESLKSQEFVETGLCYDTRNW NDEWDNLIKMASTDTPMARSGLQYNSLEEIHIFVLCNILRRPIIVISGEMPADHGSVLKC FQPYALAPGENHTAKVQECYRYPIVLGYDSHHFVPLVTLKDSGPEIRAVPLVNRDRGRFE DLKVHFLTDPENEMKEKLLKEYLMVIEIPVQGWDHGTTHLINAAKLDEANLPKEINLVDD YFELVQHEYKKWQENSEQGRREGHAQNPMEPSVPQLSLMDVKCETPNCPFFMSVNTQPLC HECSERRQKNQNKLPKLNSKPGPEGLPGMALGASRGEAYEPLAWNPEESTGGPHSAPPTA PSPFLFSETTAMKCRSPGCPFTLNVQHNGFCERCHNARQLHASHAPDHTRHLDPGKCQAC LQDVTRTFNGICSTCFKRTTAEASSSLSTSLPPSCHQRSKSDPSRLVRSPSPHSCHRAGN DAPAGCLSQAARTPGDRTGTSKCRKAGCVYFGTPENKGFCTLCFIEYRENKPSLYRWGDP YVVLTSIHSHVDFAAASGKVSPTASRFQNTIPCLGRECGTLGSTMFEGYCQKCFIEAQNQ RFHEAKRTEEQLRSSQRRDVPRTTQSTSRPKCARASCKNILACRSEELCMECQHPNQRMG PGAHRASLRDVKPDPHGSEALCKKLKEAQGKWTYSESVCSSWFFPTCPVPFLRTRQKCRT IHGL >gi568815592f:137771228_137981316|GENSCAN_predicted_CDS_5|2535_bp atggctgaacaagtccttcctcaggctttgtatttgagcaatatgcggaaagctgtgaag atacgggagagaactccagaagacatttttaaacctactaatgggatcattcatcatttt aaaaccatgcaccgatacacactggaaatgttcagaacttgccagttttgtcctcagttt cgggagatcatccacaaagccctcatcgacagaaacatccaggccaccctggaaagccag aagaaactcaactggtgtcgagaagtccggaagcttgtggcgctgaaaacgaacgaacta gagagcagcaatgccagtgccttcaccagcaaatcaagggatggtgacggcaattgcctc atgcatgccacttctcagtacatgtggggcgttcaggacacagacttggtactgaggaag gcgctgttcagcacgctcaaggaaacagacacacgcaactttaaattccgctggcaactg gagtctctcaaatctcaggaatttgttgaaacggggctttgctatgatactcggaactgg aatgatgaatgggacaatcttatcaaaatggcttccacagacacacccatggcccgaagt ggacttcagtacaactcactggaagaaatacacatatttgtcctttgcaacatcctcaga aggccaatcattgtcatttcaggtgagatgcctgcagatcacggatctgtacttaaatgc tttcagccttatgccttggctcctggagaaaaccacactgccaaagttcaggaatgctac agataccccattgttctcggctatgacagccatcattttgtacccttggtgaccctgaag gacagtgggcctgaaatccgagctgttccacttgttaacagagaccggggaagatttgaa gacttaaaagttcactttttgacagatcctgaaaatgagatgaaggagaagctcttaaaa gagtacttaatggtgatagaaatccccgtccaaggctgggaccatggcacaactcatctc atcaatgccgcaaagttggatgaagctaacttaccaaaagaaatcaatctggtagatgat tactttgaacttgttcagcatgagtacaagaaatggcaggaaaacagcgagcaggggagg agagaggggcacgcccagaatcccatggaaccttccgtgccccagctttctctcatggat gtaaaatgtgaaacgcccaactgccccttcttcatgtctgtgaacacccagcctttatgc catgagtgctcagagaggcggcaaaagaatcaaaacaaactcccaaagctgaactccaag ccgggccctgaggggctccctggcatggcgctcggggcctctcggggagaagcctatgag cccttggcgtggaaccctgaggagtccactggggggcctcattcggccccaccgacagca cccagcccttttctgttcagtgagaccactgccatgaagtgcaggagccccggctgcccc ttcacactgaatgtgcagcacaacggattttgtgaacgttgccacaacgcccggcaactt cacgccagccacgccccagaccacacaaggcacttggatcccgggaagtgccaagcctgc ctccaggatgttaccaggacatttaatgggatctgcagtacttgcttcaaaaggactaca gcagaggcctcctccagcctcagcaccagcctccctccttcctgtcaccagcgttccaag tcagatccctcgcggctcgtccggagcccctccccgcattcttgccacagagctggaaac gacgcccctgctggctgcctgtctcaagctgcacggactcctggggacaggacggggacg agcaagtgcagaaaagccggctgcgtgtattttgggactccagaaaacaagggcttttgc acactgtgtttcatcgagtacagagaaaacaaaccatctctgtatcggtggggtgacccc tatgtggtactaactagcatccattctcatgtagattttgctgctgcctcagggaaagtc agtcccacagcgtccaggttccagaacaccattccgtgcctggggagggaatgcggcacc cttggaagcaccatgtttgaaggatactgccagaagtgtttcattgaagctcagaatcag agatttcatgaggccaaaaggacagaagagcaactgagatcgagccagcgcagagatgtg cctcgaaccacacaaagcacctcaaggcccaagtgcgcccgggcctcctgcaagaacatc ctggcctgccgcagcgaggagctctgcatggagtgtcagcatcccaaccagaggatgggc cctggggcccaccgggcatctctcagagatgtgaagccagatcctcatggcagcgaggcc ctctgcaagaagctcaaggaagctcagggaaaatggacgtattcagagagtgtttgtagt tcatggtttttccctacctgcccggttcctttcctgaggacccggcagaaatgcagaacc atccatggactgtga >gi568815592f:137771228_137981316|GENSCAN_predicted_peptide_6|643_aa MATSLEFYSHAMAKLNTVKAEQDTHLPTERPPNRDINTETSAEITRSPYANPCKKKPRET DAQMKRALPLYSESGPVLFMFFRGPPTNISKTPSTKEEINACVWGLGRKWARRGGNWSNF GNKAHSRRPERKTSGGQDTLGVPVLYGEELESGGKGGERQSGRAAIKHKTGIIPGPVELV PVHIVRLADNECEDLSSEPGTLARNVGCCRYAHEDEVTGPSSTERLVPSNSGFFSCPHVS LAKILQTFQALKRAVDLPAQHSSSAKGQTASSSGSLIPMPPTGRHLPAGVNRHPIQENSG WHLAGDPLGQSFQRKEQAAIFAVLQPLLAIPIKQTGLGVDLQQTPADLQQRGLTVRRKTN KPKGIASISTKSTSTPKPHPKVTNIKEGRTNDKNHMIISTDAEKAFNKIQHPFMLKTLNK LGIDGTYLKIIRAIYNKSIANILNGQKLEAFPLKMSTRHGCLLSPLLFNMVLEVLARAIK QEKEIKSIRIGREEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQT FLYTNNRQTESQIISELLFTIATKRIKYLGIQLTRDVKDLFKENYKPLLKEIREDTNKWK NISCSWIRRINIVKMAILPKVIYRFNAVPIKLPLTYFTELEKF >gi568815592f:137771228_137981316|GENSCAN_predicted_CDS_6|1932_bp atggccaccagtttggagttttattctcatgccatggcaaagctgaatacagtgaaagct gagcaggatacacacctacctaccgagagacctccaaacagagacatcaacacagaaacc tccgcagagataaccagatctccatatgcaaacccatgtaaaaagaaacctagagaaaca gatgcccagatgaagagggctttaccactttattctgagtcaggtccagttttgtttatg tttttcagagggccacctacaaatatcagcaaaacccctagcacaaaagaagagatcaat gcctgtgtctgggggttagggaggaagtgggctcgaaggggaggaaattggagcaatttt gggaacaaagcacacagccggaggcctgagaggaaaacttcgggtgggcaggacactttg ggggtcccagtcctgtatggagaggagctggagtctggtggaaagggaggggaaaggcag agtggcagagcagctataaaacataaaaccgggataataccaggacctgttgaattggta cctgtgcacattgtgaggttagctgacaatgaatgtgaagatcttagctcagagcctggc acattggcgagaaatgttggctgctgccgttacgcacatgaggatgaagtcacgggtccc agttctacagaaaggttagtccccagtaattctggcttcttttcctgtcctcatgtctcc cttgccaagatcctgcaaactttccaggctctgaagagagcagtagatctcccagcacag cactcgagctctgctaagggacagactgcctcctcaagtggatccctgattcccatgcct cctactgggagacacctcccagcaggggtcaacagacaccccatacaggagaactctggc tggcatctggcgggtgaccctctggggcaaagcttccagaggaaggaacaggcagcaatc tttgctgttctgcagcctttgctggcgatacccataaagcaaacaggtttgggagtggac ctccagcaaactccagcagacctgcagcagagaggcctgactgttagaaggaaaactaac aaaccgaaaggaatagcatcaatatcaacaaaaagcacatccacaccaaaaccccatcca aaggtcaccaacatcaaagagggaagaaccaatgacaaaaaccacatgattatctcaaca gatgcagaaaaggccttcaacaaaattcaacaccccttcatgctaaaaactctcaacaaa ctaggtattgatggaacatatctcaaaataataagagctatttataacaaatccatagct aatatactgaatgggcaaaagctggaagcatttcctttgaaaatgagcacaagacacgga tgccttctctcaccactcctattcaacatggtgttggaagttctggccagggcaatcaag caagagaaagaaataaagagtattcgaataggaagagaagaagtcaaattgtctctgttt gcagatgacatgattgtatatttagaaaaccccatcgtctcagcccaaaatctccttaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaaaca ttcctatacaccaataatagacaaacagagagccaaatcataagtgaactcctattcaca attgctacaaaaagaataaaatacctaggaatacaacttacaagggatgtgaaggacctc ttcaaggagaactacaaaccactgctcaaggaaatcagagaggacacaaacaaatggaaa aacatttcatgctcatggataagaagaatcaatattgtgaaaatggccatactgcccaaa gtaatttatagattcaatgctgtccctattaagctaccattgacttacttcacagaatta gaaaaattctga >gi568815592f:137771228_137981316|GENSCAN_predicted_peptide_7|198_aa XVSSMRMNPNEYQEGEAEGKESAMRISRLRVHQSEGLAKARIYLVHLQDSKKGNMAGVES VGKFSSPPQWPSPVLQDCSVTTISSPPSAFSFPFPRPILVLNQLTLISLILVAFWIKRRA LLRSFAFAMASTLAPPDDLHIAGFLLGGRAGHSGMVLYIVPLMDNAAGARRRKVAHQNQE EKLAFFCNAAPVASADKA >gi568815592f:137771228_137981316|GENSCAN_predicted_CDS_7|597_bp naagtcagctctatgaggatgaatcccaatgaataccaagaaggagaggccgaggggaaa gagtcagctatgagaatatcaaggctaagagtgcaccagtcagaaggattagcaaaggca agaatatacctggtgcatctccaggacagcaagaaaggcaatatggctggagtggagtca gtgggaaagttctcctctccaccacagtggccgtccccagtccttcaagactgttctgtc accaccatctccagtccaccctctgccttctccttcccattccctcgtccaatcctggtg cttaatcaattgactctaataagcctgattcttgttgcattttggataaagagaagggcc ttgctgaggtcatttgcatttgctatggcctccactcttgctcctccggatgacctccac attgctggcttcttgttgggtggtagggcagggcacagcgggatggtcttatacatagta ccactgatggataatgcagctggagcaagaagaagaaaagtagcacaccaaaatcaggaa gagaaactcgctttcttctgcaatgctgctccagtggcatctgctgacaaagcttag