GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:48:46 Sequence gi568815597f:96346930_96548315 : 201386 bp : 38.26% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 92 87 6 1.05 1.02 Term - 1364 1264 101 0 2 43 44 111 0.618 -0.49 1.01 Init - 4055 3878 178 2 1 54 119 99 0.861 9.07 1.00 Prom - 23056 23017 40 -2.85 2.00 Prom + 28639 28678 40 -5.75 2.01 Init + 30346 30436 91 0 1 78 98 93 0.991 10.00 2.02 Term + 33301 34829 1529 2 2 67 42 429 0.099 25.85 2.03 PlyA + 35326 35331 6 1.05 3.00 Prom + 35620 35659 40 -8.05 3.01 Init + 37612 37720 109 0 1 45 93 129 0.652 9.53 3.02 Intr + 43721 43947 227 0 2 25 30 196 0.054 4.28 3.03 Intr + 55742 55870 129 1 0 12 103 125 0.289 6.37 3.04 Term + 72255 72395 141 2 0 64 44 131 0.239 3.25 3.05 PlyA + 72438 72443 6 1.05 4.02 PlyA - 72580 72575 6 1.05 4.01 Sngl - 83880 83482 399 0 0 25 38 279 0.979 12.61 4.00 Prom - 85828 85789 40 -6.95 5.02 PlyA - 86523 86518 6 1.05 5.01 Sngl - 88296 87202 1095 0 0 60 39 269 0.565 15.93 5.00 Prom - 88621 88582 40 -12.62 6.04 PlyA - 88721 88716 6 1.05 6.03 Term - 89985 89189 797 1 2 26 32 391 0.620 19.75 6.02 Intr - 91817 91675 143 1 2 41 71 91 0.164 1.78 6.01 Init - 93039 92900 140 0 2 37 78 119 0.143 5.46 6.00 Prom - 93785 93746 40 -8.05 7.00 Prom + 95473 95512 40 -8.55 7.01 Init + 100001 100164 164 1 2 86 29 145 0.663 7.65 7.02 Intr + 100249 100561 313 1 1 -53 -40 382 0.582 6.56 7.03 Term + 100637 101389 753 1 0 22 42 1011 0.725 82.35 7.04 PlyA + 102158 102163 6 1.05 8.06 PlyA - 103082 103077 6 1.05 8.05 Term - 108228 107942 287 1 2 16 53 186 0.883 2.48 8.04 Intr - 127123 126940 184 1 1 58 85 53 0.094 0.44 8.03 Intr - 128231 128139 93 2 0 90 59 44 0.193 0.94 8.02 Intr - 131493 131378 116 1 2 64 79 57 0.131 1.65 8.01 Init - 137445 137370 76 0 1 74 69 100 0.294 8.00 8.00 Prom - 178054 178015 40 -5.05 9.00 Prom + 178995 179034 40 -4.25 9.01 Sngl + 182045 182452 408 1 0 62 32 307 0.851 18.54 9.02 PlyA + 182462 182467 6 1.05 10.00 Prom + 182885 182924 40 -3.65 10.01 Init + 198873 198936 64 2 1 64 115 50 0.382 6.66 10.02 Term + 199153 199229 77 2 2 47 48 76 0.269 -3.48 10.03 PlyA + 200368 200373 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 33240 32518 723 0 0 31 37 327 0.844 17.87 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_1|92_aa MGEDFPPVWAPENKTHKGIGISLRLDPQEFLTLKLVHTQPPSIRQLPFKCSYQLPAAAPE KADKNISKDMEDLNNTINKSDLMGTYNSMPNN >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_1|279_bp atgggagaggattttcctccagtctgggcccctgaaaataaaactcacaaaggtataggg atctccctaagactggatccccaggaatttttaactctcaagctagtccacactcaacct ccatcaattcgtcaattaccatttaagtgttcctaccagttacctgctgcagctcctgag aaagcagacaaaaacattagtaaggacatggaggatttgaacaacacaattaacaagtca gatctaatgggcacctacaactctatgcccaacaattag >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_2|539_aa MDPNQEEIPDLPEKQSRRSVIKLIKEAPEKVLEVLVREIRQEKEIKGIQLGKEEVKLSLF ADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFT IASKRIKYLGIQLTRDVKDLFKQNYKLLLNEIKEDTKKWKNIQCSWIGRTNIVKMAILPK VIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARITKSIRSQKNKAGGITLPDFKLYYK ATVTKTAWYWYQNRDIDQWNRTEPSEITLNIYSYLIFDKPEKNKQWGKDSLFNKWCWENW LAICRKLKLDPFLTPYTEINSRWIKDLNVRTKTIKTLEENLGITIQDIGMGKDFMSKTPK AMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFTTYSSDKGLISRIYNELKQI YKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMA VIKMSGNNRCWRGCGEIGTLLHCWWDCKLVQPLWKSVWQFLRDLELEIPFDPAIPLLGI >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_2|1620_bp atggatccaaaccaagaagaaatccctgatttacctgaaaaacaatccagaaggtcagtt attaagctaatcaaggaggcaccagagaaagtgttggaagttctggtcagggaaattagg caggagaaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgttt gcagatgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagca ttcttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcaca attgcttcaaagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctc ttcaagcagaactacaaactacttctcaatgaaataaaagaggatacaaagaaatggaag aacattcaatgctcatggataggaagaaccaatattgtgaaaatggccatactgcccaag gtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcaccaagtcaatc cgaagccaaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaac agaacagagccctcagaaataacgctgaatatctacagctatctgatctttgacaaacct gagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaattgg ctagccatatgtagaaagctgaaactggatcccttccttacaccttatacagaaattaat tcaagatggattaaagacttaaacgttagaactaaaaccataaaaaccctagaagaaaac ctaggcattaccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaa gcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagcttctgc acagcaaaagaaactaccatcagagtgaacaggcaacctacaaaatgggagaaaattttc acaacctactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatt tacaagaagaaaacaaacaaccccatcaaaaagtgggtgaaggacatgaacagacacttc tcaaaagaagacatttatgcagccaaaaaacacatgaaaaaatgctcaccatcactggcc atcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttagaatggca gtcattaaaatgtcaggaaacaacaggtgctggagaggatgtggagaaataggaacactt ttacactgttggtgggactgtaaactagttcaaccattgtggaagtcagtgtggcaattc ctcagggatctagaactagaaataccatttgacccagccatcccattactgggtatatag >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_3|201_aa MNIVMDALKREHFYTAGRNVNQYNHYGKQCDDSLKNLDLNSPFKPPPGSVRMPNIPNTQP AIMKPTEKHPVYTLIAKKHALAQAELKHQEEVERKDAELVIGNEKCKTSLNMRSFNQSTS LRILKSNASAKNKRKSYGNTDRDWLVEWRLVKSLQLLEKLQVLTYCAITMTSLAVLVRLL PAMDVVTGTIFINSCHSRINA >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_3|606_bp atgaatattgtcatggatgcattgaaaagggaacatttctacactgctggtaggaatgta aaccagtacaaccactacggaaaacagtgtgatgattccttaaagaacttggatctcaac agtcccttcaagcctccaccaggcagtgtgaggatgcctaacatacccaatacacaacca gcaataatgaaaccaacagagaaacatccagtttatacgctgattgcaaaaaaacatgca ttggcccaagctgaacttaaacaccaggaagaagtagaaagaaaagatgcagaattagtc atcgggaatgagaaatgcaaaacctcactcaacatgagaagtttcaaccagagcactagc ctccgcattctgaaatccaacgccagtgccaaaaataagaggaagagctatggtaacact gacagagactggttggtggagtggagactagtgaagagtttacagcttttggaaaagctc caggtcctcacgtattgtgcaataacaatgacttccttggcggttttggtacgtttattg ccggcaatggacgttgtaacaggaacaattttcattaactcctgccactcaaggattaat gcatga >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_4|132_aa MTNPQANIILNGEKSKASPPRTGTRQGCPLSPFLFNIVLEVLAKALRQEKEIKGIQIHKE EVKLSLFADDMIVYLENPKDSSKKLLELINEFSEVSEYKINVHRSVALLYTNSDHTENQI KNSTLFTIAAKK >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_4|399_bp atgacaaacccacaagccaacattatactgaatggggaaaagtcgaaagcatcccctcca agaaccggaacaagacaaggatgcccactttcaccatttctattcaacatagtactggaa gtcctagccaaagcactcagacaagagaaagaaataaagggcatccaaatccataaagag gaagtcaaactatcactgtttgctgatgatatgattgtatacctagaaaaccctaaagat tcatccaaaaagctcctggaactgataaatgaattcagtgaagtttcagaatacaaaatt aatgtacacagatcagtagctctgctatacaccaacagcgaccatactgagaatcaaatc aagaactcgacccttttcacaatagctgcaaaaaaataa >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_5|364_aa MSEPPFTIASKIIKYLQIQLTRDVKDLFKENYKPLLNKIKEDTNKWKNIPCSWIGRINIV KMAILPKVICRFNAISTKLPITFFTELEKTTLKFIWNQKRACIAKTILSKKNKAGGIMLP DLKLYYKATVSKTAWYWYQNRDIDQGNRIEPSEIISYIYNHLIFDKPDKNKKWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKD FMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNMQPTEWEKIFAIYPSDKGLISRI YKELKQIYKKKTKQPNQKVAKGYEQTLLERRHLCSQQTREKMLILTGHQRNANQNHSEIP SHTS >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_5|1095_bp atgagtgaacccccattcacaattgcttcaaagataataaaatacctacaaatccaactt acaagggatgtgaaggacctcttcaaagagaactacaaaccactgctcaacaaaataaaa gaggacacaaacaaatggaagaatattccatgctcatggataggaagaatcaatattgtg aaaatggccatactgcccaaggtaatttgtagattcaatgccatctccaccaagctacca ataaccttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcctgcattgccaagacaatcctaagcaaaaagaacaaagctggaggcatcatgttacct gacttgaaactatactacaaagctaccgtaagcaaaacagcatggtactggtaccaaaac agagatatagaccaagggaacagaatagagccctcagaaataatatcatacatctacaac catctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaat aaatggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaattaattcaagatggattaaagatttaaatgttagacctaaaacc ataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacatgcaacct acagaatgggagaaaatttttgcaatctacccatctgacaaagggctaatatccagaatc tacaaagaacttaaacaaatttacaagaaaaaaaccaaacaacccaatcaaaaagtggcc aaagggtatgaacagacacttctcgaaagaagacatttatgcagccaacagacacgtgaa aaaatgctcatcctcactggccatcagagaaatgcaaatcaaaaccacagtgagatacca tctcacaccagttag >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_6|359_aa MRKNQKNNSGNMTERVSLTPPKDHTSSPAMDPSQDEISELPEKEFRRKTGSGVDLQQIPT DQQLMVLTVRRKTNKQKGHPHQNSICTSPSSKTKAQRCTYYKTDHIVGSKALLSKCKITE IITNYLSDHSAIKLELRIKKLTQNCSTTWKLNNLLLNDYWVNNEMKAEIKMFFETNENKD TTYQNLWDTFKAVCRGKFIALSAHKRKQEISKIDTLTSQLKELEKQEQTHSKASRKQEIS KIRVELKEIETQKTLQKINESRSWFFEKINKIDRLPARLIKKKREKNQIDAIKSDKGAIT TDPTEIQTTIREYYKQLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPKQALKLRQ >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_6|1080_bp atgagaaagaaccagaaaaacaattctggtaatatgacagaacgagtttccttaacaccc ccaaaagatcataccagctcaccagcaatggatccaagccaagatgaaatttctgaattg ccagaaaaagaattcaggaggaaaacagggtctggagtggacctccagcaaattccaaca gaccagcagctgatggtcctgactgttagaaggaaaactaacaaacagaaaggacatcca caccaaaactccatctgtacgtcaccatcatcaaagaccaaagcacaacgctgtacttat tacaaaactgaccacatagttggaagtaaagcactcctcagcaaatgtaaaataacagaa attataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaaa ctcactcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgactactgg gtaaataatgaaatgaaggcagaaataaaaatgttctttgaaaccaatgagaacaaagac acaacgtaccaaaatctctgggacacatttaaagcagtgtgtagagggaaatttatagca ctaagtgcccacaagagaaagcaagaaatatctaaaattgacaccctaacatcacaatta aaagaactagagaagcaagagcaaacacattcaaaagctagcagaaaacaagaaataagt aagatcagagtagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaa tctaggagctggttttttgaaaagatcaacaaaattgatagactgccagcaagactaata aagaagaaaagagagaagaatcaaatagatgcaataaaaagtgataaaggggctatcacc actgatcccacagaaatacaaaccaccatcagagaatattataaacaactctacgcaaat aaactagaaaatctagaagaaatggataaattcctggacacatacactctcccaagacta aaccaggaagaagttgaatctctgaatagaccaaaacaggctctgaaattgaggcaataa >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_7|409_aa MGKEKTHINIVVIGHVDSGKSTTSGHLIYKCGGIDKRTIEKLQKEAAEMGKGSFNKYYVT IIDAPGHRDFIKNMITGTSQAGCAVLIVAAGVGEFEAGISKNGQTREHALLAYTLGVKQL IVDVNKMDSTEPPYSQKRYEEIVKEISTYIKKIGYNPNTGWKVTHKDGNASGTMLLEALD CILPPTRPTDKPLRLPLQDVYKIGGIGTVLVGRVETGILKPGMVVTFAPVSVTTEVKSVE MHHEALNEALPGDNMGFNVKNVSVKDVHRGNVAGDSKNDPPMEAAGFTAQVIILNHPGQI SSGYAPVLDCHTAHIACKFAELKEKIDRRSGKKLEDGPKFLKSGDAAIADMVPGKPMCVE SFSDYPPLGRFTVHDMRQTLAVGVIKAVDKKAAGAGKVTKSAQKAQKAK >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_7|1230_bp atgggaaaagaaaagactcatatcaacattgtcgtcattggacacgtagattcgggcaag tccaccacttctggccatctgatctacaaatgcggtggcatcgacaaaagaacaattgaa aaattgcagaaagaggctgctgagatgggaaagggctccttcaacaagtactatgtgact atcattgatgccccaggacacagagacttcatcaaaaacatgattacagggacatctcag gctggttgtgctgtcctaattgttgctgctggtgttggtgaatttgaagctggtatctcc aagaatgggcagacccgagagcatgcccttctggcatatacactgggtgtgaaacaacta attgttgatgttaacaaaatggattccactgagccaccctacagccagaagagatatgag gaaattgttaaggaaatcagcacttacattaagaaaattggctacaaccccaacacagga tggaaagtcacccataaggatggcaatgccagtggaaccatgctgcttgaggctctggac tgcatcctaccaccaactcgtccaactgacaagcccttgcgcctgcctctccaggatgtc tacaaaattggtggtattggtactgttcttgttggccgagtggagactggtattctcaaa cctggtatggtggtcacctttgctccagtcagcgttacaacagaagtaaaatctgtcgaa atgcaccatgaagctttgaatgaagctcttcctggggacaatatgggcttcaatgtcaag aatgtgtctgtcaaggatgttcatcgtggcaacgttgctggtgacagcaaaaatgaccca ccaatggaagcagctggcttcactgctcaggtgattatcctgaaccatccgggccaaata agctctggctatgcacctgtattggattgccacacagctcacattgcatgcaagtttgct gagctgaaggaaaagattgatcgccgttctggtaaaaagctggaagatggccctaaattc ttgaagtctggtgatgctgccattgctgatatggttcctggcaagcccatgtgtgttgag agcttctcagactatccacctttgggtcgctttactgttcatgatatgagacagacactt gcggtgggtgtcatcaaagcagtggacaagaaggctgctggagctggcaaggtcaccaag tctgcccagaaagctcagaaggctaaatga >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_8|251_aa MGHGGSVSSFERLNEALGFKKTVQRWCVTSRSFSPWNLTVALEPIQEGLQNTQAKGQCSL DLSVVPPSYVSERSLPPHHSTGMPNPTCGLVWRKALYLSYTLPKGMKMGVSGEFTSESSL TDSLPGITEAVSKCKSVSPIVPEFIVHQKALQWESGDRPKSKTLTTPNANEDVEQQELSL LVGIENGIATLEDSLAVSYKTKYALSMQSSNCPPWYLPKWLKTYVHTKTCTWMSIAALCI ISKTWKQPRYP >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_8|756_bp atgggtcatgggggatctgtgagtagttttgaacggctgaatgaagcattaggatttaag aagactgtgcagagatggtgtgtgacttctagaagcttttcaccctggaatctcactgtt gcactagagccaatccaagaaggtttacaaaacacgcaagccaaaggacaatgcagttta gacttaagtgtggtgccccctagctatgtctctgagcggtccctaccaccccaccacagt actggcatgcccaatcccacatgtggcctagtgtggaggaaggctttatacctctcctac acactgcccaaggggatgaaaatgggagtcagcggggaattcacatctgagtcatctctc accgactcactccctgggatcactgaggctgtgtctaaatgcaaaagtgtctcccccatt gttcctgaattcattgtgcatcagaaagcattgcaatgggagtcaggagacaggccaaaa tccaaaacactgaccacaccaaatgctaatgaggatgtggagcaacaggaactttcattg ctcgtgggaatagaaaatggtatagccactttagaagacagcttggcagtttcttacaaa actaaatatgctcttagcatgcaatccagtaactgccctccttggtatttacctaaatgg ctgaaaacttatgtccacacaaaaacatgcacgtggatgtctatagcagctttatgcata atttccaaaacttggaagcaaccaagatatccttga >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_9|135_aa MWESLELSRDLLNGYDQNADDDMDNEIQAELISVGDEELVGNWSKRDSCYALAKRLAAFC PCPRDLWNFELERDDLGYLAEEISKQQTIQEVTWVLLKAFSCIKEAEHSSSENLQPDNVI EKKNPLSEEKFKPAA >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_9|408_bp atgtgggaaagtttggaactctctagagacttgttgaatggctatgatcaaaatgctgat gatgatatggacaatgaaatacaggctgagttgatctcagttggagatgaggaacttgtt gggaattggagcaaacgtgactcttgttatgctttagcaaagagactggcagcattttgc ccctgccctagagatttgtggaactttgaacttgagagagatgatttagggtatctggca gaagaaatttctaagcagcaaaccattcaagaggtaacttgggtgctgttaaaggcattc agttgtataaaggaagcagagcatagcagttcagaaaatttgcagcctgacaatgtgata gaaaagaaaaacccactttctgaggagaaattcaagccagctgcataa >gi568815597f:96346930_96548315|GENSCAN_predicted_peptide_10|46_aa MGLTDPVLLTVGITDEETLNPGSLTPTCVDKATCANQTQGSFASQT >gi568815597f:96346930_96548315|GENSCAN_predicted_CDS_10|141_bp atgggtctgactgatccagtcctgcttacagtagggataacagatgaggagacactgaat ccaggatcactgacacctacatgtgtggataaagctacatgtgccaaccagacccaagga tcctttgccagccagacctaa