GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:49:54 Sequence gi568815587f:103936823_104138010 : 201188 bp : 36.82% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 14 9 6 -0.45 1.04 Term - 2633 1966 668 0 2 29 54 174 0.227 1.10 1.03 Intr - 6828 6630 199 0 1 37 116 247 0.784 20.40 1.02 Intr - 10902 10840 63 0 0 131 89 48 0.613 7.20 1.01 Init - 35462 35280 183 2 0 73 20 101 0.011 0.97 1.00 Prom - 38161 38122 40 -2.85 2.06 PlyA - 38920 38915 6 1.05 2.05 Term - 39901 39491 411 1 0 34 48 161 0.068 0.96 2.04 Intr - 50880 50824 57 0 0 101 82 23 0.453 1.16 2.03 Intr - 59423 59243 181 1 1 60 105 147 0.982 12.55 2.02 Intr - 63415 63229 187 2 1 18 97 186 0.597 10.33 2.01 Init - 64375 64294 82 1 1 77 14 76 0.765 0.28 2.00 Prom - 64563 64524 40 -9.15 3.00 Prom + 64635 64674 40 -7.75 3.01 Init + 67028 67095 68 1 2 99 75 -1 0.459 0.32 3.02 Intr + 69795 70003 209 0 2 44 85 209 0.649 14.00 3.03 Intr + 70054 70268 215 2 2 40 67 104 0.565 1.01 3.04 Term + 90555 90728 174 2 0 93 32 78 0.417 -0.52 3.05 PlyA + 91905 91910 6 1.05 4.00 Prom + 93248 93287 40 -5.65 4.01 Sngl + 100001 101191 1191 1 0 93 42 1331 0.966 124.64 4.02 PlyA + 101917 101922 6 1.05 5.07 PlyA - 102212 102207 6 1.05 5.06 Term - 105247 105221 27 1 0 118 49 20 0.024 -1.80 5.05 Intr - 113967 113708 260 1 2 87 27 135 0.139 3.56 5.04 Intr - 125580 125427 154 0 1 1 65 126 0.019 0.32 5.03 Intr - 130772 130703 70 1 1 92 83 48 0.010 2.97 5.02 Intr - 140546 140413 134 2 2 104 68 21 0.027 0.22 5.01 Init - 141418 141101 318 1 0 55 58 190 0.648 10.08 5.00 Prom - 141533 141494 40 -5.65 6.00 Prom + 147599 147638 40 -3.55 6.01 Init + 163547 163726 180 1 0 70 41 138 0.555 6.53 6.02 Intr + 164002 164683 682 0 1 -8 86 311 0.618 11.53 6.03 Intr + 164739 166040 1302 1 0 75 66 257 0.034 8.56 6.04 Term + 180157 180701 545 2 2 83 31 242 0.752 11.44 6.05 PlyA + 180821 180826 6 1.05 7.03 PlyA - 181166 181161 6 1.05 7.02 Term - 187623 187517 107 1 2 117 48 97 0.547 6.19 7.01 Intr - 189557 189393 165 0 0 50 64 81 0.329 0.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 164739 166051 1313 1 2 75 47 257 0.914 11.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:103936823_104138010|GENSCAN_predicted_peptide_1|370_aa MDVLLASGGRRTGMLVNILQYTGHPSPTLANTYSTRSVNNAEFEKTSSKRFVHILPFRVT LEDFQPAAASETNWESVTSSISGVSYNSPSVTDPTLIADALDKKIAEFDTVEDLLKYFNP ESWQEDLENMYLDTPRYRGRSYHDRKSKAQNLLKLISNFSKVSGYKINVQKSQAFLYTNN RQTESQITSELPFTIASKRIKYLGIQLTRDMKDLFKENYKPLLNEIKEDTKKWKNIPCSW VGIINIVKMAILPKVIYRFNAIPIKLPITFFTELEKTTLKFMWNQKRAHIAKSILSQKNK AGGIMLPDFKLHYKATVTKTTWYWYQNRDINQWNRTEPSEIMPHIYNYLIFDKPEKNKQW GKDSLFNKWC >gi568815587f:103936823_104138010|GENSCAN_predicted_CDS_1|1113_bp atggatgtgctactggcatctggtgggcggaggacaggaatgctggtgaatattctacaa tacacaggccacccatccccaactctagcaaatacttattcaacccgaagtgtaaataat gctgagtttgagaaaacctcctctaagcgatttgtacacattttaccattcagagttacc ctggaagatttccaacccgcagcagcttcagagaccaactgggaatctgtcacaagctct atttcaggggtatcctataactctccatcagtaacggatcccactctgattgcggatgct ctggacaaaaaaattgcagaatttgatacagtggaagatctgctcaagtacttcaatcca gagtcatggcaagaagatcttgagaatatgtatctggacacccctcggtatcgaggcagg tcataccatgaccggaagtcaaaagcccaaaatctcctcaagctgataagcaatttcagc aaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcctatacaccaataac agacaaacagagagccaaatcacgagtgaactcccattcacaattgcttcaaagagaata aaatacctaggaatccaacttacaagggacatgaaggacctcttcaaggagaactacaaa ccactgctcaatgaaataaaagaggatacaaagaaatggaagaacattccatgctcatgg gtaggaataatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattcaat gccatccccatcaagctaccaattactttcttcacagaattggaaaaaactactttaaag ttcatgtggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaa gctggaggcatcatgctacctgacttcaaactacactacaaggctacagtaaccaaaaca acatggtactggtaccaaaacagagatatcaaccaatggaacagaacagagccctcagaa ataatgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatgg ggaaaggattccctatttaataaatggtgctga >gi568815587f:103936823_104138010|GENSCAN_predicted_peptide_2|305_aa MIDGIIINMSGLNLQSVFHPQRLGERSDLYRRDETIQVKGNGYVQSPRFPNSYPRNLLLT WRLHSQENTRIQLVFDNQFGLEEAENDICRYDFVEVEDISETSTIIRGRWCGHKEVPPRI KSRTNQIKITFKSDDYFVAKPGFKIYYSLLLLGKRYKKPKGLKSSSFINFIEINYSVERI SIIRAIYEKPIANIVLNWQKLEAFHLKTGTRQGCPLLPLLFNILLEVLARAIRQEKEIKG IQIGREKVKLSLFADDMIVYLENPIVSAQKLLKLISNFSKVSGYKINVQKSQTFLYTNNK QRAKS >gi568815587f:103936823_104138010|GENSCAN_predicted_CDS_2|918_bp atgattgatggaatcattatcaacatgagtggactcaatcttcaatccgtctttcatccc cagaggttgggagagagatcagacttgtaccgaagagatgagaccatccaggtgaaagga aacggctacgtgcagagtcctagattcccgaacagctaccccaggaacctgctcctgaca tggcggcttcactctcaggagaatacacggatacagctagtgtttgacaatcagtttgga ttagaggaagcagaaaatgatatctgtaggtatgattttgtggaagttgaagatatatcc gaaaccagtaccattattagaggacgatggtgtggacacaaggaagttcctccaaggata aaatcaagaacgaaccaaattaaaatcacattcaagtccgatgactactttgtggctaaa cctggattcaagatttattattctttgctgctactaggcaaaagatacaagaagccaaaa ggactcaaaagtagcagtttcattaattttattgagataaactattcagtagaacgtatc tcaataataagagctatttatgagaaacccatagccaatatcgtactgaactggcaaaag ctggaagcattccatttgaaaactggcacaagacaaggatgccctctcttaccactcctg ttcaacatactattggaagttctggccagggcaatcaggcaagagaaagaaataaagggt attcagataggaagagagaaagtcaaattgtctctgtttgcagatgacatgattgtatat ttagaaaaccccatcgtctcagcccaaaaactcctgaagctgataagcaacttcagcaaa gtctcaggatacaaaatcaatgtgcaaaaatcacaaacattcctatacaccaataataaa caaagagccaaatcatga >gi568815587f:103936823_104138010|GENSCAN_predicted_peptide_3|221_aa MPWEWQMTEGVPTWSNGWEGSCVPQAPEADEEETNRSAEEWTNSVAERREGASEYQEEFS WGRLERRPAAGQPNSREEYIPTPSPFQLPIHPVHVEPDSSWMPDKDLGTKRAWSWLILKL SVDSKAKSAHCITCPFGLWELQIPTPGCCYGSGPQGSHLRAPPPVINYCFHSYEEKMTYF GRVLFYLQDKFKLSVKSTPKSYGKEIIVIIPILHKRIMRLG >gi568815587f:103936823_104138010|GENSCAN_predicted_CDS_3|666_bp atgccctgggagtggcagatgacagaaggagtgcccacctggagcaatggctgggaagga agctgtgtaccccaggctccagaagcagatgaggaggagacgaacagaagtgcagaagaa tggacgaacagcgtggcagagagaagagaaggagcatctgaatatcaagaggagttcagc tgggggcggctagagaggagacctgctgctggacagccaaactccagggaagagtatatt cccactccatcccccttccagctccctatccatcctgtccatgtggaacctgattcttcc tggatgccagacaaggacctgggtaccaagagggcatggagctggttaatacttaagctg tctgtggacagcaaggctaaaagtgcacactgtatcacgtgcccatttgggctttgggag ttgcagatacccactcctggatgttgctatgggagtggaccccaggggtctcatctgcgt gctccccctcctgttattaattattgttttcactcctacgaggaaaaaatgacatacttt ggtagagtgcttttctatttacaagacaaattcaaattatccgttaagtccacaccaaaa tcctatggaaaagagattattgtcatcatccccattttacataagaggataatgaggctc ggataa >gi568815587f:103936823_104138010|GENSCAN_predicted_peptide_4|396_aa MLITVYCVRRDLSEVTFSLQVSPDFELRNFKVLCEAESRVPVEEIQIIHMERLLIEDHCS LGSYGLKDGDIVVLLQKDNVGPRAPGRAPNQPRVDFSGIAVPGTSSSRPQHPGQQQQRTP AAQRSQGLASGEKVAGLQGLGSPALIRSMLLSNPHDLSLLKERNPPLAEALLSGSLETFS QVLMEQQREKALREQERLRLYTADPLDREAQAKIEEEIRQQNIEENMNIAIEEAPESFGQ VTMLYINCKVNGHPLKAFVDSGAQMTIMSQACAERCNIMRLVDRRWAGVAKGVGTQRIIG RVHLAQIQIEGDFLQCSFSILEDQPMDMLLGLDMLRRHQCSIDLKKNVLVIGTTGTQTYF LPEGELPLCSRMVSGQDESSDKEITHSVMDSGRKEH >gi568815587f:103936823_104138010|GENSCAN_predicted_CDS_4|1191_bp atgctgatcaccgtgtactgcgtgcggagggacctctccgaggtcaccttctctctccag gtcagccccgactttgagctccgaaacttcaaggtcctctgcgaagcggagtccagagtc cccgtcgaagagatccagatcatccacatggagcgactcctcatcgaggaccactgttcc ctgggctcctacggcctcaaagatggcgatatcgtggttttactgcagaaggacaatgtg ggacctcgggctccagggcgtgccccgaaccagcctcgtgtagacttcagtggcattgcg gtgcctgggacgtccagctcccgtccacagcaccctggacagcagcagcagcgcacaccc gctgcccagcggtcacagggcttggcgtcaggagagaaggtggccggcctgcaaggtctg ggcagccccgccctgatccgcagcatgctgctctccaacccccacgatctgtccctgctc aaggaacgcaaccctcccttggcggaagccctgctcagcggaagccttgagaccttttct caggtgctgatggagcagcaaagggaaaaggccttgagagagcaagagaggcttcgtctc tacacagccgacccactggatcgggaagctcaggccaaaatagaagaggaaatccggcag caaaacattgaagaaaacatgaatatagcgatagaagaggcccccgagagttttggacaa gtgacgatgctctacattaactgcaaagtgaatgggcatcctttgaaggcttttgttgac tcgggcgcccagatgaccattatgagccaggcttgtgccgagcgatgtaacatcatgagg ctggtggaccgacggtgggctggggttgctaaaggagtgggcacacagagaattattggc cgtgttcatctagctcagattcaaattgaaggtgatttcttacagtgctctttctccata cttgaggatcaacccatggatatgcttctaggcctagatatgctccggagacatcaatgt tccatcgatttgaagaaaaatgtgctggtcatcggcaccactggcacgcagacttatttt cttcctgagggagagttgcccttatgctctaggatggtaagtgggcaagatgagtcttcg gacaaggaaattacacattcagtcatggattcaggacgaaaagagcattaa >gi568815587f:103936823_104138010|GENSCAN_predicted_peptide_5|320_aa MHLMEKFPITQPLTHPTAMDPKKCFMDWLMSADLWSIEDTENEAFERQWQGLRCTVADAN LHQRPLRTSGITSIFPVPSALRQERTWDMSGETIGQHGSRPSSPAENLLSISTYTAALHP SFFISTLVDLSIFLTSLSPGLSPVMSSTEVFLVILLMEDQEILYDMFADTLDRKTESRKK DRKSQILRHAEKEQMEEFAFSVAKNRVEKYPRKNLPKVVTGMLVEGVCAAGDGGGTVICL ARPFRTLKDIEYPRPAVLNADICQDIPKCPTHFWVFPSGADPHVITLLDEQTEVQKGLVP CPRVYVQEVAKLAFASSKNN >gi568815587f:103936823_104138010|GENSCAN_predicted_CDS_5|963_bp atgcatttaatggagaagtttcccattacacaacccttaacccacccaacagccatggat ccaaagaagtgttttatggactggctgatgagtgcagacctgtggtcaatagaggacact gagaatgaggcatttgagagacaatggcaaggtctgaggtgcacagtggcagatgcaaac ctacatcaaaggccgctgaggaccagtggcataaccagcatatttcctgtgcccagtgct ctgagacaagagcggacttgggacatgtctggagagaccatagggcagcatggatcacgc cctagctcacctgctgagaatttgctctctatttctacttacactgctgccttacatcct tccttcttcatctctaccctggttgaccttagtatcttcttgacctccctgtctccaggc ttgtcaccagtcatgtcgtctacagaggtatttcttgttatcttactaatggaagatcaa gagattctttatgatatgtttgctgatactttggatagaaagacggagtcaaggaagaaa gataggaagagccagatcttacgacatgcagagaaagaacagatggaggaatttgccttt tctgtggccaagaacagggttgagaaatatcccaggaagaacttgcccaaggttgtaact ggaatgctggttgaaggagtatgtgctgctggagatgggggagggacagttatttgtctt gcaagacccttccgtacactaaaggacattgagtaccccagacctgcagtactgaatgct gatatttgccaggacatcccaaagtgccccacacatttctgggtgtttccgagtggagca gacccacacgttatcactttacttgatgagcaaactgaggtccagaaaggtttggtgcct tgcccaagggtgtatgtccaggaagtggcaaaactggcttttgcaagttcaaagaataac tag >gi568815587f:103936823_104138010|GENSCAN_predicted_peptide_6|902_aa MDKLLDTYTLPRLNQEEVESLNRTITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEQL HINTTNDKNHMIISIHAEKAFDKILQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILN GQKLEAFPLKTGTRQGCPLAPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDM IVYLENPIVSAQNLLKLIGNLRKVSGCKINVQKSQAFLHTNNRHTESQIMSELPLTIASK RIKYLGIQLTRDMKDLFKGNYKPLLNEIKEDMNKWKNQYRENGHTAQELEKTTLKFIWNQ KRALIAKSILSQKNKAGGSMLPDFKLYYKATVTKTAWYWYQQRDIDQWNRTEPPEIMPHI YNYLIFDKPDKNKKWGKDFLFNKWSWENWLAICRKLKLDPFLTPYTKINSRWIKDLHVRP KTIKTLEENLGNTIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRENR QPTEWENIFAIYSSDKGLISRIYNELKQIYKKKTKNPIKKWAKDMNRHFSKEDIYAAKKH MKICSSSLVIREMQIKTTMRYHLTPVRMAIIKKSGNNRCWRGCGEIGTLLHCWWDCKLVQ PLWKSVWPFLKDLELEIPFDPAIPLLDIYSKDYKSCCYKDTCTRMFIAALFTTAKTWNQP KCPSVIDWIKKMWHIYAVEYYAAIKKDDFMSFVGTWMKLETIILSKLLQGQKTKHRVFSV IVSSMSLLAVCKRTNIIIHEDQVGFIPGMQGWFNICKSINVIHHINRIKNKNQMIISIDR EKAFDKIHHPFMIKTLSKIDIQGTYLNVINAIYDKPTVKTILNEEKLKAFPLRTGTRQGC PPSPLLFNIVLEVLATAIRQEKEIKGIQIGKEEVELSLFADDMIAYVENHKDSSRKFLEL IK >gi568815587f:103936823_104138010|GENSCAN_predicted_CDS_6|2709_bp atggataaattactcgatacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaacaataacaggctctgaaattgaggcaataattaatagcttaccaaccaaa aaaagtccaggaccagatggattcacagctgaattctaccagaggtacaaggagcagctg catataaacacaaccaacgacaaaaaccacatgattatctcaatacatgcagaaaaggcc tttgacaaaattttacaacccttcatgctaaaaactctcaataaattaggtatcgatggg atgtatctcaaaataataagagctatttatgacaaacccacagccaatatcatactgaat gggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctcgca ccactcctattcaacatagtgttggaagttctggccagggcaatccggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatg attgtatatctagaaaaccccattgtctcagcccaaaatctcctcaagctgataggcaac ctcaggaaagtctcaggatgcaaaatcaatgtgcaaaaatcacaagcatttttacacacc aataatagacacacagagagccaaatcatgagtgaacttccattgacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggacatgaaggacctcttcaaggggaac tacaaaccactgctcaatgaaataaaagaggatatgaacaaatggaagaatcaatatcgt gaaaatggtcatactgcccaagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagccctcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcagcatg ctacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtac caacaaagagatatagaccaatggaacagaacagagcccccagaaataatgccgcatatc tacaactatctgatctttgacaaacctgacaaaaacaagaaatggggaaaggatttccta tttaataaatggtcctgggaaaactggctagccatatgtagaaagctgaaactggatccc ttccttacaccttatacaaaaattaattcaagatggattaaagacttacatgttagacct aaaaccataaaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggc aaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgg gatctaattaaactaaaaagcttctgcacggcaaaagaaactaccataagagagaacagg caacctacagaatgggaaaacatttttgcaatctactcatctgacaaagggctaatatcc agaatctacaatgaactcaaacaaatttacaagaaaaaaacaaagaaccccatcaaaaag tgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacac atgaaaatatgctcatcatcactggtcatcagagaaatgcaaatcaaaaccacaatgaga taccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacaggtgctgg agaggatgtggagaaataggaacacttttacactgttggtgggactgtaaactagttcaa ccattgtggaagtcagtttggccattcctcaaggatctagaactagaaataccatttgac ccagccattccattactggatatatactcaaaggattataaatcatgctgctataaagac acatgcacacgtatgtttattgcagcactattcacaacagcaaagacttggaaccaaccc aaatgtccatcagtgatagactggattaagaaaatgtggcacatatacgccgtggaatac tatgcagccataaaaaaggatgatttcatgtcctttgtagggacatggatgaagctggaa accatcattctcagcaaactactgcaaggacaaaaaaccaaacaccgcgtgttctcagtc atagtctcgagtatgtctttattagcagtgtgcaaacggaccaatataataatccatgag gatcaagtaggtttcataccagggatgcagggatggtttaacatatgcaagtcaatcaat gtgatacaccacataaacagaattaaaaacaaaaatcaaatgatcatctcaatagacaga gaaaaagcatttgacaaaatccaccatccctttatgattaaaactctcagcaaaatcgac atacaagggacatacctcaatgtaataaatgctatttatgacaaacccacagtcaaaaca atactgaatgaagaaaagttgaaagcattccctctgagaactggaacaagacaaggatgc ccaccctcaccacttctcttcaacatagtactggaagtcctagccacagcaatcagacaa gagaaagaaataaagggcattcaaattggtaaagaggaagttgaactgtcactgtttgct gatgatatgattgcttatgtagaaaaccataaagattcctccagaaagttcctagaactg ataaaataa >gi568815587f:103936823_104138010|GENSCAN_predicted_peptide_7|90_aa XVRFLTAYGVPVLDPSPVGLEEGVPVSCSTEIQAGRSTGEFVPPAYRLLCPKCKGRISSE TTLMEVDTGLPVKGNSPPYYTARSQLVAQY >gi568815587f:103936823_104138010|GENSCAN_predicted_CDS_7|273_bp naagtgaggtttcttactgcttatggggtcccagtattagatcctagccctgttggactg gaggagggcgtccctgtgtcctgctccactgaaatccaggcagggaggagcacaggagag tttgtgcctccagcttacaggctgctctgtcccaagtgcaagggcagaatatcttctgag accactctaatggaagttgatacaggacttccagtaaaagggaattcaccaccatattat acagcccgttcacagttagtggctcaatattaa