GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:22:48 Sequence gi568815575f:50810784_51016604 : 205821 bp : 39.91% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 2511 2351 161 1 2 86 9 293 0.077 19.99 1.02 Intr - 2897 2642 256 2 1 -5 46 316 0.758 14.19 1.01 Init - 3235 3119 117 1 0 80 116 80 0.997 10.28 1.00 Prom - 4854 4815 40 -4.45 2.00 Prom + 7522 7561 40 -5.65 2.01 Init + 17827 17981 155 0 2 79 47 106 0.581 5.10 2.02 Intr + 28992 29216 225 0 0 99 61 122 0.693 6.78 2.03 Intr + 29341 29468 128 1 2 97 93 25 0.711 3.30 2.04 Term + 30923 31005 83 0 2 54 55 77 0.546 -2.12 2.05 PlyA + 32846 32851 6 1.05 3.05 PlyA - 33602 33597 6 1.05 3.04 Term - 40024 39792 233 2 2 47 37 197 0.223 6.15 3.03 Intr - 42286 42107 180 2 0 15 99 75 0.063 0.22 3.02 Intr - 59187 59086 102 1 0 84 51 76 0.025 2.73 3.01 Init - 61913 61832 82 2 1 94 5 83 0.311 1.78 3.00 Prom - 68861 68822 40 -2.35 4.00 Prom + 83123 83162 40 -5.05 4.01 Init + 84901 84925 25 0 1 106 98 -2 0.295 2.34 4.02 Term + 94608 95083 476 1 2 101 47 419 0.900 33.16 4.03 PlyA + 96493 96498 6 1.05 5.00 Prom + 97539 97578 40 -9.45 5.01 Init + 100001 100328 328 1 1 73 105 309 0.841 28.63 5.02 Term + 104974 105824 851 2 2 133 48 538 0.971 46.32 5.03 PlyA + 106919 106924 6 1.05 6.03 PlyA - 107064 107059 6 1.05 6.02 Term - 116584 116345 240 1 0 48 47 151 0.788 2.04 6.01 Init - 120559 120173 387 1 0 99 85 379 0.948 35.55 6.00 Prom - 128096 128057 40 -2.75 7.04 PlyA - 128192 128187 6 1.05 7.03 Term - 134242 134012 231 1 0 32 39 119 0.043 -3.11 7.02 Intr - 141939 141723 217 2 1 85 110 114 0.369 10.88 7.01 Init - 152671 152571 101 1 2 43 79 61 0.060 0.52 7.00 Prom - 157474 157435 40 -4.95 8.00 Prom + 169320 169359 40 -6.15 8.01 Init + 172251 172310 60 2 0 88 37 76 0.433 3.80 8.02 Intr + 172998 173294 297 2 0 124 36 243 0.578 18.85 8.03 Term + 180109 180183 75 0 0 86 43 73 0.126 -0.54 8.04 PlyA + 180444 180449 6 1.05 9.04 PlyA - 180539 180534 6 1.05 9.03 Term - 184371 184277 95 1 2 69 48 115 0.038 2.61 9.02 Intr - 191723 191644 80 1 2 86 56 93 0.025 4.18 9.01 Init - 198418 198270 149 1 2 83 23 96 0.288 2.21 9.00 Prom - 202360 202321 40 -3.65 10.03 PlyA - 202644 202639 6 -0.45 10.02 Term - 204000 202701 1300 2 1 10 49 306 0.714 8.96 10.01 Intr - 204595 204206 390 0 0 40 69 285 0.607 14.71 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 2511 2336 176 1 2 86 49 261 0.915 18.94 S.002 Term + 122816 123084 269 0 2 55 49 230 0.964 10.47 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_1|178_aa MENRPGSFQYVPVQLQGGAPWGFTLKGGLEHCEPLTVSKNLARRHTREGAGAALAAPRRG GYFPQFGPKEVEAVGDFSLEASASLRDSEGLKGQSSRLGSSLGSVVPSIDRRAVPPRQVA GLGAESCPSAGEISCTLRSVEETCAMERSCAGATAVAAAAAAAAAAAAATAAAAAAAR >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_1|534_bp atggagaaccggcctgggtccttccagtacgtccctgtgcagctgcaagggggggcaccc tggggcttcacccttaaggggggtctggaacactgtgagccgctcacagtgtctaagaac cttgcgcgcagacacactcgcgagggcgctggggcagcccttgctgccccacgtcggggc ggctacttccctcagtttgggccgaaggaagtggaggctgttggggatttctccttggag gcctctgcgtccctgcgggacagtgaaggactgaagggacagtcatcgcggcttggcagc tctcttggcagcgttgtcccctctatcgaccggcgggccgtccctcccaggcaggttgca gggctaggggctgagtcctgccccagcgccggcgagatttcctgcacgttgagaagtgtg gaggaaacttgcgccatggagcgcagctgtgctggagctactgctgttgctgctgccgct gccgccgccgccgccgccgccgccgccactgccgccgccgccgccgccgccagg >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_2|196_aa MAKIAIKNYRFASYHLATPVETQIHFLKSPSESSRIEYMSEPTTMTKETESRIKAQIQDG GQGRWPQCGLGTLTAAGEAAHVSLGRQRFRSPILDPTGFPWPPRDQAGLELGSQETDRAD PRHQSVRFPSPAFLPSAPHVSSKWARMDLEAPFLSLQAQDSTCQNVLSAEVEGTLFSLAL DHEIRICYTLRKEEKA >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_2|591_bp atggccaagatagccatcaagaactatagatttgcatcttaccatttagcaaccccagtg gaaacacagatccattttcttaaaagtcccagcgaaagttccagaattgagtacatgtct gaaccaaccacaatgaccaaagaaacagaatctaggattaaagctcagattcaggatggt gggcaggggagatggccacagtgtggacttgggaccctgactgctgctggagaagcagcc catgtgtctttgggaagacagaggttccggtctcccatcctggaccccacaggcttccct tggccacccagggatcaggcaggactagagctgggcagccaagagacagacagagctgac ccaaggcaccagagtgtcaggtttccctccccagccttcctgcccagtgctccccatgta tcttccaagtgggctaggatggatctggaagctcccttcctctctctgcaggcacaggat tccacttgtcagaacgttctttctgctgaagtggaagggactctgtttagccttgccctt gaccatgaaattcgaatatgctacacattgagaaaagaggagaaggcctga >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_3|198_aa MSERESSGEEDKESSSSPATLVLKKEGTRLLFPKQNTLNSFSQSSSMPKELMLILVSEKY YINENPVLNSRARGCFSGKTPEYSLDRLSAAELRGETLIATVHIHVCARSVQTAQRTEWE AGSGQGEPTGRLQIWELVWDCPCGYLPMVHRHPPGDGSRGQPKWPPSSLGLGSDSGSLLM LYLVLLTQCALDLIAMEK >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_3|597_bp atgagcgagagagagagcagtggggaggaggataaggagtcaagcagcagtccagcaact ctggttctcaagaaggaagggactaggcttctcttccctaaacaaaatactctcaactct ttcagccagtcttcatcaatgccaaaggagttgatgctgatactggtgtctgagaagtac tacattaatgaaaatcctgtactgaatagcagggcaagggggtgtttctcaggtaaaacc cctgagtatagtttggatcgtctgagtgcagcagagttaaggggtgagaccttaatagcc acagtgcatatacatgtatgtgcccgaagtgttcagacagctcagagaactgaatgggag gctggcagtgggcaaggtgaacccactgggcggttacaaatttgggagctcgtctgggat tgcccttgtggctacctgcccatggttcatcgccatccacctggcgatggatccagaggc cagcccaagtggccacctagttctcttggactggggtctgactctggttctctacttatg ctctatctggtgctgctgacccaatgtgcattggatttaattgcaatggagaaatag >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_4|166_aa MDLTGDHWELHLKQKGDNEGGLYTMARTRWTARKSTGGIAPRKQLATKATCKSAPSTGGV KKPHRYRPGTVALREIRHYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQRAAIGTLQEA SEAYLVGLFEDTNLCAIHATRVTIMPKDIQLARHIRGERVSEYTMM >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_4|501_bp atggatttaactggagatcactgggagctccacctgaagcagaagggggataacgaagga ggtctctataccatggctcgtacaaggtggactgcccgcaaatctaccggtggtatagca cccaggaagcaactggctacaaaagccacttgtaagagtgcgccctctactggaggggtg aagaaacctcatcgttacagacctggtactgtggcactccgtgaaattagacattatcag aagtccactgaacttctgattcgcaaacttcccttccagcgtctggtgcgagaaatcgct caggactttaaaacagatctgcgcttccagagggcagctatcggtactttgcaggaggca agtgaggcctatctagttggcctttttgaagacaccaacctgtgtgctatccatgccaca cgtgtaacaattatgccaaaagacatccagctagcacgccacatacgtggagaacgtgtt tcagaatacaccatgatgtga >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_5|392_aa MVLLSILRILFLCELVLFMEHRAQMAEGGQSSIALLAEAPTLPLIEELLEESPGEQPRKP RLLGHSLRYMLELYRRSADSHGHPRENRTIGATMVRLVKPLTNVARPHRGTWHIQILGFP LRPNRGLYQLVRATVVYRHHLQLTRFNLSCHVEPWVQKNPTNHFPSSEGDSSKPSLMSNA WKEMDITQLVQQRFWNNKGHRILRLRFMCQQQKDSGGLELWHGTSSLDIAFLLLYFNDTH KSIRKAKFLPRGMEEFMERESLLRRTRQADGISAEVTASSSKHSGPENNQCSLHPFQISF RQLGWDHWIIAPPFYTPNYCKGTCLRVLRDGLNSPNHAIIQNLINQLVDQSVPRPSCVPY KYVPISVLMIEANGSILYKEYEGMIAESCTCR >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_5|1179_bp atggtcctcctcagtattcttagaattctttttctttgtgaactcgtgcttttcatggaa cacagggcccaaatggcagaaggagggcagtcctctattgcccttctggctgaggcccct actttgcccctgattgaggagctgctagaagaatcccctggcgaacagccaaggaagccc cggctcctagggcattcactgcggtacatgctggagttgtaccggcgttcagctgactcg catgggcaccctagagagaaccgcaccattggggccaccatggtgaggctggtgaagccc ttgaccaatgtggcaaggcctcacagaggtacctggcatatacagatcctgggctttcct ctcagaccaaaccgaggactataccaactagttagagccactgtggtttaccgccatcat ctccaactaactcgcttcaatctctcctgccatgtggagccctgggtgcagaaaaaccca accaaccacttcccttcctcagaaggagattcctcaaaaccttccctgatgtctaacgct tggaaagagatggatatcacacaacttgttcagcaaaggttctggaataacaagggacac aggatcctacgactccgttttatgtgtcagcagcaaaaagatagtggtggtcttgagctc tggcatggcacttcatccttggacattgccttcttgttactctatttcaatgatactcat aaaagcattcggaaggctaaatttcttcccaggggcatggaggagttcatggaaagggaa tctcttctccggagaacccgacaagcagatggtatctcagctgaggttactgcctcttcc tcaaaacatagcgggcctgaaaataaccagtgttccctccaccctttccaaatcagcttc cgccagctgggttgggatcactggatcattgctccccctttctacaccccaaactactgt aaaggaacttgtctccgagtactacgcgatggtctcaattcccccaatcacgccattatt cagaaccttatcaatcagttggtggaccagagtgtcccccggccctcctgtgtcccgtat aagtatgttccaattagtgtccttatgattgaggcaaatgggagtattttgtacaaggag tatgagggtatgattgctgagtcttgtacatgcagatga >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_6|208_aa MAHYEREMKIYIPPKAETKMKFEDPNAPKSPPLAFFMFSSEDCPKIKEHPGLSISDVAKK LGEMWNYIAEDDKHPYEKKAVKLKEKYEKDIAAFGGKGKPDAAKKGAIKAEKCKKKKEED KMRNMKIKKANLSGPRLWTTLSAKPAPGFGPTPVPGKPPSDPVSRHIQGQSQSQDSRPQT SPSTLAQDSIHSPSLQASTNKPILQASP >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_6|627_bp atggcccattatgaaagagaaatgaaaatctatatccctccaaaagcagagaccaaaatg aagttcgaagatcccaatgcacccaagagccctcctttggcctttttcatgttctcttct gaggattgcccaaaaatcaaagaacatcctggcctatcaattagtgatgttgcaaaaaaa ctgggagagatgtggaattacattgctgaagatgacaagcacccttatgaaaagaaggct gtgaagctgaaggaaaaatatgaaaaggatattgctgcatttggaggtaaaggaaaacct gatgctgcaaaaaagggagccatcaaggctgaaaaatgcaagaaaaagaaggaagaggac aaaatgaggaatatgaaaattaaaaaggcaaacctcagtggccccaggctctggaccacc ctcagtgccaagccagcaccaggctttgggcccacaccagtgccaggcaagcccccctca gacccagtctccaggcacatccagggccaaagccaatcccaggactctaggcctcaaact tcacccagcaccctagcacaggacagcatccacagccctagtcttcaggccagcaccaat aaacccatcctccaggccagcccctga >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_7|182_aa MKDPILLDLLSFSLAGFEEASFHVVRQLMERAMCCFSSLREFLVRFRKDKAFVLLLQRIT QQVKTENHTSLEQVHTASSGTRNPPQECGLTSSRLPLSQGKGDQARCNHKKYITQIPIEK HSPKFLTSAPQNCQHHQKQGKSENLSQPKGAPGDMMTKGTVLSWMGSWNRKKSIQSKLRK SE >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_7|549_bp atgaaagaccccatcttgctagacttgctgtctttctctcttgctggctttgaagaagca agtttccatgttgtgaggcagcttatggagagggctatgtgttgcttctcttccctgaga gaattcctggttaggttccgtaaagataaggcctttgtgttactccttcagagaatcacc caacaggtcaaaacagaaaaccatacttctttagaacaagttcatactgcttcctctggc accagaaacccaccacaggaatgtgggctgacatcttccagactgccactgagccaggga aagggggatcaggcaaggtgtaatcataagaagtatatcacacaaatcccaattgagaaa cattctccaaaattcctaacaagtgcacctcaaaattgtcagcatcatcagaaacaagga aagtctgagaacctgtcacagccaaaaggagccccaggagacatgatgactaaaggcact gtgttgtcttggatgggatcctggaacagaaaaaaaagcattcagtcaaaactaagaaaa tctgaataa >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_8|143_aa MADSCRSSQAQASCFMDKGHVIKLQVAMQPEPLMMAPSAGNLTQASEGALTAVSPKQHPC QQEAVKIGLHTYPYSKGSWMCFFRGANEAAKWEGVPGKTPTSLHTGVEPWEVHAICNREI NPDDWQHTSNFTLIVGEIHQGPF >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_8|432_bp atggctgattcctgccgatcaagccaagcccaagcttcttgtttcatggataaaggccac gtcatcaaactccaagtggccatgcaaccagagcctctgatgatggccccttctgctggg aaccttacacaggcctctgagggagctttaactgccgtttccccaaaacagcacccctgt cagcaggaagcagttaagatcggtcttcatacttatccttattctaagggcagttggatg tgcttctttagaggggcgaatgaggcagccaagtgggagggggtccctggaaaaactcca accagcctgcacactggggtggagccttgggaagttcatgccatttgcaacagggagata aacccagatgactggcagcacactagtaatttcaccctcattgtaggtgaaattcaccaa ggacctttttaa >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_9|107_aa MDEAGSYHSQQTNTGTENQTLHVVIHKWELNNENIWAQEGEHHTPGPVEGGKGKGPRSGH EAPLHLALTLSWIPQASNWLLPREARGREVVLLKPKFKWKEVAYLVS >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_9|324_bp atggatgaagctggaagctatcattctcagcaaactaacacaggaacagaaaaccaaaca ctgcatgttgtcattcataagtgggagttgaacaatgagaacatatgggcacaggaaggg gaacatcacacaccagggcctgtcgaggggggaaagggcaaagggccacgaagcggtcat gaggctcctctccacctggcattgaccctcagctggattccacaggcatcaaactggctg ctcccaagagaagcccgtggccgagaagttgtcctgctgaaaccaaaattcaagtggaag gaggtggcttacttggtgtcatag >gi568815575f:50810784_51016604|GENSCAN_predicted_peptide_10|563_aa XLNQEETERLNRPITGSEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSI EKEGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILDKILANRIQQHIKKLI HHDQVGFIPGIQYHTEWAKTGSIPFENWHKTGMPSLTILFNIVLEVLTRAIRQEKEIKGI QLGNEEVKLSLFPDNMIVYLENPIVSAQNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQ TESQIMSELPFTIASKRVKYLGIQLTRDVKDLFKENYKPLLNEIKEDTKKCKNIPCSWVG RIYIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKSILSQKNKAG GITLPDFKLYYKAIVTKTAWYWYQNRDIDQWNRTEPSEVMSHIYNHLIFDKPDKNKKWGK DSLFNKWCWENWLAVCRKVKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDI GMGKDFMSKTPKAMATKAKIDKWDLIQLKSFCTAKETIIRVNRQPTEWEKIFVIYSSDKG LISRIYNELKQIYKKKTTPSKSG >gi568815575f:50810784_51016604|GENSCAN_predicted_CDS_10|1692_bp nnactaaaccaggaagaaactgaacgtctgaatagaccaataacaggctctgaaattgag gcaataattaatagcttaccaaccaaaaaaagtccaggaccagatggattcacagccgaa ttctaccagaggtacaaggaggaactggtaccattccttctgaaactattccaatcaata gaaaaagagggaatcctccctaactcattttatgaagccagcatcatcctgataccaaag cctggcagagacacaacaaaaaaagagaattttagaccaatatccctgatgaatatcgat gcaaaaatcctcgataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatc caccatgatcaagtgggcttcatccctgggatccaatatcatactgaatgggcaaaaact ggaagcattccctttgaaaactggcacaagacagggatgccgtctctcaccattctattc aacatagtattggaagttctgaccagggcaatcaggcaggagaaggaaataaagggtatc caattaggaaacgaggaagtcaaattgtccctgtttccagataacatgattgtatatcta gaaaaccccattgtctcagcccaaaatctccttaagctgataggcaacttcagcaaagtc tcaggatacaaaattaatgtgcaaaaatcacaagcattcttatacaccaataacagacaa acagagagtcaaatcatgagtgaactcccattcacaattgcttcaaagagagtaaaatac ctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactg ctcaatgaaataaaagaggatacaaagaaatgtaagaacattccatgctcatgggtggga agaatctatatcgtgaaaatggccatattgcccaaggtaatttatagattcaatgccatc cccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcata tggaaccaaaaaagagcatgcattgccaagtcaatcctaagccaaaagaacaaagctgga ggcatcacgctacctgacttcaaactatactacaaggctatagtaaccaaaacagcatgg tactggtaccaaaacagagatatagaccaatggaacagaacagagccctcagaagtaatg tcgcatatctacaaccatctgatctttgacaaacctgacaaaaacaagaaatggggaaag gattccctatttaataaatggtgctgggaaaactggctagccgtatgtagaaaggtgaaa ctggatccctttcttacaccttatacaaaaattaattcaagatggattaaagacttaaat gttagacctaaaaccataaaaaccctagaagaaaacctgggcaataccattcaggacata ggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaatt gacaaatgggatctaattcaactaaagagcttctgcacagcaaaagaaactatcatcaga gtgaacaggcaacctacagaatgggaaaaaatttttgtaatctactcatctgacaaaggg ctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaacaaccccatca aaaagtgggtga