GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:28:15 Sequence gi568815591f:73334073_73535780 : 201708 bp : 47.68% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6566 6760 195 1 0 154 109 234 0.999 31.71 1.02 Intr + 7201 7310 110 0 2 68 71 95 0.992 4.88 1.03 Term + 8735 8825 91 2 1 103 49 111 0.984 5.89 1.04 PlyA + 9661 9666 6 1.05 2.00 Prom + 17503 17542 40 -1.66 2.01 Init + 18434 18570 137 1 2 67 34 136 0.574 3.81 2.02 Term + 25380 25506 127 0 1 93 38 90 0.144 2.26 2.03 PlyA + 26328 26333 6 -0.45 3.00 Prom + 26612 26651 40 -2.46 3.01 Init + 32203 32483 281 0 2 74 87 152 0.406 8.19 3.02 Intr + 32871 32908 38 0 2 128 80 33 0.812 4.41 3.03 Term + 39741 39967 227 1 2 102 43 46 0.164 -1.56 3.04 PlyA + 40204 40209 6 1.05 4.03 PlyA - 41929 41924 6 1.05 4.02 Term - 42715 42598 118 0 1 84 46 82 0.125 1.61 4.01 Init - 61795 61743 53 1 2 93 43 97 0.074 4.37 4.00 Prom - 84164 84125 40 0.14 5.00 Prom + 94748 94787 40 -5.96 5.01 Sngl + 99936 101711 1776 2 0 88 42 3224 0.999 310.38 5.02 PlyA + 102030 102035 6 1.05 6.19 PlyA - 103283 103278 6 1.05 6.18 Term - 108481 108124 358 0 1 95 44 367 0.496 26.98 6.17 Intr - 108756 108653 104 0 2 129 109 186 0.999 23.77 6.16 Intr - 110057 109912 146 0 2 71 85 114 0.927 9.40 6.15 Intr - 113307 113192 116 2 2 88 95 279 0.997 28.69 6.14 Intr - 115617 115470 148 1 1 109 94 93 0.989 11.29 6.13 Intr - 116922 116775 148 0 1 67 71 118 0.985 7.81 6.12 Intr - 125646 125464 183 0 0 81 63 137 0.891 10.48 6.11 Intr - 129027 128850 178 2 1 94 57 125 0.997 9.92 6.10 Intr - 131493 131367 127 1 1 104 94 29 0.706 4.84 6.09 Intr - 135578 135445 134 1 2 48 73 104 0.937 5.19 6.08 Intr - 136411 136273 139 2 1 62 68 178 0.999 12.72 6.07 Intr - 144497 142796 1702 2 1 67 93 1363 0.799 122.02 6.06 Intr - 155319 155122 198 0 0 77 116 147 0.988 15.95 6.05 Intr - 158849 158728 122 0 2 36 96 11 0.765 -3.09 6.04 Intr - 164626 164425 202 1 1 110 30 253 0.726 20.56 6.03 Intr - 174399 174255 145 2 1 27 105 83 0.141 4.18 6.02 Intr - 176780 176664 117 1 0 28 87 91 0.130 2.58 6.01 Init - 187861 187707 155 1 2 116 37 162 0.082 13.38 6.00 Prom - 200493 200454 40 -2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:73334073_73535780|GENSCAN_predicted_peptide_1|131_aa ALLLLRRRSAPPEEQHLVEAAKLPVLLNLSFTYLKLDRPTIALCYGEQALIIDQKNAKAL FRCGQACLLLTEYQKARDFLVRAQKEQPFNHDINNELKKLASCYRDYVDKEKEMWHRMFA PCGDGSTAGES >gi568815591f:73334073_73535780|GENSCAN_predicted_CDS_1|396_bp gccctattgcttctgcgccggcgatcagcaccccctgaagagcagcacctggtggaggcc gccaagcttcctgttctcctgaacctgtcctttacatacctgaagctagaccgacccacc atagccctgtgctatggagagcaggctttgatcattgaccaaaagaatgccaaggccctc ttcaggtgtggacaggcttgtcttctcctgactgagtatcaaaaggcccgggattttcta gttcgagcccagaaggagcaacccttcaatcatgacatcaataatgagctgaagaaactg gctagctgttacagggactatgtggataaagagaaagaaatgtggcaccgcatgttcgcg ccctgtggcgatggttctacagcaggagaaagttga >gi568815591f:73334073_73535780|GENSCAN_predicted_peptide_2|87_aa MSPRQLCWLTPILQVLILFSRPLVVAVIDCHTFDAPLRSQAATDPGVPSTLASPLSASDS LPRCRPISLLICSGHGIQQAFLCPDRT >gi568815591f:73334073_73535780|GENSCAN_predicted_CDS_2|264_bp atgagtccccgccagctttgctggctgacccccattctgcaggtgctgatcctgttctca agacctttggttgtagctgttattgactgtcatacttttgatgctcccctacgctcccaa gcagccaccgacccaggggtgccgtctacactggcttctccactctcagccagtgactcc ttgccccgctgccgtcccatctcgctgctgatctgctcgggccatgggatccagcaggca tttctttgtcccgatcggacctga >gi568815591f:73334073_73535780|GENSCAN_predicted_peptide_3|181_aa MRAVPRELLSLCAGRAVGAQAVPVECEVADADLHVGSHIEVGDEMEKSQRWLQSRSVPEQ ARQTWVGWFFSKRLLHRSRDRHTHSVDLSTLHTSPIRLEALGFVASDASSWVNSSTPKAP ASIFTSEPPEPAPPASPSSSLTAPPWTFSSTPLGPQNYPFRFLLRNLPPTFLRVVGDSSR G >gi568815591f:73334073_73535780|GENSCAN_predicted_CDS_3|546_bp atgagggctgtcccaagggagcttctgtcactctgtgctgggcgcgctgtgggtgctcaa gcagtgccagtggagtgtgaagtcgctgacgctgaccttcatgtgggctctcacattgag gtgggcgatgagatggagaagtctcagaggtggctgcagtcgcggtctgttcctgaacag gctcgccaaacttgggttggttggtttttttcaaaacgcctcctacacaggtcaagagac cgccacacacacagtgtggacctcagtaccctccacacaagccccatccgacttgaggcc ttgggctttgtggcttcagatgcttcgtcttgggtaaactcctcgactcccaaggcccca gcctccatctttacctcagagcctcctgaacctgctcctccagcctcaccttcctccagc ctcaccgctcctccctggactttcagctccacacccctggggcctcagaactaccccttc cggtttctcctgcgtaaccttccgcctaccttcctgagagtggttggtgacagcagccgg ggctag >gi568815591f:73334073_73535780|GENSCAN_predicted_peptide_4|56_aa MPWALRVEALVLAEAAGTTSSRGSYLQRRSSTKPGGQSGSTTELLIGPGMSSDESS >gi568815591f:73334073_73535780|GENSCAN_predicted_CDS_4|171_bp atgccctgggcgctgagggtagaggctctcgtcctggcagaggctgcaggcacgacctcc tcacgtggctcctacctccagcgcagaagcagcactaaaccaggtggtcaatcagggagc accaccgagcttctgattggtccagggatgagcagtgatgagtcaagctaa >gi568815591f:73334073_73535780|GENSCAN_predicted_peptide_5|591_aa MAVAPLRGALLLWQLLAAGGAALEIGRFDPERGRGAAPCQAVEIPMCRGIGYNLTRMPNL LGHTSQGEAAAELAEFAPLVQYGCHSHLRFFLCSLYAPMCTDQVSTPIPACRPMCEQARL RCAPIMEQFNFGWPDSLDCARLPTRNDPHALCMEAPENATAGPAEPHKGLGMLPVAPRPA RPPGDLGPGAGGSGTCENPEKFQYVEKSRSCAPRCGPGVEVFWSRRDKDFALVWMAVWSA LCFFSTAFTVLTFLLEPHRFQYPERPIIFLSMCYNVYSLAFLIRAVAGAQSVACDQEAGA LYVIQEGLENTGCTLVFLLLYYFGMASSLWWVVLTLTWFLAAGKKWGHEAIEAHGSYFHM AAWGLPALKTIVILTLRKVAGDELTGLCYVASTDAAALTGFVLVPLSGYLVLGSSFLLTG FVALFHIRKIMKTGGTNTEKLEKLMVKIGVFSILYTVPATCVIVCYVYERLNMDFWRLRA TEQPCAAAAGPGGRRDCSLPGGSVPTVAVFMLKIFMSLVVGITSGVWVWSSKTFQTWQSL CYRKIAAGRARAKACRAPGSYGRGTHCHYKAPTVVLHMTKTDPSLENPTHL >gi568815591f:73334073_73535780|GENSCAN_predicted_CDS_5|1776_bp atggccgtggcgcctctgcggggggcgctgctgctgtggcagctgctggcggcgggcggc gcggcactggagatcggccgcttcgacccggagcgcgggcgcggggctgcgccgtgccag gcggtggagatccccatgtgccgcggcatcggctacaacctgacccgcatgcccaacctg ctgggccacacgtcgcagggcgaggcggctgccgagctagcggagttcgcgccgctggtg cagtacggctgccacagccacctgcgcttcttcctgtgctcgctctacgcgcccatgtgc accgaccaggtctcgacgcccattcccgcctgccggcccatgtgcgagcaggcgcgcctg cgctgcgcgcccatcatggagcagttcaacttcggctggccggactcgctcgactgcgcc cggctgcccacgcgcaacgacccgcacgcgctgtgcatggaggcgcccgagaacgccacg gccggccccgcggagccccacaagggcctgggcatgctgcccgtggcgccgcggcccgcg cgccctcccggagacctgggcccgggcgcgggcggcagtggcacctgcgagaaccccgag aagttccagtacgtggagaagagccgctcgtgcgcaccgcgctgcgggcccggcgtcgag gtgttctggtcccggcgcgacaaggacttcgcgctggtctggatggccgtgtggtcggcg ctgtgcttcttctccaccgccttcactgtgctcaccttcttgctggagccccaccgcttc cagtaccccgagcgccccatcatcttcctctccatgtgctacaacgtctactcgctggcc ttcctgatccgtgcggtggccggagcgcagagcgtggcctgtgaccaggaggcgggcgcg ctctacgtgatccaggagggcctggagaacacgggctgcacgctggtcttcctactgctc tactacttcggcatggccagctcgctctggtgggtggtcctgacgctcacctggttcctg gctgccgggaagaaatggggccacgaggccatcgaggcccacggcagctatttccacatg gctgcctggggcctgcccgcgctcaagaccatcgtcatcctgaccctgcgcaaggtggcg ggtgatgagctgactgggctttgctacgtggccagcacggatgcagcagcgctcacgggc ttcgtgctggtgcccctctctggctacctggtgctgggcagtagtttcctcctgaccggc ttcgtggccctcttccacatccgcaagatcatgaagacgggcggcaccaacacagagaag ctggagaagctcatggtcaagatcggggtcttctccatcctctacacggtgcccgccacc tgcgtcatcgtttgctatgtctacgaacgcctcaacatggacttctggcgccttcgggcc acagagcagccatgcgcagcggccgcggggcccggaggccggagggactgctcgctgcca gggggctcggtgcccaccgtggcggtcttcatgctcaaaattttcatgtcactggtggtg gggatcaccagcggcgtctgggtgtggagctccaagactttccagacctggcagagcctg tgctaccgcaagatagcagctggccgggcccgggccaaggcctgccgcgcccccgggagc tacggacgtggcacgcactgccactataaggctcccaccgtggtcttgcacatgactaag acggacccctctttggagaaccccacacacctctag >gi568815591f:73334073_73535780|GENSCAN_predicted_peptide_6|1473_aa MAPLLGRKPFPLVKPLPGEEPLFTIPHTQEAFRTREYPFPAARAGLGRAGPGEYEARLER YSERIWTCKSTGSSQLTHKEAWEEEQEVAELLKEEFPAWYEKLVLEMVHHNTASLEKLVD TAWLEIMTKYAVGEECDFEVGKEKMLKVKIVKIHPLEKVDEEATEKKSDGACDSPSSDKE NSSQIAQDHQKKETVVKEDEGRRESINDRARRSPRKLPTSLKKGERKWAPPKFLPHKYDV KLQNEDKIISNVPADSLIRTERPPNKEIVRYFIRHNALRAGTGENAPWVVEDELVKKYSL PSKFSDFLLDPYKYMTLNPSTKRKNTGSPDRKPSKKSKTDNSSLSSPLNPKLWCHVHLKK SLSGSPLKVKNSKNSKSPEEHLEEMMKMMSPNKLHTNFHIPKKGPPAKKPGKHSDKPLKA KGRSKGILNGQKSTGNSKSPKKGLKTPKTKMKQMTLLDMAKGTQKMTRAPRNSGGTPRTS SKPHKHLPPAALHLIAYYKENKDREDKRSALSCVISKTARLLSSEDRARLPEELRSLVQK RYELLEHKKRWASMSEEQRKEYLKKKREELKKKLKEKAKERREKEMLERLEKQKRYEDQE LTGKNLPAFRLVDTPEGLPNTLFGDVAMVVEFLSCYSGLLLPDAQYPITAVSLMEALSAD KGGFLYLNRVLVILLQTLLQDEIAEDYGELGMKLSEIPLTLHSVSELVRLCLRRSDVQEE SEGSDTDDNKDSAAFEDNEVQDEFLEKLETSEFFELTSEEKLQILTALCHRILMTYSVQD HMETRQQMSAELWKERLAVLKEENDKKRAEKQKRKEMEAKNKENGKVENGLGKTDRKKEI VKFEPQVDTEAEDMISAVKSRRLLAIQAKKEREIQEREMKVKLERQAEEERIRKHKAAAE KAFQEGIAKAKLVMRRTPIGTDRNHNRYWLFSDEVPGLFIEKGWVHDSIDYRFNHHCKDH TVSGDEDYCPRKLSGLFCPYRFLCDSQKELDELLNCLHPQGIRESQLKERLEKRYQDIIH SIHLARKPNLGLKSCDGNQELLNFLRSDLIEVATRLQKGGLGYVEETSEFEARVISLEKL KDFGECVIALQASVIKKFLQGFMAPKQKRRKLQSEDSAKTEEVDEEKKMVEEAKVASALE KWKTAIREAQTFSRMHVLLGMLDACIKWDMSAENARCKVCRKKGEDDKLILCDECNKAFH LFCLRPALYEVPDGEWQCPACQPATARRNSRGRNYTEESASEDSEDDESDEEEEEEEEEE EEEDYEVAGLRLRPRKTIRGKHSVIPPAARSGRRPGKKPHSTRRSQPKAPPVDDAEVDEL VLQTKRSSRRQSLELQKCEEILHKIVKYRFSWPFREPVTRDEAEDYYDVITHPMDFQTVQ NKCSCGSYRSVQEFLTDMKQVFTNAEVYNCRGSHVLSCMVKTEQCLVALLHKHLPGHPYV RRKRKKFPDRLAEDEGDSEPEAVGQSRGRRQKK >gi568815591f:73334073_73535780|GENSCAN_predicted_CDS_6|4422_bp atggcgccgctcctgggccgcaagcccttcccgctggtgaagccgttgcccggagaggag ccgctcttcaccatcccgcacactcaggaggccttccgcacccgggagtatccttttcca gccgcgcgggccgggctgggccgggctgggccgggagagtatgaagcccgcttggaaagg tacagtgagcgcatttggacgtgcaagagtactggaagcagtcagctaacacacaaggaa gcctgggaggaagaacaggaagttgctgagcttttgaaggaggagtttcctgcctggtat gagaagcttgttctggaaatggttcaccataacacagcctccttagagaagttagtagat actgcttggttggagatcatgaccaaatatgctgtgggagaagagtgtgacttcgaggtt gggaaggagaaaatgctcaaggtgaagattgtgaagattcatcctttggagaaagtggat gaagaggccactgagaagaaatctgatggtgcctgtgattctccatcaagtgacaaagag aactccagtcagattgctcaggaccatcagaagaaggagacagttgtgaaagaggatgaa ggaaggagagagagtattaatgacagagcacgtagatcgccacgaaaacttcctacttca ttaaaaaaaggagaaaggaaatgggctcctccaaaatttctgcctcacaaatatgatgtg aaactacaaaatgaagataagatcatcagtaacgtgccagcagacagcttgattcgtaca gagcgcccaccaaataaggagatagttcgatactttatacggcataatgcattacgagct ggtactggtgaaaatgcaccttgggtcgtagaagatgaattggtgaagaaatactctctg cccagcaagttcagtgactttttacttgatccatacaagtatatgactctcaacccttct actaagaggaagaatactggatccccagacaggaagccctcaaagaaatccaagacagac aactcttctcttagttcaccactaaatcctaagttatggtgtcacgtacacttgaagaag tcattgagtggctcgccactcaaagtgaagaactcaaagaattccaaatctcctgaagaa catctagaagaaatgatgaagatgatgtcgcccaataagctgcacactaactttcacatt cctaaaaaaggcccacctgccaagaaaccagggaagcacagtgacaagcctttgaaggca aagggcagaagcaaaggcatcctgaatggacagaaatccacagggaattccaaatctccc aaaaaaggactgaagactcctaaaaccaaaatgaagcagatgactttgttggatatggcc aaaggcacgcagaagatgacacgagccccacggaattctgggggtacacctaggacctct agtaaacctcataaacatctgcctcctgcagccctacacctcattgcatactacaaagaa aacaaagacagggaggacaagaggagcgccctgtcctgtgttatctccaaaacagctcgt cttctctctagtgaagatagagctcgtctcccagaagaattgcgaagtcttgttcaaaaa cgctatgaacttctagagcacaaaaagaggtgggcttctatgtctgaagaacaacggaaa gaatatttgaaaaagaaacgggaggagctgaaaaagaagttgaaggaaaaagccaaagaa cgaagagagaaagaaatgcttgagagattagaaaaacagaagcggtatgaggaccaagag ttaactggcaaaaaccttccagcattcagattggtggatacccctgaagggctgcccaac acgctgtttggggatgtggccatggtggtggaattcttgagctgttattctgggctactt ttaccagatgctcagtatcctattactgctgtgtcccttatggaagccttgagtgcagat aagggtggctttttataccttaacagggtgttggtcatcctcttacagaccctcctacaa gatgagatagcagaagactatggtgaattgggaatgaagctgtcggaaatccccttgact ctgcattctgtttcagagctggtgcggctctgcttgcgcagatctgatgttcaggaggaa agcgagggctcagacacagatgacaataaagattcagctgcatttgaggataatgaggta caagatgagttcctagaaaagctggagacctctgaattttttgagctgacgtcagaggag aagctacagatcttgacagcactgtgccaccggatcctcatgacatactcagtgcaagac cacatggagaccagacagcagatgtctgcagagttgtggaaggaacggcttgctgtgttg aaggaagaaaatgataagaagagagcagagaaacagaaacggaaagaaatggaagccaaa aataaagaaaatggaaaagttgagaatgggttaggcaaaactgataggaaaaaagaaatt gtgaagtttgagccccaagtagatacagaagctgaagacatgattagtgctgtgaagagc agaaggttgcttgccattcaagctaagaaggaacgggaaatccaggaaagagaaatgaaa gtgaaactggaacgccaagctgaagaagaacgaatacggaagcacaaagcagctgctgag aaagctttccaggaagggattgccaaggccaaactagtcatgcgcaggactcctattggc acagatcgaaaccataatagatactggctcttctcagatgaagttccaggattattcatt gaaaaaggctgggtacatgacagcattgactaccgattcaaccatcactgcaaagaccac acagtctctggtgatgaggattactgtcctcgcaagttatcaggtctcttttgcccctac aggtttttatgtgatagtcaaaaggagctggatgagttgctaaactgtcttcaccctcag ggaataagagaaagtcaacttaaagagagactagagaagaggtaccaggacattattcac tctattcatctagcacggaagccaaatttgggtctaaaatcttgtgatggcaaccaggag cttttaaacttccttcgtagtgatctcattgaagttgcaacaaggttacaaaaaggagga cttggatatgtggaagaaacatcagaatttgaagcccgggtcatttcattagagaaattg aaggattttggtgagtgtgtgattgcccttcaggccagtgtcataaagaaatttctccaa ggcttcatggctcccaagcaaaagagaagaaaactccaaagtgaagattcagcaaaaact gaggaagtggatgaagagaagaaaatggtagaggaagcaaaggttgcatctgcactggag aaatggaagacagcaatccgggaagctcagactttctccaggatgcacgtgctgcttggg atgcttgatgcctgtatcaagtgggatatgtccgcagaaaatgctaggtgcaaagtttgt cgaaagaaaggtgaggatgacaaattgatcttgtgtgatgagtgtaataaagccttccac ctgttttgtctgaggccggccctctatgaagtaccagatggtgagtggcagtgcccagct tgccagcccgctactgccaggcgcaactcccgtggcaggaactatactgaagagtctgct tctgaggacagtgaagatgatgagagtgatgaagaggaggaggaggaagaagaggaggag gaggaagaagattatgaggtggctggtttgcgattgagacctcgaaagaccatccggggc aagcacagcgtcatcccccctgcagcaaggtcaggccggcgcccgggtaagaagccacac tctaccaggaggtctcagcccaaggcaccacctgtggatgatgctgaggtggatgagctg gtgcttcagaccaagcggagctcccggaggcaaagcctggagctgcagaagtgtgaagag atcctccacaagatcgtgaagtaccgcttcagctggcccttcagggagcctgtgaccaga gatgaggccgaggactactatgatgtgatcacgcaccccatggactttcagacagtgcag aacaaatgttcctgtgggagctaccgctctgtgcaggagtttcttactgacatgaagcaa gtgtttaccaatgctgaggtttacaactgccgtggcagccatgtgctaagctgcatggtg aagacagaacagtgtctagtggctctgttgcataaacaccttcctggccacccatatgtc cgcaggaagcgcaagaagtttcctgataggcttgctgaagatgaaggggacagtgagcca gaggccgttggacagtccaggggacgaagacagaagaagtag