GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:38:21 Sequence gi568815584f:92823360_93034881 : 211522 bp : 49.88% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4662 4974 313 2 1 59 54 147 0.769 5.89 1.02 Intr + 7729 7865 137 2 2 28 35 109 0.776 -0.11 1.03 Intr + 9763 9988 226 0 1 73 84 186 0.903 14.26 1.04 Intr + 12200 12305 106 0 1 78 87 128 0.888 10.97 1.05 Term + 30711 30933 223 0 1 -1 48 198 0.033 3.29 1.06 PlyA + 31350 31355 6 1.05 2.00 Prom + 31885 31924 40 -4.96 2.01 Sngl + 33683 34396 714 1 0 58 42 268 0.848 15.43 2.02 PlyA + 34759 34764 6 -0.45 3.00 Prom + 35133 35172 40 -2.46 3.01 Init + 50132 50315 184 1 1 83 75 91 0.878 6.58 3.02 Intr + 53283 53343 61 1 1 102 103 24 0.740 3.09 3.03 Term + 54822 55029 208 0 1 35 46 141 0.649 1.41 3.04 PlyA + 55250 55255 6 1.05 4.04 PlyA - 55748 55743 6 1.05 4.03 Term - 68380 68222 159 1 0 57 45 136 0.476 4.14 4.02 Intr - 73506 73457 50 1 2 94 75 32 0.447 0.90 4.01 Init - 75054 75009 46 0 1 90 92 47 0.591 4.34 4.00 Prom - 76755 76716 40 -5.66 5.00 Prom + 76880 76919 40 -4.26 5.01 Init + 79496 79665 170 1 2 100 78 23 0.346 1.51 5.02 Intr + 83878 84039 162 1 0 88 111 23 0.487 3.69 5.03 Intr + 92080 92188 109 1 1 12 110 69 0.175 1.79 5.04 Term + 95713 95865 153 0 0 135 45 79 0.815 6.22 5.05 PlyA + 96422 96427 6 1.05 6.00 Prom + 99755 99794 40 -1.76 6.01 Init + 100001 100046 46 1 1 87 105 137 0.996 14.14 6.02 Intr + 100840 100886 47 2 2 81 94 13 0.739 -0.67 6.03 Intr + 103246 103339 94 0 1 93 64 53 0.793 2.94 6.04 Intr + 104191 104259 69 2 0 108 63 17 0.553 0.35 6.05 Intr + 106358 106456 99 0 0 90 66 141 0.995 12.18 6.06 Intr + 107891 108394 504 0 0 97 35 445 0.750 33.05 6.07 Intr + 109011 109492 482 1 2 81 73 667 0.987 57.25 6.08 Term + 111442 111525 84 0 0 91 53 134 0.997 7.85 6.09 PlyA + 111904 111909 6 1.05 7.08 PlyA - 113577 113572 6 1.05 7.07 Term - 118545 118202 344 1 2 107 42 452 0.991 37.17 7.06 Intr - 123134 122972 163 2 1 103 109 430 0.999 46.15 7.05 Intr - 128654 128587 68 0 2 117 90 91 0.994 10.82 7.04 Intr - 135007 134842 166 1 1 87 93 330 0.173 32.93 7.03 Intr - 139036 138996 41 2 2 64 105 3 0.037 -2.46 7.02 Intr - 139490 139392 99 0 0 99 32 123 0.057 7.88 7.01 Init - 147798 147750 49 0 1 94 99 22 0.039 3.91 7.00 Prom - 149008 148969 40 -7.76 8.17 PlyA - 149238 149233 6 1.05 8.16 Term - 149968 149893 76 0 1 120 54 32 0.231 0.31 8.15 Intr - 158445 158320 126 2 0 102 71 120 0.769 11.49 8.14 Intr - 159594 159564 31 1 1 97 70 -6 0.434 -4.31 8.13 Intr - 161512 161366 147 2 0 110 38 35 0.261 0.91 8.12 Intr - 163302 163173 130 0 1 53 63 52 0.642 -0.53 8.11 Intr - 163782 163596 187 2 1 93 66 125 0.987 10.49 8.10 Intr - 169394 169286 109 0 1 18 95 108 0.907 3.84 8.09 Intr - 170638 170521 118 1 1 73 97 259 0.476 25.34 8.08 Intr - 170991 170920 72 0 0 94 76 30 0.550 2.00 8.07 Intr - 174584 174462 123 2 0 58 89 29 0.220 0.78 8.06 Intr - 185375 185282 94 1 1 90 64 47 0.475 2.47 8.05 Intr - 186427 186308 120 0 0 46 60 86 0.288 1.21 8.04 Intr - 191950 191837 114 0 0 85 99 31 0.046 3.46 8.03 Intr - 192287 192175 113 2 2 64 0 73 0.081 -4.72 8.02 Intr - 193442 193317 126 2 0 94 109 250 0.983 28.58 8.01 Intr - 207807 207703 105 0 0 98 78 5 0.142 0.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 31978 32634 657 0 0 49 36 194 0.855 6.48 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_1|334_aa MQSEAASANGETAASYLGDLAKIIDENGCTKQQIFNVDETAFYWKKMLSRTFIAPSPGEE KSLPGFKASKDGLTLLLGANAAGDFKLKPVLIYNCGNPKTLKNDEWLKWKRLAVACVGED VVQLELSPTAGGSAQQHNHFGKLAVSDKANLTNKTLSNSSQSELENRLHQLTETLIQKQT MLESLSTEKNSLVFQLERLEQQMNSASGSSSNGSSINMSGIDNGEGTRLRNVPVLFNDTE TNLAGMYGKVRKAASSIDQFRITNAEKSLKDLMELKTKARELPDECTSLSSRFNQLEERV SVMEDEMNEMKREEKFREKRIKRNEQSLQEYGTT >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_1|1005_bp atgcaaagtgaagcagcaagtgctaatggagaaactgctgcaagttatctgggagatcta gctaagatcattgatgaaaatggctgcactaaacaacagattttcaatgtagatgaaaca gccttctattggaagaaaatgctatctaggactttcatagctccttctccaggagaggag aagtcactgcctggattcaaagcttcaaaggacgggctgactctcttgttaggggctaat gctgctggtgactttaagttaaagccagtgctcatttacaattgtggaaatcctaagacc cttaagaatgatgaatggctgaaatggaaaagattggcagtagcatgtgttggtgaggat gtggtgcagctggaactctcacccactgctggtggaagtgcacagcagcacaaccacttt ggaaaactggctgtttctgataaggctaaccttaccaataaaactttaagcaatagcagt cagtctgagttagaaaatcgactccatcagctaacagagactctcatccagaaacagacc atgctggagagtctcagcacagaaaagaactccctggtctttcaactggagcgcctcgaa cagcagatgaactccgcctctggaagtagtagtaatgggtcttcgattaatatgtctgga attgacaatggtgaaggcactcgtctgcgaaatgttcctgttctttttaatgacacagaa actaatctggcaggaatgtacggaaaagttcgcaaagctgctagttcaattgatcagttt agaataacgaacgcagagaaatccttaaaggacctgatggagctgaaaaccaaggcacga gaactacctgacgaatgcacaagcctcagtagccgattcaatcaactggaagaaagggta tcagtgatggaagatgaaatgaatgaaatgaagcgagaagagaagtttagagaaaaaaga ataaaaagaaacgaacaaagcctccaagaatatgggactacatga >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_2|237_aa MIVYLENPNVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGIINIVKMAILPKVIY RFNAIPIKLPMTFFTELEKTTLKFIWNQKRACIAKSILSQKNKDGGITLLDFKLCYKATV TKTAWYWYQNRDIDQRNRTEPSEIMPHIYNHLIFDKPDKNKQWGKDSLLINGAGKTG >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_2|714_bp atgattgtgtatctagaaaaccccaacgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaatattcca tgctcatgggtaggaataatcaatatcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcctgcattgccaagtcaatcctaagccaa aagaacaaagatggaggcatcacgctacttgacttcaaactatgctacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagaccaacggaacagaacagag ccctcagaaataatgccacatatctacaaccatctgatctttgacaaacctgacaaaaac aagcaatggggaaaggattccctattaataaatggtgctgggaaaactggctag >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_3|150_aa MAEDEGRAKDVLRGGGKKSLCKGTPVCKAIGSRETYSLPREQHGKNPAPRFNYLHLALPL TGPGSHHSTFRRYEFGHCGCLIFVSKRPPAECQQRSAASKVAIDQRPFQRVPGIRGLLCV PGEGIVVQWKAPRAHCGGPTELRMQTGFTC >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_3|453_bp atggcagaagatgaaggaagagcaaaagatgtcttacgtggtggcgggaaaaagagcttg tgcaagggaactcccgtttgtaaagccatcggatctcgtgagacttactcactaccccga gaacagcatgggaaaaacccagccccacgattcaattatctccacctggccctgcccttg acaggacctggcagccaccattctactttccgtcgctatgaatttggccactgtggatgt ctcatttttgtttccaagaggccacctgctgagtgtcagcagcggtcagcggcctccaag gttgctattgatcaacgtccgttccagcgggttcctggcatccgaggcctgctctgtgtc ccgggagaagggattgtggtgcaatggaaagctcccagggcacactgcgggggccccaca gagctcaggatgcagacgggcttcacctgctga >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_4|84_aa MGLSFRLHPTQLVTLGVFLTPMFLLPVLSLLEMLQDINPRFLTLTFQCLRFRADPDADYD VQLYSPTLIQPGTEEVLNKYLLSP >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_4|255_bp atggggctcagtttccgcctccatcctacacagcttgtgacactgggtgtgtttctgacg cccatgttcttactcccagtcctgagtctgctggagatgctacaggatatcaaccccaga ttcctaaccctgaccttccagtgcctgagattcagggctgacccggatgcagactatgat gtgcagctctacagccccaccctgatccagccaggcacagaggaggttctcaacaaatat ttgctgagcccctga >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_5|197_aa MAQSNSVYYRCQGRSNACTKPRGNPTQTRHWPIKGSARLRTAVTLFSSNLLSRFGRRFCV GEMRTATLSHDDQTSLKLGGIDCFASMLVAASVLKTALSTYICTCLKCAIWDEVLIMEFR HRRKRSLLGQVDEDEDTRSVPGLNWVQVRMMIVTASQVREPFTCMDCFTLHTLVSNSIIL LLMGSHGKVAGMGRSGS >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_5|594_bp atggctcagagcaacagcgtttactacagatgccaagggagaagcaacgcctgtacgaag ccacgtggaaacccgacccaaacccggcactggccaatcaagggcagtgcgcgactcagg acagctgtcacactattttcttccaacctcctttcccgatttggaaggaggttctgtgtg ggtgagatgaggacagcaacgctctcccatgatgaccaaaccagcttgaaactgggaggc atcgactgttttgcctctatgctcgtggcagcctctgttttaaaaacagctttatcgaca tacatttgtacatgtttaaagtgcgcaatttgggatgaagtcctcatcatggaattcaga cacagaaggaagagaagtcttcttgggcaggtggatgaagatgaagacaccaggtcagtg cctggattgaactgggtccaggtgaggatgatgatagtgactgcatcacaagtacgtgag cctttcacctgcatggactgcttcactcttcacacacttgtgagcaattctatcatcctc ctcctcatgggaagtcatgggaaagtggctggcatgggccgctcagggagttga >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_6|474_aa MRSAAVLALLLCAGQVTALPVNSPMNKGDTEVMKCIVEVISDTLSKPSPMPVSQECFETL RGDERILSILRHQNLLKELQDLALQGAKERAHQQKKHSGFEDELSEVLENQSSQAELKEA VEEPSSKDVMEKREDSKEAEKSGEATDGARPQALPEPMQESKAEGNNQAPGEEEEEEEEA TNTHPPASLPSQKYPGPQAEGDSEGLSQGLVDREKGLSAEPGWQAKREEEEEEEEEAEAG EEAVPEEEGPTVVLNPHPSLGYKEIRKGESTYDGEDLNERVWERGMGRSEALAVDGAGKP GAEEAQDPEGKGEQEHSQQKEEEEEMAVVPQGLFRGGKSGELEQEEERLSKEWEDSKRWS KMDQLAKELTAEKRLEGQEEEEDNRDSSMKLSFRARAYGFRGPGPQLRRGWRPSSREDSL EAGLPLQVRGYPEEKKEEEGSANRRPEDQELESLSAIEAELEKVAHQLQALRRG >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_6|1425_bp atgcgctccgccgctgtcctggctcttctgctctgcgccgggcaagtcactgcgctccct gtgaacagccctatgaataaaggggataccgaggtgatgaaatgcatcgttgaggtcatc tccgacacactttccaagcccagccccatgcctgtcagccaggaatgttttgagacactc cgaggagatgaacggatcctttccattctgagacatcagaatttactgaaggagctccaa gacctcgctctccaaggcgccaaggagagggcacatcagcagaagaaacacagcggtttt gaagatgaactctcagaggttcttgagaaccagagcagccaggccgagctgaaagaggcg gtggaagagccatcatccaaggatgttatggagaaaagagaggattccaaggaggcagag aaaagtggtgaagccacagacggagccaggccccaggccctcccggagcccatgcaggag tccaaggctgaggggaacaatcaggcccctggggaggaagaggaggaggaggaggaggcc accaacacccaccctccagccagcctccccagccagaaatacccaggcccacaggccgag ggggacagtgagggcctctctcagggtctggtggacagagagaagggcctgagtgcagag ccagggtggcaggcaaagagagaagaggaggaggaggaggaggaggaggctgaggctgga gaggaggctgtccccgaggaagaaggccccactgtagtgctgaacccccacccgagcctt ggctacaaggagatccggaaaggcgagagtacgtatgatggcgaagacctcaacgaacgt gtctgggagagggggatgggtcggtcggaggctctggctgtggatggagctgggaagcct ggggctgaggaggctcaggaccccgaagggaagggagaacaggagcactcccagcagaaa gaggaggaggaggagatggcagtggtcccgcaaggcctcttccggggtgggaagagcgga gagctggagcaggaggaggagcggctctccaaggagtgggaggactccaaacgctggagc aagatggaccagctggccaaggagctgacggctgagaagcggctggaggggcaggaggag gaggaggacaaccgggacagttccatgaagctctccttccgggcccgggcctacggcttc aggggccctgggccgcagctgcgacgaggctggaggccatcctcccgggaggacagcctt gaggcgggcctgcccctccaggtccgaggctaccccgaggagaagaaagaggaggagggc agcgcaaaccgcagaccagaggaccaggagctggagagcctgtcggccattgaagcagag ctggagaaagtggcccaccagctgcaggcactacggcggggctga >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_7|309_aa MSSVGPLGGTARGLLPDDRICSPPFMELTSLCGDDTMRLLEKNGLTFPFICKTRVAHGTN SHEMAIVFNQEGLNAIQPPCVVQNFINHNAVLYKVFVVGESYTVVQRPSLKNFSAGTSDR ESIFFNSHNVSKPESSSVLTELDKIEGVFERPSDEVIRELSRALRQALGVSLFGIDIIIN NQTGQHAVIDINAFPGYEGVSEFFTDLLNHIATVLQGQSTAMAATGDVALLRHSKLLAEP AGGLVGERTCSASPGCCGSMMGQDAPWKAEADAGGTAKLPHQRLGCNAGVSPSFQQHCVA SLATKASSQ >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_7|930_bp atgagcagcgtggggcccttgggtggcacagcaagggggctgctcccagacgacaggatc tgctcgccacccttcatggagctcacgagcctgtgcggggatgacaccatgcggctgctg gagaagaacggcttgactttcccattcatttgcaaaaccagagtggctcatggcaccaac tctcacgagatggctatcgtgttcaaccaggagggcctgaacgccatccagccaccctgc gtggtccagaatttcatcaaccacaacgccgtcctgtacaaggtgttcgtggttggcgag tcctacaccgtggtccagaggccctcactcaagaacttctccgcaggcacatcagaccgt gagtccatcttcttcaacagccacaacgtgtcaaagccggagtcgtcatcggtcctgacg gagctggacaagatcgagggcgtgttcgagcggccgagcgacgaggtcatccgggagctc tcccgggccctgcggcaggcactgggcgtgtcactcttcggcatcgacatcatcatcaac aaccagacagggcagcacgccgtcattgacatcaatgccttcccaggctacgagggcgtg agcgagttcttcacagacctcctgaaccacatcgccactgtcctgcagggccagagcaca gccatggcagccacaggggacgtggccctgctgaggcacagcaagcttctggccgagccg gcgggcggcctggtgggcgagcggacatgcagcgccagccccggctgctgcggcagcatg atgggccaggacgcgccctggaaggctgaggccgacgcgggcggcaccgccaagctgccg caccagagactcggctgcaacgccggcgtgtcgcccagcttccagcagcattgtgtggcc tccctggccaccaaggcctcctcccagtag >gi568815584f:92823360_93034881|GENSCAN_predicted_peptide_8|596_aa AFLREALDSFASAAVTKGHKLGSLNSRNAPLPVPELNLSRPIEEQGPLDVIIHKLTDVIL EADQNDSQSLELVHRFQGMSHVVPSRASALPVAPSPFHLLNTVSLIGLETVGKTCPQCRL TSLVDKQEVEAWKHVQLVHRDPAEPGVSLDFLSWCRVTMRLLASRCYGRQSWTHLAALVP EKQIMSCSGVAVRLCARDKAHCVCLCLHVINPIATSPAKSTAGRILSGSLAKGQLSLLAS FHGLGASENPERACAVGGHRDALPQSPYQCGAWGRHSQRSQCLEWICFREYIDAHPETIV LDPLPAIRTLLDRSKSYELIRKIEAYMEGSPAPIPELDFHVHGQRALDLEEPEMGTTEDP GLESRTGVCAWRTVAREPGSAFPGEVVRSRPPKGTGRDETDAGFSAKLLISTAVGTVDRA AAPGILMRPPAQWLLDEIPHLVQVGLPQAFLQSFLKTSVPSQPTGTPPLGASVDGCCHCY CGIYRTVIIQNYKNCVPTTPVALEVTGSPCSFADIHRAPVFSLHSLPVTRPFAALIAHSF HCFVIVCWPLSSPPDRDSLEDRGECVFGDHDSAYGSLIPSINLSSHVHMCQRDGSY >gi568815584f:92823360_93034881|GENSCAN_predicted_CDS_8|1791_bp gccttccttagagaggccttggattcatttgccagtgctgctgtcacaaagggccacaaa ctgggcagcttaaatagcagaaatgcacccctcccagttccggagctgaaccttagccgg ccgatcgaggagcagggccccctggacgtcatcatccacaagctgactgacgtcatcctt gaagccgaccagaatgatagccagtccctggagctggtgcacaggttccagggcatgtcc cacgttgtgccctccagggcgtctgccctgccagtggccccaagtccattccacttgctg aacactgtctccttgatcggactggagactgtgggaaaaacatgccctcagtgcagactc accagccttgttgataagcaggaagtggaggcgtggaagcatgtgcagcttgtccacagg gacccagcagagccgggggtcagtcttgactttctcagctggtgccgtgtgacgatgaga ctgctggccagcagatgctacgggaggcagagttggacgcatctggcagccctggtccca gagaaacagatcatgtcctgttctggggttgctgtgaggctgtgtgcccgtgataaagcc cactgtgtctgcctttgtcttcatgtaataaatcccattgccacctcaccagccaagagc acagccggccggattctcagtggcagcttagcaaagggccagttgtctctgttggcatca ttccatgggcttggggcctctgaaaaccctgagagagcttgcgctgtaggtgggcaccgt gacgctctgccccagagtccttatcagtgcggggcatggggccgccactcccagaggagc cagtgcctggagtggatctgcttccgggagtacatcgatgcccaccctgagaccatcgtc ctggacccgctccctgccatcagaaccctgcttgaccgctccaagtcctatgagctcatc cggaagattgaggcctacatggaaggctccccggcccctattcctgagcttgactttcat gtccatggccagagggccctggatctagaagaacctgaaatgggtaccactgaggaccct ggcctggagagtaggactggggtatgtgcctggaggacagtggcccgggagcctggatca gcttttcctggagaggttgtcaggtccaggcctcctaagggcactggcagagatgaaaca gatgctggattctctgccaagctcctcatttccacagctgtgggcactgtggacagggca gcagcccctggcatcctgatgcgccctccagctcagtggcttctggacgagataccccac ttggtccaggtgggcctgccccaggccttcctgcagagctttctgaaaacctctgtcccc tctcaacccacaggcacccctcctctaggagcatccgtggatgggtgttgccattgttac tgtggtatttaccgaacagtcatcatacagaattataagaactgtgtcccaaccacccca gttgccctagaagtgacaggatccccttgcagctttgcagacatccacagggcaccagtg ttttctctccacagcttgcctgtcaccagaccctttgctgccctcatcgcacattcattc cactgtttcgtcattgtctgctggccgctgtcttccccacctgaccgtgacagccttgag gacaggggcgagtgtgtctttggggaccatgacagtgcttacggctcacttatcccatcc atcaacttatcttcacacgtgcacatgtgccagagagatggttcttactga