GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:20:44 Sequence gi568815576f:31299684_31533968 : 234285 bp : 45.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 17937 18003 67 2 1 74 62 107 0.885 7.93 1.02 Term + 21866 21993 128 0 2 22 53 115 0.584 -0.16 1.03 PlyA + 22772 22777 6 1.05 2.05 PlyA - 23093 23088 6 1.05 2.04 Term - 27626 27208 419 0 2 100 54 386 0.943 31.84 2.03 Intr - 36180 36009 172 0 1 105 74 167 0.703 16.52 2.02 Intr - 43277 43214 64 1 1 108 91 35 0.812 4.52 2.01 Init - 45919 44649 1271 1 2 111 94 1499 0.987 144.58 2.00 Prom - 64707 64668 40 -0.96 3.02 PlyA - 66741 66736 6 1.05 3.01 Sngl - 85908 85543 366 0 0 76 41 169 0.976 7.10 3.00 Prom - 88811 88772 40 -5.96 4.00 Prom + 90318 90357 40 -4.96 4.01 Init + 100001 100042 42 1 0 98 80 33 0.484 3.92 4.02 Intr + 100937 101060 124 1 1 99 76 82 0.969 8.26 4.03 Intr + 103346 103521 176 0 2 82 109 105 0.998 11.66 4.04 Intr + 111329 111398 70 1 1 95 109 47 0.944 6.25 4.05 Intr + 120573 120742 170 1 2 73 93 169 0.999 15.57 4.06 Intr + 123597 123727 131 2 2 107 84 205 0.992 21.49 4.07 Intr + 126932 127099 168 2 0 95 101 115 0.999 12.56 4.08 Intr + 127377 127499 123 0 0 70 92 91 0.939 7.40 4.09 Term + 134189 134288 100 2 1 102 47 50 0.800 -0.10 4.10 PlyA + 136069 136074 6 1.05 5.13 PlyA - 137030 137025 6 1.05 5.12 Term - 140438 140197 242 0 2 110 53 217 0.829 16.59 5.11 Intr - 141185 141021 165 0 0 120 96 10 0.669 4.93 5.10 Intr - 143411 143120 292 2 1 90 42 234 0.541 15.71 5.09 Intr - 145007 144923 85 1 1 71 61 64 0.853 1.82 5.08 Intr - 147885 147743 143 0 2 124 99 66 0.705 10.45 5.07 Intr - 154693 154461 233 2 2 73 94 195 0.717 16.09 5.06 Intr - 155632 155453 180 2 0 55 87 76 0.840 4.04 5.05 Intr - 156304 156169 136 1 1 80 98 -10 0.517 -0.66 5.04 Intr - 158967 158792 176 1 2 106 109 128 0.967 16.36 5.03 Intr - 163450 163249 202 1 1 75 86 178 0.999 15.16 5.02 Intr - 164284 163998 287 2 2 108 9 293 0.999 20.46 5.01 Init - 170898 170829 70 0 1 72 77 8 0.249 -2.69 5.00 Prom - 174341 174302 40 -0.66 6.00 Prom + 174919 174958 40 -4.56 6.01 Init + 188952 189100 149 2 2 79 98 52 0.248 4.96 6.02 Intr + 189728 190076 349 2 1 33 -12 250 0.028 4.86 6.03 Intr + 196955 197120 166 1 1 72 38 62 0.020 -0.87 6.04 Intr + 197291 197434 144 0 0 42 90 60 0.647 1.85 6.05 Term + 198094 198251 158 2 2 66 41 96 0.906 0.80 6.06 PlyA + 200619 200624 6 1.05 7.00 Prom + 207704 207743 40 -0.96 7.01 Init + 212309 212359 51 1 0 58 103 0 0.045 -0.34 7.02 Intr + 221552 221718 167 1 2 73 53 102 0.051 3.96 7.03 Intr + 229007 229180 174 2 0 54 99 149 0.950 11.65 7.04 Intr + 229815 229910 96 0 0 57 87 44 0.418 0.32 7.05 Term + 231311 231326 16 2 1 125 46 21 0.647 -0.69 7.06 PlyA + 232386 232391 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 221627 221516 112 1 1 42 43 175 0.925 6.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:31299684_31533968|GENSCAN_predicted_peptide_1|64_aa MAKIQTTNADKDVEQQELSFIAAFSSSFYSLKGPFVSLGDMKRFVTSNVSCYLDLKGKFA HVRL >gi568815576f:31299684_31533968|GENSCAN_predicted_CDS_1|195_bp atggccaaaatccagacaacaaatgctgacaaggatgtggagcaacaggaactttcattc attgctgcattttccagcagcttctacagtctgaaagggccctttgtctcccttggtgac atgaagaggtttgtcacttcaaatgtcagctgttacttggacttgaagggcaagtttgcc catgtccgcctgtga >gi568815576f:31299684_31533968|GENSCAN_predicted_peptide_2|641_aa MERVNDASCGPSGCYTYQVSRHSTEMLHNLNQQRKNGGRFCDVLLRVGDESFPAHRAVLA ACSEYFESVFSAQLGDGGAADGGPADVGGATAAPGGGAGGSRELEMHTISSKVFGDILDF AYTSRIVVRLESFPELMTAAKFLLMRSVIEICQEVIKQSNVQILVPPARADIMLFRPPGT SDLGFPLDMTNGAALAANSNGIAGSMQPEEEAARAAGAAIAGQASLPVLPGVDRLPMVAG PLSPQLLTSPFPSVASSAPPLTGKRGRGRPRKANLLDSMFGSPGGLREAGILPCGLCGKV FTDANRLRQHEAQHGVTSLQLGYIDLPPPRLGENGLPISEDPDGPRKRSRTRKQVACEIC GKIFRDVYHLNRHKLSHSGEKPYSCPVCGLRFKRKDRMSYHVRSHDGSVGKPYICQSCGK GFSRPDHLNGHIKQVHTSERPHKCQTCNASFATRDRLRSHLACHEDKVPCQVCGKYLRAA YMADHLKKHSEGPSNFCSICNREGQKCSHQDPIESSDSYGDLSDASDLKTPEKQSANGSF SCDMAVPKNKMESDGEKKYPCPECGSFFRSKSYLNKHIQKVHVRALGGPLGDLGPALGSP FSPQQNMSLLESFGFQIVQSAFASSLVDPEVDQQPMGPEGK >gi568815576f:31299684_31533968|GENSCAN_predicted_CDS_2|1926_bp atggagcgggtgaacgacgcttcgtgcggcccgtctggctgctacacataccaggtgagc agacacagcacggagatgctgcacaacctgaaccagcagcgcaaaaacggcgggcgcttc tgcgacgtgctcttgcgggtaggcgacgagagcttcccagcgcaccgcgccgtgctggcc gcctgcagcgagtactttgagtcggtgttcagcgcccagttgggcgacggcggagctgcg gacgggggtccggctgatgtagggggcgcgacggcagcaccaggcggcggggccgggggc agccgggagctggagatgcacactatcagctccaaggtatttggggacattctggacttc gcctacacttcccgcatcgtggtgcgcttggagagctttcccgaactcatgacggccgcc aagttcctgctgatgaggtcggttatcgagatctgccaggaagtcatcaaacagtccaac gtacagatcctggtaccccctgcccgcgccgatataatgctctttcgcccccctgggacc tcggacttgggcttccctttggacatgaccaacggggcagccttggcagccaacagcaat ggcatcgccggcagcatgcagccagaggaggaggcagctcgggcggctggtgcagccatt gcaggccaagcctctttgcctgtgttacctggggtggaccgcttgcccatggtggctgga cccctatccccccaactgctgacttccccattccccagtgtggcatccagtgcccctccc ctgactggcaagcgaggccggggccgcccaaggaaggccaacctgctggactcaatgttt gggtccccagggggcctgagggaggcaggcatccttccatgcggtctatgtggtaaggtg ttcactgatgccaaccggctccggcagcacgaggcccagcacggtgtcaccagcctccag ctgggctacatcgaccttcctcctccgaggctgggtgagaatgggctacccatctctgaa gaccccgacggcccccgaaagaggagccggaccaggaagcaggtggcttgtgagatctgc ggcaagatcttccgtgatgtgtatcatcttaaccggcacaagctgtcccactctggggag aagccctactcctgccctgtgtgtgggttgcggttcaagagaaaagaccgcatgtcctac catgtgcggtcccatgatgggtccgtgggcaagccttacatctgccagagctgtgggaaa ggcttctccaggcctgatcacttgaacggacatatcaagcaggtgcacacttctgagcgg cctcacaagtgtcagacctgcaatgcttcttttgccacccgagaccgtctgcgctcccac ctggcctgtcatgaagacaaggtgccctgccaggtgtgtgggaagtacttgcgggcagca tacatggcagaccacctgaagaagcacagcgaggggcccagcaacttctgcagtatctgt aaccgagaaggccagaaatgctcacatcaggatccgattgagagctctgactcctatggt gacctctcagatgccagcgacctgaagacgccagagaagcagagtgccaatggctctttc tcctgcgacatggcagtccccaaaaacaaaatggagtctgatggggagaagaagtaccca tgccctgaatgtgggagcttcttccgctctaagtcctacttgaacaaacacatccagaag gtgcatgtccgggctctcgggggccccctgggggacctgggccctgcccttggctcacct ttctctcctcagcagaacatgtctctcctcgagtcctttgggtttcagattgttcagtcg gcatttgcgtcatctttagtagatcctgaggttgaccagcagcccatggggcctgaaggg aaatga >gi568815576f:31299684_31533968|GENSCAN_predicted_peptide_3|121_aa MDTFLDTYTFPKLKQEEIDSLNRTIMSSKMKSVINSLLTKKSPGPDEVTAEFHQMYKDGI VLFLLKLFQKIEAEGLLPNSFCEPGIILTLKPGRDTPKIENFKPISLMNIDAEILNKILA N >gi568815576f:31299684_31533968|GENSCAN_predicted_CDS_3|366_bp atggatacattcctggacacatacaccttcccaaaactgaaacaagaagaaatagattcc ctgaacagaacaataatgagctccaaaatgaaatcagtaataaatagcctactaacaaag aaaagcccaggaccagatgaagtcacagccgaattccaccagatgtacaaagacgggata gtactatttcttctgaaactatttcaaaaaattgaggcagagggactcctccccaactca ttctgtgagcccggcatcatcctgacactaaaacctggcagagacacacccaaaatagaa aacttcaagccaatatccttgatgaacatcgatgcagaaatcctcaacaaaatacttgca aattga >gi568815576f:31299684_31533968|GENSCAN_predicted_peptide_4|367_aa MSSTLAKIAEIEAEMARTQKNKATAHHLGLLKARLAKLRRELITPKGGGGGGPGEGFDVA KTGDARIGFVGFPSVGKSTLLSNLAGVYSEVAAYEFTTLTTVPGVIRYKGAKIQLLDLPG IIEGAKDGKGRGRQVIAVARTCNLILIVLDVLKPLGHKKIIENELEGFGIRLNSKPPNIG FKKKDKGGINLTATCPQSELDAETVKSILAEYKIHNADVTLRSDATADDLIDVVEGNRVY IPCIYVLNKIDQISIEELDIIYKVPHCVPISAHHRWNFDDLLEKIWDYLKLVRIYTKPKG QLPDYTSPVVLPYSRTTVEDFCMKIHKNLIKEFKYALVWGLSVKHNPQKVGKDHTLEDED VIQIVKK >gi568815576f:31299684_31533968|GENSCAN_predicted_CDS_4|1104_bp atgagcagcaccttagctaagatcgcggagatagaagcagagatggctcggactcaaaag aacaaggccacagcacaccacttagggctgcttaaggctcgtcttgctaagcttcgtcga gaactcattactccaaagggtggtggtggtggaggtccaggagaaggttttgatgtggcc aagacaggtgatgctcgaattggatttgttggttttccatctgtggggaagtcaacactg cttagtaacctggcaggggtatattctgaggtggcagcctatgaattcactactctgacc actgtgcctggtgtcatcagatacaaaggtgccaagatccagctcctggatctcccaggt atcattgaaggtgccaaggatgggaaaggtagaggtcgtcaagtcattgcagtggcccga acctgtaacttgatcttgattgttctggatgtcctgaaacctttgggacataagaagata attgaaaatgagctggaaggctttggcattcgcttgaacagcaaaccccccaacattggc tttaagaagaaggacaagggaggcattaatctcacagccacttgcccccagagtgagctg gatgctgaaactgtgaagagcattctggctgaatacaagattcataatgccgatgtgact ctacgtagtgatgctacagctgatgacctcattgatgtggtggaaggaaacagagtttat atcccctgtatctatgtgttaaataagattgaccaaatctccattgaggaattggatatc atctataaggtgcctcactgtgtacccatctctgcccatcaccgctggaattttgatgac ctattggaaaagatctgggactatctgaaactagtgagaatttacaccaaacccaaaggc cagttaccagattacacatccccagtggtgcttccttactccaggaccacagtggaggat ttctgcatgaagattcacaaaaatcttatcaaagaatttaaatatgctctggtctggggt ctctctgtgaaacacaatcctcagaaagtgggtaaagaccatacgttggaggatgaggat gtcattcaaattgtgaagaagtga >gi568815576f:31299684_31533968|GENSCAN_predicted_peptide_5|736_aa MGQVQWLTPVIPALWEARAGRSQDPRERVKEDDLDVVLSPQRRSFGGGCHVTAAVSSRRS GSPLEKDSDGLRLLGGRRIGSGRIISARTFEKDHRLSDKDLRDLRDRDRERDFKDKRFRR EFGDSKRVFGERRRNDSYTEEEPEWFSAGPTSQSETIELTGFDDKILEEDHKGRKRTRRR TASVKEGIVECNGGVAEEDEVEVILAQEPAADQEVPRDAVLPEQSPGDFDFNEFFNLDKV PCLASMIEDVLGEGSVSASRFSRWFSNPSRSGSRSSSLGSTPHEELERLAGLEQAILSPG QNSGNYFAPIPLEDHAENKVDILEMLQKAKVDLKPLLSSLSANKEKLKESSHSGVVLSVE EVEAGLKGLKVDQQVKNSTPFMAEHLEETLSAVTNNRQLKKDGDMTAFNKLVSTMKASGT LPSQPKVSQMSQLELQQAALEGLALPHDLAVQAANFYQPGFGKPQVDRTRDGFRNRQQRV TKSPAPVHRGNSSSPAPAASITSMLSPSFTPTSVIRKMYESKEKSKEEPASGKAALGDSK EDTQKASEGTAELLRALLKKVCDGENMGLSADQPDILYKALGKTTSFSGLWFCVSEKVLC DSVLPPGMDLSHLQGISGPILGQPFYPLPAASHPLLNPRPGTPLHLAMVQQQLQRSVLHP PGSGSHAAAVSVQTTPQNVPSRSGLPHMHSQLEHRPSQRSSSPVGLAKWFGSDVLQQPLP SMPAKVISVDELEYRQ >gi568815576f:31299684_31533968|GENSCAN_predicted_CDS_5|2211_bp atgggccaggtgcagtggctcacacctgtaatcccagcactttgggaggccagggcgggc agatcacaagatccacgagagcgtgtgaaagaagatgacttagatgttgttctcagccct cagagacggagctttggagggggctgccacgtgacagccgctgttagctcccggcgctca ggaagtccattagagaaagatagtgatgggcttcgtctgcttggtggacgtaggattggc agtgggaggataatctctgcccggacctttgagaaggatcaccgtcttagcgataaggac ctgcgggacttgagagacagagaccgagagagggacttcaaggacaagcgtttcaggaga gagtttggagatagtaagcgtgtctttggtgagcgtagaagaaatgattcttacacagaa gaagaaccagagtggttctctgctggacccacaagtcagtctgaaaccatcgaactgact ggctttgatgataagatactagaagaagatcacaaagggagaaaaagaacaaggcgacgg acagcctctgtgaaggaaggtatagtagagtgcaatggaggagtggccgaagaggatgaa gtggaggtcatccttgcacaggagcctgcggctgatcaggaagtgccaagggatgctgtc ttgcctgagcagtccccaggagactttgactttaatgagttctttaaccttgataaggtg ccatgcttggcttcgatgatagaagatgttttgggagaagggtcagtctctgccagtcgg ttcagtaggtggttctctaacccgagcagatcaggaagccgatccagcagtcttgggtca acaccacatgaagagctagagagacttgcaggtctggagcaagccatcctctctcctgga cagaactcggggaattactttgctcctataccattggaagaccatgctgaaaataaagtg gatattttagaaatgctacagaaagccaaagtggatttgaaacctcttctttccagcctt tctgcaaataaagaaaaacttaaagaaagctcacattcaggggttgtgctttcagtggag gaggtagaagcaggtctgaagggcttgaaggttgaccagcaagtgaagaattcaactccc ttcatggcagaacacctagaagagaccttgagtgccgtaaccaacaatcgacaactgaag aaagacggagacatgactgcgttcaacaagctagtgagcacaatgaaggcaagtgggact ttgccttctcagcccaaagtcagccagatgagccagctggagttgcaacaggcagcttta gaagggctggccttgccacatgaccttgctgtacaggcagcaaacttctaccagcctggt tttggcaaaccacaggtggacagaaccagagatggattcagaaacaggcaacagcgagtg accaagtcaccagcacccgtgcatcgagggaattcctcttcccctgcccctgctgcctcc atcacaagcatgctttctccttcctttacccctacctcagtgattcgtaagatgtacgag agcaaagagaaaagcaaggaggagccagcatctggaaaagcagctcttggtgacagtaaa gaggatactcagaaggccagtgaaggtactgcagagctgttaagggcactgctcaagaaa gtatgtgatggagaaaacatgggcctgagcgctgaccagcctgatattctatacaaagcc ctgggcaagaccactagcttctctggactatggttctgtgtcagtgagaaggttctgtgt gacagtgtgcttcctcctgggatggacttgagtcatttacagggaatatctggccccatc ctgggtcagcccttttaccctttacctgctgctagtcaccctctcttaaaccctcgtcct ggaacacctctgcatctggcaatggtgcaacagcagctacagcgctcagttctgcatcct ccaggctctggttcccatgcagcagctgtcagcgttcagacaacccctcagaacgtgccc agccggtcaggcctgccccacatgcactcccagctggagcatcgccccagccagaggagc agctcccctgtgggccttgccaaatggtttggctcagatgtgctacagcaacccctgccc tccatgcccgccaaagttatcagtgtagatgaattggaataccgacagtga >gi568815576f:31299684_31533968|GENSCAN_predicted_peptide_6|321_aa MGAFGGRRLLQVKESISTFCFTHTSPIHGSLVYNALHLEDKSAQNCHREQEEGGRISRPG RFKLELAQARSRPPRPGGSVPASQPAGYPPAALRASRPCPPPAGAPAPALTHATPGAMRP RPFYTREHRARGAPDPAATAADPYSVPPPLGGPLPRPPLPSPLPAPVRGPALTSPPSSGL GLSGSEPGKDLSRVPPSPVKSVRASASSPESAGDVVYLRVCACADPNQGFPPPGSVAQER KGWYMGDLTPTEPAGSNAVNKPGSLPRVGVWFTEYFKPTIETYCSEKRIPFKILLVINKA PGHQGTLMKMYKEINVTLHVC >gi568815576f:31299684_31533968|GENSCAN_predicted_CDS_6|966_bp atgggggcatttggaggcaggaggcttcttcaggtcaaggaaagcatctccactttctgt ttcacccatacttctcctatccatggctccttggtctacaatgctctgcacctggaagat aaatcggcacaaaattgtcaccgagaacaggaggagggaggccgaatctcccggcccggg cgcttcaagctggagctagcgcaggcccgcagccgtcctccccggcccggcggctcggtc cccgccagccagcccgccggctacccgccggccgccttgcgggcctcgcgcccctgccca ccccctgcgggcgccccggcgcccgcgctcacgcacgcgaccccgggagcaatgcgcccg cgccccttctacacccgggagcaccgggcgcgcggtgccccggacccagccgccaccgcc gccgacccttactcggtgcctccgccgctcggcggccctttgcctcggccgccgctcccc tccccgctgcccgcaccggtacgaggcccggccctaacgtccccaccctcttctgggctc ggcctttccggcagtgagcccggcaaggacctcagtcgggttcctccgagcccggtgaaa tctgtccgagcctccgcttcctcccctgaaagtgcaggtgatgttgtctacctgcgggtc tgtgcgtgcgcggatccaaatcaggggtttcctcctcccggctccgtggcccaggagcgg aagggctggtatatgggcgaccttactcccactgagcccgcagggagtaacgcagtaaac aaacctggaagccttccccgcgtgggagtgtggtttactgaatattttaagcctactatc gagacctactgttcagaaaaaaggattcctttcaagatattactggtcattaacaaggca cctggtcaccaaggaactctgatgaagatgtacaaagagattaatgttactttacatgtc tgctaa >gi568815576f:31299684_31533968|GENSCAN_predicted_peptide_7|167_aa MYRLRYHFETNNVTITQRLANVWNDTITSNCSLDERVQFFFFTNSRLQMMRGNSFTFKSF GAFDVTSSISAVRYFKDGAVKKPYSAKTLSNKKSSASFGIRRELPSTSHLVQYRGTHTCT RQGRLRELRISLGESETLSQNNNNRFIWCLPMPGVTLDAEHIKKLLM >gi568815576f:31299684_31533968|GENSCAN_predicted_CDS_7|504_bp atgtatcgtttaagatatcattttgaaacaaacaatgtaaccataacacagagacttgcg aatgtgtggaatgacaccatcaccagcaattgtagccttgatgagagagtccaattcttc ttcttcactaatagcaggttgcaaatgatgaggggtaatagctttacctttaaatctttt ggtgcctttgatgtcacttcaagtatctctgcagtgaggtattttaaggatggtgcagtt aagaaaccttattctgcaaagacactgtccaacaagaagtcttctgcatcctttgggatc cggagggagttacctagtaccagtcatctagtgcagtatcgtggcacacatacttgtacc cgacagggccggttaagagaactgcgcatcagcctgggcgagagtgagactctgtctcag aacaacaacaacagatttatctggtgcttacccatgccaggtgttacgctagatgctgag catataaaaaaactgctgatgtaa