GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:36:02 Sequence gi568815588r:80147222_80347683 : 200462 bp : 45.87% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 PlyA - 145 140 6 1.05 1.15 Term - 1467 1249 219 0 0 32 46 160 0.054 3.24 1.14 Intr - 10542 10420 123 0 0 55 72 287 0.987 24.58 1.13 Intr - 10804 10746 59 2 2 65 75 61 0.959 1.10 1.12 Intr - 11974 11879 96 2 0 73 106 192 0.965 19.48 1.11 Intr - 14978 14714 265 2 1 134 110 147 0.805 18.59 1.10 Intr - 16184 16128 57 2 0 91 109 80 0.762 9.48 1.09 Intr - 16392 16313 80 1 2 110 80 87 0.963 9.37 1.08 Intr - 16922 16832 91 2 1 72 94 141 0.984 12.67 1.07 Intr - 18976 18863 114 1 0 30 93 117 0.959 7.04 1.06 Intr - 19763 19657 107 0 2 128 23 208 0.843 18.23 1.05 Intr - 20092 20005 88 1 1 2 103 143 0.997 6.74 1.04 Intr - 22137 21748 390 0 0 122 109 241 0.992 24.52 1.03 Intr - 23694 23579 116 1 2 72 101 103 0.793 10.17 1.02 Intr - 25648 25586 63 2 0 111 110 38 0.512 6.99 1.01 Init - 30880 30727 154 1 1 62 65 92 0.286 2.75 1.00 Prom - 37349 37310 40 -5.66 2.00 Prom + 40230 40269 40 -4.76 2.01 Init + 48079 48260 182 0 2 67 77 114 0.716 6.86 2.02 Intr + 51598 51714 117 1 0 46 84 55 0.399 0.48 2.03 Intr + 54114 54142 29 0 2 102 105 0 0.586 0.86 2.04 Intr + 57035 57152 118 0 1 75 84 57 0.551 3.52 2.05 Term + 60488 60551 64 2 1 101 39 70 0.116 0.76 2.06 PlyA + 60566 60571 6 1.05 3.04 PlyA - 62121 62116 6 1.05 3.03 Term - 78274 78045 230 2 2 -39 40 370 0.761 16.29 3.02 Intr - 86639 86417 223 2 1 59 101 40 0.001 0.10 3.01 Init - 93234 93178 57 0 0 76 100 29 0.019 4.21 3.00 Prom - 93399 93360 40 -6.26 4.05 PlyA - 93575 93570 6 1.05 4.04 Term - 100399 99998 402 1 0 47 36 568 0.061 42.65 4.03 Intr - 105378 105172 207 0 0 -50 80 247 0.723 9.37 4.02 Intr - 105837 105583 255 0 0 -44 53 465 0.520 27.44 4.01 Init - 106162 106094 69 1 0 56 75 104 0.870 6.95 4.00 Prom - 106873 106834 40 -1.46 5.10 PlyA - 111150 111145 6 1.05 5.09 Term - 126662 126560 103 1 1 38 43 90 0.379 -2.75 5.08 Intr - 127432 127299 134 1 2 18 103 251 0.733 19.04 5.07 Intr - 127978 127796 183 1 0 125 115 216 0.999 27.98 5.06 Intr - 129373 129155 219 1 0 71 57 488 0.995 42.50 5.05 Intr - 133095 132952 144 0 0 78 99 160 0.999 16.58 5.04 Intr - 133592 133459 134 0 2 89 70 150 0.623 13.66 5.03 Intr - 136817 136695 123 0 0 116 72 218 0.989 23.56 5.02 Intr - 138368 138291 78 0 0 91 83 91 0.994 8.42 5.01 Init - 142202 142112 91 2 1 69 114 53 0.984 6.65 5.00 Prom - 152520 152481 40 -4.26 6.00 Prom + 153519 153558 40 -4.46 6.01 Sngl + 157848 158129 282 2 0 88 37 153 0.785 5.69 6.02 PlyA + 158731 158736 6 1.05 7.00 Prom + 159257 159296 40 -6.36 7.01 Sngl + 160705 162090 1386 0 0 42 32 458 0.887 31.69 7.02 PlyA + 162199 162204 6 -0.45 8.00 Prom + 162482 162521 40 -2.46 8.01 Init + 162919 163051 133 0 1 78 47 90 0.551 4.20 8.02 Term + 168470 168555 86 0 2 131 55 99 0.992 8.72 8.03 PlyA + 168965 168970 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 8691 8632 60 0 0 92 47 81 0.922 2.20 S.002 Init - 79464 79455 10 0 1 71 82 9 0.854 -1.16 S.003 Intr + 86443 86562 120 0 0 129 113 97 0.987 16.99 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_1|673_aa MAALGCQERLLQGQPLAVASFEKMQRSQLPEDWTMSRFLNFQRKGGEECCFYLTMSYPGY PPPPGGYPPAAPGGGPWGGAAYPPPPSMPPIGLDNVATYAGQFNQDYLSGMAANMSGTFG GANMPNLYPGAPGAGYPPVPPGGFGQPPSAQQPVPPYGMYPPPGGNPPSRMPSYPPYPGA PVPGQPMPPPGQQPPGAYPGQPPVTYPGQPPVPLPGQQQPVPSYPGYPGSGTVTPAVPPT QFGSRGTITDAPGFDPLRDAEVLRKAMKGFGTDEQAIIDCLGSRSNKQRQQILLSFKTAY GKASCGDLIKDLKSELSGNFEKTILALMKTPVLFDIYEIKEAIKGVGTDEACLIEILASR SNEHIRELNRAYKAEFKKTLEEAIRSDTSGHFQRLLISLSQGNRDESTNVDMSLAQRDAQ GCYLRPSRHFRGHLVQRLQIGKYFRLKPSEVYQALLGDLGLGSGCGLCHMCVFLSLQELY AAGENRLGTDESKFNAVLCSRSRAHLVAVFNEYQRMTGRDIEKSICREMSGDLEEGMLAV VKCLKNTPAFFAERLNKAMRGAGTKDRTLIRIMVSRSETDLLDIRSEYKRMYGKSLYHDI SDCTGSDLKASTALALTQGLQLPLPGYCLCLLKALGLYNRLVAEPARLVSFPSEQQGPPG PSWAQRCHLGASN >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_1|2022_bp atggcagccctggggtgccaggaacgacttctccaaggacagccccttgcggtagcctct tttgaaaaaatgcagaggtcacagttaccagaagactggaccatgagcagatttcttaat tttcaaagaaaaggaggagaagaatgctgcttctatctaaccatgagctaccctggctat cccccgcccccaggtggctacccaccagctgcaccaggtggtggtccctggggaggtgct gcctaccctcctccgcccagcatgccccccatcgggctggataacgtggccacctatgcg gggcagttcaaccaggactatctctcgggaatggcggccaacatgtctgggacatttgga ggagccaacatgcccaacctgtaccctggggcccctggggctggctacccaccagtgccc cctggcggctttgggcagcccccctctgcccagcagcctgttcctccctatgggatgtat ccacccccaggaggaaacccaccctccaggatgccctcatatccgccatacccaggggcc cctgtgccgggccagcccatgccaccccccggacagcagcccccaggggcctaccctggg cagccaccagtgacctaccctggtcagcctccagtgccactccctgggcagcagcagcca gtgccgagctacccaggatacccggggtctgggactgtcacccccgctgtgcccccaacc cagtttggaagccgaggcaccatcactgatgctcccggctttgaccccctgcgagatgcc gaggtcctgcggaaggccatgaaaggcttcgggacggatgagcaggccatcattgactgc ctggggagtcgctccaacaagcagcggcagcagatcctactttccttcaagacggcttac ggcaaggcgagctgcggggatttgatcaaagatctgaaatctgaactgtcaggaaacttt gagaagacaatcttggctctgatgaagaccccagtcctctttgacatttatgagataaag gaagccatcaagggggttggcactgatgaagcctgcctgattgagatcctcgcttcccgc agcaatgagcacatccgagaattaaacagagcctacaaagcagaattcaaaaagaccctg gaagaggccattcgaagcgacacatcagggcacttccagcggctcctcatctctctctct cagggaaaccgtgatgaaagcacaaacgtggacatgtcactcgcccagagagatgcccag ggctgctacctgaggcctagcaggcactttagaggccatctagttcagaggttgcaaatt ggcaaatactttaggctcaaaccttcagaagtttaccaggctctcctgggtgacctgggc ctggggtctgggtgtggcctgtgccacatgtgcgtcttcctctctctccaggagctgtat gcggccggggagaaccgcctgggaacagacgagtccaagttcaatgcggttctgtgctcc cggagccgggcccacctggtagcagttttcaatgagtaccagagaatgacaggccgggac attgagaagagcatctgccgggagatgtccggggacctggaggagggcatgctggccgtg gtgaaatgtctcaagaataccccagccttctttgcggagaggctcaacaaggccatgagg ggggcaggaacaaaggaccggaccctgattcgcatcatggtgtctcgcagcgagaccgac ctcctggacatcagatcagagtataagcggatgtacggcaagtcgctgtaccacgacatc tcggactgcactgggtcagacctgaaagccagcacagcactggctctcacccaaggcctg cagttaccactccctggctactgcctgtgtctgctcaaggccctggggctctacaatcgg ctggtggcagagccagccaggcttgtgtccttcccttcagaacagcaaggtcctccaggc cccagctgggcccaaaggtgccatctgggagccagcaactag >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_2|169_aa MSPVSRMQLEVQGPAAHSGGENTFTWEWADPSAAGILSCVKQSGAKSHSHWATEEQHENR RWRGEDDDEKLNPTAKIRKGMHASIISNKQDAKNKIKYIWLIFLYSAVRGNTLQHKECLE ERGPVNPLESPTIDCRIKGMRKAMRASEGDREGQLLRKAHFSPWLSECS >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_2|510_bp atgtccccagtcagccgcatgcaactggaagttcaaggacctgcagctcactctggagga gaaaacaccttcacttgggaatgggctgacccttctgcagctggcatcctaagctgtgtc aagcagtctggagccaagagtcactcacactgggccactgaggaacagcatgaaaacaga aggtggcgtggggaagatgacgatgagaaacttaacccaacagcaaaaatcaggaaagga atgcatgcaagtataataagtaataagcaagatgcaaagaataaaatcaagtatatctgg ctcatcttcctttactcagctgttcgagggaacaccttgcagcacaaagaatgccttgag gaaaggggccctgttaatcccctggaatccccaacaattgactgccgcataaaaggaatg aggaaagcaatgagagccagtgaaggcgaccgcgaaggccagcttcttcgaaaggcccat ttttctccgtggctgagcgaatgctcctag >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_3|169_aa MSKGESRKCNEENVSKSSKPPYETWGRSTGSYSGAPDPFHLPSLTFLGDLHETVTDFTCC CSSLVSWSLSRDRNCGVRTINTFTTCRTRAEAYERGRRRRRRRRRRRRRRRRKEEKKKKK KKKEKEKEKEKEKEKKRKEEEEEEEEEEEEEEEEEEKMYAYGCPSQKNK >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_3|510_bp atgagcaagggtgaaagtagaaaatgcaatgaggaaaatgtatccaagtcttcaaagcca ccctatgagacatggggcagaagcactgggagctattcaggggcaccagaccccttccat ctgccctcactcaccttccttggggacttgcatgagactgtcactgactttacatgctgc tgcagctccttggtcagctggtccttgtcacgggacagaaactgtggggtcaggacaatc aacaccttcacgacctgcagaacaagggcagaggcttatgaaagaggaagaagaagaagg agaaggagaaggagaaggagaaggagaagaagaagaaaagaagaaaagaagaagaagaag aagaagaaggagaaggagaaggagaaggagaaggagaaggagaagaaaagaaaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaaaaatgtatgca tatggatgtccaagccagaaaaacaaataa >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_4|310_aa MEIIEVTEEIHVKVVIEGPDATQEVDEVPEDQQLQEVDEVPEDLQLQGVDEVPEDQQLQE VNEVPEDQQLQKVDEVPEDHQLQEVDEVPEDHQLREVDEVPEDRQLQEELDKAPENNRVE EVVKFSGDSLVQEVAEFPEDSRVEVVEFPEDSPVEEFVEVPENLQMEGVFEFPDNTQCSA LRKNGFVVLKGWPCKIVEMSASKTGKHGHAKVHLVGIDIFTGKKYEDICPSTHNMDVPNI RRNDFQLIGIQDGYLSLLQDSGEVPEDLRLPEGDLGKETEQKYDCGEEILITVLSAMTEE AAVAIKAMAK >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_4|933_bp atggagattattgaagtaaccgaggagattcatgtgaaggtggttattgaggggccagac gccacccaggaggtggatgaggtcccagaggaccaacagctgcaggaggtggatgaggtc ccagaagacctacagctgcagggtgtggatgaagtcccagaggaccaacagctgcaggag gtgaatgaggtcccggaagaccaacagctgcaaaaggtggatgaggtcccggaggaccac cagctgcaggaggtggatgaggttccggaggaccaccagcttcgagaggtggatgaggtc ccggaggaccgacagctgcaggaggagctggataaggccccagagaacaatcgagtggag gaggtggttaagttttcaggggactctctagtgcaggaggtggctgagttcccagaggac agtcgagtggaggtggttgaattcccagaggactctccagtggaggagtttgttgaggtc ccagaaaaccttcagatggagggagtgtttgagttcccagacaacacccagtgctcagca ttacgtaagaatggctttgtggtgctcaaaggctggccatgtaagatcgtggagatgtct gcttcgaagactggcaagcacggccacgccaaggtccatctggttggtattgacatcttt actgggaagaaatatgaagatatctgcccgtcaactcataatatggatgtccccaacatc agaaggaatgacttccagctgattggcatccaggatgggtacctatcactgctccaggac agcggggaggtaccagaggaccttcgtctccctgagggagaccttggcaaggagactgag cagaagtacgactgtggagaagagatcctgatcacggtgctgtctgccatgacagaggag gcagctgttgcaatcaaggccatggcaaaataa >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_5|402_aa MNGPVDGLCDHSLSEGVFMFTSESVGEGHPDKICDQISDAVLDAHLKQDPNAKVACETVC KTGMVLLCGEITSMAMVDYQRVVRDTIKHIGYDDSAKGSLCFRLGFDFKTCNVLVALEQQ SPDIAQCVHLDRNEEDVGAGDQGLMFGYATDETEECMPLTIILAHKLNARMADLRRSGLL PWLRPDSKTQVTVQYMQDNGAVIPVRIHTIVISVQHNEDITLEEMRRALKEQVIRAVVPA KYLDEDTVYHLQPSGRFVIGGPQGDAGVTGRKIIVDTYGGWGAHGGGAFSGKDYTKVDRS AAYAARWVAKSLVKAGLCRRVLVQVSYAIGVAEPLSISIFTYGTSQKTERELLDVVHKNF DLRPGVIVRDLDLKKPIYQKTACYGHFGRSEFPWEVPRKLVF >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_5|1209_bp atgaatggaccggtggatggcttgtgtgaccactctctaagtgaaggagtcttcatgttc acatcggagtctgtgggagagggacacccggataagatctgtgaccagatcagtgatgca gtgctggatgcccatctcaagcaagaccccaatgccaaggtggcctgtgagacagtgtgc aagaccggcatggtgctgctgtgtggtgagatcacctcaatggccatggtggactaccag cgggtggtgagggacaccatcaagcacatcggctacgatgactcagccaagggctcactt tgcttccgcctaggctttgacttcaagacttgcaacgtgctggtggctttggagcagcaa tccccagatattgcccagtgcgtccatctggacagaaatgaggaggatgtgggggcagga gatcagggtttgatgttcggctatgctaccgacgagacagaggagtgcatgcccctcacc atcatccttgctcacaagctcaacgcccggatggcagacctcaggcgctccggcctcctc ccctggctgcggcctgactctaagactcaggtgacagttcagtacatgcaggacaatggc gcagtcatccctgtgcgcatccacaccatcgtcatctctgtgcagcacaacgaagacatc acgctggaggagatgcgcagggccctgaaggagcaagtcatcagggccgtggtgccggcc aagtacctggacgaagacaccgtctaccacctgcagcccagtgggcggtttgtcatcgga ggtccccagggggatgcgggtgtcactggccgtaagattattgtggacacctatggcggc tggggggctcatggtggtggggccttctctgggaaggactacaccaaggtagaccgctca gctgcatatgctgcccgctgggtggccaagtctctggtgaaagcagggctctgccggaga gtgcttgtccaggtttcctatgccattggtgtggccgagccgctgtccatttccatcttc acctacggaacctctcagaagacagagcgagagctgctggatgtggtgcataagaacttc gacctccggccgggcgtcattgtcagggatttggacttgaagaagcccatctaccagaag acagcatgctacggccatttcggaagaagcgagttcccatgggaggttcccaggaagctt gtattttag >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_6|93_aa MGRNQSRKAENSKKQSAPSSPKDRSSSPAMEQSRTENDFDELTEVGFRKLVINFFELKED VQTHRKEAKNFEKRLNEWLTKINSVEKTLNDLM >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_6|282_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaaacagagcgccccttcttct ccaaaggatcgcagctcctcgccagcaatggaacaaagcaggacagagaatgattttgat gagttgacagaagtaggcttcagaaagttggtaataaacttctttgagctaaaggaggat gttcaaacccatcgcaaggaagctaaaaactttgaaaaaagattaaatgaatggctaact aaaataaacagtgtagagaagaccttaaatgacctgatgtag >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_7|461_aa MIISIDAEKAFNKIQQPFIIKVFNKLGIDGTYLKVIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNITLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIAYLENPVVS AQNLLKLISNFSKVSAYKINVQKSQAFLDTNNRQTESQIMSELPLTIATKIIKYLGIQLT KDVKDLFKENYKPLFNEIKEDTNKWKNIPCSWTGKINIMKTAILPKVIYRFSAITIKLPM TFFTELEKTTLKFIWSQKRARIAKTILSQKNKAGGIMLPDFKLYYKATVTQTAWYWYQNR DIDQWNRTEASEITPHIYNPLIFDKPDKNKKWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNVRPKTIKALEENLGKTIQDIDMGKDFMTKTSKAMATKAKIDKWDLI KLKSFCTAKETIIRVNRHPTEWDKSFAIYSSDKRLISRIYK >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_7|1386_bp atgattatctcaatagatgcagaaaaggccttcaacaaaattcaacagcccttcataata aaagttttcaataaactaggtattgatgggacatatctcaaagtaataagagctatttat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacaaggatgccctctctcaccactcctattcaacataacgttggaagtt ctggccagggcaatcaggcaggagaaagaaataaagggtattcagttaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgcgtatttagaaaaccccgtcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcagcatacaaaatcaat gtgcaaaaatcacaggcattcttagacaccaataacagacaaacagagagccaaatcatg agtgaactcccactcacaattgctacaaagataataaaatacctaggaatccaacttaca aaggatgtgaaggacctcttcaaggagaactacaaaccactgttcaatgaaataaaagag gacacaaacaaatggaagaacattccatgctcatggacaggaaaaatcaatatcatgaaa acggccatactgcccaaggtaatttatagattcagtgccatcaccatcaaactaccaatg acattcttcacagaattggaaaaaactactttaaagttcatatggagccaaaaaagagcc cgcattgccaagacaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaacccaaacagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagaggcctcagaaataacaccacacatctacaaccct ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttacacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaagccctagaagaaaacctaggcaaaaccattcaggacatagacatgggcaaggacttc atgactaaaacatcaaaagcaatggcaacaaaagccaaaatagacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactatcatcagagtgaacaggcatcctaca gaatgggacaaaagttttgcaatctactcatctgacaaaaggctaatatccagaatctac aaataa >gi568815588r:80147222_80347683|GENSCAN_predicted_peptide_8|72_aa MEYYAAIKKDEFMSFVGTRMKLETIILSKLSQGQKNKHHTSSLIGGLLRSPSLPNEGCSL PQGAQSHQLPKG >gi568815588r:80147222_80347683|GENSCAN_predicted_CDS_8|219_bp atggaatactatgcagccataaagaaggatgagttcatgtcctttgtagggacacggatg aagctggaaaccatcattctgagcaaactatcacaaggacagaaaaacaaacaccatacg tcctcactcataggtgggctcctgcgcagcccgagcctccccaacgagggctgctccctg ccccagggcgcccagtcccatcaactgcccaagggctga