GENSCAN 1.0 Date run: 5-Nov-116 Time: 20:22:31 Sequence gi568815593f:123445664_123714393 : 268730 bp : 36.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 11597 12058 462 1 0 42 41 189 0.906 5.51 1.02 PlyA + 12109 12114 6 1.05 2.02 PlyA - 13904 13899 6 1.05 2.01 Sngl - 23326 23054 273 1 0 69 44 300 0.997 18.78 2.00 Prom - 26815 26776 40 -2.35 3.00 Prom + 51283 51322 40 -3.85 3.01 Init + 60875 60996 122 1 2 70 94 75 0.779 6.01 3.02 Intr + 62725 62780 56 1 2 61 90 62 0.447 1.40 3.03 Intr + 65705 65901 197 0 2 36 95 92 0.426 2.91 3.04 Intr + 66318 66414 97 2 1 49 81 83 0.805 2.36 3.05 Intr + 66631 66907 277 2 1 -3 88 202 0.574 6.75 3.06 Intr + 99952 100178 227 1 2 60 82 317 0.845 25.01 3.07 Intr + 127730 127878 149 0 2 94 119 48 0.048 7.53 3.08 Intr + 142764 142848 85 2 1 96 95 51 0.089 5.07 3.09 Intr + 144747 144895 149 1 2 22 97 110 0.084 4.33 3.10 Intr + 159061 159167 107 0 2 73 55 112 0.165 4.49 3.11 Intr + 171543 171604 62 0 2 50 110 34 0.004 -0.74 3.12 Intr + 179946 180012 67 1 1 24 66 143 0.041 2.74 3.13 Intr + 183371 183554 184 2 1 49 57 78 0.265 -0.33 3.14 Term + 183686 183892 207 1 0 50 34 172 0.350 4.36 3.15 PlyA + 184270 184275 6 1.05 4.00 Prom + 185107 185146 40 -9.05 4.01 Init + 188072 188284 213 1 0 63 20 153 0.885 4.79 4.02 Intr + 190486 190737 252 0 0 62 42 201 0.951 9.61 4.03 Intr + 190840 191344 505 0 1 42 29 829 0.514 64.02 4.04 Intr + 191356 191559 204 2 0 -42 48 250 0.141 6.25 4.05 Intr + 191606 191760 155 0 2 -66 42 232 0.117 2.07 4.06 Term + 202765 202884 120 0 0 49 55 130 0.506 3.29 4.07 PlyA + 202988 202993 6 1.05 5.04 PlyA - 204223 204218 6 1.05 5.03 Term - 212375 212253 123 2 0 75 48 87 0.021 0.80 5.02 Intr - 230783 230484 300 2 0 18 86 128 0.093 1.61 5.01 Init - 232644 231475 1170 0 0 44 41 515 0.171 35.97 5.00 Prom - 232853 232814 40 -9.85 6.02 PlyA - 232958 232953 6 1.05 6.01 Sngl - 234015 233722 294 0 0 88 54 295 0.983 21.55 6.00 Prom - 237517 237478 40 -4.65 7.00 Prom + 240741 240780 40 -3.15 7.01 Init + 248057 248063 7 1 1 88 81 2 0.119 0.60 7.02 Intr + 254192 254354 163 0 1 102 57 121 0.145 8.51 7.03 Intr + 260885 261009 125 2 2 43 95 93 0.149 4.81 7.04 Term + 266061 266107 47 1 2 142 43 7 0.023 -2.01 7.05 PlyA + 268477 268482 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 144752 144895 144 1 0 69 97 105 0.908 9.67 S.002 Term + 148189 148296 108 0 0 88 49 64 0.862 0.03 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:123445664_123714393|GENSCAN_predicted_peptide_1|153_aa MIISIDVEKAFDKIQQRFMLKTLNKLRIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIMLEVLTRAIRQEKEIKDIQLGKEEVKLSLFADDMIVYLENLIVS AQNLLKLISNFSKVSGYTINVQKSQAFLYTNNR >gi568815593f:123445664_123714393|GENSCAN_predicted_CDS_1|462_bp atgattatctcaatagatgtagaaaaggcctttgacaaaattcaacaacgcttcatgcta aaaactctcaataaattacgtattgatgggacatatctcaaaataataagagctatctat gacaaacccacagccaatatcatattgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacataatgttggaagtt ctgaccagggcaattaggcaggagaaggaaataaaggatattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattgtatatctagaaaacctcatcgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacacaatcaat gtgcaaaaatcacaagcattcttatacaccaataacagataa >gi568815593f:123445664_123714393|GENSCAN_predicted_peptide_2|90_aa MRNNLSDSEEHSYITKAAKLKEKYEKDVADCKHKGKFDGTKHPTNIAWKKVEAEDEEDEE NEEEEAVEEDDFKTIYLSLCEYLRIREDCN >gi568815593f:123445664_123714393|GENSCAN_predicted_CDS_2|273_bp atgaggaataacttaagtgacagtgaagagcactcttacatcactaaggcagcaaaactg aaggagaagtatgagaaggatgttgctgactgtaagcataaaggaaagtttgatggcaca aaacatcctactaacattgcctggaaaaaggtggaagcagaagacgaagaagatgaggag aatgaagaggaggaggcagtggaggaggatgactttaaaaccatttatctatctctatgt gaatatcttagaatacgggaggattgtaattga >gi568815593f:123445664_123714393|GENSCAN_predicted_peptide_3|661_aa MFPVTPPPLLHTYRKAQPLITEQSETEKDSSKGNLSATRKSSYAESPTPNVMEFVGPLAE VYQNDLSPSQGSNCKRENERNAVFLCRQSGLKCESETPISVCALQLDNLGDVWLLRTLGG EFEKQKDGDRARLRLYTPCQPMTAPSPFRSAIRPIQEAEARAATLRPAAAVGGTSGSAGT ESEQRCVTSLSRSFPPTSLARSLAPCVARFPPPAAGGCDVPPPLPPPALTDWGRRPRPGP TPLLAAAPLSVLVLIKEFKVEYRKLDMENKKKDKDKSDDRMARPSGRSGHNTRGTGSSSS GVLMVGPNFRVGKKIGCGNFGELRLDGIPQVYYFGPCGKYNAMVLELLGPSLEDLFDLCD RTFSLKTVLMIAIQLADTLKERYQKIGDTKRATPIEVLCENFPEEMATYLRYVRRLDFFE KPDYDYLRKLFTDLFDRKGYMFDYEYDWIGKQLVVSSTNGELNTDDPTAGRSNAPITAPT EVEVMDETKCFKGFYKMALMVRYLLRDATGEYLQTEPPDVPWHQQERIASISPAKSPLQL KPWQAQSLPAAPPPPPAPHPCDNTAMGVKLGTEKSGPFPTLSSHLHLGECSWRRPAPSTR QYPATAEKKSTLLCCCCCHCCWHMQMRTDPAAATIQKALSGTIHQSAVTQWSRSTAALPV Q >gi568815593f:123445664_123714393|GENSCAN_predicted_CDS_3|1986_bp atgttccccgttactcctcctcccttactccacacctacaggaaggcacagcctctgatc actgaacagagtgagacagaaaaagattcatcaaagggaaatctgagtgctactaggaaa agttcatatgctgaatccccaaccccaaatgtgatggaatttgtagggcctttggcagaa gtctaccaaaatgacttatctccatcccagggaagcaactgcaagagggaaaatgaaagg aatgcagtctttctatgtaggcagagtggacttaagtgtgagtcagaaacaccgatttca gtttgtgcgctgcaacttgataacttgggtgatgtttggcttctcagaactttgggaggt gaatttgaaaaacagaaagacggggatagggcgcggcttaggctttatacaccttgccag ccaatgaccgcgccgtctcctttccggtcggcgatccgaccaatccaagaagcagaggct cgggcggcgactctccggccagcggcggcggtaggaggcaccagcggcagtgcagggacc gaatccgagcagcgctgcgttacctctctctctcgctccttcccccctacctcgctcgct cgctcgctcgctccctgcgtggctcgctttcctcctccggccgccggcgggtgtgatgtg ccgccgccgctgcccccgccggcgctgacggactgggggcgccgcccgcgcccgggaccg acccctctgctcgcggccgcgcctttgagtgtgcttgtcttgattaaagaattcaaagtg gagtaccgcaaacttgatatggaaaataaaaagaaagacaaggacaaatcagatgataga atggcacgacctagtggtcgatcgggacacaacactcgaggaactgggtcttcatcgtct ggagttttaatggttggacctaactttagagttggaaaaaaaattggatgtggcaatttt ggagaattacgattagatggtatacctcaagtttactatttcggcccttgtggtaaatac aatgctatggtgctggaactgctgggacctagtttggaagacttgtttgacttgtgtgac agaacattttctcttaaaacagttctcatgatagctatacaactggctgacacattaaag gagaggtatcagaaaattggagatacaaaacgggctacaccaatagaagtgttatgtgaa aattttccagaagaaatggcaacatatcttcgttatgtaagaaggctagatttttttgaa aaaccagactatgactacttaagaaagctttttactgacttgtttgatcgaaaaggatat atgtttgattatgaatatgactggattggtaaacagttggttgtaagttctacaaatgga gagttaaacacagatgaccccaccgcaggacgttcaaatgcacccatcacagcccctact gaagtagaagtgatggatgaaaccaagtgctttaagggattttacaaaatggcattaatg gtacgctacttactgagggatgctacaggggagtacctgcagaccgaacctccagacgtg ccctggcatcagcaggaacgcatagcttctattagccccgccaagtctccattgcagtta aagccttggcaggcacagagcctgccagccgcacccccgcccccaccagcaccccacccc tgtgacaacactgccatgggagtgaaactaggcacagagaagagtgggcctttccccacc ctgagcagccatctccacctgggtgaatgctcatggaggaggccagcacccagcacccgc cagtaccctgccacagcggagaagaagtccaccctgctctgctgctgctgctgccactgc tgctggcacatgcaaatgagaacagatcctgctgctgccaccatacaaaaagctttgagt ggcaccatccatcagagtgctgtgacacagtggtcccggagcactgcggcactgccagtg cagtga >gi568815593f:123445664_123714393|GENSCAN_predicted_peptide_4|482_aa MEDNLDNIILNARIGKDFMIKMPKAIATKAKIDKCNLIKLKSFCTAKEAINEVDRQPTEW EKSFATCHLTKRSLGSVQATSYGARLVSSTARVYAGAGGSGSRISVSRSTSFQGGLESGG LAAGMAGGLAGIGGIQHEKETMQSLKDCLASYLDRTIQNLRIQIFANTVDNACIVLQIDN ARLAADDFRVKYETELAMRQSVENDIHGLRNVIDDTVTRLQLETEIEAVKEELLFMKKNH KEEAKSLQAQIASSGLMVEVDAPKSQDLAKVMADIRAQYDELAQKNGEELDKHWPQQIEE STTVVTTQSAEVGAAETTLTELRLLGDDLDSMRNLKASLESEEQPEGGGGPLRPADGAVQ WDPAAPGVRAGTDPGRGTAPGPGVGGPAERQDGEDFNFGDALDSSNSMQTIQKTTTTHRT VDGKVVSETHDTKVLRHSANRSRTEEGSSWFVVQLKRKCKICSISALHYQTIDFRESTEN VK >gi568815593f:123445664_123714393|GENSCAN_predicted_CDS_4|1449_bp atggaagacaacctagacaacatcattctgaatgcaagaataggcaaagatttcatgata aagatgccaaaagcaattgcaacgaaagcaaaaattgacaaatgtaatctaattaaacta aagagcttctgcacagccaaggaagctatcaacgaagtggacaggcaaccaacagaatgg gagaaaagttttgcaacgtgtcatctgacaaagcggtccctgggctctgtccaggcgacc agctacggcgcccggctggtcagcagcacagccagagtctatgcaggcgccgggggctct ggttcccggatctctgtgtcccgctccaccagcttccagggcggcttggagtccgggggc ctggccgcggggatggccgggggtctggcaggaataggaggcatccagcacgagaaggag accatgcaaagcctgaaagactgcctggcctcctacctggacagaaccatccagaacctg aggattcagatcttcgcaaatactgtggacaatgcctgcatagttctgcagattgacaat gcccgtcttgccgctgatgactttagagtcaagtatgagacagagctggccatgcgccag tctgtggagaacgacatccacgggctccgcaatgtcattgatgacactgtcactcggctg cagctggagacagagatcgaggctgtcaaggaggagctgctcttcatgaagaagaaccac aaagaggaagcaaaaagcctacaagcccagattgccagctctgggttgatggtggaggta gatgcccccaaatctcaggacctcgccaaggtcatggcagacatccgggcccaatatgac gagctggctcagaagaacggagaggagctggacaagcattggcctcagcagattgaagag agcaccacagtggtcaccactcagtccgccgaggttggagctgctgagacgacgctcacg gagctgagacttcttggagatgacctggactccatgagaaatctgaaggccagcttggaa tctgaagaacagcctgagggaggtggaggcccgctacgccctgcagatggagcagttcaa tgggatcctgctgcacctggagtcagagctggcacagacccgggcagagggacagcgcca ggcccaggagtaggaggacctgctgaacgtcaagatggcgaggacttcaattttggtgat gctctggacagcagcaactccatgcaaaccatccaaaagaccaccaccacccaccggaca gtggatggcaaagtggtgtctgagacccacgacaccaaagttctgaggcattcagccaac agaagcaggactgaagagggcagcagttggtttgtggtacagctaaaacgcaaatgtaaa atctgcagtatttctgcgctgcattaccagacaatagacttcagagaatccacagagaat gtaaagtga >gi568815593f:123445664_123714393|GENSCAN_predicted_peptide_5|530_aa MVKGLIQQEELTILNIYAPNTGAPRFIKQVLSDLQRDLDSHTIITGDFNTPLSTLDRSTR QKVNKDTQQLNSALHQADLIDIYRTLHPNSTEYTFFSAPHHTFSKIDHIVGSKALLSKCK RTETIINFLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDYWIHKEMKAEIKMFFETNE NKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELEKQEQTHSKASRRQ EITNIRAELKEIETQKTLQKINESRSWFFEKIKKIDRPLARLIKKKREKNQIDAIKNDKG DIATNPTEIQTTIREYYKHLYANKLENLEEMEKFLDTYTLPRLNQEEVESLNRPITGFEI EAIINSLPTKKSLGPDGFTAEFYQRYKEELLISNFSKVSGYTINVQKSQAFLYTINRQTE SQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNDIKEDTNKWKNIPCSWVGRI NIMKMAILPKGCLRNQGSTESSGEGHQWLLLSGQCLEQRITSSLFSFFTL >gi568815593f:123445664_123714393|GENSCAN_predicted_CDS_5|1593_bp atggtaaagggattaattcaacaagaagaactaactatcctaaatatatatgcacccaat acaggagcacccagattcataaagcaagtccttagtgacctacaaagagacttagactcc cacacaataataacaggagactttaacaccccactgtcaacattagacagatcaacgaga cagaaagttaacaaggatacccagcaattgaactcagctttgcaccaagcagacctaata gacatctacagaactctccaccccaattcaacagagtatacattcttttcagcaccacac cacaccttttccaaaattgaccacatagttggaagtaaagcactcctcagcaaatgtaaa agaacagaaactataataaactttctctcagaccacagtgcaatcaaactagaactcagg attaagaaactcactcaaaaccgctcaactacatggaaactgaacaacctgctcctgaat gactactggatacataaagaaatgaaggcagaaataaagatgttctttgaaaccaacgag aacaaagacacaacataccagaatctctgggacacattcaaagcagtgtgtagagggaaa ttcatagcactaaatgcccacaagagaaagcaggaaagatctaaaattgacaccctaaca tcacaattaaaagaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaa gaaataactaacatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaa atcaatgaatccaggagctggttttttgaaaagatcaagaaaattgatagaccactagca agactaataaagaagaaaagagagaagaatcaaatagacgcaataaaaaatgacaaaggg gatatcgccaccaatcccacagaaatacaaactaccatcagagaatactataaacacctc tacgcaaataaactagaaaacctagaagaaatggaaaaattcctcgacacatacactctc ccaagactaaaccaggaagaagttgaatctctgaatagaccaataacaggctttgaaatt gaggcaataattaatagcttaccaaccaaaaaaagtctaggaccagatggattcacagct gaattctaccagaggtacaaagaggagctgctgataagcaacttcagcaaagtctcagga tacacaatcaatgtgcaaaaatcacaagcattcttatacaccattaacagacaaacagaa agccaaatcatgagtgaactcccattcacaattgcttctaagagaataaaatacctagga atccaacttacaagggatgtgaaagacctcttcaaggagaactacaaaccactgctcaat gacataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatc aatatcatgaaaatggccatactgcccaagggttgtctcaggaaccaaggaagtacagag tcatcaggagaaggacaccagtggctattactgtcagggcagtgcctggaacaacgcatt acttcttccctcttcagcttcttcaccctatag >gi568815593f:123445664_123714393|GENSCAN_predicted_peptide_6|97_aa MGKKQSRKTRNSKNQSTSPPPKGHSSSPATEQSWTENDSDELREEGFRRSNYSELKEEVQ ANGKEAKNFEKKLDERITRITNAEKSLKDLMELKTKA >gi568815593f:123445664_123714393|GENSCAN_predicted_CDS_6|294_bp atggggaaaaaacagagcagaaaaaccagaaactctaaaaatcagagcacctctcctcct ccaaagggacacagctcctcaccagcaacagaacaaagctggacagagaacgactctgac gagttgagagaagaaggcttcagaaggtcaaactactccgagctaaaggaggaagttcaa gccaatggcaaagaagctaaaaactttgaaaaaaaattagacgaacggataactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccaaggcatga >gi568815593f:123445664_123714393|GENSCAN_predicted_peptide_7|113_aa MPGKAYWVQCGTILRTRPWVLDYYCNREINIFHGCAHQQKEAAQAEALPLAFQITCGKKV TKDPQSLVCGLVEENNPELGVKRQGFEVGLSVTKCENVGTLIIGVRAYLGNPG >gi568815593f:123445664_123714393|GENSCAN_predicted_CDS_7|342_bp atgcctggaaaggcttactgggtccagtgtggcaccatactaaggactcgaccctgggtc cttgattattactgcaacagagaaattaatatcttccatggctgtgctcatcaacagaaa gaagcagcacaagccgaggctctgcctctggccttccaaatcacatgtggtaagaaagtt accaaggatcctcagtcactggtatgtggtcttgtagaggaaaataaccctgaattggga gtcaagagacagggttttgaagttggtctctcagtcactaagtgtgaaaatgtaggcacg cttatcattggagttagggcttacctgggtaatccaggatga