GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:26:06 Sequence gi568815592r:111561742_111820051 : 258310 bp : 43.31% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.16 Intr - 1298 1224 75 2 0 96 92 23 0.085 3.21 1.15 Intr - 4819 4703 117 1 0 92 101 159 0.992 18.26 1.14 Intr - 5951 5883 69 2 0 103 95 54 0.968 6.98 1.13 Intr - 11242 11154 89 2 2 60 92 56 0.843 2.89 1.12 Intr - 14080 13902 179 0 2 101 85 0 0.808 0.56 1.11 Intr - 18648 18456 193 1 1 106 100 76 0.916 9.15 1.10 Intr - 30353 29517 837 0 0 95 103 314 0.117 24.96 1.09 Intr - 36314 36114 201 0 0 111 51 24 0.035 0.26 1.08 Intr - 43587 43379 209 2 2 76 25 68 0.049 -1.98 1.07 Intr - 44210 44035 176 2 2 90 115 59 0.600 7.54 1.06 Intr - 63424 63384 41 2 2 140 87 11 0.080 4.14 1.05 Intr - 68301 68166 136 0 1 38 20 104 0.019 -1.26 1.04 Intr - 68398 68321 78 1 0 107 -20 129 0.067 3.75 1.03 Intr - 75171 74099 1073 1 2 67 -33 347 0.269 10.45 1.02 Intr - 75384 75217 168 1 0 19 47 112 0.139 0.12 1.01 Init - 83337 83229 109 0 1 54 109 74 0.307 6.49 1.00 Prom - 99103 99064 40 -3.66 2.12 PlyA - 99575 99570 6 1.05 2.11 Term - 100206 99998 209 1 2 110 48 242 0.431 19.90 2.10 Intr - 112889 112758 132 0 0 89 110 106 0.964 13.52 2.09 Intr - 132787 132634 154 1 1 97 86 160 0.912 16.35 2.08 Intr - 132963 132887 77 1 2 78 96 60 0.999 5.03 2.07 Intr - 134715 134536 180 1 0 59 53 268 0.714 20.24 2.06 Intr - 138527 138363 165 0 0 84 59 46 0.598 1.23 2.05 Intr - 141293 141144 150 0 0 71 91 136 0.999 12.33 2.04 Intr - 142361 142258 104 1 2 53 116 86 0.999 7.72 2.03 Intr - 147416 147237 180 1 0 65 71 87 0.638 3.68 2.02 Intr - 152702 152606 97 0 1 95 99 91 0.998 9.97 2.01 Init - 158310 158064 247 0 1 50 115 289 0.978 25.56 2.00 Prom - 165145 165106 40 -4.16 3.00 Prom + 165283 165322 40 -5.16 3.01 Init + 169024 169168 145 0 1 47 86 45 0.049 0.49 3.02 Intr + 174634 174781 148 2 1 40 95 66 0.739 1.79 3.03 Intr + 176631 176735 105 0 0 111 47 67 0.895 4.23 3.04 Term + 177934 177988 55 1 1 136 43 38 0.532 1.13 3.05 PlyA + 178425 178430 6 1.05 4.00 Prom + 194804 194843 40 -3.56 4.01 Init + 195874 195930 57 0 0 78 80 47 0.646 4.21 4.02 Intr + 197359 197394 36 0 0 95 94 3 0.356 0.06 4.03 Intr + 201110 201149 40 1 1 102 80 23 0.240 0.70 4.04 Intr + 205485 205513 29 1 2 67 110 20 0.053 -0.07 4.05 Intr + 212006 212029 24 1 0 112 82 26 0.068 2.62 4.06 Term + 222018 222137 120 2 0 65 54 57 0.024 -1.53 4.07 PlyA + 223044 223049 6 1.05 5.03 PlyA - 223941 223936 6 1.05 5.02 Term - 244261 244173 89 2 2 101 54 80 0.548 3.72 5.01 Init - 256931 256874 58 2 1 38 78 66 0.190 2.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 68398 68315 84 1 0 107 39 137 0.873 8.35 S.002 Init - 114783 114678 106 0 1 36 85 47 0.811 -0.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:111561742_111820051|GENSCAN_predicted_peptide_1|1250_aa MTQLLRKQVADTTGTCELGTAMKLDWGSGDLTPVTPDAEKAFDKIQQPFMLKTLNKLGID GTYLKIIRAIYDKPTANIILNGKKLEAFPSKTVLEVLARAIRQEKEIKDIQSGKEEVKLS LFADDMIVYLENPIVSAQNLLKLISNFSNVSGYQINVQKSQAFLYTNKRQTESQIMSELP FTVASKRIKYLGIQLTRDVKELFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKRAIL PKIIYRFNVIPFKLPKTFFTELEKTTLKFMWNQKRAHIAKTILSQKNKAGGITLSDFKLY YKTTVTKTAWYWYQNRDIDQWNRTEPSEIISHIYNHLIFDKPEKNKQWGKDSLFNKWCWE NWLAICRELKLDPFLTPYTKINSRWIKHLNVRPKTIKTLEENLGNTIQDIGMGKDFMSKT PKAMATKAKIDKWDLIKLKSFCTAKETTIRGTTIPENSFGAFYWIPYLHLAISELMGPDL EMAFCSNSIDQNLLHAPPSSKKSGEYFLFVCLKAEMELVATSLSRDMYTEGCCLKGDFEQ RLPRLRRNLHTASTSCFRACSSTYLGPEKVEGDEKPPRADYPPGPVCLSVVDLRGEKNQQ HIFGPGSSAELCAGCEPPPGPSPQPPSSPPSLALDGAFGNKGEKCQNLILEANQASCDFQ VQDGFADEPEMTQGTASMGAPRVLVGRSPVPLRSQCLLPGSWLENRAFPLQSRHRPLVET GAKVMPELQAQTRMNRSIPVEVDESEPYPSQLLKPIPEYSPEEESEPPAPNIRNMAPNSL SAPTMLHNSSGDFSQAHSTLKLANHQRPVSRQVTCLRTQVLEDSEDSFCRRHPGLGKAFP SGCSAVSEPASESVVGALPAEHQFSFMEKRNQWLVSQLSAASPDTGHDSDKSDQSLPNAS ADSLGGSQEMVQRPQPHRNRAGLDLPTIDTGYDSQPQDVLGIRQLERPLPLTSVCYPQDL PRPLRSREFPQFEPQRYPACAQMLPPNLSPHAPWNYHYHCPGSPDHQVPYGHDYPRAAYQ QVIQPALPGQPLPGASVRGLHPVQKVILNYPSPWDHEERPAQRDCSFPGLPRHQDQPHHQ PPNRAGAPGESLECPAELRPQVPQPPSPAAVPRPPSNPPARGTLKTSNLPEELRKVFITY SMDTAMEVVKFVNFLLVNGFQTAIDIFEDRIRGIDIIKWMERYLRDKTVMIIVAISPKYK QDVEGAESQLDEDEHGLHTKYIHRMMQIEFIKQGSMNFRFIPVLFPNAKK >gi568815592r:111561742_111820051|GENSCAN_predicted_CDS_1|3750_bp atgactcaactgctgagaaagcaagtcgctgatacaacagggacttgtgagctgggaaca gccatgaaactggactggggatctggggacctgactcctgtcacaccagatgcagaaaag gcctttgacaaaattcaacagcccttcatgctaaaaactctcaataaactaggtattgat gggacgtatctcaaaataataagagctatttatgacaaacccacagccaatatcatactg aatgggaaaaaactggaagcattcccttcgaaaactgtgttggaagttctggccagggca atcaggcaggagaaagaaataaaggatattcaatcaggaaaagaggaagtcaaattgtcc ctatttgcagatgacatgattgtatatttagaaaaccccatcgtctcagcccaaaatctc cttaagctgataagcaacttcagcaatgtctcaggataccaaatcaatgtgcaaaaatca caagcattcttatacaccaataagagacaaacagagagccaaatcatgagtgaactccca ttcacagttgcttcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaa gaactcttcaaggagaactacaaaccactgctcaatgaaataaaagaggacacaaacaaa tggaagaacattccatgttcatggataggaagaatcaatatcgtgaaaagggccatactg cccaagataatttatagattcaatgtcatccccttcaagctaccaaagactttcttcaca gaattggaaaaaactactttaaagttcatgtggaaccaaaaaagagcccacattgccaag acaatcctaagccaaaagaacaaagctggaggcatcacactatctgacttcaagctatac tacaagactacagtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaa tggaacagaacagagccctcagaaataatatcacacatctacaaccatctgatctttgac aaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatggtgctgggaa aactggctagccatatgtagagagctgaaactggatcccttccttacaccttatacaaaa attaattcaagatggattaaacacttaaatgttagacctaaaaccataaaaaccctagaa gaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtctaaaaca ccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactaaagagc ttctgcacagcaaaagaaactaccatcagaggcaccaccatccccgagaactcctttgga gccttctactggatcccctaccttcatctggccatcagtgaactgatggggccagatctg gaaatggctttctgctcaaattcaattgaccagaacttgttgcatgccccacctagcagc aagaagtctggggaatattttctcttcgtgtgcttgaaagcagaaatggaattggtggca actagtctctccagggacatgtacacggaagggtgttgcctgaagggagattttgagcag aggcttcctaggctccgtagaaatttgcatacagcttccacttcctgcttcagagcctgt tcttctacttacctgggcccggagaaggtggagggagacgagaagccgccgagagccgac taccctccgggcccagtctgtctgtccgtggtggatctaaggggggaaaaaaatcaacag cacatttttgggccaggaagttctgctgagctctgtgctggctgtgagccgcctcccggc ccctctccgcagccgccctccagccctcccagcctggcgctggacggagcctttgggaac aagggggagaaatgccagaacttgatcttagaagctaatcaagcttcttgtgactttcaa gttcaagatggatttgctgatgagcctgagatgacccagggcactgccagcatgggggct cccagggtcctagttggcaggtctcctgttcccctgaggagtcagtgcttgctccctggg tcatggctggaaaacagggcttttccacttcaatccaggcaccgccctctggtagagact ggagcaaaggtcatgccagagttacaagcacaaactagaatgaaccgaagcattcctgtg gaggttgatgaatcagaaccatacccaagtcagttgctgaaaccaatcccagaatattcc ccggaagaggaatcagaaccacctgctccaaatataaggaacatggcacccaacagcttg tctgcacccacaatgcttcacaattcctccggagacttttctcaagctcactcaaccctg aaacttgcaaatcaccagcggcctgtatcccggcaggtcacctgcctgcgcactcaagtt ctggaggacagtgaagacagtttctgcaggagacacccaggcctgggcaaagctttccct tctgggtgctctgcagtcagcgagcctgcgtctgagtctgtggttggagccctccctgca gagcatcagttttcatttatggaaaaacgtaatcaatggctggtatctcagctttcagcg gcttctcctgacactggccatgactcagacaaatcagaccaaagtttacctaatgcctca gcagactccttgggcggtagccaggagatggtgcaacggccccagcctcacaggaaccga gcaggcctggatctgccaaccatagacacgggatatgattcccagccccaggatgtcctg ggcatcaggcagctggaaaggcccctgcccctcacctccgtgtgttacccccaggacctc cccagacctctcaggtccagggagttccctcagtttgaacctcagaggtatccagcatgt gcacagatgctgcctcccaatctttccccacatgctccatggaactatcattaccattgt cctggaagtcccgatcaccaggtgccatatggccatgactaccctcgagcagcctaccag caagtgatccagccggctctgcctgggcagcccctgcctggagccagtgtgagaggcctg caccctgtgcagaaggttatcctgaattatcccagcccctgggaccacgaagagaggccc gcacagagagactgctcctttccggggcttccaaggcaccaggaccagccacatcaccag ccacctaatagagctggtgctcctggggagtccttggagtgccctgcagagctgagacca caggttccccagcctccgtccccagctgctgtgcctagaccccctagcaaccctccagcc agaggaactctaaaaacaagcaatttgccagaagaattgcggaaagtctttatcacttat tcgatggacacagctatggaggtggtgaaattcgtgaactttttgttggtaaatggcttc caaactgcaattgacatatttgaggatagaatccgaggcattgatatcattaaatggatg gagcgctaccttagggataagaccgtgatgataatcgtagcaatcagccccaaatacaaa caggacgtggaaggcgctgagtcgcagctggacgaggatgagcatggcttacatactaag tacattcatcgaatgatgcagattgagttcataaaacaaggaagcatgaatttcagattc atccctgtgctcttcccaaatgctaagaag >gi568815592r:111561742_111820051|GENSCAN_predicted_peptide_2|564_aa MGCVQCKDKEATKLTEERDGSLNQSSGYRYGTDPTPQHYPSFGVTSIPNYNNFHAAGGQG LTVFGGVNSSSHTGTLRTRGGTGVTLFVALYDYEARTEDDLSFHKGEKFQILNSSTKKGG KEGPEPQEIRFAGRSDLLEGNHVVDCRLVEGSADTQWMSEPQRHIHGLPDVNGKRWYFGK LGRKDAERQLLSFGNPRGTFLIRESETTKGAYSLSIRDWDDMKGDHVKHYKIRKLDNGGY YITTRAQFETLQQLVQHYSERAAGLCCRLVVPCHKGMPRLTDLSVKTKDVWEIPRESLQL IKRLGNGQFGEVWMGTWNGNTKVAIKTLKPGTMSPESFLEEAQIMKKLKHDKLVQLYAVV SEEPIYIVTEYMNKGSLLDFLKDGEGRALKLPNLVDMAAQVAAGMAYIERMNYIHRDLRS ANILVGNGLICKIADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFG ILLTELVTKGRVPYPGMNNREVLEQVERGYRMPCPQDCPISLHELMIHCWKKDPEERPTF EYLQSFLEDYFTATEPQYQPGENL >gi568815592r:111561742_111820051|GENSCAN_predicted_CDS_2|1695_bp atgggctgtgtgcaatgtaaggataaagaagcaacaaaactgacggaggagagggacggc agcctgaaccagagctctgggtaccgctatggcacagaccccacccctcagcactacccc agcttcggtgtgacctccatccccaactacaacaacttccacgcagccgggggccaagga ctcaccgtctttggaggtgtgaactcttcgtctcatacggggaccttgcgtacgagagga ggaacaggagtgacactctttgtggccctttatgactatgaagcacggacagaagatgac ctgagttttcacaaaggagaaaaatttcaaatattgaacagctcgaccaaaaaaggagga aaagaaggtcctgagccccaggagatcagatttgctgggaggtctgatttgcttgagggc aatcatgtcgttgactgcaggctggtggaaggttctgcagatactcagtggatgtcagag ccgcagcgccacatccatggcctccccgatgttaatgggaagaggtggtactttggaaaa cttggccgaaaagatgctgagcgacagctattgtcctttggaaacccaagaggtaccttt cttatccgcgagagtgaaaccaccaaaggtgcctattcactttctatccgtgattgggat gatatgaaaggagaccatgtcaaacattataaaattcgcaaacttgacaatggtggatac tacattaccacccgggcccagtttgaaacacttcagcagcttgtacaacattactcagag agagctgcaggtctctgctgccgcctagtagttccctgtcacaaagggatgccaaggctt accgatctgtctgtcaaaaccaaagatgtctgggaaatccctcgagaatccctgcagttg atcaagagactgggaaatgggcagtttggggaagtatggatgggtacctggaatggaaac acaaaagtagccataaagactcttaaaccaggcacaatgtcccccgaatcattccttgag gaagcgcagatcatgaagaagctgaagcacgacaagctggtccagctctatgcagtggtg tctgaggagcccatctacatcgtcaccgagtatatgaacaaaggaagtttactggatttc ttaaaagatggagaaggaagagctctgaaattaccaaatcttgtggacatggcagcacag gtggctgcaggaatggcttacatcgagcgcatgaattatatccatagagatctgcgatca gcaaacattctagtggggaatggactcatatgcaagattgctgacttcggattggcccga ttgatagaagacaatgagtacacagcaagacaaggtgcaaagttccccatcaagtggacg gcccccgaggcagccctgtacgggaggttcacaatcaagtctgacgtgtggtcttttgga atcttactcacagagctggtcaccaaaggaagagtgccatacccaggcatgaacaaccgg gaggtgctggagcaggtggagcgaggctacaggatgccctgcccgcaggactgccccatc tctctgcatgagctcatgatccactgctggaaaaaggaccctgaagaacgccccactttt gagtacttgcagagcttcctggaagactactttaccgcgacagagccccagtaccaacct ggtgaaaacctgtaa >gi568815592r:111561742_111820051|GENSCAN_predicted_peptide_3|150_aa MFLPDVDIQSSIFLPSLPSFYKNNQPEKKRKEKQKRGLLFMTSKEKQEGLEQSEDVNSCS STSKSHQSAALAEALCPSTKQNWPVPESSDGAALEDSRGLGLKAYNLNSHLFLESGAPKA HGCSQAKLLTGEGVSEVYKAMIKMKGFYFE >gi568815592r:111561742_111820051|GENSCAN_predicted_CDS_3|453_bp atgtttcttcctgatgttgatatccaatcaagtatcttcctaccctccttaccctcattc tataaaaacaatcaaccagagaaaaagagaaaagaaaaacaaaaacgtgggcttctattc atgaccagcaaagagaaacaggaaggcttagaacagtcagaggatgtaaactcttgctca tccaccagcaagagtcatcaatcggcagctctggcagaagcactttgtcccagcactaag cagaactggcctgtccctgagagctccgatggagctgccttagaagattccaggggtctt ggactgaaggcttacaacctcaactcacatctgtttttggagtcaggtgcccccaaggct catggctgcagccaggcaaagctgctcactggagagggggtgtcagaagtgtacaaagcc atgattaagatgaagggtttctattttgagtag >gi568815592r:111561742_111820051|GENSCAN_predicted_peptide_4|101_aa MASHIRKEMDAMKGDIGQELLGLYPTLDHLQVTKKVLAFKWLDTGILKNKSKNQNPFNIY IQCSWAHGSTPSWEWGRDSPRFCSVLTCNIRINQELVRAVS >gi568815592r:111561742_111820051|GENSCAN_predicted_CDS_4|306_bp atggcatcccacatcaggaaggagatggatgctatgaagggagacattggtcaagagctc cttggcttatacccaacacttgatcatttgcaggtaactaaaaaagtcctcgcattcaaa tggctggacacgggtatattgaaaaacaagagtaaaaatcagaatcccttcaacatctat attcagtgctcatgggcacatggcagcacgcccagctgggagtgggggcgggacagtcca aggttttgcagtgtgctgacctgcaacattcgcatcaaccaggagcttgtaagagctgtc agctga >gi568815592r:111561742_111820051|GENSCAN_predicted_peptide_5|48_aa MTAASEKASQQWQQQGNRPGSSTASRAVVGSEEKEKTRKSRNTLEELL >gi568815592r:111561742_111820051|GENSCAN_predicted_CDS_5|147_bp atgacagcagcatcggagaaagctagtcagcagtggcagcagcaaggcaacagaccaggg tcttcaactgcctcaagagctgttgtgggctcagaagaaaaggaaaagactaggaaaagc agaaacacgctggaggaactactgtga