GENSCAN 1.0 Date run: 3-Nov-116 Time: 18:22:05 Sequence gi568815588r:44275889_44485005 : 209117 bp : 45.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 1044 1083 40 0.34 1.01 Init + 17184 17241 58 2 1 69 90 93 0.524 7.07 1.02 Intr + 30902 30988 87 0 0 67 89 68 0.675 4.74 1.03 Intr + 33267 33337 71 1 2 111 87 110 0.984 12.10 1.04 Intr + 34907 35011 105 1 0 125 100 41 0.936 9.41 1.05 Intr + 37407 37484 78 2 0 61 62 61 0.094 0.55 1.06 Intr + 39949 40237 289 0 1 20 103 114 0.071 2.92 1.07 Intr + 40920 41051 132 1 0 83 65 34 0.208 1.22 1.08 Term + 56626 56750 125 2 2 75 41 86 0.044 1.15 1.09 PlyA + 57193 57198 6 1.05 2.07 PlyA - 57673 57668 6 1.05 2.06 Term - 62713 62444 270 1 0 91 41 112 0.322 2.28 2.05 Intr - 70250 70195 56 0 2 68 77 33 0.471 -1.10 2.04 Intr - 71764 71660 105 2 0 88 100 32 0.439 4.59 2.03 Intr - 72249 72211 39 1 0 93 94 15 0.417 0.80 2.02 Intr - 72549 72524 26 2 2 39 94 26 0.354 -4.03 2.01 Init - 73063 72975 89 1 2 97 110 59 0.967 9.12 2.00 Prom - 73354 73315 40 -5.36 3.07 PlyA - 73714 73709 6 1.05 3.06 Term - 78409 78204 206 2 2 82 51 82 0.251 1.43 3.05 Intr - 85939 85802 138 2 0 130 92 -17 0.320 3.34 3.04 Intr - 97933 97779 155 0 2 68 102 61 0.513 5.22 3.03 Intr - 102835 102749 87 0 0 90 82 132 0.901 11.89 3.02 Intr - 104992 104875 118 2 1 93 75 88 0.999 7.52 3.01 Init - 109117 109057 61 1 1 92 127 173 0.560 21.01 3.00 Prom - 126049 126010 40 -6.66 4.05 PlyA - 126178 126173 6 1.05 4.04 Term - 131346 130557 790 2 1 67 44 208 0.041 7.09 4.03 Intr - 134880 134802 79 1 1 90 71 80 0.022 5.11 4.02 Intr - 139029 138678 352 0 1 16 25 231 0.002 4.50 4.01 Init - 139265 139080 186 2 0 76 77 107 0.779 7.46 4.00 Prom - 139884 139845 40 0.24 5.07 PlyA - 139900 139895 6 1.05 5.06 Term - 144831 144676 156 0 0 48 49 128 0.294 2.83 5.05 Intr - 162824 162666 159 2 0 86 53 46 0.017 1.08 5.04 Intr - 164461 164408 54 1 0 86 89 44 0.060 3.48 5.03 Intr - 165981 165734 248 1 2 67 78 48 0.005 -1.12 5.02 Intr - 174624 174527 98 2 2 36 40 130 0.057 2.65 5.01 Init - 177461 177316 146 2 2 77 98 99 0.229 9.39 5.00 Prom - 179802 179763 40 -1.26 6.00 Prom + 181606 181645 40 -8.36 6.01 Init + 182744 182779 36 1 0 63 90 42 0.512 1.92 6.02 Intr + 188358 188413 56 2 2 137 103 -1 0.602 4.08 6.03 Intr + 189086 189158 73 2 1 65 18 86 0.174 -1.29 6.04 Intr + 192854 192969 116 1 2 27 99 92 0.210 3.45 6.05 Intr + 197031 197141 111 0 0 115 99 29 0.629 6.19 6.06 Intr + 200457 200562 106 0 1 105 70 41 0.805 4.22 6.07 Term + 201610 201711 102 0 0 54 55 103 0.606 1.88 6.08 PlyA + 202083 202088 6 1.05 7.00 Prom + 204319 204358 40 -9.46 7.01 Sngl + 205559 205855 297 1 0 41 54 282 0.711 15.85 7.02 PlyA + 206344 206349 6 1.05 8.02 PlyA - 208174 208169 6 1.05 8.01 Term - 208959 208307 653 1 2 67 38 254 0.459 12.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 165550 165188 363 1 0 86 44 161 0.864 7.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_1|314_aa MGKREGGAGRGASGGPRVGGCTGSVVLAPASDEGLRKLTIMAESEGGAGSSLHGSAIGKW NGKAYAAVLLSQVEDERVCAGPAWERQAGQERPARKVQGKAPEAEGKMRKTQVWHAQATR SNEHAVMQQPESQHGCLLDILVCTYLAEKGLSGSCGLTEALVAFGEGSSLGRKNCRDPWG SLRQEPHWSDPRISSCLVAMLKQEGNFFLLGVKEWCDYQQDHYSYLHFSAPPTSPGPCVP AQLTLLLVSTVPFRTSNLVFGPCFSEPWPLQGGVALGTGGWGEVTFRDATEGKAKGTIQA ERGPGQSLAVTFSG >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_1|945_bp atggggaagcgagaagggggggcggggaggggggcctcgggaggaccgcgggtgggaggc tgtacaggaagtgtggtgctagcacctgcttctgatgagggcctcaggaagcttacaatc atggcagaaagtgaaggtggagcaggctccagtttacatggctctgccattgggaagtgg aatggcaaagcatatgcggccgtgctgctcagccaggtcgaggatgagcgggtgtgcgca ggcccagcgtgggagaggcaggcaggccaggagcggcccgccagaaaggtgcaagggaaa gccccagaggcggaggggaagatgagaaagacccaggtctggcatgctcaagccactcgt tcaaatgaacatgctgttatgcagcagccagaaagccagcacggatgccttctggacatc ctggtctgcacgtacctagccgagaaggggctctctggcagctgtgggctgacagaggca ctggtggcctttggggagggcagcagtttaggaagaaagaactgcagagacccctggggc tccctcaggcaggagcctcactggtcagaccctcggatctcttcatgtctagtggccatg ctgaagcaggaaggcaacttcttcctcttaggagtgaaagagtggtgtgattaccagcaa gatcattattcttatctccatttctcagcccctcccaccagccctggtccctgtgtgcca gcacaactcactttgctgctggtgagcacggtccccttcaggacctctaacctggtcttt ggaccctgcttctcggaaccatggcccttacaaggtggagtagccctaggcactggggga tggggtgaagtgaccttccgggatgctaccgagggaaaagccaaagggactattcaggca gagaggggaccagggcagtctttggcggtcacgttctcaggctga >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_2|194_aa MKSTSKRCMQKGNSDYVGSCQDVMHVSNEGQQQVLPNREAFLQGMELSGSAESSFPLPAP IQQVFDFQAGPRDELLGDTTLSPESQAEEKLQLLGQLLERGKGPVMLFQFSESQLSKRSI CQTTNSVNTPNIQICQQLASFDSLEYVAVLTECDDGHGYLRSISFVLLWGSWRYQSIEDF SDGGKFSNRGLSGG >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_2|585_bp atgaagtctacatccaagcggtgcatgcagaaaggaaacagtgactacgtaggatcatgc caggacgtgatgcatgtatcgaatgaagggcagcagcaagtattgccaaacagagaggcc tttctccagggcatggagctctcagggagtgcagaatcttctttccctctccctgctccc atccagcaggtctttgacttccaagctgggccaagggatgagctgctgggagatacgacc ctcagtccagagagtcaagctgaggagaaactgcagctcctgggccagttattagaaaga ggaaagggcccagtgatgctcttccaattcagtgaaagccaattaagtaagcgatcaatt tgccaaacgactaattcagtgaacacaccaaatatacagatttgtcaacaacttgcttct tttgactctctagagtatgtggcagttcttactgaatgtgatgatggtcacggatattta aggagtatttcatttgtgctgctgtggggatcatggaggtaccagagcatagaggacttt tctgatggaggaaaattcagcaacagaggcctgtctggaggctga >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_3|254_aa MNAKVVVVLVLVLTALCLSDGKPVSLSYRCPCRFFESHVARANVKHLKILNTPNCALQIV ARLKNNNRQVCIDPKLKWIQEYLEKALNNSPTWGHLRWHGPWLRGPEGEGLLEATKVPLG TAEARMQGSSILSLSPTPVPGKPGEAPGSCCTSSLDVSSLHPNLELLSGASESLAWPGAP HTPCKPEPILPALCDMTLKPRVAKPFINVFITDEGNLQSCHMQPPQFPYAQRLDHLMPRK DMSSDKVPRMLTAH >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_3|765_bp atgaacgccaaggtcgtggtcgtgctggtcctcgtgctgaccgcgctctgcctcagcgac gggaagcccgtcagcctgagctacagatgcccatgccgattcttcgaaagccatgttgcc agagccaacgtcaagcatctcaaaattctcaacactccaaactgtgcccttcagattgta gcccggctgaagaacaacaacagacaagtgtgcattgacccgaagctaaagtggattcag gagtacctggagaaagctttaaacaacagtcccacctggggccacctacggtggcatggc ccctggctgagaggccccgagggcgaagggttactggaagccacgaaagtgcctcttggg acagccgaggccaggatgcagggcagcagcatcctgagcctcagccccacgccggtgccg ggaaagcctggagaagccccaggttcttgctgtacttcctccctggatgtctcttctctt cacccaaacttggagctgctttctggagcatctgaatccctggcctggcctggggctcct catactccctgtaaaccagagcctattctgcctgccctctgcgatatgaccctgaagcct agggtggcaaagccttttatcaatgtctttatcactgatgagggaaatcttcagtcttgc cacatgcaacccccacagttcccatatgcccagaggctggatcacctgatgccaaggaag gacatgtcctctgataaggtccccagaatgctgactgcccactga >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_4|468_aa MKTIRSNQTVDIPENVITLKGCKVIVKGPRGTLWRDFNHISIELSLLVKKKKRLRLNNTC GENMIKGVTLGFHYKMRSVYAHFPINIVTRENGSFVEIRNFLGEKYIRRVRMRPGVACSV CQTQKHELILEGNDIELVSNSAALIQQATTVKNKHIRKFLDGIYVSEKETAQHIRKFLDG SQLLASKGQNLMENEFDKLTEVGFRRVGKTTLNFIWNQKRALIAKTILGKKNKAGGIMLP DFNLYYKATVTKTAWYWYQNRYIDQWNRMEASEITPHIYHHLTFDKPDAHKQWGKDSLFN KWCWENWLAICRKLKLDPFFTPYTKINSRWIKDLDIRPKIIKILEENLGNTILDVGMGKG FMSKTQKAMATKAKIDKWDLIQVKNICTAKETIIGVNRQPTEWQKIFAIYPSDKGLISRI CKELKQIYKKKANNPNKKMSKGYEQTLLKRRHLCNQHTYEKMLNITDH >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_4|1407_bp atgaagaccatccgcagcaatcagactgttgatattccggaaaatgtcattactctgaag ggatgcaaagttattgtgaagggccccagaggaactctgtggagggacttcaatcacatc agtatagaactcagtcttcttgtaaagaaaaaaaagaggctccggcttaacaacacttgt ggtgagaacatgatcaagggtgttacactgggcttccattacaagatgagatctgtgtat gctcacttccccatcaacatcgttacccgggagaatgggtcttttgttgaaatccgaaat ttcttgggtgaaaaatacatccgcagggttaggatgagaccaggtgttgcttgttcagta tgtcaaacccagaaacatgaattaatccttgaaggaaatgacattgagcttgtttcaaat tcagcggctttgattcagcaagcaacaacagttaaaaacaagcatatcaggaaatttttg gatggtatctatgtctctgaaaaagaaactgctcagcatatcaggaaatttttggatgga tcacagctccttgccagcaagggacaaaacttgatggagaatgagtttgacaagttgaca gaagtaggcttcagaagagttggaaaaactactttaaacttcatttggaatcaaaaaaga gccctcatagccaagacaatcctgggcaagaagaacaaagctggaggcatcatgctacct gacttcaatctatactacaaggctacagtaacaaaaacagcatggtactggtaccaaaac agatacatagaccaatggaacagaatggaggcctcagaaataacaccacacatctaccac catttgacctttgacaaacctgatgcacacaagcaatggggaaaagattccctatttaat aaatggtgttgggaaaactggctagccatatgcagaaaactgaaactggaccccttcttt acaccttatacaaaaattaactcaagatggatcaaagacttagacataagacctaaaatc ataaaaatcctagaagaaaacctgggcaataccattctggatgtaggcatgggcaaaggc ttcatgtctaaaacacaaaaagcaatggcaacaaaagccaaaattgacaaatgggatcta attcaagtaaagaacatctgcacagcaaaggaaactatcatcggagtgaataggcaacct acagaatggcagaaaatttttgcaatctatccatctgacaaagggctaatatccagaatc tgcaaagaacttaaacaaatttacaagaaaaaagcaaacaaccccaacaaaaaaatgagc aaaggatatgaacagacacttctcaaaagaagacatttatgtaaccaacatacatatgaa aaaatgctcaacatcactgatcattag >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_5|286_aa MGRNQHKKTENSKNQNASSSPKDHNSSPAREQNWMENEFDELTEVGFRRTNDKSHKIISV DEVKAFEKIQQPFMLKTLKKLVLEVLAREVRQEKERKGIQLGNEEVKLSLFADDMIVYLE HPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIAPSCELSFAENQFLSD TVKRNCNLEGEKAKCIFVHSAEIARPSSRTLLGDTDEPCRALTSGPLNRRLIPGKGTVCN TAANASFINFLRDNYFRQYSKMEYNHHLSSSSQNICWDKDISCKAT >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_5|861_bp atggggagaaaccagcacaaaaagactgaaaattccaaaaaccagaacgcctcttcttct ccaaaggatcacaactcctcaccagcaagagaacaaaactggatggagaatgagtttgac gaattgacagaagtgggcttcagaagaaccaatgacaaaagccacaagattatctcagta gatgaagtaaaggcctttgaaaaaattcaacagcccttcatgctaaaaacgctcaagaaa ctagtgttggaagttctggccagagaagtcaggcaagagaaagaaagaaagggtattcaa ttaggaaatgaggaggtcaaattgtccctgtttgcggatgacatgattgtatatctagaa caccccattgtctcagcccaaaatctccttaagctgataagcaacttcagcaaagtctca ggatacaaaatcaatgtacaaaaatcacaggcattcttatacaccaataacagacaaaca gagagccaaatcgcaccatcatgtgagctctcctttgctgagaaccagttcttgtctgac actgtgaaaagaaactgcaacttggaaggagaaaaggcaaaatgtatttttgtgcattca gcagaaatagccaggccaagcagccggacacttttaggtgacacagatgaaccctgcagg gcgctgacgtcaggccccctcaaccgaagactcattccagggaagggcactgtgtgcaac acagcagccaatgcaagttttattaacttcctccgtgataactacttcagacaatattcc aaaatggaatacaatcaccacctctccagtagttctcagaatatctgctgggataaagac atctcctgcaaagctacatga >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_6|199_aa MSGKGRVLVEDETLATTILLPASVNLIRYLISTVYLQNTAQLKVKKEETVKDDEQGVEVF PAQPLTDGYAFGTLQIARLSGDPTRGSGITSWWSGCQPPGGGLQKNKPPKQVSILTVLLP TLAHLCGEQARPSRIDSCGCSVCTVYPVFPNGSPNLTSAPVPGTPPVLHGSDPVDKFTSF ACTVDGDCVQRELAGPFGF >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_6|600_bp atgtcagggaaaggccgtgtcctggtggaggatgagaccctggcaaccaccattctcctt cctgcatctgtgaatttgattaggtacctcatctcaactgtatatttgcagaacacagca caattaaaagtcaagaaggaggagacagtaaaggacgatgagcaaggagtggaagtgttt cctgcccaaccacttactgacgggtacgcctttggaactctgcagattgctcggctctct ggggatcccacgagaggctctggaattacctcctggtggagtgggtgccagcccccggga ggagggcttcagaaaaacaagcccccaaaacaagtctccattcttactgtccttctcccc actctggcgcacctgtgtggagagcaggccaggccatcacgcattgactcatgtggttgt tcagtctgcaccgtctacccagtatttcctaatggcagccccaatctgacatcggctcca gtcccagggacacctccggtactacatgggagcgacccagtggacaaatttacatcattt gcctgcactgtggatggtgactgtgtgcagagagaactggctggtccttttggattctga >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_7|98_aa MGNDFDELTEVGFRRSVITNFSELKEDVRTHCKEAKNLEKRLGEWLTRINNVEKTLNDLT ELKSMEQELRDARTGFNSQFDQVEERVSVTEDQINEIK >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_7|297_bp atgggaaatgactttgatgagttgacagaagtaggcttcagaaggtcggtaataacaaac ttctccgagctaaaggaggatgttagaacccattgcaaagaagctaaaaaccttgaaaaa agattaggtgaatggctaactagaataaacaacgtagagaagaccttaaatgacctgacg gagctaaaatccatggaacaagaacttcgtgatgcacgcacaggcttcaatagccaattc gatcaagtggaagaaagggtatcagtgactgaagatcaaattaatgagataaagtga >gi568815588r:44275889_44485005|GENSCAN_predicted_peptide_8|217_aa XLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAQNLLKQISNFSTVSG YKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLK EIKEDTNKWKNIPCSWVGRINIVKMATLPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWN QKRARIAKSILSQKNKAGGITLPDFKLYYKVTVTKTA >gi568815588r:44275889_44485005|GENSCAN_predicted_CDS_8|654_bp ntgttggaagttctggccagggcaattaggcaggagaaggaaataaagggtattcaatta ggaaaagaggaagtcaaattgtccctgtttgcagacgacatgattgtatatctagaaaac cccatcatctcagcccaaaatctccttaagcagataagcaacttcagcacagtctcagga tacaaaatcaatgtacaaaaatcacaagcattcttatacaccaacaacagacaaacagag agccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctagga atccaacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaag gaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatc aatattgtgaaaatggccacactgcccaaggtaatttacagattcaatgccatccccatc aagctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaac caaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatc acactacctgacttcaaactatactacaaggttacagtaaccaaaacagcatag