GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:15:59 Sequence gi568815587r:85874708_86168780 : 294073 bp : 39.60% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7806 7968 163 2 1 65 45 169 0.891 8.93 1.02 Intr + 11493 11660 168 1 0 58 82 214 0.857 16.80 1.03 Intr + 12720 12747 28 1 1 65 92 12 0.754 -4.44 1.04 Term + 16456 16654 199 1 1 117 43 154 0.899 9.69 1.05 PlyA + 16748 16753 6 1.05 2.00 Prom + 38343 38382 40 -3.65 2.01 Init + 40183 40225 43 0 1 104 53 31 0.484 1.85 2.02 Intr + 41321 41526 206 0 2 30 116 212 0.851 16.20 2.03 Intr + 50744 50931 188 1 2 11 71 122 0.057 0.37 2.04 Intr + 51013 51171 159 1 0 50 72 157 0.553 8.48 2.05 Term + 53631 53913 283 0 1 97 37 121 0.487 1.81 2.06 PlyA + 53916 53921 6 1.05 3.04 PlyA - 55773 55768 6 1.05 3.03 Term - 60517 59886 632 2 2 72 50 441 0.986 32.19 3.02 Intr - 60821 60577 245 1 2 81 50 276 0.640 19.32 3.01 Init - 61180 61002 179 1 2 79 28 167 0.690 8.58 3.00 Prom - 64422 64383 40 -6.45 4.00 Prom + 66231 66270 40 -7.75 4.01 Init + 66905 67133 229 1 1 37 78 155 0.458 7.98 4.02 Intr + 67980 68274 295 1 1 17 38 194 0.396 2.44 4.03 Term + 68291 68597 307 2 1 50 42 180 0.393 3.20 4.04 PlyA + 70929 70934 6 1.05 5.19 PlyA - 71170 71165 6 1.05 5.18 Term - 86070 85978 93 0 0 126 35 65 0.214 1.85 5.17 Intr - 100105 100001 105 1 0 87 67 35 0.092 0.79 5.16 Intr - 106521 106422 100 2 1 48 101 69 0.705 3.39 5.15 Intr - 107068 107038 31 2 1 119 81 49 0.757 3.57 5.14 Intr - 107296 107165 132 2 0 48 92 141 0.987 10.20 5.13 Intr - 109266 109159 108 1 0 73 74 73 0.837 3.74 5.12 Intr - 115671 115543 129 1 0 69 31 85 0.488 0.65 5.11 Intr - 122222 122119 104 1 2 72 99 64 0.994 4.80 5.10 Intr - 126072 125936 137 0 2 91 91 72 0.998 6.25 5.09 Intr - 126451 126328 124 0 1 62 115 124 0.989 12.17 5.08 Intr - 128744 128659 86 2 2 49 76 68 0.802 -0.60 5.07 Intr - 132876 132835 42 0 0 69 111 36 0.575 1.52 5.06 Intr - 140256 140163 94 2 1 102 82 56 0.833 5.45 5.05 Intr - 147762 147660 103 1 1 122 96 42 0.883 6.71 5.04 Intr - 151660 151585 76 1 1 100 98 -21 0.114 -1.73 5.03 Intr - 156904 156762 143 2 2 61 99 67 0.092 4.25 5.02 Intr - 194119 193944 176 0 2 14 74 235 0.447 13.36 5.01 Init - 194603 194539 65 2 2 57 49 126 0.536 4.27 5.00 Prom - 195516 195477 40 -4.95 6.00 Prom + 197289 197328 40 -6.45 6.01 Init + 214378 214447 70 0 1 48 74 47 0.092 0.56 6.02 Intr + 229837 231211 1375 2 1 55 53 378 0.251 17.95 6.03 Intr + 261638 261662 25 2 1 92 92 29 0.002 0.81 6.04 Intr + 275471 275610 140 1 2 30 64 124 0.022 2.54 6.05 Intr + 283834 284006 173 1 2 26 25 210 0.137 7.16 6.06 Intr + 286746 286849 104 1 2 98 64 72 0.513 4.77 6.07 Intr + 287638 287791 154 0 1 74 72 63 0.498 2.02 6.08 Term + 291007 291269 263 2 2 22 42 203 0.492 3.80 6.09 PlyA + 291363 291368 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 261491 261662 172 1 1 54 92 116 0.834 6.37 S.002 Term + 265983 266086 104 1 2 87 43 115 0.879 4.36 S.003 Term - 274152 273957 196 2 1 104 54 128 0.876 6.90 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:85874708_86168780|GENSCAN_predicted_peptide_1|185_aa NSRLKEEQIWHIRHLLKELSEEKAEGLPVVTREDVEEAMKEKWKFERDQEKNLRDMRMQI SNAEKLFLEKLSEKEYWEEYKNVGSERHAKLITSLQNDINTVKENAEKMSGLSACGPICI TVLAVFSSPFLKTSGAIMHLIKHDQWPLSIPPHGSPVSPSTNCSSVTCLTQQPLLKPRGR GGENT >gi568815587r:85874708_86168780|GENSCAN_predicted_CDS_1|558_bp aatagccgcttaaaagaagaacagatttggcacatacggcatctactaaaggaactgagt gaagagaaggcagagggattgccagttgtaacaagagaggatgttgaagaagcgatgaag gaaaaatggaagtttgaaagagaccaggaaaaaaacttgagagatatgcgcatgcaaata agtaatgctgagaaactatttcttgagaaactcagtgaaaaggaatattgggaggagtac aagaatgtagggagtgaacgacatgctaaactcattacctccttacaaaatgacatcaac acagttaaagagaatgcagagaaaatgtcagggctctctgcttgtggaccaatttgtata acagtccttgcagtattctccagcccatttctaaagacatctggtgccatcatgcacctt atcaaacatgaccaatggcctttgtccataccacctcatgggtcaccagtcagccccagc acaaactgttcctcggtcacctgccttactcagcagcctctgctgaaacccagggggcga ggaggagaaaacacttag >gi568815587r:85874708_86168780|GENSCAN_predicted_peptide_2|292_aa MTFFTWQQEEVQSEEEKSELQPTEVESRDLMSSSDESTILHLSHENSIEDLQYVKIDKEE NSGTEFGDTDMKYLLYEDEKDFKPPLVIPRQTGSGVDLQQTPADLQQRGLTVRRKTNKQK GIASTSTKRTSTQKPHLKVTNIKDQRITTPRLQENKTGQRIEFDELTEVDFRRWVITNSF ELKEHVPTQCKEAKNLEKRYKEELVPFLLKLFQTIEKEGLLPNSLNEASIIVIPKPGRDN TKENFRPISLMNIDAKTLSKILANRIQQHIKKLIHHDQVSFIPGMQGWFNIC >gi568815587r:85874708_86168780|GENSCAN_predicted_CDS_2|879_bp atgaccttcttcacatggcagcaggaagaagtgcagagcgaagaagagaagtcagaattg caacccacagaagtagaaagtagagacttgatgtcctcatcagatgagagcactatctta catcttagtcatgaaaatagcatcgaagatctccagtatgtgaagatagataaagaggaa aactcaggcacagagtttggggacactgatatgaagtacttactatatgaggatgagaag gatttcaagcctccactggtgatacccaggcaaacaggatctggagtagacctccagcaa actccagcagacctgcagcagaggggcctgactgttagaaggaaaactaacaaacagaaa ggaatagcatcaacatcaacaaaaaggacgtccacacaaaaaccccatctgaaggtcacc aacatcaaagaccaaaggatcacaactcctcgcctgcaagagaacaaaactggacagaga attgagtttgacgaattgacagaagtagacttcagaaggtgggtaataacaaactccttt gagctaaaggagcatgttccaacccaatgcaaggaagctaagaaccttgaaaaaaggtac aaagaggagttggtaccattccttctgaaactattccaaacaatagaaaaagagggactc ctccctaactcacttaatgaggccagcatcatcgtgataccaaaacctggcagagacaac acaaaagaaaatttcagaccaatatccctgatgaacatcgatgcgaagaccctcagtaaa atactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtcagc ttcatccctgggatgcaaggctggttcaacatatgctaa >gi568815587r:85874708_86168780|GENSCAN_predicted_peptide_3|351_aa MAGPDSGTQWWCHTETTGETRRYRTHGLDGHNAQTRSQSSRDVERSLLALLHWNHVRQEW ICQPQSIKFRYLCASRPYLRVLITSRGVFPKTLAVHTMAPKPPVISGFMGLLGLSPRYQL CRECHDEEADGLVPALFQYGGQESSVLGVWGGLVVDEPHLDGLHGAHDHHGLSHSSAQAA QQPARAVQPSLGIPQVVAEELEHSEAGGRFGDGAVEQGAETAVQAQNAMAADCLPHPLPD ALVPRRVRGLVQLPLRLHVLGGEGDVDLNAARQAACQDRLPQVRQLGLPRRWRGGRQSVW GSRRGRGREAGRPPSGGLRSELPELRTQNELSKLTETCLKFSGFTILKVTA >gi568815587r:85874708_86168780|GENSCAN_predicted_CDS_3|1056_bp atggctgggccagacagcgggacacagtggtggtgtcacacagagaccacaggggagaca cggcgctataggacacatggacttgacggccacaatgcacagaccaggtcacagagctca agggatgtggaaaggagccttttggcactactgcactggaatcatgtgagacaagagtgg atctgccagccacagtccatcaagttccggtatttgtgcgcctccaggccctacctgcgg gtcttaatcacgtccagaggagtgtttccaaagacgctggctgtgcacacgatggctccg aagcccccagtgatcagcgggttcatgggcttgttggggttgtctcctcggtaccagttg tgcagggaatgtcatgacgaagaagcggatggcctggttccagccctgtttcagtacggt ggccaagaatcctcggtacttggggtttggggaggtctggtcgtggatgaacctcacctt gatggtctccatggggcacacgatcaccacggcctcagccactccagcgcccaggcagca cagcagcccgcgcgtgctgtccagccgtccctgggaatcccgcaggtggttgccgaggaa ctagaacattccgaagctggcggccgctttggggatggagccgtagagcagggagctgag actgcggtacaggcccagaacgccatggctgcggactgtctgccacacccactccccgat gctctggtaccgcggcgggttcgagggctagtccagctgccactgcgtcttcacgtactg ggtggggaaggtgatgtagatctcaatgccgcccgccaggccgcctgccaggatcgcctt ccccaggtgcgtcagcttggccttcccagacgctggcgcggcggccgccagagcgtgtgg ggctcgaggcgcggacgcgggcgggaggctgggcgcccgcccagtggcggccttcggtcc gagcttcctgaactccgcactcaaaatgaactttctaaattaactgagacctgcctgaaa ttttcggggttcacaatccttaaggtaacagcttga >gi568815587r:85874708_86168780|GENSCAN_predicted_peptide_4|276_aa MDGGIEIFVFDKGTGRGARLGGTLDNWRPQVELTIHWSPTNIQRVLVLVDTGTDCTLVCG NLDKFPGKAAYIDGYRVICLGKTKAIPEAITDKIQAYTRPTTMRQLQTFVGLLGYWWAFV PHLAQMVKPLCQLTKMGTTWDWDNEAEMAFLAAKWAIQQAQALQVIDRGAHLNLICFWQG LWQHTEGFRILAGFWSQLWKEAELWYSLIGKELIAAYATLQACESVMGQTTVVMPVDDLP NSGVGTFTGNDPPDWDGTDIYLSKVGSLVKTAEYPE >gi568815587r:85874708_86168780|GENSCAN_predicted_CDS_4|831_bp atggatggaggtatagaaatttttgtgtttgataagggaactggccgaggtgctcggctt gggggaacactggacaactggaggccacaagtggaattgacaatccactggtcccccacc aacatacagcgggtgctggtgctggtagacactggcacagattgtactctcgtctgtggg aacctggataaatttccaggcaaggctgcatatatagatggctatagagttatctgcttg ggtaaaacaaaggctataccagaggccatcactgataaaattcaggcatatacccggccc acaacaatgaggcaactacagacttttgtaggcctcttgggatattggtgggcatttgta ccccatttggctcagatggtaaaaccgttgtgtcagttgacaaaaatgggaactacttgg gattgggacaatgaggctgagatggcttttctggcagccaagtgggccatacagcaagca caggccctacaagtaattgatcggggcgcccatttgaacttgatatgcttttggcaaggc ctatggcagcacacagagggctttagaatactggcaggcttttggtcccaactttggaag gaagctgagctctggtattcattgatagggaaggagttaatagctgcatatgccaccctt caggcttgtgagagtgtgatgggacaaacaacagtagtcatgcctgtagatgacttaccc aatagtggggtgggtaccttcacgggtaatgaccctccggactgggatggcacagacatc taccttagcaaagtggggagcctagtcaaaacagcagagtaccctgagtaa >gi568815587r:85874708_86168780|GENSCAN_predicted_peptide_5|615_aa MRRLCTPLGRASCCARLASSITPRRLLSGWGGGGAAEMSGQSLTDRITAAQHSVTGSAVS KTVCKATTHEIMGPKKKHLDYLIQCTNEMNVNIPQLADSLFERTTNSSWVVVFKSLITTH HLMVYGNERFIQYLASRNTLFNLSNFLDKSGLQGYDMSTFIRRYSRYLNEKAVSYRQVAF DFTKVKRGADGVMRTMNTEKLLKTVPIIQNQMDALLDFNQVGIDRGDIPDLSQAPSSLLD ALEQHLASLEGKKIKDSTAASRATTLSNAVSSLASTGLSLTKVDEREKQAALEEEQARLK ALKEQRLKELAKKPHTSLTTAASPVSTSAGGIMTAPAIDIFSTPSSSNSTSKLPNDLLDL QQPTFHPSVHPMSTASQVASTWGDAVDDAIPSLNPFLTKSSGDVHLSISSDVSTFTTRTP THEMFVGFTPSPVAQPHPSAGLNVDFESVFGNKSTNVIVDSGGFDELGGLLKPTVASQNQ NLPVAKLPPSKLVSDDLDSSLANLVGNLGIGNGTTKNDVNWSQPGEKKLTGGSNWQPKVA PTTAWNAATMPPQMGSVPVMTQPTLIYSQPVMRPPNPFGPVSGAQLSAASSPSSHSPHRA SGKDPFAELSLEDFL >gi568815587r:85874708_86168780|GENSCAN_predicted_CDS_5|1848_bp atgcgccggctgtgcactcccctcgggcgggcctcctgttgcgcccgcctggcgagttcc atcactccccgccggctgctgagcgggtggggtggtggaggagctgcagagatgtccggc cagagcctgacggaccgaatcactgccgcccagcacagtgtcaccggctctgccgtatcc aagacagtatgcaaggccacgacccacgagatcatggggcccaagaaaaagcacctggac tacttaattcagtgcacaaatgagatgaatgtgaacatcccacagttggcagacagttta tttgaaagaactactaatagtagttgggtggtggtcttcaaatctctcattacaactcat catttgatggtgtatggaaatgagcgttttattcagtatttggcttcaagaaacacgttg tttaacttaagcaattttttggataaaagtggattgcaaggatatgacatgtctacattt attaggcggtatagtagatatttaaatgagaaagcagtttcatacagacaagttgcattt gatttcacaaaagtgaagagaggggctgatggagttatgagaacaatgaacacagaaaaa ctcctaaaaactgtaccaattattcagaatcagatggatgcacttcttgattttaatcaa gttggaattgacagaggtgatataccagacctttcacaggcccctagcagtcttcttgat gctttggaacaacatttagcttccttggaaggaaagaaaatcaaagattctacagctgca agcagggcaactacactttccaatgcagtgtcttccctggcaagcactggtctatctctg accaaagtggatgaaagggaaaagcaggcagcattagaggaagaacaggcacgtttgaaa gctttaaaggaacagcgcctaaaagaacttgcaaagaaacctcatacctctttaacaact gcagcctctcctgtatccacctcagcaggagggataatgactgcaccagccattgacata ttttctacccctagttcttctaacagcacatcaaagctgcccaatgatctgcttgatttg cagcagccaacttttcacccatctgtacatcctatgtcaactgcttctcaggtagcaagt acatggggagatgctgttgatgatgccattccaagcttaaatcctttcctcacaaaaagt agtggtgatgttcacctttccatttcttcagatgtatctacttttactactaggacacct actcatgaaatgtttgttggattcactccttctccagttgcacagccacacccttcagct ggccttaatgttgactttgaatctgtgtttggaaataaatctacaaatgttattgtagat tctgggggctttgatgaactaggtggacttctcaaaccaacagtggcctctcagaaccag aaccttcctgttgccaaactcccacctagcaagttagtatctgatgacttggattcatct ttagccaaccttgtgggcaatcttggcatcggaaatggaaccactaagaatgatgtaaat tggagtcaaccaggtgaaaagaagttaactgggggatctaactggcaaccaaaggttgca ccaacaaccgcttggaatgctgcaacaatgcctccacaaatgggaagtgttcctgtaatg acgcaaccaaccttaatatacagccagcctgtcatgagacctccaaacccctttggccct gtatcaggagcacagctgtcggcagcatccagcccctccagtcacagtcctcacagagct tcaggaaaggacccctttgcagagctctctttggaggatttcttataa >gi568815587r:85874708_86168780|GENSCAN_predicted_peptide_6|767_aa MELSSMPKGIDVAFSKLEIRPKHLLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVY LENPIVSAQNLLKLISNFSKVSGYKINVQKSQTFLYTNNRQTESQIMSELPFTIVSKRIK YLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNA IPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPDFKLYYKATVTKTA WYWYQNRAIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKL KLDPFLTPYTKVNSRWIKDLNVRPKTIKTLEENLGIPIQDIGMGKDFMSKTPKAMATKDK IDKWDLIKLKSFCIAKETTIRVNRQPTKWEKIFATYSSDKGLISRIYNELKQIYKKKTNN PIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGN NRALATVLEKKRQNQINIVKLGTEHVEERSKREVVEGYAFGEEEEEKEFLQSLASKRNFR SSQETIESGLFQPDRVKALLSVRACHMQKHEPSCKEVMCQWGVRKYLLRYARQKVLDSWI WAPDLIPDSSRKTHAKYHGVSDNEQTLSYEMECWFHVLEDTASCQKEDLGLISMCASIQM RLSGLSWMSHGNGQTSAASWEVGTPNTRSRLDLGDTIQDIGTGKYFMTKMRRAIATKAKI DKWDLIKLKSFCTAKEAINRVNRPLIEGEKSIANYAFDKGLISSIYK >gi568815587r:85874708_86168780|GENSCAN_predicted_CDS_6|2304_bp atggaactctccagtatgccaaagggtattgatgtggctttcagtaagctggagatcaga cccaaacacttgttggaagttctggccagggcaattaggcaggagaaggaaataaagggt attcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatat ctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttcagcaaa gtctcaggatacaaaatcaatgtacaaaaatcacaaacattcttatacaccaataacaga caaacagagagccaaatcatgagtgaactcccattcacaattgtttcaaagagaataaaa tacctaggaatccaacttacaagggacgtgaaggacctcttcaaggagaactacaaacca ctgctcaacgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgggta ggaagaatcaatatcatgaaaatggccatactgcccaaggtaatttacagattcaatgcc atccccatcaagctaccaatgactttcttcacagaattggaaaaaactactttaaagttc atatggaaccaaaaaagagcccacatcgccaagtcaatcctaagccaaaagaacaaagct ggaggcatcacgctacctgacttcaaactatactacaaggctacagtaaccaaaacagca tggtactggtaccaaaacagagctatagaccaatggaacagaacagagccctcagaaata atgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaagcaatgggga aaggattccctatttaataaatggtgctgggaaaactggctagccatatgtagaaagctg aaactggatcccttccttacaccttatacaaaagtcaattcaagatggattaaagactta aacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattcccattcaggat ataggcatgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaa attgacaaatgggatctaattaaactaaagagcttctgcatagcaaaagaaactaccatc agagtgaacaggcaacctacaaaatgggagaaaattttcgcaacctactcatctgacaaa gggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaaccaacaac cccatcaaaaagtgggcgaaggacatgaacagacacttctcaaaagaagacatctatgca gccaaaaaacacatgaaaaaatgctcaccatcactggccatcagagaaatgcaaatcaaa accacaatgagataccatctcaccccagttagaatggcaatcattaaaaagtcaggaaac aacagagctctggctactgtattggaaaagaaaaggcagaaccagattaacatagtaaaa ctggggacagagcatgtggaagaaaggagcaagagggaggtggtagaaggttacgcattt ggggaggaagaggaagagaaggaatttttgcagagccttgcctcaaagaggaattttcga tctagtcaagagacaatagagagtgggctcttccaacctgatcgggtaaaagcactgctg tcagtgagggcctgccacatgcaaaaacatgagccttcctgcaaggaggtgatgtgccag tggggagtaaggaagtacctcctacgatatgcaagacaaaaagtactggacagctggatt tgggctccagatctaatccctgattccagcaggaaaacacatgcaaaatatcatggagtc tcagacaatgagcagactctgtcttatgaaatggagtgctggttccatgtgctagaagac acagcttcctgccagaaagaggatctgggcctcatctccatgtgtgccagcattcagatg cggctgtctgggctttcatggatgagtcatggcaatggccagacctcagctgcaagttgg gaagtaggaacaccaaacacaagaagcaggttggacctaggtgataccattcaggacata ggcacgggcaaatatttcatgacaaagatgcgaagagcaattgcaacaaaagcaaaaatt gacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaagctatcaacaga gtaaacagaccacttatagaaggggagaaaagtattgcaaactatgcatttgacaaaggt ctaatatccagcatctataagtaa