GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:38:28 Sequence gi568815575f:52982499_53187936 : 205438 bp : 44.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1991 1993 3 1 0 98 103 0 0.017 2.50 1.02 Intr + 8376 8564 189 2 0 -66 89 175 0.004 1.98 1.03 Intr + 9132 9233 102 2 0 117 110 -2 0.898 5.27 1.04 Intr + 11576 11605 30 1 0 34 116 49 0.564 0.63 1.05 Intr + 13395 13493 99 2 0 36 76 80 0.726 1.91 1.06 Intr + 18057 18128 72 2 0 39 47 146 0.646 5.20 1.07 Intr + 37407 37548 142 2 1 9 63 208 0.749 10.33 1.08 Term + 45036 45220 185 1 2 81 49 137 0.987 6.81 1.09 PlyA + 45231 45236 6 1.05 2.00 Prom + 53184 53223 40 -4.26 2.01 Sngl + 94124 95245 1122 1 0 64 48 1781 0.991 168.40 2.02 PlyA + 97758 97763 6 -0.45 3.00 Prom + 98105 98144 40 -6.56 3.01 Init + 100001 100798 798 1 0 106 -17 752 0.202 61.21 3.02 Intr + 102014 102124 111 1 0 34 71 119 0.353 5.38 3.03 Intr + 102257 102368 112 1 1 39 111 201 0.891 17.45 3.04 Intr + 102456 102601 146 1 2 116 59 44 0.554 4.30 3.05 Intr + 102729 102823 95 2 2 96 47 74 0.546 2.86 3.06 Intr + 103058 103812 755 2 2 60 105 1519 0.307 141.91 3.07 Term + 105278 105441 164 0 2 109 49 201 0.894 16.40 3.08 PlyA + 106021 106026 6 1.05 4.04 PlyA - 106823 106818 6 1.05 4.03 Term - 108799 108720 80 2 2 123 42 62 0.854 3.03 4.02 Intr - 110946 110779 168 1 0 93 55 65 0.753 3.62 4.01 Init - 112874 112841 34 2 1 73 113 0 0.430 1.04 4.00 Prom - 117479 117440 40 -3.86 5.08 PlyA - 119766 119761 6 1.05 5.07 Term - 130806 130627 180 0 0 73 48 176 0.398 9.71 5.06 Intr - 145403 145377 27 2 0 83 103 16 0.013 0.91 5.05 Intr - 160682 160431 252 2 0 -14 50 499 0.025 33.53 5.04 Intr - 160807 160688 120 1 0 6 71 318 0.731 22.59 5.03 Intr - 161291 161109 183 2 0 20 10 371 0.692 22.48 5.02 Intr - 161421 161308 114 0 0 120 60 90 0.810 10.04 5.01 Init - 162972 162847 126 0 0 86 93 28 0.773 1.45 5.00 Prom - 163354 163315 40 -13.78 6.00 Prom + 163523 163562 40 -10.15 6.01 Sngl + 163631 163993 363 1 0 71 48 336 0.067 23.98 6.02 PlyA + 164390 164395 6 1.05 7.00 Prom + 164925 164964 40 -4.96 7.01 Sngl + 166101 166484 384 2 0 26 55 202 0.555 6.89 7.02 PlyA + 166503 166508 6 1.05 8.00 Prom + 167269 167308 40 -2.46 8.01 Init + 167393 167525 133 1 1 78 47 59 0.324 1.10 8.02 Term + 178099 178172 74 2 2 53 32 171 0.536 6.07 8.03 PlyA + 178190 178195 6 1.05 9.04 PlyA - 178612 178607 6 1.05 9.03 Term - 180893 180819 75 2 0 24 38 163 0.692 2.74 9.02 Intr - 182058 181943 116 1 2 50 36 111 0.274 2.27 9.01 Init - 188946 188901 46 0 1 78 97 6 0.346 1.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 8258 8564 307 2 1 42 89 164 0.956 8.35 S.002 Init - 133448 133349 100 2 1 84 94 47 0.803 5.32 S.003 Sngl + 163544 163993 450 1 0 88 48 364 0.918 28.62 S.004 Term + 198692 198765 74 0 2 106 38 47 0.863 -0.43 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_1|273_aa MKEEEEEEDQEGLGKGLCSELGLMRIQKALTCEWVIEVELTVTELGEKSMIHVRWSASVW YRITGGHSLSWRLRSICLPLWPQYGDNDSYHQGLPRKRFLDRDLTAKKDRTVPSRAPLGG ANKEQLVDADTHGPFRAGPQRPLCLTVREREDPGTRLYECIPLYSSKTLSPKKIEEEEEE EEEEEEEEKEEEEEEEKESAELDLDAGIRLDFRPTLEPSEELLEPIALWAVPDEQLSIDQ QGAAWFTDGSSQGNGNCLVWKAAALKPGHGKMD >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_1|822_bp atgaaggaggaggaggaagaggaggaccaggaaggcctcggcaaaggactgtgctcggag ctggggctgatgagaatacagaaggccctcacctgtgaatgggtgatagaagtagagttg acagtcacggagttgggtgagaaaagcatgattcatgttaggtggtcagccagtgtctgg tacagaatcacgggtgggcactcactctcttggaggctaaggtccatctgtctacctctg tggccccagtatggggacaatgacagttaccatcaaggcttgccaaggaagaggttcctg gaccgagacctcacagccaaaaaggatagaactgttccatcccgcgcgccccttggtggt gcaaacaaggaacagctagtggatgcagatacccatggaccattccgggctggtccccag aggccactgtgcctgaccgtgcgcgagcgcgaagacccagggacccggctctacgagtgc atcccgctgtactctagcaaaaccctgtctccaaaaaaaatagaagaagaagaagaagaa gaggaagaggaagaggaagaagaaaaggaggaggaagaggaggaggaaaaagagagtgct gagttggaccttgatgctggaataagactagatttcagaccgactttagaaccatctgag gagctactggaaccaattgccctatgggcagtgcccgatgaacagctctcaattgaccaa caaggagctgcttggtttacagatggcagttcccaggggaatggaaactgccttgtttgg aaagctgctgcattaaaaccaggacacggaaagatggattga >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_2|373_aa MANTTGEPEEVSGALSPPSASAYVKLVLLGLIMCVSLAGNAILSLLVLKERALHKAPYYF LLDLCLADGIRSAVCFPFVLASVRHGSSWTFSALSCKIVAFMAVLFCFHAAFMLFCISVT RYMAIAHHRFYAKRMTLWTCAAVICMAWTLSVAMAFPPVFDVGTYKFIREEDQCIFEHRY FKANDTLGFMLMLAVLMAATHAVYGKLLLFEYRHRKMKPVQMVPAISQNWTFHGPGATGQ AAANWIAGFGRGPMPPTLLGIRQNGHAASRRLLGMDEVKGEKQLGRMFYAITLLFLLLWS PYIVACYWRVFVKACAVPHRYLATAVWMSFAQAAVNPIVCFLLNKDLKKCLRTHAPCWGT GGAPAPREPYCVM >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_2|1122_bp atggccaacactaccggagagcctgaggaggtgagcggcgctctgtccccaccgtccgca tcagcttatgtgaagctggtactgctgggactgattatgtgcgtgagcctggcgggtaac gccatcttgtccctgctggtgctcaaggagcgtgccctgcacaaggctccttactacttc ctgctggacctgtgcctggccgatggcatacgctctgccgtctgcttcccctttgtgctg gcttctgtgcgccacggctcttcatggaccttcagtgcactcagctgcaagattgtggcc tttatggccgtgctcttttgcttccatgcggccttcatgctgttctgcatcagcgtcacc cgctacatggccatcgcccaccaccgcttctacgccaagcgcatgacactctggacatgc gcggctgtcatctgcatggcctggaccctgtctgtggccatggccttcccacctgtcttt gacgtgggcacctacaagtttattcgggaggaggaccagtgcatctttgagcatcgctac ttcaaggccaatgacacgctgggcttcatgcttatgttggctgtgctcatggcagctacc catgctgtctacggcaagctgctcctcttcgagtatcgtcaccgcaagatgaagccagtg cagatggtgccagccatcagccagaactggacattccatggtcccggggccaccggccag gctgctgccaactggatcgccggctttggccgtgggcccatgccaccaaccctgctgggt atccggcagaatgggcatgcagccagccggcggctactgggcatggacgaggtcaagggt gaaaagcagctgggccgcatgttctacgcgatcacactgctctttctgctcctctggtca ccctacatcgtggcctgctactggcgagtgtttgtgaaagcctgtgctgtgccccaccgc tacctggccactgctgtttggatgagcttcgcccaggctgccgtcaacccaattgtctgc ttcctgctcaacaaggacctcaagaagtgcctgaggactcacgccccctgctggggcaca ggaggtgccccggctcccagagaaccctactgtgtcatgtga >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_3|726_aa MDRPDEGPPAKTRRLSSSESPQRDPPPPPPPPPLLRLPLPPPQQRPRLQEETEAAQVLAD MRGVGLGPALPPPPPYVILEEGGIRAYFTLGAECPGWDSTIESGYGEAPPPTESLEALPT PEASGGSLEIDFQVVQSSSFGGEGALETCSAVGWAPQRLVDPKSKEEAIIIVEDEDEDER ESMRSSRRRRRRRRRKQRKVKRESRERNAERMESILQALEDIQLDLEAVNIKAGKAFLRL KRKFIQMRRPFLERRDLIIQHIPGFWSILDLNCSTHQFLNHPRISILINRRDEDIFRYLT NLQVQDLRHISMGYKMKLYFQTNPYFTNMVIVKEFQRNRSGRLVSHSTPIRWHRGQEPQA RRHGNQDASHSFFSWFSNHSLPEADRIAEIIKNDLWVNPLRYYLRERGSRIKRKKQEMKK RYFPPLSAMSTPPPTNPILSHSLPYSKTRGRCEVVIMEDAPDYYAVEDIFSEISDIDETI HDIKISDFMETTDYFETTDNEITDINENICDSENPDHNEVPNNETTDNNESADDHETTDN NESADDNNENPEDNNKNTDDNEENPNNNENTYGNNFFKGGFWGSHGNNQDSSDSDNEADE ASDDEDNDGNEGDNEGSDDDGNEGDNEGSDDDDRDIEYYEKVIEDFDKDQADYEDVIEII SDESVEEEGIEEGIQQDEDIYEEGNYEEEGSEDVWEEGEDSDDSDLEDVLQVPNGWANPG KRGKTG >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_3|2181_bp atggaccgcccagatgaggggcctccggccaagacccgccgcctgagcagctccgagtct ccacagcgcgacccgcccccgccgccgccgccgccgccgctcctccgactgccgctgcct ccaccccagcagcgcccgaggctccaggaggaaacggaggcggcacaggtgctggccgat atgaggggggtgggactgggccccgcgctgcccccgccgcctccctatgtcattctcgag gagggggggatccgcgcatacttcacgctcggtgctgagtgtcccggctgggattctacc atcgagtcggggtatggggaggcgcccccgcccacggagagcctggaagcactccccact cctgaggcctcgggggggagcctggaaatcgattttcaggttgtacagtcgagcagtttt ggtggagagggggccctagaaacctgtagcgcagtggggtgggcgccccagaggttagtt gacccgaagagcaaggaagaggcgatcatcatagtggaggatgaggatgaggatgagcgg gagagtatgaggagcagcaggaggcggcggcggcggcggaggaggaagcagaggaaggtg aagagggaaagcagagagagaaatgccgagaggatggagagcatcctgcaggcactggag gatattcagctggatctggaggcagtgaacatcaaggcaggcaaagccttcctgcgtctc aagcgcaagttcatccagatgcgaagacccttcctggagcgcagagacctcatcatccag catatcccaggcttctggtcaatccttgacctaaactgctccactcatcagttcctcaac caccccagaatttcaattttgatcaaccgacgtgatgaagacattttccgctacttgacc aatctgcaggtacaggatctcagacatatctccatgggctacaaaatgaagctgtacttc cagactaacccctacttcacaaacatggtgattgtcaaggagttccagcgcaaccgctca ggccggctggtgtctcactcaaccccaatccgctggcaccggggccaggaaccccaggcc cgtcgtcacgggaaccaggatgcgagccacagctttttcagctggttctcaaaccatagc ctcccagaggctgacaggattgctgagattatcaagaatgatctgtgggttaaccctcta cgctactacctgagagaaaggggctccaggataaagagaaagaagcaagaaatgaagaaa cgttacttcccacctttgagtgctatgagtacaccacctcccaccaaccctatactcagc cacagccttccttacagtaaaaccaggggcagatgtgaggtggtgatcatggaagacgcc cctgactattatgcagtggaagacattttcagcgagatctcagacattgatgagacaatt catgacatcaagatctctgacttcatggagaccaccgactacttcgagaccactgacaat gagataactgacatcaatgagaacatctgcgacagcgagaatcctgaccacaatgaggtc cccaacaacgagaccactgataacaacgagagtgctgatgaccacgaaaccactgacaac aatgagagtgcagatgacaacaacgagaatcctgaagacaataacaagaacactgatgac aacgaagagaaccctaacaacaacgagaacacttacggcaacaacttcttcaaaggtggc ttctggggcagccatggcaacaaccaggacagcagcgacagtgacaatgaagcagatgag gccagtgatgatgaagataatgatggcaacgaaggtgacaatgagggcagtgatgatgat ggcaatgaaggtgacaatgaaggcagcgatgatgacgacagagacattgagtactatgag aaagttattgaagactttgacaaggatcaggctgactacgaggacgtgatagagatcatc tcagacgaatcagtggaagaagagggcattgaggaaggcatccagcaagatgaggacatc tatgaggaaggaaactatgaggaggaaggaagtgaagatgtctgggaagaaggggaagat tcggacgactctgacctagaggatgtgcttcaggtcccaaacggttgggccaatccgggg aagagggggaaaaccggataa >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_4|93_aa MRMEGILLITEVFRRTELMAQRGKLTGLKSHSQAPPLARSGPASSRYHSPPPPRLCVFLP YTAGSRRSLSQWGLSSLAFSEDSILELDVTLKT >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_4|282_bp atgagaatggagggaattttgctgataactgaagtattcaggcggacagaacttatggcc cagagaggcaaactgactggcttgaagtcacacagccaggccccacccctcgcgcgctcc ggccctgcctcttcccgctaccactccccaccaccgccccgtctttgcgtcttcttaccg tacaccgccggcagtcggcgcagtctcagccaatggggcctgtcttctctggccttcagc gaagacagtatcttggagctagatgtgaccttaaagacctag >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_5|333_aa MAGAPPPASLPPCSLSSDCCASNERESVGVGPSEPGAGYNLLVAMEIEIAALIIDNGSGM CKAGFAGDGWPLSCVPLHHQGVMVGMGQKDSYMGHEAQSKHDILTLKYPIEHGIITIWDD MEKIWHHTFYNELCMTPEKHPEIVRDINEKVCYIALDFEQEMATATSSSSLEKSYELPDG QVITIGNKWFWCLQVLFQPSFLGMESCGIHKTTFNSIMKCDVDIRKDLYANTVLSSSTTM CPGITTLASSTMKIKFIMPPEHKYSPPYHHQVPQAWLVAIGLGKFMKPGKVVLVLAECYS GHKAVTMKNVDKGTSIFPYSHALMAGIDCYPRK >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_5|1002_bp atggcgggtgcccctcccccagcctcgctgccaccttgcagtttgagctcagactgctgt gctagcaatgagcgagaatctgtgggcgtaggaccctccgagccaggggcgggatataat ctcctggttgcaatggaaatagagatcgccgcgctcatcattgacaatggctctggcatg tgcaaagctggctttgctggggatggatggcccctgagctgtgttcccctccatcatcag ggtgtgatggtgggcatgggccagaaggactcctacatgggccatgaggcccagagcaag cacgacatcctgaccctgaagtaccctatcgagcatggcatcatcaccatctgggatgac atggagaagatctggcaccacaccttctacaacgagctgtgcatgaccccagagaagcac ccggagattgtgcgcgacatcaacgagaaggtgtgctacatcgccctggacttcgagcag gagatggccactgccacatcctcctcctccctggagaagagctatgagctgcctgatggc caggtcatcaccatcggcaacaagtggttctggtgtctgcaggtgctgttccagccttcc ttcttgggcatggaatcctgcggcatccacaagaccaccttcaactccatcatgaagtgt gacgtggacatccgcaaagacctgtacgccaacacagtgctgtccagcagcaccaccatg tgccctggcatcaccacgctggcatccagcaccatgaagatcaagttcatcatgccccca gagcacaagtactcgcccccctaccaccaccaagtaccacaggcttggttggttgctata ggcttgggcaagttcatgaaacctgggaaagtggtgctggtcctggctgaatgctactct ggacacaaagctgtcaccatgaagaacgttgataagggcacctcaatcttcccctacagc cacgctctgatggctggaattgactgctatccccgcaaatga >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_6|120_aa MEQSWMENDELREEGFRQSNYSKLKEEVRTNGKEVKNLEKRLDKSLTRITNAEKSLKDLM ELKTTAQELSDECTSLSSRFDQLEERVSVMEDQMNEMKREEKFREKRIKRNKASKKYGTM >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_6|363_bp atggaacaaagctggatggagaatgacgagttgagagaagaaggcttcagacaatcaaac tactccaagctaaaggaggaagttcgaaccaatggcaaagaagttaaaaaccttgaaaaa agattagacaaatcactaactagaataaccaatgcagagaagtccttaaaggacctgatg gagctgaaaaccacagcacaagaactatctgatgaatgcacaagcctcagtagccgattc gatcaactggaagaaagggtatcagtgatggaagatcaaatgaatgaaatgaagcgagaa gagaagtttagagaaaaaagaataaaaagaaacaaagcctccaagaaatatgggactatg tga >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_7|127_aa MEEHSMLMDRKSQYRENAILPKVIYRFNAIPIKLPMTFFTELEKTSLKFIWNQKRACMAK SILSQKNKAGGITLPDFKLYYKAIVTKTAWYWYQNRDIDQWNRTEPSEIMPRIYNHLIFD KPEKNKQ >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_7|384_bp atggaagaacattccatgctcatggataggaagagtcaatatcgtgaaaatgccatactg cccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaacttctttaaagttcatatggaaccaaaaaagagcctgcatggccaag tcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatac tacaaggctatagtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaa tggaacagaacagagccctcagaaataatgccacgtatctacaaccatctgatctttgac aaacctgagaaaaacaagcaatga >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_8|68_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIEIVAQEVVYESEYEKH EKELYERK >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_8|207_bp atggaatattatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcacaaggacaaaaaaccaaacaccgcatg ttctcactcatagagatagtggcccaggaagttgtgtatgagagcgaatatgagaaacat gaaaaggagctgtatgagcgcaaataa >gi568815575f:52982499_53187936|GENSCAN_predicted_peptide_9|78_aa MASRLVLGSAPNRKAAMFSTRTKNAGEELTFDYQMKGSGDVSSDSIDHSPAKKRVSGIVA PQNRDFEDVNEGLLEQRN >gi568815575f:52982499_53187936|GENSCAN_predicted_CDS_9|237_bp atggcatcccgtttggtcctgggatcagctccaaataggaaggcagcaatgttttccacg agaaccaaaaatgctggagaagagctgacttttgattatcaaatgaaaggttctggagat gtatcttcagattctattgaccacagcccagccaaaaagagggtcagcggcatagtggcg ccccagaacagggacttcgaggacgtgaacgaaggtctgctggagcagaggaactga