GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:12:38 Sequence gi568815575f:52976622_53177740 : 201119 bp : 44.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7868 7870 3 1 0 98 103 0 0.015 2.50 1.02 Intr + 14253 14441 189 2 0 -66 89 175 0.004 1.98 1.03 Intr + 15009 15110 102 2 0 117 110 -2 0.897 5.27 1.04 Intr + 17453 17482 30 1 0 34 116 49 0.564 0.63 1.05 Intr + 19272 19370 99 2 0 36 76 80 0.726 1.91 1.06 Intr + 23934 24005 72 2 0 39 47 146 0.646 5.20 1.07 Intr + 43284 43425 142 2 1 9 63 208 0.749 10.33 1.08 Term + 50913 51097 185 1 2 81 49 137 0.987 6.81 1.09 PlyA + 51108 51113 6 1.05 2.00 Prom + 59061 59100 40 -4.26 2.01 Sngl + 100001 101122 1122 1 0 64 48 1781 0.991 168.40 2.02 PlyA + 103635 103640 6 -0.45 3.00 Prom + 103982 104021 40 -6.56 3.01 Init + 105878 106675 798 1 0 106 -17 752 0.202 61.21 3.02 Intr + 107891 108001 111 1 0 34 71 119 0.353 5.38 3.03 Intr + 108134 108245 112 1 1 39 111 201 0.891 17.45 3.04 Intr + 108333 108478 146 1 2 116 59 44 0.554 4.30 3.05 Intr + 108606 108700 95 2 2 96 47 74 0.546 2.86 3.06 Intr + 108935 109689 755 2 2 60 105 1519 0.307 141.91 3.07 Term + 111155 111318 164 0 2 109 49 201 0.894 16.40 3.08 PlyA + 111898 111903 6 1.05 4.04 PlyA - 112700 112695 6 1.05 4.03 Term - 114676 114597 80 2 2 123 42 62 0.854 3.03 4.02 Intr - 116823 116656 168 1 0 93 55 65 0.753 3.62 4.01 Init - 118751 118718 34 2 1 73 113 0 0.430 1.04 4.00 Prom - 123356 123317 40 -3.86 5.08 PlyA - 125643 125638 6 1.05 5.07 Term - 136683 136504 180 0 0 73 48 176 0.398 9.71 5.06 Intr - 151280 151254 27 2 0 83 103 16 0.013 0.91 5.05 Intr - 166559 166308 252 2 0 -14 50 499 0.025 33.53 5.04 Intr - 166684 166565 120 1 0 6 71 318 0.731 22.59 5.03 Intr - 167168 166986 183 2 0 20 10 371 0.692 22.48 5.02 Intr - 167298 167185 114 0 0 120 60 90 0.810 10.04 5.01 Init - 168849 168724 126 0 0 86 93 28 0.773 1.45 5.00 Prom - 169231 169192 40 -13.78 6.00 Prom + 169400 169439 40 -10.15 6.01 Sngl + 169508 169870 363 1 0 71 48 336 0.067 23.98 6.02 PlyA + 170267 170272 6 1.05 7.00 Prom + 170802 170841 40 -4.96 7.01 Sngl + 171978 172361 384 2 0 26 55 202 0.555 6.89 7.02 PlyA + 172380 172385 6 1.05 8.00 Prom + 173146 173185 40 -2.46 8.01 Init + 173270 173402 133 1 1 78 47 59 0.324 1.10 8.02 Term + 183976 184049 74 2 2 53 32 171 0.539 6.07 8.03 PlyA + 184067 184072 6 1.05 9.04 PlyA - 184489 184484 6 1.05 9.03 Term - 186770 186696 75 2 0 24 38 163 0.739 2.74 9.02 Intr - 187935 187820 116 1 2 50 36 111 0.343 2.27 9.01 Init - 194823 194778 46 0 1 78 97 6 0.375 1.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 14135 14441 307 2 1 42 89 164 0.956 8.35 S.002 Init - 139325 139226 100 2 1 84 94 47 0.803 5.32 S.003 Sngl + 169421 169870 450 1 0 88 48 364 0.918 28.62 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_1|273_aa MKEEEEEEDQEGLGKGLCSELGLMRIQKALTCEWVIEVELTVTELGEKSMIHVRWSASVW YRITGGHSLSWRLRSICLPLWPQYGDNDSYHQGLPRKRFLDRDLTAKKDRTVPSRAPLGG ANKEQLVDADTHGPFRAGPQRPLCLTVREREDPGTRLYECIPLYSSKTLSPKKIEEEEEE EEEEEEEEKEEEEEEEKESAELDLDAGIRLDFRPTLEPSEELLEPIALWAVPDEQLSIDQ QGAAWFTDGSSQGNGNCLVWKAAALKPGHGKMD >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_1|822_bp atgaaggaggaggaggaagaggaggaccaggaaggcctcggcaaaggactgtgctcggag ctggggctgatgagaatacagaaggccctcacctgtgaatgggtgatagaagtagagttg acagtcacggagttgggtgagaaaagcatgattcatgttaggtggtcagccagtgtctgg tacagaatcacgggtgggcactcactctcttggaggctaaggtccatctgtctacctctg tggccccagtatggggacaatgacagttaccatcaaggcttgccaaggaagaggttcctg gaccgagacctcacagccaaaaaggatagaactgttccatcccgcgcgccccttggtggt gcaaacaaggaacagctagtggatgcagatacccatggaccattccgggctggtccccag aggccactgtgcctgaccgtgcgcgagcgcgaagacccagggacccggctctacgagtgc atcccgctgtactctagcaaaaccctgtctccaaaaaaaatagaagaagaagaagaagaa gaggaagaggaagaggaagaagaaaaggaggaggaagaggaggaggaaaaagagagtgct gagttggaccttgatgctggaataagactagatttcagaccgactttagaaccatctgag gagctactggaaccaattgccctatgggcagtgcccgatgaacagctctcaattgaccaa caaggagctgcttggtttacagatggcagttcccaggggaatggaaactgccttgtttgg aaagctgctgcattaaaaccaggacacggaaagatggattga >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_2|373_aa MANTTGEPEEVSGALSPPSASAYVKLVLLGLIMCVSLAGNAILSLLVLKERALHKAPYYF LLDLCLADGIRSAVCFPFVLASVRHGSSWTFSALSCKIVAFMAVLFCFHAAFMLFCISVT RYMAIAHHRFYAKRMTLWTCAAVICMAWTLSVAMAFPPVFDVGTYKFIREEDQCIFEHRY FKANDTLGFMLMLAVLMAATHAVYGKLLLFEYRHRKMKPVQMVPAISQNWTFHGPGATGQ AAANWIAGFGRGPMPPTLLGIRQNGHAASRRLLGMDEVKGEKQLGRMFYAITLLFLLLWS PYIVACYWRVFVKACAVPHRYLATAVWMSFAQAAVNPIVCFLLNKDLKKCLRTHAPCWGT GGAPAPREPYCVM >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_2|1122_bp atggccaacactaccggagagcctgaggaggtgagcggcgctctgtccccaccgtccgca tcagcttatgtgaagctggtactgctgggactgattatgtgcgtgagcctggcgggtaac gccatcttgtccctgctggtgctcaaggagcgtgccctgcacaaggctccttactacttc ctgctggacctgtgcctggccgatggcatacgctctgccgtctgcttcccctttgtgctg gcttctgtgcgccacggctcttcatggaccttcagtgcactcagctgcaagattgtggcc tttatggccgtgctcttttgcttccatgcggccttcatgctgttctgcatcagcgtcacc cgctacatggccatcgcccaccaccgcttctacgccaagcgcatgacactctggacatgc gcggctgtcatctgcatggcctggaccctgtctgtggccatggccttcccacctgtcttt gacgtgggcacctacaagtttattcgggaggaggaccagtgcatctttgagcatcgctac ttcaaggccaatgacacgctgggcttcatgcttatgttggctgtgctcatggcagctacc catgctgtctacggcaagctgctcctcttcgagtatcgtcaccgcaagatgaagccagtg cagatggtgccagccatcagccagaactggacattccatggtcccggggccaccggccag gctgctgccaactggatcgccggctttggccgtgggcccatgccaccaaccctgctgggt atccggcagaatgggcatgcagccagccggcggctactgggcatggacgaggtcaagggt gaaaagcagctgggccgcatgttctacgcgatcacactgctctttctgctcctctggtca ccctacatcgtggcctgctactggcgagtgtttgtgaaagcctgtgctgtgccccaccgc tacctggccactgctgtttggatgagcttcgcccaggctgccgtcaacccaattgtctgc ttcctgctcaacaaggacctcaagaagtgcctgaggactcacgccccctgctggggcaca ggaggtgccccggctcccagagaaccctactgtgtcatgtga >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_3|726_aa MDRPDEGPPAKTRRLSSSESPQRDPPPPPPPPPLLRLPLPPPQQRPRLQEETEAAQVLAD MRGVGLGPALPPPPPYVILEEGGIRAYFTLGAECPGWDSTIESGYGEAPPPTESLEALPT PEASGGSLEIDFQVVQSSSFGGEGALETCSAVGWAPQRLVDPKSKEEAIIIVEDEDEDER ESMRSSRRRRRRRRRKQRKVKRESRERNAERMESILQALEDIQLDLEAVNIKAGKAFLRL KRKFIQMRRPFLERRDLIIQHIPGFWSILDLNCSTHQFLNHPRISILINRRDEDIFRYLT NLQVQDLRHISMGYKMKLYFQTNPYFTNMVIVKEFQRNRSGRLVSHSTPIRWHRGQEPQA RRHGNQDASHSFFSWFSNHSLPEADRIAEIIKNDLWVNPLRYYLRERGSRIKRKKQEMKK RYFPPLSAMSTPPPTNPILSHSLPYSKTRGRCEVVIMEDAPDYYAVEDIFSEISDIDETI HDIKISDFMETTDYFETTDNEITDINENICDSENPDHNEVPNNETTDNNESADDHETTDN NESADDNNENPEDNNKNTDDNEENPNNNENTYGNNFFKGGFWGSHGNNQDSSDSDNEADE ASDDEDNDGNEGDNEGSDDDGNEGDNEGSDDDDRDIEYYEKVIEDFDKDQADYEDVIEII SDESVEEEGIEEGIQQDEDIYEEGNYEEEGSEDVWEEGEDSDDSDLEDVLQVPNGWANPG KRGKTG >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_3|2181_bp atggaccgcccagatgaggggcctccggccaagacccgccgcctgagcagctccgagtct ccacagcgcgacccgcccccgccgccgccgccgccgccgctcctccgactgccgctgcct ccaccccagcagcgcccgaggctccaggaggaaacggaggcggcacaggtgctggccgat atgaggggggtgggactgggccccgcgctgcccccgccgcctccctatgtcattctcgag gagggggggatccgcgcatacttcacgctcggtgctgagtgtcccggctgggattctacc atcgagtcggggtatggggaggcgcccccgcccacggagagcctggaagcactccccact cctgaggcctcgggggggagcctggaaatcgattttcaggttgtacagtcgagcagtttt ggtggagagggggccctagaaacctgtagcgcagtggggtgggcgccccagaggttagtt gacccgaagagcaaggaagaggcgatcatcatagtggaggatgaggatgaggatgagcgg gagagtatgaggagcagcaggaggcggcggcggcggcggaggaggaagcagaggaaggtg aagagggaaagcagagagagaaatgccgagaggatggagagcatcctgcaggcactggag gatattcagctggatctggaggcagtgaacatcaaggcaggcaaagccttcctgcgtctc aagcgcaagttcatccagatgcgaagacccttcctggagcgcagagacctcatcatccag catatcccaggcttctggtcaatccttgacctaaactgctccactcatcagttcctcaac caccccagaatttcaattttgatcaaccgacgtgatgaagacattttccgctacttgacc aatctgcaggtacaggatctcagacatatctccatgggctacaaaatgaagctgtacttc cagactaacccctacttcacaaacatggtgattgtcaaggagttccagcgcaaccgctca ggccggctggtgtctcactcaaccccaatccgctggcaccggggccaggaaccccaggcc cgtcgtcacgggaaccaggatgcgagccacagctttttcagctggttctcaaaccatagc ctcccagaggctgacaggattgctgagattatcaagaatgatctgtgggttaaccctcta cgctactacctgagagaaaggggctccaggataaagagaaagaagcaagaaatgaagaaa cgttacttcccacctttgagtgctatgagtacaccacctcccaccaaccctatactcagc cacagccttccttacagtaaaaccaggggcagatgtgaggtggtgatcatggaagacgcc cctgactattatgcagtggaagacattttcagcgagatctcagacattgatgagacaatt catgacatcaagatctctgacttcatggagaccaccgactacttcgagaccactgacaat gagataactgacatcaatgagaacatctgcgacagcgagaatcctgaccacaatgaggtc cccaacaacgagaccactgataacaacgagagtgctgatgaccacgaaaccactgacaac aatgagagtgcagatgacaacaacgagaatcctgaagacaataacaagaacactgatgac aacgaagagaaccctaacaacaacgagaacacttacggcaacaacttcttcaaaggtggc ttctggggcagccatggcaacaaccaggacagcagcgacagtgacaatgaagcagatgag gccagtgatgatgaagataatgatggcaacgaaggtgacaatgagggcagtgatgatgat ggcaatgaaggtgacaatgaaggcagcgatgatgacgacagagacattgagtactatgag aaagttattgaagactttgacaaggatcaggctgactacgaggacgtgatagagatcatc tcagacgaatcagtggaagaagagggcattgaggaaggcatccagcaagatgaggacatc tatgaggaaggaaactatgaggaggaaggaagtgaagatgtctgggaagaaggggaagat tcggacgactctgacctagaggatgtgcttcaggtcccaaacggttgggccaatccgggg aagagggggaaaaccggataa >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_4|93_aa MRMEGILLITEVFRRTELMAQRGKLTGLKSHSQAPPLARSGPASSRYHSPPPPRLCVFLP YTAGSRRSLSQWGLSSLAFSEDSILELDVTLKT >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_4|282_bp atgagaatggagggaattttgctgataactgaagtattcaggcggacagaacttatggcc cagagaggcaaactgactggcttgaagtcacacagccaggccccacccctcgcgcgctcc ggccctgcctcttcccgctaccactccccaccaccgccccgtctttgcgtcttcttaccg tacaccgccggcagtcggcgcagtctcagccaatggggcctgtcttctctggccttcagc gaagacagtatcttggagctagatgtgaccttaaagacctag >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_5|333_aa MAGAPPPASLPPCSLSSDCCASNERESVGVGPSEPGAGYNLLVAMEIEIAALIIDNGSGM CKAGFAGDGWPLSCVPLHHQGVMVGMGQKDSYMGHEAQSKHDILTLKYPIEHGIITIWDD MEKIWHHTFYNELCMTPEKHPEIVRDINEKVCYIALDFEQEMATATSSSSLEKSYELPDG QVITIGNKWFWCLQVLFQPSFLGMESCGIHKTTFNSIMKCDVDIRKDLYANTVLSSSTTM CPGITTLASSTMKIKFIMPPEHKYSPPYHHQVPQAWLVAIGLGKFMKPGKVVLVLAECYS GHKAVTMKNVDKGTSIFPYSHALMAGIDCYPRK >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_5|1002_bp atggcgggtgcccctcccccagcctcgctgccaccttgcagtttgagctcagactgctgt gctagcaatgagcgagaatctgtgggcgtaggaccctccgagccaggggcgggatataat ctcctggttgcaatggaaatagagatcgccgcgctcatcattgacaatggctctggcatg tgcaaagctggctttgctggggatggatggcccctgagctgtgttcccctccatcatcag ggtgtgatggtgggcatgggccagaaggactcctacatgggccatgaggcccagagcaag cacgacatcctgaccctgaagtaccctatcgagcatggcatcatcaccatctgggatgac atggagaagatctggcaccacaccttctacaacgagctgtgcatgaccccagagaagcac ccggagattgtgcgcgacatcaacgagaaggtgtgctacatcgccctggacttcgagcag gagatggccactgccacatcctcctcctccctggagaagagctatgagctgcctgatggc caggtcatcaccatcggcaacaagtggttctggtgtctgcaggtgctgttccagccttcc ttcttgggcatggaatcctgcggcatccacaagaccaccttcaactccatcatgaagtgt gacgtggacatccgcaaagacctgtacgccaacacagtgctgtccagcagcaccaccatg tgccctggcatcaccacgctggcatccagcaccatgaagatcaagttcatcatgccccca gagcacaagtactcgcccccctaccaccaccaagtaccacaggcttggttggttgctata ggcttgggcaagttcatgaaacctgggaaagtggtgctggtcctggctgaatgctactct ggacacaaagctgtcaccatgaagaacgttgataagggcacctcaatcttcccctacagc cacgctctgatggctggaattgactgctatccccgcaaatga >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_6|120_aa MEQSWMENDELREEGFRQSNYSKLKEEVRTNGKEVKNLEKRLDKSLTRITNAEKSLKDLM ELKTTAQELSDECTSLSSRFDQLEERVSVMEDQMNEMKREEKFREKRIKRNKASKKYGTM >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_6|363_bp atggaacaaagctggatggagaatgacgagttgagagaagaaggcttcagacaatcaaac tactccaagctaaaggaggaagttcgaaccaatggcaaagaagttaaaaaccttgaaaaa agattagacaaatcactaactagaataaccaatgcagagaagtccttaaaggacctgatg gagctgaaaaccacagcacaagaactatctgatgaatgcacaagcctcagtagccgattc gatcaactggaagaaagggtatcagtgatggaagatcaaatgaatgaaatgaagcgagaa gagaagtttagagaaaaaagaataaaaagaaacaaagcctccaagaaatatgggactatg tga >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_7|127_aa MEEHSMLMDRKSQYRENAILPKVIYRFNAIPIKLPMTFFTELEKTSLKFIWNQKRACMAK SILSQKNKAGGITLPDFKLYYKAIVTKTAWYWYQNRDIDQWNRTEPSEIMPRIYNHLIFD KPEKNKQ >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_7|384_bp atggaagaacattccatgctcatggataggaagagtcaatatcgtgaaaatgccatactg cccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttcttcaca gaattggaaaaaacttctttaaagttcatatggaaccaaaaaagagcctgcatggccaag tcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatac tacaaggctatagtaaccaaaacagcatggtactggtaccaaaacagagatatagaccaa tggaacagaacagagccctcagaaataatgccacgtatctacaaccatctgatctttgac aaacctgagaaaaacaagcaatga >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_8|68_aa MEYYAAIKNDEFMSFVGTWMKLETIILSKLSQGQKTKHRMFSLIEIVAQEVVYESEYEKH EKELYERK >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_8|207_bp atggaatattatgcagccataaaaaatgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactatcacaaggacaaaaaaccaaacaccgcatg ttctcactcatagagatagtggcccaggaagttgtgtatgagagcgaatatgagaaacat gaaaaggagctgtatgagcgcaaataa >gi568815575f:52976622_53177740|GENSCAN_predicted_peptide_9|78_aa MASRLVLGSAPNRKAAMFSTRTKNAGEELTFDYQMKGSGDVSSDSIDHSPAKKRVSGIVA PQNRDFEDVNEGLLEQRN >gi568815575f:52976622_53177740|GENSCAN_predicted_CDS_9|237_bp atggcatcccgtttggtcctgggatcagctccaaataggaaggcagcaatgttttccacg agaaccaaaaatgctggagaagagctgacttttgattatcaaatgaaaggttctggagat gtatcttcagattctattgaccacagcccagccaaaaagagggtcagcggcatagtggcg ccccagaacagggacttcgaggacgtgaacgaaggtctgctggagcagaggaactga