GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:35:21 Sequence gi568815597f:63223059_63424492 : 201434 bp : 40.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8446 8500 55 0 1 75 85 80 0.710 7.90 1.02 Intr + 14914 14953 40 2 1 115 67 44 0.057 1.36 1.03 Intr + 17507 17651 145 2 1 38 41 138 0.068 3.46 1.04 Term + 17940 19433 1494 2 0 -16 43 565 0.058 31.15 1.05 PlyA + 19765 19770 6 1.05 2.03 PlyA - 22643 22638 6 1.05 2.02 Term - 36568 36452 117 1 0 110 38 73 0.760 2.06 2.01 Init - 39468 39142 327 0 0 59 98 236 0.998 19.07 2.00 Prom - 40565 40526 40 -4.65 3.00 Prom + 40852 40891 40 -8.05 3.01 Init + 41404 41458 55 0 1 75 99 -20 0.882 -0.70 3.02 Intr + 43897 44070 174 2 0 28 59 282 0.799 18.29 3.03 Term + 63953 64356 404 0 2 -15 53 320 0.106 12.53 3.04 PlyA + 64546 64551 6 1.05 4.00 Prom + 66054 66093 40 -4.65 4.01 Init + 67476 67569 94 2 1 38 87 60 0.167 1.49 4.02 Intr + 93973 94143 171 2 0 -42 30 221 0.056 2.29 4.03 Intr + 96508 96831 324 2 0 0 50 321 0.395 14.32 4.04 Intr + 97030 97252 223 2 1 51 93 134 0.499 6.46 4.05 Term + 99928 101437 1510 1 1 1 50 2071 0.704 183.59 4.06 PlyA + 102050 102055 6 1.05 5.07 PlyA - 102085 102080 6 1.05 5.06 Term - 102798 102727 72 0 0 61 38 60 0.729 -4.77 5.05 Intr - 103385 103254 132 2 0 83 59 110 0.891 7.52 5.04 Intr - 104180 104009 172 1 1 3 90 166 0.468 7.32 5.03 Intr - 105183 105051 133 1 1 97 3 74 0.702 -1.42 5.02 Intr - 106071 105949 123 1 0 82 78 68 0.876 4.84 5.01 Init - 107453 107285 169 2 1 72 23 187 0.853 10.34 5.00 Prom - 108997 108958 40 -6.55 6.00 Prom + 119427 119466 40 -7.95 6.01 Init + 121557 121671 115 2 1 61 46 111 0.646 4.62 6.02 Intr + 125171 125273 103 0 1 69 39 115 0.491 3.01 6.03 Term + 127484 127604 121 2 1 114 38 52 0.460 -0.23 6.04 PlyA + 127648 127653 6 1.05 7.04 PlyA - 128416 128411 6 1.05 7.03 Term - 137328 136871 458 1 2 -19 48 435 0.252 23.10 7.02 Intr - 140047 139884 164 0 2 89 97 101 0.271 9.80 7.01 Init - 144171 144098 74 0 2 61 38 93 0.089 2.19 7.00 Prom - 145763 145724 40 -5.85 8.03 PlyA - 146268 146263 6 1.05 8.02 Term - 157317 157253 65 1 2 88 44 67 0.363 -0.63 8.01 Init - 161266 161131 136 1 1 59 98 105 0.988 8.95 8.00 Prom - 164804 164765 40 -1.05 9.00 Prom + 164910 164949 40 -7.95 9.01 Init + 166902 167292 391 2 1 32 42 234 0.480 10.18 9.02 Term + 177946 178136 191 2 2 58 44 150 0.587 4.23 9.03 PlyA + 178700 178705 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 17970 19433 1464 2 0 42 43 543 0.902 40.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_1|577_aa MTLDAGKDMEQQELYFIAACKRPTVGLYRVIVLNQEEVESLNRPITGSEIVAIINSLPTK KSPGPDGFTAEFYQRYKEELHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDG TYFKIIRAIYDKTTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKE IKGIQLGKEEVRLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYT NNRQPESQIMSELPFTIASKRIKYLGIQLTREVKDLFKENYKPLLKEIKEDTNKWKNIPC SWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQK NKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNK QWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKISSRWIKDLNVRPKTIKTLEENLGIT IEDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYS SDKGLISRIYNELKQIYKKKTNNPIKKWAKDMNKVFN >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_1|1734_bp atgacactagatgctggcaaagatatggagcaacaggaactctatttcattgctgcttgc aaacggcctactgtggggctttaccgtgtgatcgtactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagtttaccaaccaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggaactg catataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagcc tttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatggg acgtatttcaaaataataagagctatctatgacaaaaccacagccaatatcatactgaat gggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctca ccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggagaaggaa ataaagggtattcaattaggaaaagaggaagtcagattgtccctgtttgcagatgacatg attgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacacc aacaacagacaaccagagagccaaatcatgagtgaactcccattcacaattgcttcaaag agaataaaatacctaggaatccaacttacaagggaggtgaaggacctcttcaaggagaac tacaaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacaga ttcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactact ttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaag aacaaagctggaggcatcacactacctgacttcaaactatactacaaggctacagtaacc aaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccc tcagaaataatgccgcatatctacaactatctgatctttgacaaacctgagaaaaacaag caatggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgt agaaagctgaaactggatcccttccttacaccttatacaaaaatcagttcaagatggatt aaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattacc attgaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatggcaaca aaagccaaaattgacaaatgggatctaattaaactaaagagcttctgcacagcaaaagaa actaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacctactca tctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaa acaaacaaccccatcaaaaagtgggcgaaggacatgaacaaggttttcaattga >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_2|147_aa MVVSLTYDSPVKGVWSALGSVTIRRHGSQKATWIWLAKEPEIRVAPVAAANFSIACWLTF LEDVTLTAAEFPNGNNGTSCQPKLLAGPLQIYHVDAITFPFVVYHLEVKGVALVIIVQHG TSVMSCPMTHDLKAPRSCHVTFSLASH >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_2|444_bp atggtagtcagtttaacatacgatagccctgtgaaaggtgtgtggtcagccctgggatca gtaaccatcagaagacatggctcccagaaggctacctggatctggttagcaaaggaacca gaaataagagtggctccagtggcagcagcaaacttcagcatagcctgctggctgacattc ctggaggatgtgacactgacagcagctgagtttccaaatggcaacaatggcacaagctgc cagccaaagcttcttgcaggtcctcttcagatttatcatgtagatgccatcacttttcct tttgtagtgtatcatttggaagtcaagggtgtggcccttgtcatcatagtccagcatgga acttcagtcatgtcatgtcccatgacccatgacctcaaggctcccagaagctgccacgtg acattttcacttgcatcccattag >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_3|210_aa MGLLNGTLPPCGIPLWGREQDPVSKEEGGVGEREGGGGGRGGGGRGGGEGEGKGEGGGRG GEGEEGEEGGGGEGGGAWPLLIEAPGLYLQTRLVQASSCRPRIQVHPCRPRLQAYAYGQM PGLSQFLASPCRLRLKTCLGGRLAPVGSGFRKAPMDTGSASALTDTISKPVATDSVNRST PVDPGARPAPTDPESRIIPSDPSSRPAPVD >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_3|633_bp atggggttgctaaatggcacgttgcctccttgtggcataccattatgggggagagagcaa gaccctgtctcaaaagaagaaggaggagtaggagaaagagaaggaggaggaggaggaaga ggaggagggggaagaggaggaggagaaggagaaggaaaaggagaaggaggaggaagagga ggagaaggggaagaaggagaagaaggaggaggaggagaaggaggaggagcttggcccctg ttaattgaggctccagggctatacctgcaaaccaggctggtccaggccagctcctgcaga cccaggatccaggtccacccctgtagacccaggctccaggcctatgcctatggacagatg ccaggcctgtcccagtttctggccagcccctgcagactcaggctcaagacctgccttggt ggcagattggccccagtgggctcaggattcaggaaagcccccatggacacaggctctgca tctgccctcacagacacaatatctaagcctgttgcaacagactcagtcaacagatctacc ccagtggaccctggtgccagaccagcccccacagacccagaatccaggatcattccatca gacccaagctctaggcctgccccagtggactga >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_4|773_aa MCLNTIKATYDKPTASNKLNSEKLKAFPLRSEASESQANLPGRNKTEGLGAEVLPARPPE GAERQRRTSRTLQEAGRLRRPKRRRTRGAANCNYAANNLFSPTSSRSPAGPCTASGSDPS AGAASCVKASRNLPVALPGRRFASFPPPEVVQALWSAWREKAGGRFLSRERPRKNPLPPL EIDIRPARARDVGAAARKKRNKWNRQLGVVRPPPPSPGRFQGDTGEWTDQAGPWQREGRR QRPAAWRESRELASADGGIEIRLIWRLLRDRSVAAPEHPHRPLPPPRRRYQPRGGMTLSG GGSASDMSGQTVLTAEDVDIDVVGEGDDGLEEKDSDAGCDSPAGPPELRLDEADEVPPAA PHHGQPQPPHQQPLTLPKEAAGAGAGPGGDVGAPEADGCKGGVGGEEGGASGGGPGAGSG SAGGLAPSKPKNSLVKPPYSYIALITMAILQSPQKKLTLSGICEFISNRFPYYREKFPAW QNSIRHNLSLNDCFVKIPREPGNPGKGNYWTLDPQSEDMFDNGSFLRRRKRFKRHQQEHL REQTALMMQSFGAYSLAAAAGAAGPYGRPYGLHPAAAAGAYSHPAAAAAAAAAAALQYPY ALPPVAPVLPPAVPLLPSGELGRKAAAFGSQLGPGLQLQLNSLGAAAAAAGTAGAAGTTA SLIKSEPSARPSFSIENIIGGGPAAPGGSAVGAGVAGGTGGSGGGSTAQSFLRPPGTVQS AALMATHQPLSLSRTTATIAPILSVPLSGQFLQPAASAAAAAAAAAQAKWPAQ >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_4|2322_bp atgtgcctcaacacaataaaggccacatatgacaaacccacagctagcaataaactcaac agtgagaagttgaaggcatttcctctaagatcagaggcatccgaaagccaggccaacttg cctggacgtaacaagacggaagggctgggcgctgaggtcctgccagcccggccgccagag ggagctgagcgccagaggaggacaagccgaacccttcaggaggccgggcgtctccggaga ccgaagcgccggaggacccgaggagctgcaaactgcaactacgcggcaaacaatttattt agcccgacatctagccggtctccggcaggaccctgcaccgcgtcgggatcggacccttcc gctggggcggcctcctgcgtcaaggccagcaggaaccttcctgtcgccctccccggccgc cgcttcgcctccttcccgcccccggaggttgtgcaggcgctatggtccgcctggagggag aaagccggcggccggttcctgagccgagagcggccgcggaaaaatcctctgcctccgctg gaaatcgatattaggccggcgcgggcgcgggacgtcggggccgcagccagaaaaaagcgc aacaaatggaaccggcagctgggagttgttcgtcctccacccccttccccagggaggttc caaggagacaccggggaatggacggatcaggctgggccgtggcagagggagggtaggagg cagcgaccagcagcgtggagggagtccagagagctagcctctgcggacggcggaatcgaa attaggctcatttggagactacttcgagaccgctccgtggcagcccccgaacaccctcat cgcccgctgccccctccccgccgccgctaccaaccccgaggagggatgaccctctccggc ggcggcagcgccagcgacatgtccggccagacggtgctgacggccgaggacgtggacatc gatgtggtgggcgagggcgacgacgggctggaagagaaggacagcgacgcaggttgcgat agccccgcggggccgccggagctgcgcctggacgaggcggacgaggtgcccccggcggca ccccatcacggacagcctcagccgccccaccagcagcccctgacattgcccaaggaggcg gccggagccggggccggaccggggggcgacgtgggcgcgccggaggcggacggctgcaag ggcggtgttggcggcgaggagggcggcgcgagcggcggcgggcctggcgcgggcagcggt tcggcgggaggcctggccccgagcaagcccaagaacagcctagtgaagccgccttactcg tacatcgcgctcatcaccatggccatcctgcagagcccgcagaagaagctgaccctgagc ggcatctgcgagttcatcagcaaccgcttcccctactacagggagaagttccccgcctgg cagaacagcatccgccacaacctctcactcaacgactgcttcgtcaagatcccccgcgag ccgggcaacccgggcaagggcaactactggaccctggacccgcagtccgaggacatgttc gacaacggcagcttcctgcggcgccggaaacgcttcaagcgccaccagcaggagcacctg cgcgagcagacggcgctcatgatgcagagcttcggcgcttacagcctggcggcggcggcc ggcgccgcgggaccctacggccgcccctacggcctgcaccctgcggcggcggccggtgcc tattcgcacccggcagcggcggcggccgcggctgctgcggcggcgctccagtacccgtac gcgctgccgccggtggcaccggtgctgcctcccgctgtgccgctgctgccctcgggcgag ctgggccgcaaagcggccgccttcggctcacagctcggcccgggcctgcagctgcagctc aatagcctgggcgccgccgcggccgctgcgggcacagcgggcgccgcgggcaccaccgcg tcgctcatcaagtccgagccaagcgcgcggccgtcgttcagcatcgagaacatcataggt gggggccccgcggctcctgggggctcggcggtgggcgctggggtcgccggcggcactggg ggttcagggggcggcagcacggcgcagtcgtttctgcggccacccgggaccgtgcagtcg gcagcgctcatggccacccaccaaccgctgtcgctgagccggacgactgccaccatcgcg cccattcttagcgtgccactctccggacagtttctgcagcccgcagcctcggccgccgcc gctgctgcggccgccgctcaagccaaatggccggcgcaatag >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_5|266_aa MQGGKEPKLSPSIRRAPQLRSLDPGKEHGKRGVRRGGSGGAGDGGECASRCRFAAWGSKE WLVHSSEGKEGEWRREKLALGVTLPTVGGQQGEGGLLTCTTGDTGGVRGHKALEPTFFQI TYPSGQFLKGLTLKIPTSHTLMFKKLKRKRAFEGATAGPIFLTSARTLNNRWQLEGLGPR VSCLPRHKGRGPSLGRRSRDPDPEGMFTQDTIMFTVYQENRLGICKEVSFKATPSPRRQR RNEKGFGLEGEEASIGPLQKSNYVVD >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_5|801_bp atgcagggaggaaaagaaccaaagctctccccttcaataagacgtgcaccgcagctccgg tccctggaccccgggaaggagcatggcaaacgcggagtccgcagaggcggaagcggcggg gccggggatggcggagagtgtgcttctcgctgccggttcgcggcctgggggagcaaagaa tggcttgtacattcctcagagggcaaggagggggagtggcgaagagagaagttggctctg ggtgtaactctgccaactgttggagggcagcaaggggagggtggtcttttaacttgtacc accggggacacaggtggagtccggggacacaaggctctggaacccactttctttcagatc acatacccctcaggacagtttctcaagggactcacgttaaaaatacccacctctcacaca ctcatgtttaaaaagcttaaacgaaaaagagcttttgagggtgcgacggccgggcccatc ttcctgacatctgccaggactttaaacaaccgctggcagcttgagggtctgggtccgaga gtttcctgccttccacgacataaaggaaggggaccatctttaggccgtcgttcaagggac cctgatcccgaaggaatgtttactcaagacacaataatgtttaccgtttaccaggaaaac agattgggtatttgcaaggaagtgtctttcaaagcgacccccagcccacgtcgacagcgg agaaatgagaaaggctttgggttggagggagaggaagcatctataggaccccttcagaaa tccaattatgttgtggattga >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_6|112_aa MKEEQQYRLKTKNLEDKVVGARSNRHEKPLGLSREVEKEAEVIERERFEDTVVLALKMKE GAMSQGMQTAFRRLTWASLYDASVQREQEQTLQGFLRPRLQRSSYNTSATFY >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_6|339_bp atgaaagaggagcagcaatatagactaaagaccaagaacttagaagacaaagtagttgga gccagaagcaaccgccatgagaagcccctaggtctttccagagaggttgagaaggaagca gaggttatagaaagagaaagatttgaagatactgtggtgctagctttgaagatgaaggaa ggagccatgagccaaggaatgcagacagcttttagaaggttaacctgggcttctttgtac gatgccagtgttcaaagagagcaagagcagacgctgcaaggtttcctgaggcctagactc cagaggagctcatacaacacttctgccacattctattag >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_7|231_aa MPAKKKTGQPPKVRNIEHPTICESLKFRKLFTRQLTSQREHSKRQRQKLQISYTPALQVT QQPSHDTRRVEIDSASQWEAKLYYNWVMVNSKLRKLDNAIQDCTNTVKLNDTYLKACLRR AQCYMNTKQYEEAVHNYEKVYQTEKTKEHKQLLKNAQLKLKKSKKKDYYKILGVDNNASE DETKKAYHKWALMHHPDWHSGGSAEVQKEEGKFKEIGEAFTIFSDPKKKTR >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_7|696_bp atgcctgcgaagaagaagactgggcagccccctaaagtaagaaacattgaacatccaacc atttgcgagagcctaaagtttcgcaaactttttacacggcagctcacttcccagagagag cattccaagcggcaaaggcaaaagttgcagatctcttacaccccagccttacaagtcaca cagcaacctagccatgatacaaggagagtggaaatagattctgcctctcagtgggaagct aaactctactataattgggttatggttaattctaagcttaggaaactagataatgcaata caagactgcacaaatacagtgaagctcaatgacacctatttaaaagcctgcttgagaaga gctcagtgttacatgaacaccaaacagtatgaagaagcagtgcacaattatgaaaaagtg tatcagacagagaaaacaaaagaacacaaacagctcctaaaaaatgcacagctgaaactg aagaagagtaagaagaaagattactacaagattctgggagtggacaataatgcctctgag gacgagaccaagaaagcttatcacaaatgggccttgatgcaccatccagattggcacagt ggaggcagtgctgaggttcagaaggaggaggggaagttcaaggaaattggagaggccttt accatcttctctgatcccaagaaaaagactcgctag >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_8|66_aa MGKRSEYTFFKRRHTNGKQEYEKVLNITDDQRNANQNYNETSSHPATKLLPFFRSRTNVL KVAMSI >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_8|201_bp atgggcaaaagatctgaatacacatttttcaaaagaagacatacaaatggcaaacaggaa tatgaaaaagtgctcaacatcactgatgatcagagaaatgcaaatcaaaactacaatgag acatcatctcacccagccaccaaactacttcctttcttccgttcacgaacaaacgtcctg aaagttgctatgtccatttga >gi568815597f:63223059_63424492|GENSCAN_predicted_peptide_9|193_aa MGAGGGVTKVPVWPSHWDCATSDLKLAQHWVLPKACGSHCLIPADVHSRPKGSSSAGIES SQAYALSLGWQLPQSTAYVGSEMLSGSQGLELGTLIRSQAWNPGLQESAWCCAPLWPEAR AWNVGLRTLPDSSCQKGLQLTQRFLDSLLLGFKSQPDMEPKIVHMDVSGGTMATSWYHTV VLYAQMSWLSTMP >gi568815597f:63223059_63424492|GENSCAN_predicted_CDS_9|582_bp atgggagctggaggaggggtgacaaaagtacccgtgtggccatcacattgggactgtgct acatcagacctgaagctagcacagcactgggtcttacccaaggcctgcggcagtcactgc ctgattcctgccgatgttcactcaaggcccaagggttcttcatcagcaggtattgaatcc agccaggcttatgccctttccttagggtggcagcttccccagtccacagcctatgttggg tcagaaatgctgtctgggagccaaggcctggagttaggaaccttaatcaggagccaggcc tggaatccgggactccaggagtctgcctggtgctgtgctccactgtggcctgaagctagg gcctggaatgtgggcctcaggactctgcctgattcaagctgccaaaaggggctccaacta acccagagattcttagactcgctgcttcttggcttcaagtcccagccagacatggaaccc aagattgttcacatggatgtttctggagggacaatggccaccagctggtaccatactgta gtactctatgcccagatgtcttggctgtctaccatgccttga