GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:31:51 Sequence gi568815584f:35726517_35970474 : 243958 bp : 38.78% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 PlyA - 1722 1717 6 1.05 1.08 Term - 1994 1842 153 2 0 83 42 69 0.020 -1.26 1.07 Intr - 4881 4843 39 0 0 129 54 35 0.031 1.70 1.06 Intr - 12134 11997 138 2 0 61 45 182 0.615 10.94 1.05 Intr - 16049 15852 198 2 0 96 57 204 0.920 16.73 1.04 Intr - 22308 22069 240 0 0 98 92 213 0.999 19.52 1.03 Intr - 24174 23966 209 1 2 78 95 160 0.613 13.57 1.02 Intr - 34490 34313 178 2 1 72 69 111 0.634 6.17 1.01 Init - 36896 36810 87 2 0 42 111 25 0.353 0.89 1.00 Prom - 37163 37124 40 -6.45 2.00 Prom + 37880 37919 40 -8.55 2.01 Sngl + 39396 40178 783 2 0 20 42 418 0.918 26.11 2.02 PlyA + 41185 41190 6 1.05 3.07 PlyA - 41271 41266 6 1.05 3.06 Term - 44483 44431 53 0 2 67 35 47 0.091 -6.09 3.05 Intr - 49229 49119 111 0 0 114 116 48 0.598 9.63 3.04 Intr - 63238 63143 96 2 0 109 60 28 0.088 1.16 3.03 Intr - 82409 82214 196 2 1 -32 85 381 0.885 23.87 3.02 Intr - 82566 82526 41 1 2 96 46 -22 0.003 -8.58 3.01 Init - 93609 93441 169 0 1 97 85 186 0.429 18.95 3.00 Prom - 94661 94622 40 -2.45 4.00 Prom + 95652 95691 40 -3.75 4.01 Init + 98552 98599 48 1 0 67 80 19 0.826 0.20 4.02 Intr + 99585 99846 262 2 1 63 24 254 0.938 12.64 4.03 Intr + 99948 100142 195 1 0 7 85 272 0.890 17.26 4.04 Intr + 104894 104984 91 0 1 74 89 97 0.965 6.53 4.05 Intr + 106462 106589 128 1 2 46 91 156 0.909 11.10 4.06 Intr + 108328 108402 75 2 0 73 40 89 0.184 1.17 4.07 Term + 111527 111540 14 0 2 101 41 7 0.054 -5.31 4.08 PlyA + 111809 111814 6 1.05 5.03 PlyA - 112602 112597 6 1.05 5.02 Term - 113633 113291 343 1 1 40 47 222 0.131 6.30 5.01 Init - 116789 116644 146 2 2 68 80 158 0.107 12.64 5.00 Prom - 120636 120597 40 -2.95 6.00 Prom + 126492 126531 40 -4.25 6.01 Init + 128330 128386 57 1 0 91 66 -12 0.495 -1.74 6.02 Intr + 136074 136170 97 2 1 105 110 69 0.952 9.46 6.03 Intr + 138431 138483 53 0 2 42 86 39 0.311 -3.19 6.04 Intr + 141350 141516 167 1 2 105 93 69 0.790 6.94 6.05 Intr + 155059 155249 191 1 2 91 80 144 0.169 12.21 6.06 Term + 171379 171461 83 2 2 70 53 110 0.007 2.58 6.07 PlyA + 172649 172654 6 -3.44 7.06 PlyA - 172702 172697 6 1.05 7.05 Term - 173585 173451 135 2 0 77 47 167 0.269 8.54 7.04 Intr - 177155 177111 45 2 0 67 87 77 0.022 3.19 7.03 Intr - 195899 195805 95 0 2 72 90 77 0.424 5.06 7.02 Intr - 197140 196780 361 1 1 -6 29 278 0.114 6.37 7.01 Init - 199866 199669 198 0 0 36 87 158 0.739 9.45 7.00 Prom - 205491 205452 40 -4.95 8.00 Prom + 218865 218904 40 -5.85 8.01 Sngl + 221561 223366 1806 1 0 66 44 550 0.715 42.49 8.02 PlyA + 224067 224072 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 143844 143961 118 0 1 124 37 59 0.969 1.43 S.002 Init - 230184 230093 92 0 2 30 50 123 0.827 2.71 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_1|413_aa MVSSGEERNNMIWASVIPVQLSKTLEIMQIRREGVRLFLLWLQALQNNCSKEQLWMFSCL IPGFSAPQSEHGPRTLDNLINPPLNLQENIPQMRPKPHYVVIKKDAETNEAIYCTKEPFI KARVIVIRWLVSFWLEPKPHTGPHIPGMEGEVLPKNIQRAAASLVSREESKNDNADKTDR TTEPEQSHSNTSTLTEREPSSSSLCSIDEEHLTDIEIVRRVFSSKRSNVNFVTEIFRQAF LLPICEAAAMRKVVKVYQEWIQQEEKPLFMQEPEEIVITSSDLPCIENVTDHDISMEEGE KREEENGTNTADHVRNSSWAKNGSYQGALHNASEEATEQNIRAGTQAVLQVNQGFIPGLD PLLVFIINSSNIFLLEPANEIKNLLDEHTDMCKRILNIYRYMVVQVSMDKKTW >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_1|1242_bp atggtttcctctggggaagaaaggaacaatatgatttgggcctcagtgatacctgtacag ttaagtaaaacattggagataatgcagattaggcgggaaggtgttcgtcttttcttacta tggttgcaagctcttcagaataactgtagcaaagaacagctctggatgttttcatgctta atccctggattttcagcaccacagtctgaacatggacctcgaactttagataatctcatt aatcctccactcaaccttcaagaaaacatcccgcagatgagaccaaagccacattatgtc gtgataaagaaagatgctgaaaccaatgaagcaatctattgtacaaaggagcctttcatt aaggctcgtgttattgtcattcgttggctggtttctttctggctggagccaaaaccacat acaggacctcatattcctgggatggaaggtgaagtcttgccaaagaatattcagagagca gctgctagtttagtatccagagaagaaagcaaaaatgataatgctgataaaacagacaga actacagaacccgaacagtctcattccaatacaagcactctcacggagcgagaacctagc tcatctagtctctgtagtattgatgaagaacatctcacagacattgaaatagttcgcaga gttttttcttctaaaaggagtaatgtaaactttgtgacagagatatttcgtcaggcattt ttattaccaatttgtgaagcagcagctatgagaaaagtggtaaaagtatatcaagaatgg atccaacaagaggaaaaacctttgttcatgcaagagcctgaagaaattgtgatcacttct tcagacctcccttgcattgaaaatgtcacagaccatgatatttcaatggaagaaggagaa aaaagagaagaggaaaatgggaccaatactgctgatcatgttcgaaattccagttgggca aaaaacggctcctaccaaggtgctcttcataacgcctctgaagaagccacagaacaaaac atacgagctggtacccaggcagttttgcaggtaaatcagggatttattcctggtttggat ccattgctcgtgtttattataaactcatcaaatatatttcttcttgaacctgcaaatgaa ataaaaaatcttctggatgaacacacagatatgtgtaaacgcattcttaacatttatcgg tacatggttgtacaagtatcaatggacaaaaagacttggtaa >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_2|260_aa MRSGNAEGVFGPVKNPWSYSKQYREKRKQNLHFENEDSDWLITGGSSGGSAVAISAFTCY AALGSDTGGLTRNPAAYCGLVGFKPNYGLVSHHGLISLEDLMEVQGILTRCVDDAAIVLG VLAEHDPTDSTTAQDPVKPFMIPSLTNVSKLCIGIPEEYLVLELSSEVQSLWSKAADLFE SEGAKAIEVSFSHITYSIVCYHLLCTSEAASSIARSDGLQYGHRCDTDVSTEAMYAATRR EGFNDFVRGRILSGNVFLLK >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_2|783_bp atgagatctggaaatgcagaaggcgtatttggaccagttaaaaacccctggagttattca aaacaatatagagaaaagaggaagcagaatctccactttgagaatgaagattcagactgg ctgataactggaggaagctcaggtgggagtgcagttgccatatcggcattcacatgctat gcggctttaggatcagatactggagggctgaccagaaatcctgctgcctactgtgggctt gttggtttcaaaccaaactatggcttagtttcccatcatggtctcatttccctggaggat ttgatggaagtgcaaggaatcttaaccagatgtgtggatgatgcagcaattgtcttgggt gtactggctgaacatgaccctacggattctaccacagcacaggatcctgttaaaccattc atgattcccagtttgacaaatgtgagcaaactatgtataggtattccagaggaatatctt gtactggaattatcaagtgaagtacagtctctttggtccaaagctgctgacctctttgag tctgagggggccaaagcaattgaagtatccttttctcacattacttattcaattgtctgc taccatctattgtgcacatcagaagcagcatcgagtatagcaagatctgatgggctacaa tatggtcacagatgtgacactgatgtatctactgaagccatgtatgctgcaaccagacga gaagggttcaatgattttgtaagaggaagaattctctcaggaaatgttttcttgttaaaa taa >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_3|221_aa MAVFIRLKLAHNHADDAAMRRELMVLRGDAVLDFLEPKARQLLHNGSFPLELLSFPRPGE GSAILELGPKSPPRPLPPLSAAGTRLGAGCGSGSGSGSDRMFSKKPHGDVKKSTQKVLDT KKDALTRLKHLRIVIDLFSSNSLYQWSPNFLTPGTSFVEDSFSTDSGENAESIDLKQFFD QHFSHIYYVFFENFVTIEASLKQKENFTTSSRKNSSAMAVS >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_3|666_bp atggcggtcttcatccgccttaagctggcccacaaccatgctgatgatgcagctatgcgg cgtgagctgatggtcctgcgcggtgatgctgtgctggattttctggaaccgaaggctaga caacttctccacaatggcagctttcccctggaactgttgtccttcccacggccgggagag ggatccgccatattggagctggggccgaagtctcctccccgtccactccctccactttct gccgccgggacccggctgggagccggctgtggcagtggcagtggcagtggcagcgacagg atgttctccaagaagccgcacggggacgtgaagaagtccacccagaaggtgctagacacc aagaaggacgcactgactcgcctcaagcacctgcgcatcgtcatcgatctcttttcttct aactctttatatcagtggtccccaaactttttgacaccagggaccagtttcgtagaagac agtttttctacagacagtggggagaatgcagaatctattgatcttaaacagtttttcgac caacatttttcacatatatactatgtgttctttgaaaattttgtgactattgaagctagt cttaaacagaaagaaaattttacaacttcttccagaaagaattcatcagcgatggcagtt tcatag >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_4|270_aa MAAGQRRSSLSRLGSQFINCDPECQLSCLVGLEVQKEEVIRAAEGWKRAAPAGLRPTNYN SRQASEGRVVIAGNQSYQSRASPRPLARWKASANGRAPIAEPAAAVAGLGAGSGKRRRGW KMPVHSRGDKKETNHHDEMEVDYAENEGSSSEDEDTESSSVSEDGDSSEMDDEDCERRRM ECLDEMSNLEKQFTDLKDQLYKERLSQVDAKLQEVIAGKAPEYLEPLATLQENMQIRTKV AGIYRELCLESVKNKYECEIQASRQHCLGV >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_4|813_bp atggcggccgggcagagacgctcctcactttccagactgggcagccagtttataaactgc gatccagagtgtcagctttcgtgccttgtgggacttgaagtccagaaggaagaagtgatc agggctgccgagggatggaagcgtgcagcccctgcaggtctgagacctacgaactacaac tcccggcaagcatcggaaggccgcgtcgtaattgcgggcaaccaatcgtaccagagcagg gcaagcccccgccctctggctcggtggaaggcctccgccaatggcagagctcccatcgca gagcctgcggcggcggtggccgggctgggcgccggtagtggaaagcgacggcgcggctgg aaaatgccagtccattcccgaggggataagaaggagaccaaccatcacgatgagatggag gtggactacgccgaaaatgaggggagcagctccgaggacgaggacactgagagctcgtcg gtctccgaggatggagatagctcagaaatggatgatgaagactgtgaaagaagaagaatg gaatgtttggatgaaatgtccaatcttgaaaaacagtttaccgatctcaaagatcaactt tataaagaacgattaagtcaggtggatgcaaaactacaagaagtcatagctggaaaagca ccagaatacttggaaccgctggcaactttacaggaaaatatgcaaattcgtacaaaggta gcaggaatctatagagagctctgcttagaatctgtaaagaacaaatatgaatgtgaaatt caagcttctcgccagcattgcctcggtgtgtga >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_5|162_aa MRKNQCKKAENSKNQNASSPPKDHNSSPAREQNWTENELDELAEVGFSRTNDKNHMIISI DAENVFDKIQHHFMLKTLNELGIDGMYLKIIRAIYDKPTANIILNGQKLEAIPLKTGTRQ GCPLSPLVFDIVLEVLARAIRQEKEIKGIQIGRGEVKLSVCR >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_5|489_bp atgaggaaaaaccagtgcaaaaaggctgaaaattccaaaaaccagaacgcctcttctcct ccaaaggatcacaactcctcaccagcaagggaacaaaactggacagagaatgagcttgac gaattggcagaagtaggcttcagtaggaccaatgacaaaaaccacatgattatctcaata gatgcagaaaatgtcttcgataaaattcaacaccacttcatgctaaaaactctcaatgaa ctaggtattgatggaatgtatctcaaaataataagagctatttatgacaaacccacagca aatatcatactgaatgggcaaaagctggaagcaatccctttgaaaaccggcacaagacaa ggatgtcctctctcaccactcgtattcgacatagtattggaagttctggccagggcaatc cggcaagagaaagaaataaagggtattcaaataggaagaggggaagtcaaattgtctgtt tgcagatga >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_6|215_aa MVCHQPMTNFVLGISWKVQSEKLLLYDTVQSELEEKIRRLEEDRHSIDITSVYMLQDLDI LEDWTTIRKCFINITVFELIPLAPVKLEKHLHSARSEEGRLYYDGEWYIRGQTICIDKKD ECPTSYDQSSYSQQNTCGKPSSYGQQSSYGQQSSYRKHPPTSYLPQTGSCSQASTLEETS ASSFERLMTPGGSSSGYSCSSVTKGREGPAVYGQA >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_6|648_bp atggtgtgccaccagccaatgactaattttgtgcttgggatttcttggaaggtgcagagc gaaaagctgttgctatatgatacagtccagagtgaactagaggagaagataagaaggctt gaagaggataggcacagcattgatattacctcagtttatatgctacaagatcttgatatt cttgaagactggacaacaattaggaagtgttttattaatataacagtttttgaactgatc cctttagcacctgtgaaactggaaaaacatctgcacagtgctagatctgaagagggaaga ctatattatgatggtgaatggtatatacgtggacaaacaatatgtattgataaaaaagat gaatgtcctacaagctatgatcagagcagttactctcagcagaacacctgtgggaaaccg agcagttatggacagcagagtagctatggtcaacaaagcagctatagaaagcaccctccc actagttacctaccccaaactggatcctgcagccaggcttctacactggaggagacttct gcatcaagttttgaacgactgatgactcctggtggcagcagcagtggctacagctgcagc agtgtgacaaaggggagagaaggtcctgctgtctatgggcaagcctga >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_7|277_aa MREALADLSELDKNKNKVQKEIKEQKDGPKWVPAHLPQPRQAVNVKQYHLPGGHSEVTQT TQQLNKAPQTLRFTADTLRHGPEADSRAVLLRTRMDLPSSDGYLDMKLSWKAPPYWVSFT APESAMKTAGGEVVPAVLLDGGPRALDINTQQHRYLLEQSQCGKHGQGQKLINWLLCLLL GKEAICAFALQQAESFYGGKDGTVQLRTYTVLVAHDFRACMSVCVTQQPASGKLKNLGHL THRVSRILDFANRIFFVQPKKFICPLNFRKLAAGSRD >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_7|834_bp atgagggaagcattggcagacctcagtgagctggataagaacaagaacaaagtccagaaa gaaattaaggagcaaaaagatgggccaaaatgggtgcctgcacacctgccccagcccagg caggcagtgaacgtgaagcaataccacctgccaggaggacactctgaagtcacccaaaca actcagcaattaaataaggccccacaaactctcaggtttaccgctgacacactgagacat ggtcctgaagcagacagccgggctgtgctcctgagaacacgcatggatctgccaagcagt gatggctacctggacatgaagttgagctggaaagcgcccccatattgggtcagttttaca gcaccagagagtgccatgaagactgcgggaggggaggtggtcccagctgtgctccttgat ggaggtccaagagccttagacatcaacacacagcagcaccgatacctgctggagcaatca cagtgtgggaagcatgggcaaggccagaaacttatcaattggctattatgcctgctcctg ggaaaggaagccatatgtgcctttgctcttcagcaagcagagtctttttatggtgggaaa gatggcactgtgcagctccggacttacacagtgcttgtggcacatgacttccgggcctgc atgagtgtctgcgtgacacagcagccagcttccggcaaattaaagaacctgggccatttg acccatagagtgtcccgcattctggattttgctaatcgcatattcttcgtgcaacccaag aagttcatctgtcctctgaatttccgaaaattggcagctggatccagagactga >gi568815584f:35726517_35970474|GENSCAN_predicted_peptide_8|601_aa MESNHRPEDTESNHRPEDTEGNHRPEDTEGNHRPEDMERNHRPEDTEGNHRPEDTEGNHR PEDTERNHRPEDMESNHRLEDMQSNHRPEDMEGNHRLEDMERNHRPEDTEGNHRPEDTEG NHRPEDTEGNHRPEDTERNHRPEDMERNHRPEDMEGNHRPEDMERNHRPEDTERNHRPED MERNHRLEDMERNHRPEDTESNHRLEDMESNHRLEDMESNHRPEDMERNHRPEDMERNHR PEDMEGNHRLEDMERNHRPEDTESNHRLEDMESNHRLEDMESNHRLEDMESNHRLEDMER NHRLEDMERNHRLEDMESNHRLEDMERNHRLEDMERNHRLEDMESNHRPEDTEGNHRLED MESNHRLEDMERNHRLEDMESNHRPEDMESNHRLEDMQSNHRLEDMERNHRLEDMESNHR LEDMERNHRLEDMESNHRLEDMERNHRLEDMERNHRLGEMERNHRLGDMESNHRLEDMER NHRLEDMERNHRLGEMERNHRLGDMESNHRLEDMERNHRLEDMERNHRLGEMERNHRLGD MESNHRLEDMERNHRLEDMERNHKWERWREITGWGIWRVITGWERWTFLLRATVKTVLKD K >gi568815584f:35726517_35970474|GENSCAN_predicted_CDS_8|1806_bp atggagagtaatcacaggccagaggacacggagagtaatcacaggccggaggacacggag ggtaatcacaggccggaggacacggagggtaatcacaggccggaggacatggagagaaat cacaggccggaggacacggagggtaatcacaggccggaggacacggagggtaatcacagg ccggaggacacggagagaaatcacaggccggaggatatggagagtaatcacaggctggag gatatgcagagtaatcacaggccagaggatatggagggtaatcacaggctggaggatatg gagagaaatcacaggccggaggacacggagggtaatcacaggccggaggacacggagggt aatcacaggccggaggacacggagggtaatcacaggccggaggacacggagagaaatcac aggccggaggatatggagagaaatcacaggccggaggatatggagggtaatcacaggccg gaggatatggagagaaatcacaggccggaggacacggagagaaatcacaggccggaggat atggagagaaatcacaggctggaggatatggagagaaatcacaggccggaggacacggag agtaatcacaggctggaggatatggagagtaatcacaggctggaggatatggagagtaat cacaggccggaggatatggagagaaatcacaggccggaggacatggagagaaatcacagg ccggaggatatggagggtaatcacaggctggaggatatggagagaaatcacaggccggag gacacggagagtaatcacaggctggaggatatggagagtaatcacaggctggaggatatg gagagtaatcacaggctggaggatatggagagtaatcacaggctggaggatatggagaga aatcacaggctggaggatatggagagaaatcacaggctggaggatatggagagtaatcac aggctggaggatatggagagaaatcacaggctggaggatatggagagaaatcacaggctg gaggatatggagagtaatcacaggccggaggacacggagggtaatcacaggctggaggat atggagagtaatcacaggctggaggatatggagagaaatcacaggctggaggatatggag agtaatcacaggccggaggatatggagagtaatcacaggctggaggatatgcagagtaat cacaggctggaggatatggagagaaatcacaggctggaggatatggagagtaatcacagg ctggaggatatggagagaaatcacaggctggaggatatggagagtaatcacaggctggag gatatggagagaaatcacaggctggaggatatggagagaaatcacaggctgggagagatg gagagaaatcacaggctgggggatatggagagtaatcacaggctggaggatatggagaga aatcacaggctggaggatatggagagaaatcacaggctgggagagatggagagaaatcac aggctgggggatatggagagtaatcacaggctggaggatatggagagaaatcacaggctg gaggatatggagagaaatcacaggctgggagagatggagagaaatcacaggctgggggat atggagagtaatcacaggctggaggatatggagagaaatcacaggctggaggatatggag agaaatcacaagtgggagagatggagagaaatcacaggctgggggatatggagagtaatc acaggctgggagagatggacatttttattaagagcaactgtgaaaacagttctgaaagat aagtag