GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:11:05 Sequence gi568815593r:147294038_147553587 : 259550 bp : 39.96% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1990 2071 82 1 1 128 71 41 0.197 5.12 1.02 Intr + 4960 5036 77 0 2 64 61 70 0.051 -0.71 1.03 Intr + 29861 30065 205 2 1 79 44 273 0.045 20.28 1.04 Intr + 32238 32303 66 2 0 59 86 54 0.036 0.48 1.05 Term + 33787 34017 231 0 0 22 32 176 0.121 0.89 1.06 PlyA + 34610 34615 6 1.05 2.00 Prom + 35115 35154 40 -7.45 2.01 Init + 37668 37765 98 2 2 55 72 122 0.873 7.13 2.02 Intr + 48969 49006 38 0 2 101 115 30 0.486 3.99 2.03 Intr + 57028 57117 90 2 0 137 47 91 0.930 8.95 2.04 Intr + 67480 67577 98 2 2 63 51 88 0.334 1.41 2.05 Intr + 76617 76733 117 2 0 64 116 104 0.913 10.54 2.06 Intr + 79132 79257 126 0 0 63 82 84 0.967 5.26 2.07 Intr + 79720 79816 97 0 1 109 99 17 0.262 3.56 2.08 Intr + 97282 97423 142 2 1 82 31 204 0.753 12.59 2.09 Term + 97764 98052 289 0 1 59 38 155 0.647 1.46 2.10 PlyA + 98911 98916 6 1.05 3.17 PlyA - 99354 99349 6 -1.95 3.16 Term - 100086 99998 89 1 2 33 47 98 0.418 -2.96 3.15 Intr - 101684 101522 163 2 1 95 109 155 0.795 16.93 3.14 Intr - 102656 102369 288 2 0 16 67 162 0.910 3.42 3.13 Intr - 103808 103629 180 2 0 83 93 181 0.985 17.24 3.12 Intr - 105215 105045 171 2 0 28 56 253 0.602 15.32 3.11 Intr - 106796 106655 142 1 1 64 79 162 0.928 12.33 3.10 Intr - 107659 107503 157 2 1 127 77 114 0.999 12.35 3.09 Intr - 111764 111573 192 0 0 53 55 89 0.209 0.64 3.08 Intr - 114759 114691 69 1 0 79 107 23 0.271 1.54 3.07 Intr - 118651 118210 442 1 1 52 75 223 0.268 9.50 3.06 Intr - 119620 119559 62 2 2 124 77 13 0.401 1.33 3.05 Intr - 121836 121672 165 1 0 59 105 178 0.998 15.61 3.04 Intr - 124594 124410 185 0 2 68 89 237 0.975 20.31 3.03 Intr - 130926 130838 89 0 2 124 53 62 0.316 4.15 3.02 Intr - 133189 133095 95 2 2 39 60 75 0.196 -1.44 3.01 Init - 135264 135192 73 0 1 45 82 42 0.153 -0.83 3.00 Prom - 135891 135852 40 -7.05 4.00 Prom + 137478 137517 40 -3.85 4.01 Init + 139246 139350 105 0 0 88 106 28 0.470 3.69 4.02 Intr + 149994 150031 38 2 2 93 100 52 0.047 3.04 4.03 Intr + 152067 152230 164 0 2 130 38 62 0.279 4.10 4.04 Intr + 158178 158300 123 1 0 34 103 111 0.430 6.84 4.05 Term + 170216 170334 119 0 2 52 39 123 0.137 1.52 4.06 PlyA + 171164 171169 6 1.05 5.00 Prom + 188595 188634 40 -4.45 5.01 Init + 201773 202716 944 1 2 86 53 194 0.370 8.86 5.02 Intr + 210165 210399 235 0 1 -29 115 124 0.167 0.17 5.03 Term + 210669 210776 108 2 0 77 44 73 0.571 -0.67 5.04 PlyA + 210789 210794 6 1.05 6.03 PlyA - 211518 211513 6 1.05 6.02 Term - 214550 214464 87 2 0 83 38 69 0.767 -1.92 6.01 Init - 215821 215441 381 1 0 96 116 347 0.988 35.22 6.00 Prom - 221032 220993 40 -4.95 7.03 PlyA - 221242 221237 6 1.05 7.02 Term - 221642 221501 142 1 1 94 36 110 0.049 2.82 7.01 Init - 244578 244535 44 0 2 78 77 72 0.344 5.04 7.00 Prom - 256145 256106 40 -3.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:147294038_147553587|GENSCAN_predicted_peptide_1|220_aa XCTEAETLTNGDFLYQCKFQYSQLNASKESSPDPYPKRGFLDLTQERIQGKFAEYSFQDE EDMFMVVDLLLGGDLRYHLQQNVHFKEETVKLFICELVMALDYLQNQRIIHRSVKSKEMA MNLLLSARSQKGLFVLKCLLTMVKMLSMAAALRGARDTYAWPPASGTHRRKWYMWNGQTE NSLKLEAKLCEKDKYFREDSVESQNKFHESSYRKQPKTRS >gi568815593r:147294038_147553587|GENSCAN_predicted_CDS_1|663_bp nngtgcacagaggcagaaacccttacaaatggagatttcctttatcaatgtaaatttcaa tatagccagctaaatgccagcaaggaaagcagtcctgatccataccccaagagagggttc ttggatctcacgcaagaaagaattcagggcaagtttgcagagtattccttccaagatgag gaagacatgttcatggtggtggacctcctgctgggtggagacctgcgttatcacctgcaa cagaacgtccacttcaaggaagaaacagtgaagctcttcatctgtgagctggtcatggcc ctggactacctgcagaaccagcgcatcattcacaggtcagtcaagtccaaggagatggcc atgaacttattacttagtgcccgttctcagaagggcctttttgtactgaaatgtctcctc accatggtaaagatgctcagcatggctgcagctctgaggggagcacgggacacctatgca tggccacctgcctcaggcacccacagacgaaagtggtacatgtggaacggacagacagag aacagcctaaaattggaagctaaattgtgtgagaaagacaagtacttcagagaagatagt gtggagtcgcaaaataagtttcatgagagctcatacagaaaacagcctaaaactagaagc taa >gi568815593r:147294038_147553587|GENSCAN_predicted_peptide_2|364_aa MTGVLMKRGTLDTEEHTKSEDDVDVERHREDDRDMKPDNILLDEHGHVHITDFNIAAMLP RETQITTMAGTKPYMAPEMFSSRKGAGYSFAVDWWSLGVTAYELLRGRRPYHIRSSTSSK EIVHTFETTVVTYPSAWSQEMVSLLKKLLEPNPDQRFSQLSDVQNFPYMNDINWDAVFQK RLIPGFIPNGFIFVRTDNQRHIIPIGWIAKPMDFCGLLHYAENLEVQEIALLRVTDLTIS QTPFPLPKPVALRENRDGLPSGFRGNMRRAFGGCSQSKEILCRIKGSGGMVHFGSPKGSA DDRFGHGRLIFMAQRGCAEMSQTQGLSNSRAESAEVDAQKRVYVCLFLYLLAVAGTATYP KVTA >gi568815593r:147294038_147553587|GENSCAN_predicted_CDS_2|1095_bp atgactggtgtccttatgaaaagaggaactttggacacagaggaacatacaaagagtgaa gatgatgtggatgtagagagacacagggaggatgacagggatatgaagcctgacaatatt ttacttgacgaacatgggcacgtgcacatcacagatttcaacattgctgcgatgctgccc agggagacacagattaccaccatggctggcaccaagccttacatggcacctgagatgttc agctccagaaaaggagcaggctattcctttgctgttgactggtggtccctgggagtgacg gcatatgaactgctgagaggccggagaccgtatcatattcgctccagtacttccagcaag gaaattgtacacacgtttgagacgactgttgtaacttacccttctgcctggtcacaggaa atggtgtcacttcttaaaaagctactcgaacctaatccagaccaacgattttctcagtta tctgatgtccagaacttcccgtatatgaatgatataaactgggatgcagtttttcagaag aggctcattccaggtttcattcctaatggatttatatttgtaaggactgataaccaaaga catataattcccattggatggatagccaaaccaatggacttctgtggtctactgcattat gctgaaaacttagaagtacaagaaatagctctactacgggtaactgatttaacaatttcc caaacaccctttccactacccaagcccgtggccctcagagagaaccgggatggattgcca tctgggttcagaggcaatatgaggagggcatttgggggttgttcccagagcaaggaaatc ttgtgcagaatcaaaggttctggtggaatggttcacttcggaagtcccaagggcagtgcg gatgaccgatttggccatggaagacttatcttcatggcacagagaggttgtgcagagatg agtcagactcaggggctgagtaacagcagagcagagagtgcagaagtggacgctcagaag cgagtttatgtgtgtcttttcctctatctgctggctgtggctggtactgcaacctatccc aaagtaacagcctag >gi568815593r:147294038_147553587|GENSCAN_predicted_peptide_3|853_aa MPRTLLGCKTVTHVALGSCAGPKAVGAGEICWKVRSHIVEEKPFGALTKEMTARCQSDRL LIKGGRIVNDDQSFYADIYMEDGLIKQIGDNLIVPGGVKTIEANGKMVIPGGIDVHTHFQ MPYKGMTTVDDFFQGTKAALAGGTTMIIDHVVPEPESSLTEAYEKWREWADGKSCCDYAL HVDITHWNDSVKQEVQNLIKDKGVNSFMVYMAYKDLYQVSNTELYEIFTCLGELGAIAQV HAENGDIIAQVTPCSSLERALGLHSIFFFISSVLLELSICARHHHKYLAKDRTCSKVPHN LGGRNRGECRQAMKLEDEVKQKEGWPLLENQKRHPGRGGFQIGTFNGRAFADSLLKMCQS YKTFSVIKKDRANPHVGNGDNWPRRPCTEQARRVRGGGSHQFERFMFLLTPGDFFLQLEA EAVFRAITIASQTNCPLYVTKVMSKSAADLISQARKKGNVVFGEPITASLGIDGTHYWSK NWAKAAAFVTSPPLSPDPTTPDYINSLLASGDLQLSGSAHCTFSTAQKAIGKDNFTAIPE GTNGVEERMSVIWDKAVATGKMDENQFVAVTSTNAAKIFNLYPRKGRISVGSDSDLVIWD PDAVKIVSAKNHQSAAEYNIFEGMELRGAPLVVICQGKIMLEDGNLHVTQGAGRFIPCSP FSDYVYKRIKARRKGSVDDVEFLVLGSTDTVITSLESLASAHLMGSASVLAFESLTGKEA LTGLLVMIHPARSGSSGSASSLPPLWPKLRNGWRLCAAAMGGLAVFVQLTMADLHAVPRG MYDGPVFDLTTTPKGGTPAGSARGSPTRPNPPVRNLHQSGFSLSGTQVDEGVRSASKRIV APPGGRSNITSLS >gi568815593r:147294038_147553587|GENSCAN_predicted_CDS_3|2562_bp atgcctagaaccctgctgggctgcaagacagtgacacacgtagctcttggtagctgtgca gggcctaaggcagtgggagcaggtgaaatttgctggaaggtcaggtcacacatcgtggag gaaaagccctttggagccttgacaaaggagatgactgctagatgccagagtgaccgtctc cttatcaagggaggcagaatcgtcaatgatgatcagtccttttatgctgatatttacatg gaagatggcttaataaaacaaattggagacaatctgattgttcctggaggagtgaagacc attgaagccaatgggaagatggtgatccctggaggcatcgatgtccatactcacttccag atgccatataagggaatgaccacagtagatgacttcttccaagggacaaaggcggcctta gcaggtggcaccaccatgatcattgaccatgtggtgcctgagcctgagtccagcctgact gaggcctatgagaaatggagagagtgggctgatgggaagagttgctgtgactatgccctg catgtggacatcacccactggaatgacagcgtcaagcaggaagtgcagaacctcatcaag gacaaaggggttaactccttcatggtttatatggcttataaggatttgtatcaagtatct aacacagagctctatgagatcttcacctgcctgggagagctgggggccattgctcaagtt catgctgagaatggggatatcattgcccaggtaacaccgtgcagctccctggagcgtgct ttgggtctgcattccattttcttcttcatttcttcagtgctgcttgagctctcaatatgt gcaaggcaccatcataagtaccttgctaaagacagaacctgctccaaggtacctcacaat ttaggtggaagaaacagaggggaatgtagacaagcaatgaaactagaagatgaggtaaaa cagaaagagggctggccattgttggagaatcagaaaaggcatcctggaagaggaggattt caaattggaacatttaatgggagagcatttgctgattctcttctgaaaatgtgtcaaagc tacaagacattctctgtcataaagaaagaccgagcaaacccgcatgttggaaatggggat aactggcccagaaggccatgtactgagcaggccagaagagttcgtggaggtgggagccac cagtttgagaggttcatgtttctattgactcctggagactttttcttgcagctggaagct gaggctgtgttccgtgccatcaccattgccagccaaaccaattgccctctctacgtcaca aaggtcatgagcaagagtgcagctgacctcatctcacaagccaggaaaaaaggaaatgta gtctttggtgagcccatcactgccagcctcggcatagatggaacccattattggagcaag aactgggccaaggcggctgcatttgtgacatccccacccctgagccctgacccaactact ccggactacatcaactccttgctggccagcggggatctgcagctatctgggagtgcccac tgcaccttcagcactgcccagaaagcaattgggaaggacaacttcacagccattcctgag ggcaccaatggtgtggaggagcggatgtctgtcatctgggacaaggctgtggccacaggg aaaatggacgaaaaccagttcgtggctgtgacaagcacaaacgctgccaagatcttcaac ctgtatccccgcaagggaagaatatctgtgggttctgacagcgacctcgtcatctgggat ccagatgctgtgaagatcgtctctgccaagaaccaccagtctgcggcagagtacaacatc tttgaagggatggagctgcgcggggctcctctggttgtcatctgccagggcaagatcatg ctggaagatggcaacctgcacgtgacccagggggctggccgcttcataccctgcagcccg ttctccgactatgtctacaagcgcattaaagcacggaggaagggaagtgttgatgatgta gagttcttggttttggggagtacagatacagtcatcacatcactagaatcattagcatct gcacatttgatgggctcggcttctgtgctggcatttgagtcactaacagggaaggaggct ctgacaggacttcttgtgatgatccatcctgcacggtcaggctcctctggcagcgcgtcc tcattgcctcccctctggccaaagttaagaaatgggtggaggttgtgtgccgctgccatg ggtggtcttgcagtgtttgtccagttgactatggcagacctgcatgccgtcccaaggggc atgtacgatgggcctgtgtttgacctgaccaccacccccaaaggtggcacccccgcaggc tctgctcggggctctcctactcggccgaacccacctgtgaggaatcttcatcagtcggga tttagcctgtcaggcacccaagtggatgagggggttcgctcagccagcaagcgcatcgtg gcgcccccaggcggccgttctaatatcacatctctgagttaa >gi568815593r:147294038_147553587|GENSCAN_predicted_peptide_4|182_aa MLWVRSLGGWGKRQGGVGSGVGTKMPYQKTGNQIWETGAEKDLPLDLKKALELLSVNLEQ SEEGLAHRVICTGRDLKDHLVQCPHFTDAESERDNKGLPKVTKGKVLHLNRKRTGCSKPK GTSANRLMPTTKPGGPCKSTTVAGRIHECKNQGEEAGVIPFTFTPRDPTRNTRFPLLRFW VI >gi568815593r:147294038_147553587|GENSCAN_predicted_CDS_4|549_bp atgctctgggtccgaagcttgggtggatggggaaagaggcagggtggggtagggtcaggt gtaggaacaaagatgccttatcagaaaacaggaaaccagatatgggaaacaggtgcagag aaggacttacccctggacttgaagaaggccttagaactcctctctgtgaatttggaacaa agtgaggaggggctggctcacagggtcatttgcactggcagggacctcaaggatcatctg gttcaatgccctcattttactgatgcagaatctgagagggataataagggtttgcccaaa gtgacaaaggggaaggttctgcatctcaatcgcaaacgcacaggatgttccaagcctaag ggaacaagtgcaaacagactcatgcccacaacaaaacctggcggaccctgcaaatcaact actgtggcaggaagaatacatgaatgcaagaaccaaggggaagaagcaggagtgatccct tttaccttcactcccagggacccaactcgaaatacacgctttccacttctgcggttctgg gttatttag >gi568815593r:147294038_147553587|GENSCAN_predicted_peptide_5|428_aa MAILPKVIYRFNGILIKLSMTFFTELEKTTFKFIWNQKRARIAKSIRRQKNKAGGIMLPD FKLYYKATVTKIAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPDKNKQWGKDSLFNK WCWENWLAICRKLKLDPFLTSYTKINSRWIKDLHVRPKTIKALEENLHNTIQDIRMGKDF MSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRIY NELKQIYKKKTNNPIKKWVKDMNRHFSKEDIYAAKKHMKKCSSSLAIRKMQIKTTMRYHL TPVRMAIIKKSGNNRDSVLAALNITYKSCQQNPVSRDHYFEGTLPIAALLRRRIFYVPQM PTVKLNQSSERSQTLPSLGKETYSQIQVWPEGQIVPSAPRALANFMKETSCTKEAHMGRR KLEMTQDS >gi568815593r:147294038_147553587|GENSCAN_predicted_CDS_5|1287_bp atggccatactgcccaaggtaatttatagattcaatggcatcctcatcaagctatcaatg actttcttcacagaattggaaaaaactactttcaagttcatatggaaccaaaaaagagcc cgcattgccaagtcaatccgaagacaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctacagtaaccaaaatagcatggtactggtaccaaaacaga gatatagaccaatggaacagaacagagccctcagaaataatgccacatatctacaactat ctgatctttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca tcttatacaaaaattaattcaagatggattaaagacttacatgttagacctaaaaccata aaagccctagaagaaaacctacacaataccattcaggacatacgcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagacaaaatagacaaatgggatctaatt aaactaaagagcttctgcactgcaaaagaaactaccatcagagtgaacaggcaacctaca aaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggtgaag gatatgaacagacatttctcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcatcatcactggccatcagaaaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcgatcattaaaaagtcaggaaacaacagagacagtgttctggct gccttaaacattacttacaagagctgtcagcaaaaccctgtctccagggaccattacttt gagggaactctgccaatcgcagcattattgagaaggcgtattttttatgttcctcaaatg cctactgtcaaactaaaccaaagttctgaaaggtcacaaacacttccaagtttggggaag gaaacctactcccagattcaagtgtggcctgaaggccagattgttccctctgcacccaga gcccttgccaacttcatgaaggaaacatcttgcacaaaagaggcccacatgggaagaagg aaactagaaatgacccaagactcctga >gi568815593r:147294038_147553587|GENSCAN_predicted_peptide_6|155_aa MASGRRGWDSSHEDDLPVYLARPGTTDQVPRQKYGGMFCNVEGAFESKTLDFDALSVGQR GAKTPRSGQGSDRGSGSRPGIEGDTPRRGQGREESREPAPASPAPAGVEIRSATGKEVLQ NLGPKDKRLRDVVQKMQGLAGLETSEFKLKLSVGE >gi568815593r:147294038_147553587|GENSCAN_predicted_CDS_6|468_bp atggcctcgggccggaggggctgggacagctcccacgaagacgatctgcccgtgtacctg gccaggccgggcaccacggaccaggtcccgcggcagaaatacggcggcatgttctgcaac gtggagggcgccttcgagagcaagacgctggatttcgatgccctcagcgtggggcagcgg ggcgcgaagactcctcggagcggccagggcagcgaccgaggatcggggagtcggcccggg atcgagggggacaccccgcgcaggggccaaggccgggaagagagcagggagcccgcgccc gcctcccccgcccccgccggggtagagatccggagcgccaccggcaaagaggtgttgcag aacctcggccccaaggacaagagactcagagatgtagttcagaaaatgcagggcctggca gggctggagacttcagagttcaaactgaagctttcagttggggaataa >gi568815593r:147294038_147553587|GENSCAN_predicted_peptide_7|61_aa MEVIDDMIEAASMEWAKENSVFGTGFRTFENVKIKERCSVCKIGKETPTTPHQEGDYVMS Y >gi568815593r:147294038_147553587|GENSCAN_predicted_CDS_7|186_bp atggaggtcattgatgacatgatcgaagcagcttcaatggagtgggccaaagaaaattct gtttttggaactggctttcgaacatttgagaatgttaaaatcaaagagagatgctcagtg tgtaaaattgggaaggaaaccccaacaactccccaccaggaaggtgattatgtcatgtcc tattaa