GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:36:42 Sequence gi568815578r:1210444_1413468 : 203025 bp : 44.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7665 8037 373 2 1 24 58 220 0.248 9.73 1.02 Intr + 10412 10663 252 0 0 67 35 132 0.300 3.21 1.03 Term + 11348 11610 263 0 2 70 55 152 0.539 5.79 1.04 PlyA + 11629 11634 6 1.05 2.00 Prom + 12391 12430 40 -2.46 2.01 Init + 14973 15063 91 2 1 69 78 32 0.063 -0.99 2.02 Intr + 21083 21176 94 0 1 96 22 65 0.054 -0.28 2.03 Intr + 23642 23748 107 2 2 111 77 27 0.306 3.76 2.04 Intr + 35248 35316 69 2 0 85 62 54 0.254 1.65 2.05 Intr + 55853 56039 187 0 1 107 23 83 0.409 2.45 2.06 Intr + 56211 56369 159 0 0 98 75 104 0.541 9.20 2.07 Intr + 59121 59165 45 0 0 103 93 40 0.679 3.52 2.08 Term + 71161 71374 214 1 1 90 47 106 0.465 3.40 2.09 PlyA + 73713 73718 6 1.05 3.04 PlyA - 73923 73918 6 1.05 3.03 Term - 74181 74141 41 1 2 114 47 27 0.453 -1.45 3.02 Intr - 75964 75820 145 1 1 111 59 140 0.835 13.26 3.01 Init - 77322 77311 12 0 0 64 81 8 0.141 -2.05 3.00 Prom - 78550 78511 40 -2.96 4.00 Prom + 79057 79096 40 -7.46 4.01 Init + 85136 85268 133 1 1 94 62 110 0.485 8.89 4.02 Intr + 85882 85978 97 2 1 110 109 23 0.897 5.67 4.03 Intr + 86603 86809 207 2 0 39 86 260 0.667 19.09 4.04 Intr + 90119 90268 150 2 0 103 105 278 0.998 30.28 4.05 Term + 94435 95611 1177 1 1 79 55 1955 0.993 182.57 4.06 PlyA + 97603 97608 6 -3.44 5.12 PlyA - 97638 97633 6 -3.84 5.11 Term - 98136 97836 301 2 1 114 42 122 0.904 4.89 5.10 Intr - 100448 100357 92 2 2 102 78 176 0.966 16.79 5.09 Intr - 102065 101894 172 1 1 75 102 322 0.999 32.25 5.08 Intr - 102319 102144 176 1 2 114 109 224 0.998 25.84 5.07 Intr - 103055 102897 159 2 0 110 91 167 0.951 19.38 5.06 Intr - 107975 107875 101 0 2 96 31 48 0.778 -0.27 5.05 Intr - 108518 108469 50 1 2 60 81 29 0.103 -2.28 5.04 Intr - 109216 109147 70 2 1 121 109 26 0.130 6.24 5.03 Intr - 109948 109920 29 0 2 72 111 4 0.020 -1.14 5.02 Intr - 118743 118642 102 2 0 35 115 86 0.205 5.29 5.01 Init - 153279 153197 83 0 2 65 70 83 0.300 4.64 5.00 Prom - 157144 157105 40 -4.86 6.07 PlyA - 158564 158559 6 1.05 6.06 Term - 161797 161669 129 1 0 114 48 134 0.959 10.18 6.05 Intr - 165160 165048 113 2 2 97 85 32 0.133 3.90 6.04 Intr - 182438 182391 48 0 0 102 117 60 0.002 8.95 6.03 Intr - 182658 182519 140 2 2 30 113 66 0.002 3.41 6.02 Intr - 182874 182683 192 2 0 89 41 75 0.001 1.51 6.01 Intr - 183283 183112 172 2 1 53 77 84 0.002 2.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 171147 171247 101 2 2 58 114 35 0.864 2.83 S.002 Term + 175115 175223 109 2 1 107 48 74 0.899 3.28 S.003 Sngl + 182533 182895 363 0 0 78 38 266 0.969 14.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578r:1210444_1413468|GENSCAN_predicted_peptide_1|295_aa MEERVSVIEDQMNEMKREEKFREKRVKRNEQSLQEMWDYVKRPNLHLIGVPESDGENGTK LENTLQDIIQENFPNLARQANIQIQEIQRTPQRDSSRRATPRHIIVRFTKVEMKEKMLRA AREKVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAQNLLKLISNFS KVSEYKINVRKSQTFLYTNNRQTESQIMKLEKTTLKFTWNQKRAHIAKTILSQKNKAGGV TLPNFKLYYKATVTKTAWYWYRNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKK >gi568815578r:1210444_1413468|GENSCAN_predicted_CDS_1|888_bp atggaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaagagaag tttagagaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatgtgggactacgtg aaaagaccaaatctacatctgattggtgtacctgaaagtgacggagagaatggaaccaag ttggaaaacactcttcaggatatcatccaggagaacttccccaacctagcaaggcaggcc aacattcaaattcaggaaatacagagaacgccacaaagagactcctcgagaagagcaact ccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggca gccagagagaaagtgttggaagttctcgccagggcaatcaggcaagagaaagaaataaag ggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgta tatttagaaaaccccatcatctcagcccaaaacctccttaagctgataagcaacttcagc aaagtctcagaatacaaaatcaacgtgcgaaaatcacaaacattcctatacaccaataac agacaaacagagagccaaatcatgaaattggaaaaaactactttaaagttcacatggaac caaaaaagagcccacattgccaagacaatcctaagccaaaagaacaaagctggaggcgtc acgctacctaacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccgaaacagagatatagaccaatggaacagaacagagccctcagaaataataccacac atctacaaccatctcatctttgacaaacctgacaaaaacaagaaatga >gi568815578r:1210444_1413468|GENSCAN_predicted_peptide_2|321_aa MGPRPHPCHAPAQVSQVRAPGQSRRLRTRTGLVDLPKENFEASYNAITLPEEFHDFDTQN MNAIDVSEHFTQNQSRPEEITLRENFDNDLIFQAESFDRFFFPFARDSSSSGGRNPVMVL APAPASTQLAASCRGSGEQFGGPCRAAEAMEASAGDDKDPGDLRAHRPAAPAAVQSAECQ GVGSLIRARRLQQRLQRRCAKPGRSGASRRGCGGPVDSGEEAAAERGAGPALHRGRWEQG FGEEEAVNNLRMWNAPPQAAPGYLENNYAPVLQSSAQGHQLLSSAEKPLSLLPNPSVTRA SIMGLLAEEEVAHNRHLVNID >gi568815578r:1210444_1413468|GENSCAN_predicted_CDS_2|966_bp atggggcccaggccccatccctgtcatgcccctgctcaggtgtctcaggtccgagcgcca ggacaaagccggagactgcgcactaggaccggactggttgaccttccaaaagagaatttt gaagcatcttacaatgctatcacattgccagaagaatttcatgattttgacacccaaaat atgaatgctattgatgtttcagaacactttactcagaaccaaagcagaccagaagaaatc actcttagagaaaattttgacaatgatctaattttccaagctgagagctttgacaggttc ttcttcccttttgcccgtgacagctccagcagtggtggcaggaatcctgtcatggtcctc gccccagccccggcgagcacccagctagccgcctcctgcaggggctcgggagagcaattc ggcggcccctgcagggcagctgaagccatggaagcctccgcaggtgatgacaaggacccc ggggatctccgggcccaccgcccagctgcacccgccgcagtccagagtgccgagtgccag ggggtggggtcgctgatcagggccaggcggctgcagcagcgactgcagaggcgctgcgcc aagccgggccggagtggtgcgagccggcggggctgcggagggccagtggactcaggtgag gaggccgcggcggagcggggagctggccctgcgctgcaccgcggcaggtgggaacagggg tttggagaagaagaagctgtgaacaatttacgaatgtggaatgctcctccacaggcagct cctgggtacctggagaacaactatgcacctgtcctccagagctctgctcaagggcaccag cttctgagttctgcagagaagcccctctctcttctgcccaacccctcagtgacacgtgct tcgattatgggactattggcagaggaggaagtggcacacaatagacacttggtaaatatt gactga >gi568815578r:1210444_1413468|GENSCAN_predicted_peptide_3|65_aa MRKMLQLPCCPQQLIPVHLIPRGTSHAMLNSFTFNKIQGLNNAQCGSHLLEEGLCSKVTL SVNLL >gi568815578r:1210444_1413468|GENSCAN_predicted_CDS_3|198_bp atgaggaaaatgctgcagctgccctgctgtcctcagcagctgatccctgtccacctgatt ccccgtggtacttcccatgccatgctgaattctttcaccttcaataagatccagggcctc aacaatgcccagtgtggctcacacttgctggaagagggtctttgctccaaggtcaccctc tcagtgaatcttctctga >gi568815578r:1210444_1413468|GENSCAN_predicted_peptide_4|587_aa MPVCRMGAGGPGCMSCVMGWLPEHVHGDACGMSWNGRDMVPSRVAPGIPPIPPLTRTHSL MAMSLPGSRRTSAGSRSGGTLGRSGLAVFAQCPQLPASQNEHLPLLPASRRTSPPVSVRD AYGTSSLSSSSNSGSYKGSDSSPTPRRSMKYTLCSDNHGIKPPTPEQYLTPLQQKEVCIR HLKARLKDTQDRLQDRDTEIDDLKTQLSRMQEDWIEEECHRVEAQLALKEARKEIKQLKQ VIDTVKNNLIDKDKGLQKYFVDINIQNKKLETLLHSMEVAQNGMAKEDGTGESAGGSPAR SLTRSSTYTKLSDPAVCGDRQPGDPSSGSAEDGADSGFAAADDTLSRTDALEASSLLSSG VDCGTEETSLHSSFGLGPRFPASNTYEKLLCGMEAGVQASCMQERAIQTDFVQYQPDLDT ILEKVTQAQVCGTDPESGDRCPELDAHPSGPRDPNSAVVVTVGDELEAPEPITRGPTPQR PGANPNPGQSVSVVCPMEEEEEAAVAEKEPKSYWSRHYIVDLLAVVVPAVPTVAWLCRSQ RRQGQPIYNISSLLRGCCTVALHSIRRISCRSLSQPSPSPAGGGSQL >gi568815578r:1210444_1413468|GENSCAN_predicted_CDS_4|1764_bp atgcctgtgtgccgcatgggtgcgggaggacctggctgcatgtcctgcgtcatgggctgg ctgcctgagcacgttcatggagacgcgtgtggcatgtcttggaatggcagggacatggta ccttctcgtgttgcccccgggataccgcccatcccaccccttactcggacccacagcctc atggccatgtccctgccaggaagtagacggacctctgctggatcacgcagcgggggcact ttgggccgcagcggcctggcagtgttcgcccagtgtccgcagctgcccgccagccagaac gagcacctgcctcttcttcctgcctccaggcgcacctctccacctgtgagcgtgcgggat gcctacggcacctcttcgctcagcagcagcagcaattctggctcctacaagggcagtgac agcagtcccacgccaaggcgctccatgaaatacacgctgtgcagtgacaaccatggcatc aagcccccgaccccggagcagtacctgacccccctgcagcagaaggaggtgtgcatccgg cacctgaaagcccggctgaaggacacacaggaccggctccaggaccgggacacagagatt gatgacctgaagacgcagctgtcacgcatgcaggaggactggattgaggaggagtgccac cgcgtggaggcccagctggccctgaaggaggcccgaaaggagatcaagcagctcaagcag gtcatcgacactgtcaagaacaacctgattgacaaggacaaggggctgcagaagtacttc gtggacatcaacatccagaacaagaagctggagacgctgctgcacagcatggaggtggcc cagaatggcatggccaaggaggatggcactggggagtcagccggtgggtcccctgcccgc tccctcacccgcagctccacctacaccaagctgagtgacccggctgtctgtggtgaccgc cagccgggtgatccctccagcggctctgctgaggatggggcagacagtggctttgcagca gccgatgacacactgagccggacggacgcgctggaagccagcagcctgctgtcgtcgggg gtggactgtggcaccgaggagacctcgctgcacagctccttcggcctgggcccccgcttc cctgccagcaacacctatgagaagctgctgtgtggcatggaggctggtgtgcaggccagc tgcatgcaggagcgtgccatccagacagacttcgtgcagtaccagcctgaccttgacacc atcctggagaaagtgacccaggcccaggtctgtgggacagaccctgagtcaggggacagg tgcccagagctggatgcccacccttcagggcccagagaccccaactcagcagtggtggtg acagtgggtgatgagctagaggccccagagcccatcacccgtggacccaccccacagcgg cctggtgccaaccccaaccctggccagtcggtgagcgtggtgtgccccatggaagaggag gaggaggctgccgtggctgagaaggagcccaagagctactggagccgccactacatcgtg gatctgctggctgtggtggtgccggccgtgcccacggtggcctggctttgccgctcccag cggcgccagggccagcccatctacaacatcagctccctgctgcggggctgctgcactgtg gccttgcactccatccgcaggatcagctgccgctcgctgagccagccgagtcccagccca gcgggcggcggctcccagctctga >gi568815578r:1210444_1413468|GENSCAN_predicted_peptide_5|444_aa MGPSEAEFAAAAVLYSMQQNGCQLLLGRSGSQLRDRATFQGGKQIQQRQQLEKVVEQHLA CRGPKSGPSHSGPGQSLTQDASPASPGNSHFPTTSSGGWKPKIKVLANSVSVLYPNLAEL ENYMGLSLSSQEVQESLLQIPEGDSTAVSGPGPGQMVAPVTGYSLGVRRAEIKPGVREIH LCKDERGKTGLRLRKVDQGLFVQLVQANTPASLVGLRFGDQLLQIDGRDCAGWSSHKAHQ VVKKASGDKIVVVVRDRPFQRTVTMHKDSMGHVGFVIKKGKIVSLVKGSSAARNGLLTNH YVCEVDGQNVIGLKDKKIMEILATAGNVVTLTIIPSVIYEHMVKKAWHLSSSCRTSTAWS LTGSLHLCIDAGSEHTHMHSRAGPGPPATTVPGPVPGLSGELLPPLTRRKAHRRPQESPR KRGPSTLWARALVTHTGGLRPEQE >gi568815578r:1210444_1413468|GENSCAN_predicted_CDS_5|1335_bp atgggcccatcagaggctgagtttgcagcagcagctgtcctgtacagcatgcagcaaaac ggctgtcagctcttattaggaagatcaggaagccagctgagagacagggctacgtttcag ggagggaaacagattcagcagcggcagcagctggagaaggtcgtggagcagcaccttgcc tgcagaggacctaaaagtggaccaagccattcaggcccaggtcagagcctcacccaagat gccagccctgccagtccaggcaacagccatttccccaccaccagctctggaggctggaag cccaagatcaaggttctggccaattcagtttctgttttgtacccaaacttggcagaactg gaaaattatatgggtctttccctctccagccaagaagtccaggagagcctgcttcagatt ccagagggtgacagtacagcggtctcgggccccgggcccggccagatggtggcaccggta accgggtacagcctgggcgtgcggcgagctgagatcaagcccggggtgcgcgagatccac ctgtgcaaggacgagcgcggcaagaccgggctgaggctgcggaaggtcgaccaggggctc tttgtgcagttggtccaggccaacacccctgcatcccttgtggggctgcgctttggggac cagctcctgcagattgacgggcgtgactgtgctgggtggagctcgcacaaagcccatcag gtggtgaagaaggcatcaggcgataagattgtcgtggtggttcgggacaggccgttccag cggactgtcaccatgcacaaggacagcatgggccacgtcggcttcgtgatcaagaagggg aagattgtctctctggtcaaagggagttctgcggcccgcaacgggctcctcaccaaccac tacgtgtgtgaggtggacgggcagaatgttatcgggctgaaggacaaaaagatcatggag attctggccacggctgggaacgttgtcaccctgaccatcatccccagtgtgatctacgag cacatggtcaaaaaggcatggcacctgagcagcagctgccgcacctccaccgcctggagc ctcacaggcagcctgcacctttgcatagatgcgggatctgagcacacacacatgcactcc agagctgggccagggccaccggccaccacagtcccagggcctgtcccagggctgagtggg gagctgctccctccactgacaagaagaaaagcccatcgaaggccccaggagagcccccga aagagaggaccctcaacactgtgggccagggccctcgtgacccacaccggtggtctcagg ccagaacaggagtag >gi568815578r:1210444_1413468|GENSCAN_predicted_peptide_6|264_aa XGQQAISAAVSGNIGSWSRRCVPPVLFPWPLDSDSVGHKKSDPIRIINFHMLNKNVCMEM LNRLEAQRKGRRTKQVTRQTQLYPEASGLPEVSGLLGFRSLFRRPGFPGRRAPCGARTQG WARRAEVLGRAVEPPPGRCWSTPPVAPPARSASAAAMGVQVETISPGDGRTFPKRGQTCV VHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYG ATGHPGIIPPHATLVFDVELLKLE >gi568815578r:1210444_1413468|GENSCAN_predicted_CDS_6|795_bp nggggacagcaggccatttccgcagcagttagtggaaatattggttcttggagcaggcgc tgcgtgcctccagtgcttttcccctggcccctggattctgattccgttgggcataagaag agtgacccaattcgtattatcaactttcacatgctcaataagaatgtatgcatggagatg cttaaccggctggaagcacagaggaagggcaggagaacgaagcaagtcacccggcaaaca cagctgtatccggaggcctccgggcttccggaggtctcggggcttctgggcttccggtcc ctcttccggaggcctgggtttccgggacgtcgcgcgccgtgtggggcgcgcacgcagggc tgggcgcgacgcgccgaggtactaggcagagccgtggaaccgccgccaggtcgctgttgg tccacgccgcccgtcgcgccgcccgcccgctcagcgtccgccgccgccatgggagtgcag gtggaaaccatctccccaggagacgggcgcaccttccccaagcgcggccagacctgcgtg gtgcactacaccgggatgcttgaagatggaaagaaatttgattcctcccgggacagaaac aagccctttaagtttatgctaggcaagcaggaggtgatccgaggctgggaagaaggggtt gcccagatgagtgtgggtcagagagccaaactgactatatctccagattatgcctatggt gccactgggcacccaggcatcatcccaccacatgccactctcgtcttcgatgtggagctt ctaaaactggaatga