GENSCAN 1.0 Date run: 6-Nov-116 Time: 12:08:08 Sequence gi568815596r:65213504_65444896 : 231393 bp : 43.26% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 14407 14454 48 0 0 85 74 67 0.079 5.95 1.02 Intr + 26349 26459 111 2 0 86 85 60 0.778 6.08 1.03 Intr + 33021 33236 216 2 0 29 105 171 0.926 11.60 1.04 Intr + 37524 37596 73 2 1 112 100 31 0.968 5.68 1.05 Intr + 40225 40361 137 2 2 87 115 -31 0.940 -0.21 1.06 Intr + 42042 42191 150 2 0 67 98 144 0.997 13.66 1.07 Intr + 47744 47889 146 1 2 76 88 118 0.999 9.68 1.08 Intr + 51540 51672 133 0 1 79 84 70 0.999 6.35 1.09 Term + 55061 55231 171 1 0 118 37 209 0.981 16.63 1.10 PlyA + 56223 56228 6 1.05 2.00 Prom + 74086 74125 40 -2.46 2.01 Init + 78510 78544 35 2 2 66 64 49 0.093 -0.33 2.02 Intr + 86745 86895 151 0 1 75 95 53 0.765 4.86 2.03 Term + 87971 88006 36 1 0 93 50 50 0.490 -0.96 2.04 PlyA + 89598 89603 6 1.05 3.10 PlyA - 91468 91463 6 1.05 3.09 Term - 100666 99998 669 1 0 68 41 903 0.563 77.39 3.08 Intr - 103380 103231 150 0 0 107 99 61 0.936 9.46 3.07 Intr - 118548 118484 65 1 2 108 98 61 0.974 7.54 3.06 Intr - 121270 121102 169 1 1 106 67 118 0.841 11.02 3.05 Intr - 121529 121384 146 0 2 34 12 158 0.779 2.80 3.04 Intr - 127524 127428 97 0 1 139 89 4 0.949 5.18 3.03 Intr - 131393 131216 178 1 1 109 55 138 0.195 12.52 3.02 Intr - 136679 136642 38 2 2 88 81 -27 0.036 -6.34 3.01 Init - 141532 141443 90 1 0 78 65 136 0.529 8.79 3.00 Prom - 150319 150280 40 -3.36 4.00 Prom + 151804 151843 40 -4.06 4.01 Init + 157556 157596 41 1 2 65 61 64 0.324 0.11 4.02 Intr + 165523 165621 99 1 0 104 37 74 0.417 3.13 4.03 Term + 166809 166974 166 0 1 83 42 91 0.450 1.39 4.04 PlyA + 167083 167088 6 1.05 5.00 Prom + 173191 173230 40 -0.36 5.01 Init + 174721 174852 132 0 0 36 89 61 0.189 1.14 5.02 Intr + 176571 176659 89 2 2 64 57 46 0.080 -2.13 5.03 Intr + 200011 200178 168 1 0 77 83 115 0.744 8.96 5.04 Intr + 200275 200446 172 1 1 32 91 26 0.434 -2.75 5.05 Intr + 203844 204003 160 2 1 105 41 95 0.484 6.06 5.06 Term + 214175 214284 110 0 2 89 47 66 0.470 1.27 5.07 PlyA + 214509 214514 6 1.05 6.06 PlyA - 215127 215122 6 1.05 6.05 Term - 218682 218534 149 1 2 64 49 63 0.334 -1.94 6.04 Intr - 221438 221325 114 0 0 54 106 32 0.537 2.02 6.03 Intr - 223255 223061 195 2 0 59 60 155 0.491 9.19 6.02 Intr - 225000 224982 19 0 1 80 87 -2 0.442 -4.92 6.01 Intr - 228691 228426 266 2 2 59 99 148 0.419 10.23 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:65213504_65444896|GENSCAN_predicted_peptide_1|394_aa MDSQGRKVVVCDNGTGFVKCGYAGSNFPEHIFPALVGRPIIRSTTKVGNIEIKDLMVGDE ASELRSMLEVNYPMENGIVRNWDDMKHLWDYTFGPEKLNIDTRNCKILLTEPPMNPTKNR EKIVEVMFETYQFSGVYVAIQAVLTLYAQGLLTGVVVDSGDGVTHICPVYEGFSLPHLTR RLDIAGRDITRYLIKLLLLRGYAFNHSADFETVRMIKEKLCYVGYNIEQEQKLALETTVL VESYTLPDGRIIKVGGERFEAPEALFQPHLINVEGVGVAELLFNTIQAADIDTRSEFYKH IVLSGGSTMYPGLPSRLERELKQLYLERVLKGDVEKLSKFKIRIEDPPRRKHMVFLGGAV LADIMKDKDNFWMTRQEYQEKGVRVLEKLGVTVR >gi568815596r:65213504_65444896|GENSCAN_predicted_CDS_1|1185_bp atggacagccagggcaggaaggtggtggtgtgcgacaacggcaccgggtttgtgaagtgt ggatatgcaggctctaactttccagaacacatcttcccagctttggttggaagacctatt atcagatcaaccaccaaagtgggaaacattgaaatcaaggatcttatggttggtgatgag gcaagtgaattacgatcaatgttagaagttaactaccctatggaaaatggcatagtacga aattgggatgacatgaaacacctgtgggactacacatttggaccagagaaacttaatata gataccagaaattgtaaaatcttactcacagaacctcctatgaacccaaccaaaaacaga gagaagattgtagaggtaatgtttgaaacttaccagttttccggtgtatatgtagccatc caggcagttctgactttgtacgctcaaggtttattgactggtgtagtggtagactctgga gatggtgtgactcacatttgcccagtatatgaaggcttttctctccctcatcttaccagg agactggatattgctgggagggatataactagatatcttatcaagctacttctgttgcga ggatacgccttcaaccactctgctgattttgaaacggttcgcatgattaaagaaaaactg tgttacgtgggatataatattgagcaagagcagaaactggccttagaaaccacagtatta gttgaatcttatacactcccagatggacgtatcatcaaagttgggggagagagatttgaa gcaccagaagctttatttcagcctcacttgatcaatgttgaaggagttggtgttgctgaa ttgctttttaacacaattcaggcagctgacattgataccagatctgaattctacaaacac attgtgctttctggagggtctactatgtatcctggcctgccatcacggttggaacgagaa cttaaacagctttacttagaacgagttttgaagggtgatgtggaaaaactttctaaattt aagatccgcattgaagacccaccccgcagaaagcacatggtattcctgggtggtgcagtt ctagcggatatcatgaaagacaaagacaacttttggatgacccgacaagagtaccaagaa aagggtgtccgtgtgctagagaaacttggtgtgactgttcgataa >gi568815596r:65213504_65444896|GENSCAN_predicted_peptide_2|73_aa MDVLFREKGIVRPWCFKRKLNWPREATTPTVWSLLQAKLQPGGQPDPRGLPALGRSLIVP AKDGSVTADNVSF >gi568815596r:65213504_65444896|GENSCAN_predicted_CDS_2|222_bp atggatgtcctgttccgtgaaaaggggatcgtgaggccctggtgcttcaagagaaagttg aattggccccgagaggccaccacacccaccgtgtggtccctcctccaggccaagctgcag cctggaggtcagcctgaccctcggggactcccagctctgggcagaagcctcatcgtccct gccaaggatggttctgtgacagccgacaatgtcagcttttag >gi568815596r:65213504_65444896|GENSCAN_predicted_peptide_3|533_aa MPVLPVPPGLTSALCWAFLLLGQVHYSPQEMLFNSFQKEGEGSDSYIVRVKAVVMTRDDS SGGWFPQEGGGISRVGVCKVMHPEGNGRSGFLIHGERQKDKLRYDNSSFGLSSFSNAMLV PTGDHAPTWELCKCARRKRNNRDEESVVKGYNQDSGEQRRSDGQQPSKASFPIGETHRYV KAGVVLECYVRKDLVYTKANPTFHHWKVDNRKFGLTFQSPADARAFDRGVRKAIEDLIEG STTSSSTIHNEAELGDDDVFTTATDSSSNSSQKREQPTRTISSPTSCEHRRIYTLGHLHD SYPTDHYHLDQPMPRPYRQVSFPDDDEEIVRINPREKIWMTGYEDYRHAPVRGKYPDPSE DADSSYVRFAKGEVPKHDYNYPYVDSSDFGLGEDPKGRGGSVIKTQPSRGKSRRRKEDGE RSRCVYCRDMFNHEENRRGHCQDAPDSVRTCIRRVSCMWCADSMLYHCMSDPEGDYTDPC SCDTSDEKFCLRWMALIALSFLAPCMCCYLPLRACYHCGVMCRCCGGKHKAAA >gi568815596r:65213504_65444896|GENSCAN_predicted_CDS_3|1602_bp atgcctgtcctgccagtccccccaggcctgacctcagccctctgctgggctttccttctc ctgggccaggtgcattactcacctcaagagatgttgtttaattctttccagaaagagggg gagggaagtgacagctatattgtgcgtgtcaaggctgtggttatgaccagagatgactcc agcgggggatggttcccacaggaaggaggcgggatcagtcgcgtcggggtctgtaaggtc atgcaccccgaaggcaatggacgaagcggctttctcatccatggtgaacgacagaaagac aaactgagatatgataactcttcctttggcctctcaagcttctctaatgcaatgctagtg cccactggagaccatgcgcccacatgggagctctgcaaatgtgccaggaggaagaggaat aaccgtgatgaagagagtgtagtgaaggggtacaaccaggacagtggggagcagcgaaga tcggatggtcagcagccaagcaaagcctcctttcccattggagagactcacagatatgtg aaggcaggggtggtattggaatgctatgtaagaaaggacttggtctacaccaaagccaat ccaacgtttcatcactggaaggtcgataataggaagtttggacttactttccaaagccct gctgatgcccgagcctttgacaggggagtaaggaaagcaatcgaagaccttatagaaggt tcaacaacgtcatcttccaccatccataatgaagctgagcttggcgatgatgacgttttt acaacagctacagacagttcttctaattcctctcagaagagagagcaacctactcggaca atctcctctcccacatcctgtgagcaccggaggatttataccctgggccacctccacgac tcataccccacagaccactatcacctcgatcagccgatgccaaggccctaccgccaggtg agcttcccggacgacgacgaggagatcgtgcgcatcaacccccgggagaagatctggatg acggggtacgaggattaccggcacgcacccgtcaggggcaagtacccggacccctcggag gacgcggactcctcctacgtgcgcttcgccaagggcgaggtccccaagcatgactacaac tacccctacgtggactcctcagactttggcctaggcgaggaccccaaaggccgcgggggc agcgtgatcaagacgcagccctcccggggcaagtcgcggcggcggaaggaggacggagag cgctcgcggtgcgtgtactgcagggacatgttcaaccacgaggagaaccgccggggccac tgccaggacgcgcccgactccgtgagaacttgcatccgccgggtgagctgcatgtggtgc gcggacagcatgctctatcactgtatgtcggaccccgagggagactatacagacccttgc tcgtgcgatactagcgacgagaagttttgcctccggtggatggctcttattgccttgtct ttcctggccccctgtatgtgctgttacctgccccttcgggcctgctaccactgcggagtg atgtgcaggtgctgtggcgggaagcacaaagcggccgcgtga >gi568815596r:65213504_65444896|GENSCAN_predicted_peptide_4|101_aa MRATATLSLPVRAQASKSGVQAEHPYITGAIKDDTRAIVKTRDLANRLNLIDLSQGCVLI PKPLSLKLLPQETLLAFQRRCQKSKAVWNLSYGTAFEASQS >gi568815596r:65213504_65444896|GENSCAN_predicted_CDS_4|306_bp atgagggctacagccaccctgtcgctgcccgtaagagctcaggcctccaagtctggggtc caggctgagcatccctacattaccggtgcaatcaaagatgacacgagagctatagtaaaa actagggacttggcaaacagactcaaccttatagatcttagtcaaggctgcgtactcatc cccaaacccctgtcattaaagcttttgccccaggagactctactggcgttccagagacgc tgtcagaagtcaaaagctgtctggaacctttcttatggaactgccttcgaggctagtcaa tcatga >gi568815596r:65213504_65444896|GENSCAN_predicted_peptide_5|276_aa MGKRKFNLSESSAYETLNMSPQGRDRVSPIRGRAKDIISAQQMTRPEPSRKLVTPPEMFD GLGRGAKITFLRGRAVKTTENWLSLLQRAVVPQKNLLTIRAAYPSHRASGTQAFHIKHTS KHKLQGQPVCPNSKAYQSGSSRHLFFLLNIQTAKCGDLSIPKARKRQIYKQFMNEYKFST IKNEYLLDSELKALPCLHSQCSCTWNTDMRKPKLKTPTYHVQESYTRRASGKLIYLHLMQ AFSGKISEVRKMMAMTDCHQFRRIMNKHDWQKGCEL >gi568815596r:65213504_65444896|GENSCAN_predicted_CDS_5|831_bp atggggaaaagaaagttcaacctgagtgaatccagtgcttatgaaactctgaatatgagc cctcaaggtagagatcgcgtgtctcctattaggggccgggccaaagacattatcagtgcc caacaaatgacgagaccagaaccctcacggaagcttgtgacacctccagaaatgtttgat ggtctgggcagaggagccaaaatcacctttctgagaggaagggcggtcaaaacaacagag aattggttatcgctgctgcagagggcagtggttccacagaagaacttgctgacgattaga gctgcatacccaagtcaccgtgcaagtgggacccaggcattccacatcaagcacacatct aagcataagctgcagggccagccagtgtgccctaactccaaagcttatcagagtggttca tcaagacatttgttttttctactgaacatacagactgccaaatgtggtgacttaagtatt cccaaggccagaaagcgtcaaatctataaacagtttatgaatgagtataaattttcaaca atcaagaatgagtacttgttggatagtgagctgaaggcccttccctgcctccactcccag tgctcctgtacctggaacacagacatgcggaaaccaaagcttaagacccctacctaccat gttcaggagagctacacaagaagagcatctgggaagctcatctacctccacctcatgcag gcattttctgggaagataagtgaagtgagaaaaatgatggcaatgacagattgtcaccag ttcagaagaataatgaacaagcatgattggcagaaaggctgcgagctctga >gi568815596r:65213504_65444896|GENSCAN_predicted_peptide_6|247_aa XICIACNLVDGPQGNHCSKLKVSISDPSTTIISKITVTISTTITFTTITTIILILPITIS TAISQHHHHHHHHHHHHEHHLCRHQKSTQVEYSVFAERENDLGLNGTGDEALHDNGTTER VSGPPAGYLYRADYAIGLQPFMPRLGRRTPGAGGPHSRAPELAYLQKLITSPATLPFVYL ETGGWSPRDAHRVDNFDMACLCTPSPKPYPAPCFPFCCGALFISSRPPSPNQAISGDVKK GGERTGR >gi568815596r:65213504_65444896|GENSCAN_predicted_CDS_6|744_bp nctatctgtatagcctgcaatctggtagatggaccacagggaaaccactgcagcaaacta aaagtgtctatctcagatccttccaccaccatcatttccaaaatcaccgtcactatctcc accaccatcacctttaccaccatcaccactatcatcctcatccttcccatcaccatctcc actgctatctctcaacaccatcatcatcaccaccaccaccatcatcaccatgagcaccac ctctgtcgtcatcaaaaaagcacacaggtagaatacagtgtctttgctgaaagggaaaac gacctgggattgaacggcaccggggatgaagcattgcacgataacggcacaaccgagcgc gtcagcggcccgcccgccggctatttataccgcgcggattatgcaatcgggctccagcca ttcatgccccgactgggccgccgcacgcccggggcgggggggcctcattcccgggcgccg gagctggcctacctgcagaaattaatcacctcacctgccaccttgccatttgtgtacctt gaaactggagggtggagcccaagagatgcccacagggtggataattttgacatggcctgc ctttgcaccccttcccccaaaccctatcccgcgccctgcttccccttctgctgcggcgcc ctcttcatctctagccgccccccctccccaaatcaggcgatctccggagatgtgaagaag gggggcgagcggacaggaagatga