GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:54:18 Sequence gi568815590r:95147620_95369189 : 221570 bp : 42.19% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 6426 7175 750 2 0 74 49 530 0.999 43.62 1.02 PlyA + 7848 7853 6 1.05 2.00 Prom + 15515 15554 40 -5.85 2.01 Init + 16886 16940 55 1 1 53 54 57 0.125 0.30 2.02 Term + 26914 27029 116 2 2 5 37 231 0.440 7.45 2.03 PlyA + 27130 27135 6 1.05 3.00 Prom + 41191 41230 40 -3.65 3.01 Init + 43182 43387 206 2 2 85 38 227 0.551 15.86 3.02 Intr + 45002 45109 108 2 0 61 94 65 0.853 2.88 3.03 Intr + 45785 45915 131 2 2 49 36 90 0.598 -0.68 3.04 Term + 50032 50156 125 2 2 125 43 280 0.992 24.77 3.05 PlyA + 51564 51569 6 1.05 4.13 PlyA - 51634 51629 6 1.05 4.12 Term - 51771 51752 20 1 2 81 51 24 0.322 -4.50 4.11 Intr - 52743 52635 109 0 1 81 93 36 0.595 2.34 4.10 Intr - 53162 52995 168 2 0 115 72 130 0.884 13.32 4.09 Intr - 53844 53589 256 2 1 -45 70 263 0.473 7.72 4.08 Intr - 58096 57972 125 1 2 74 45 67 0.243 -0.54 4.07 Intr - 59141 58901 241 1 1 41 75 143 0.575 4.93 4.06 Intr - 59689 59617 73 2 1 117 33 45 0.343 -0.55 4.05 Intr - 60332 60137 196 2 1 68 64 151 0.559 8.77 4.04 Intr - 60959 60800 160 1 1 47 94 79 0.335 3.47 4.03 Intr - 62440 62218 223 2 1 94 0 167 0.273 4.76 4.02 Intr - 65647 65477 171 2 0 89 37 140 0.621 7.99 4.01 Init - 73844 73652 193 2 1 78 84 116 0.123 9.38 4.00 Prom - 80929 80890 40 -3.45 5.09 PlyA - 82021 82016 6 1.05 5.08 Term - 82608 82476 133 2 1 59 42 106 0.068 -0.32 5.07 Intr - 83355 83230 126 2 0 104 80 26 0.078 2.27 5.06 Intr - 84197 84054 144 1 0 77 116 51 0.127 5.28 5.05 Intr - 95230 95153 78 0 0 102 87 31 0.012 2.15 5.04 Intr - 104664 104569 96 2 0 75 91 36 0.272 0.81 5.03 Intr - 112913 112849 65 2 2 96 115 45 0.931 4.60 5.02 Intr - 116155 116068 88 0 1 106 84 88 0.990 9.25 5.01 Init - 121909 121416 494 1 2 61 36 358 0.583 20.57 5.00 Prom - 132828 132789 40 -4.55 6.03 PlyA - 133540 133535 6 1.05 6.02 Term - 138447 138276 172 2 1 75 44 145 0.692 5.12 6.01 Init - 139043 138979 65 2 2 70 47 56 0.734 0.37 6.00 Prom - 140256 140217 40 -2.55 7.00 Prom + 142096 142135 40 -6.15 7.01 Init + 153186 153264 79 2 1 34 106 117 0.793 9.37 7.02 Intr + 153503 153573 71 0 2 -53 94 101 0.574 -5.42 7.03 Intr + 154097 154202 106 1 1 90 73 134 0.833 10.97 7.04 Term + 156316 156497 182 2 2 66 54 112 0.704 2.39 7.05 PlyA + 157372 157377 6 1.05 8.00 Prom + 164083 164122 40 -4.05 8.01 Sngl + 165525 165716 192 2 0 63 48 171 0.501 5.49 8.02 PlyA + 166258 166263 6 1.05 9.00 Prom + 168784 168823 40 -5.75 9.01 Init + 171778 171838 61 0 1 70 107 47 0.646 6.26 9.02 Term + 173237 173415 179 0 2 -15 47 202 0.738 2.67 9.03 PlyA + 174387 174392 6 1.05 10.03 PlyA - 174795 174790 6 1.05 10.02 Term - 185189 185119 71 0 2 126 42 56 0.786 1.92 10.01 Init - 187592 187424 169 2 1 72 94 37 0.424 2.64 10.00 Prom - 187908 187869 40 -4.45 11.04 PlyA - 188366 188361 6 1.05 11.03 Term - 189266 189192 75 2 0 99 46 57 0.841 -0.54 11.02 Intr - 189555 189467 89 1 2 85 94 75 0.883 6.57 11.01 Init - 195656 195500 157 2 1 76 111 77 0.468 8.92 11.00 Prom - 200633 200594 40 -7.45 12.06 PlyA - 201382 201377 6 1.05 12.05 Term - 204689 204558 132 2 0 85 48 112 0.947 4.01 12.04 Intr - 207016 206859 158 2 2 48 90 67 0.138 1.71 12.03 Intr - 210137 210038 100 2 1 82 87 18 0.294 -0.14 12.02 Intr - 211663 211553 111 1 0 91 82 45 0.373 3.86 12.01 Init - 217599 217288 312 0 0 63 50 163 0.489 7.37 12.00 Prom - 221207 221168 40 -3.65 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 129637 129485 153 1 0 77 38 118 0.825 2.64 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_1|249_aa MVDRLANSEANTRRISIVENCFGAAGQPLTIPGRVLIGEGVLTKLCRKKPKARQFFLFND ILVYGNIVIQKKKYNKQHIIPLENVTIDSIKDEGDLRNGWLIKTPTKSFAVYAATATEKS EWMNHINKCVTDLLSKSGKTPSNEHAAVWVPDSEATVCMRCQKAKFTPVNRRHHCRKCGF VVCGPCSEKRFLLPSQSSKPVRICDFCYDLLSAGDMATCQPARSDSYSQSLKSPLNDMSD DDDDDDSSD >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_1|750_bp atggtggatcgcttggcaaacagtgaagcaaatactagacgtataagtatagtggaaaac tgttttggagcagctggtcaacctttaactatacctggacgagttcttattggagaagga gtattgactaagttgtgcaggaaaaagcccaaagcaaggcagtttttcttgtttaatgat attcttgtatatggcaatattgtcatccagaagaaaaaatataacaaacaacatattatt cccctggaaaatgtcactattgattccatcaaagatgagggagacttaaggaatggatgg ctaatcaagacaccaactaaatcttttgcagtttatgctgccactgctacggagaaatca gaatggatgaatcatataaataaatgtgttactgatttactctccaaaagtgggaagaca cccagtaatgaacatgctgctgtctgggttcctgactctgaggcaactgtatgtatgcgt tgtcagaaagcaaaattcacacctgttaatcgtcgccaccattgccgcaaatgtggtttt gttgtctgtgggccctgctctgaaaagagatttcttcttcccagccagtcctctaagcct gtgcggatttgtgacttctgctatgacctgctttctgctggggacatggccacatgccag cctgctagatcagactcttacagccagtcattgaagtctcctttaaatgatatgtctgat gatgatgacgatgatgatagcagtgactaa >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_2|56_aa MAVRILKEKRDHSNLWVGGPQNAELQQEPDNGVGDKASFRDASVYAEGTWAESADK >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_2|171_bp atggcagtaagaatcttgaaagaaaaacgggaccattccaatttatgggttggtggacct caaaatgcagaactgcagcaggaacccgacaatggggtgggagacaaagccagcttcaga gatgccagtgtttacgctgaagggacgtgggcagaatccgccgacaaataa >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_3|189_aa MAFWGLQNYKGRGQLAALAHTFRCALGAASAVSPVSPEQGEGLQKPALSTRYLLDPAGLI LWLWSPPAGSLHGLRVGMQTKPVGHPSTIHAAFHLALLGSLCQLRPQGLESTPLKISVCR GEETPAERAPVRGTYKQELRNCRKSAGEGHPGPWMGLDAKSCSVRTFSDDDDDDDDDDDD SGESKDHMA >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_3|570_bp atggctttctggggccttcagaactataaagggagagggcagctggcagccctggcccac accttcagatgtgccctgggtgctgcctctgctgtgtctcctgtgtctcctgagcaggga gagggactgcagaaacctgcactgtccactcgctacctcctggatcctgccggtctcatt ctgtggctgtggtccccgccggctggcagcctgcatggactaagagtgggcatgcaaacc aagcccgtgggccatcccagcaccattcatgctgccttccacctggcattgttgggctct ttgtgtcagctcaggccccagggacttgaaagcactccactcaagatttctgtctgccga ggagaggagaccccagcagaaagagccccggtgaggggtacctacaagcaggagctgagg aattgtaggaagtcagctggagaggggcatcctggtccgtggatgggcttggatgccaag agctgctctgtaagaaccttctcagatgatgatgatgatgatgatgatgatgatgatgat tctggagaaagcaaagaccacatggcctga >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_4|644_aa MGPAKRRAMGCGNAGKTAFPTHPDERGWPGRQQSGETVLPGKLEEQRIAKGWGVADGVSC PRGVCLLSSHEDSRGCGGPRGTEILLHQKDDKMSTKPAHRAEYAQGCARVRTQCMHGSQG HEDPGGGGHVLSAPGLSLLQTGQPCCGHLSIEPDLEIQFLRGTRRGAPTVFSQAIPEKWK KRAFHRPSGKHSTIRVPPPSSLSSVSSMRLKKNAVVVPCLLTQRVKQEFSAPVCHGGPWE EVTYHSHPQGQQDGLGNSCTPTAASPAQYLDNMNPPPSSISVFQSCQLPLGKALSTRSLR PKADSGLIITVFPAGTGALRTLLDLLDQKGRGNAGQKSSQSLARRMERPPLASIMRIRSL GWEHPYTYVKENRYLLGRRKEGQALGRQPKDPAAVSVTTSLPGASATLIIFSWCPLSLKQ PLTGHFTQPPEQYFQTTGHIMPLLTHIPQLLKTSTGPSMSRRGEGDVTTGAETATTAKKR CSRQRLEDMDSPLEPPEGAQPASTLISASDTDCGLPAFGTVREEMPAVISHQVYSNMLQE PQETKSGVQPPAEEPGTAIAIAAAAPQKWPNPPGKAPKGKRPERVAFQCLHRDCMEFTPN FRDSLCLNNLGGGVRVWQGEGSSQDPPHAAPAPSSHDPDNWHQK >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_4|1935_bp atgggccctgcgaagagaagagccatgggctgcgggaacgctgggaaaacagcattccca acacacccagatgaaaggggctggcctgggagacagcagagtggagaaacagtcctgcca ggaaaactagaggagcagaggattgccaaggggtggggtgtggctgatggtgtctcctgc cccaggggagtctgcctcctctccagccatgaggacagcaggggatgtggagggccaaga gggacagaaatcctcctgcaccaaaaggatgataaaatgagcaccaaaccagctcaccga gctgagtatgcccagggttgtgccagagtcagaacccagtgcatgcacgggagccaaggc catgaagacccaggtggaggtggccacgtgctctcagcacctggactatcacttctgcag actggacaaccatgctgtgggcacttgtccatagagccagacctggaaatacagtttctc agaggtacccggagaggggcaccaactgtgttctcccaggccatcccagaaaagtggaag aagagagcatttcacagaccatcaggaaaacactcaaccataagggtaccccctccctct tccctgagctctgtttcttccatgaggttgaagaaaaatgccgtcgtggtcccctgcctg cttactcagcgtgtcaagcaggagttctcagctccagtgtgccatggagggccttgggaa gaggtcacataccattcccatccccagggccagcaggatgggctggggaattcctgcacc ccaacagctgcgtccccagctcagtacctggacaacatgaatccacccccatcgtcaata agcgtctttcagagctgccagctccctctggggaaagccctttccactcgttctttgagg cccaaagctgattctggtctaattataactgtattcccagcaggcacaggagcactgagg acccttcttgacctcctggatcagaaaggcagaggaaatgctgggcagaagtcaagccag tccctggcaagaagaatggaacgaccacctttggcttccattatgaggattcgctctctg gggtgggaacatccatacacttatgtcaaggagaacagatacctgttgggaaggaggaaa gagggacaagcgttgggcaggcaacccaaggaccctgcagcagtctccgtgacaacttct cttcctggtgcctctgcgactctgatcattttttcttggtgtcctctttctctgaagcag cctctcaccggccactttacacagccaccagaacaatatttccaaaccacaggtcacatt atgcccctccttacccacatcccccagcttctgaagacatccactggaccttccatgagc aggagaggagaaggtgatgtgaccacgggggcagagactgcgaccacagccaagaaacgc tgcagccgtcagaggctggaagacatggattctcccctagagcctccagagggagcgcag cctgccagcaccttgatttcagccagtgacactgattgtggacttccagccttcggaact gtaagagaagaaatgcctgctgttataagccaccaggtttacagtaatatgttacaggag ccacaggaaactaagtcaggggttcagcctcctgcagaggaaccaggtacagccattgcc attgcagctgcagccccacagaaatggcctaaccccccagggaaggcccctaaaggaaag cgccctgaaagagtggctttccaatgtttacacagagattgcatggaatttacaccaaac ttcagggactcgctctgtctgaacaacctgggaggaggagtcagggtgtggcagggtgag ggaagtagccaggaccctccacatgctgcccctgctccctcatcccatgaccctgataat tggcatcagaagtga >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_5|407_aa MKGRECVLVVTTWAREGETGGTTPDMLKGRNAKHRESHRHTAETAPNSVPAYARPSRTHR VLCREPSWAPGCHGKRCACAGTPNGAGARQVPSLLLETTGALAIGGEKARRFKMAEDLDE LLDEVESKFCTPDLLRRGMVEQPKGCGGGTHSSDRNQAKAKETLRSTETFKKEDDLDSLI NEILEEPNLDKKPSKLKSKSSGNTSVRASIEGLGKRACDHLRCIACDFLVVSYDDYMWDK SCDYLFFRYTPSSYPFKIQLIFQSQYLVQQQFSNPTDQPGCSTITRTRSMSFQPSSNAAM PPLLPPGHCLTTTEAVTTLFLSDLPSPGLSRWSPGFCIPLSPPFSSTKTNHSCPVYQGLP TETRHQALSQVLETQIQPLCSWQKEDNIGRGDCSVVTHQIQLRQVYI >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_5|1224_bp atgaaggggcgagaatgcgtgcttgtggttacaacgtgggcccgggagggggagacggga ggaaccacgccagacatgctgaaaggaagaaacgcaaagcatcgcgagtcacacagacac actgcagagacagctccgaactctgtccctgcatacgcccgcccctcacgaacccaccgc gttctctgccgcgagccctcgtgggcgccgggttgccacgggaagcggtgcgcatgcgcg gggacgcctaacggggcaggagcccgccaggtcccctcgttactcctggaaaccaccgga gccttggcgattggaggggaaaaggccaggcgattcaagatggcggaggacctggacgag ctcttggatgaagtcgagtccaagttttgcacacctgaccttctaagacggggtatggtc gagcagcccaaaggctgcggcggcggcacccacagtagcgaccggaaccaagccaaggcg aaagagacgctcagatcaacagaaacatttaaaaaagaagatgatcttgacagtcttatt aatgaaatacttgaagagcccaacttggacaaaaaaccctctaaattaaaatctaaatct tcaggtaacacatctgtcagagcttccattgaaggccttggtaaaagagcatgtgaccat ctgcgttgtatagcctgtgatttcttggtagtcagctatgatgactatatgtgggacaaa tcgtgtgattatctgtttttcagatatactccctcatcttatcctttcaaaatccaactc atttttcaaagccagtatcttgtacaacagcagttctcaaaccccacagaccagcctggc tgctccaccatcacccgcaccaggtctatgtccttccagccatcctcaaatgctgccatg ccccctttgctgcctcctggccactgcctgacaaccacagaagctgtcaccactttgttc ctaagcgacctgcccagtccaggcctctccagatggtcccctggcttctgcatccctctt tcacctcctttttcctcaaccaagacaaaccactcctgtccagtctatcagggcctcccc actgaaaccaggcaccaggcattgagccaggtcctggagacacaaatacagcccctgtgc tcatggcagaaggaagacaacataggtcgaggagactgttcagttgtgactcaccagatc caactcaggcaggtctatatttaa >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_6|78_aa MKFQEDDKYYQQRQTRNKKHYLTLLTSAWSKLDYYGVPTSSSGGEHGGDPFIALRPTPRN ATQYICSECGHRNTSNAK >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_6|237_bp atgaagtttcaagaagacgacaagtactatcagcagaggcagacaagaaacaaaaagcat tatctgacattgctcacatctgcctggtcaaaactggattactatggggttccaaccagt agcagtggaggagagcatggaggagatccttttattgccctgaggcccacaccgagaaat gccacacagtatatctgctctgaatgtggtcacaggaacacatctaatgcaaagtag >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_7|145_aa MGVFEFDQGVDTNYGEKLRFPCYESMGRGRTCRWRDHRGIYGRTTATAGEGQCSTGRPEG SDVPNRQGMNLRVGDLRIPYLGTHSAQALWAEAYVELGKGMAGASFPNLFYFCTTHSAAL DGGGGEREGEGAGKVLFGRTVVTCH >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_7|438_bp atgggggtgtttgagtttgatcagggagttgacaccaactatggggaaaagctgaggttc ccgtgctacgagagcatgggccgtggaagaacttgccgatggagggaccacagagggatc tatgggagaacaacggcaactgcaggagagggccaatgttccacgggacggcctgagggc tcagatgttcctaacagacaaggaatgaatctccgggttggtgacttacggattccctat ctcggaacacattcggcgcaggctctctgggctgaagcctatgtggaactggggaaaggc atggctggggcctctttccctaacttgttctacttctgcaccactcactctgctgccctg gatggaggtggaggtgaaagggaaggtgaaggggctgggaaagtcctgtttggcaggaca gtagtaacctgccactga >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_8|63_aa MDEIVRICGRIREKPKTDLRIIGIKGTLRKRSQKRGVQGARRERGARREPGEESCRDMKE EEM >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_8|192_bp atggatgaaattgtcagaatatgtggaagaataagagaaaagccaaagacagatctcagg atcattggcattaagggtacattgaggaagaggagccagaagaggggagtccagggagca agaagagaacggggagcaagaagagaaccaggagaagagagctgcagggacatgaaggag gaggaaatgtga >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_9|79_aa MNWLFDQDKLFNVTGSQVLHALTPNVTIFGDRAFKEVIKVKCGHQGQGPNPIGLVSSEEE ETPGMQSTEKRPREDTVKS >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_9|240_bp atgaactggctgtttgaccaggacaagttatttaatgttactgggtctcaggttcttcac gccctaacccccaatgtgactatatttggagatagggcctttaaggaggtaattaaggtt aaatgtggccatcaggggcaaggtcctaatccaatagggctggtgtcctcagaagaagag gagacaccaggtatgcagagcacagagaaaaggccacgtgaggacacggtgaaaagttga >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_10|79_aa MKLSLLLTPCPPASTSNNTMRTCLGSPAKRYKTPSTSQVAPPKVILGHWTASCPQTDTFW VATSPFYLEISEPYDHAKE >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_10|240_bp atgaagctttccctcctgctcacaccttgccctcctgcttccacctctaacaacacaatg agaacatgtctgggctcacctgctaaaagatacaagacacctagcacaagccaagttgcc ccacctaaggtcatcttaggtcactggacagccagctgtccccaaacagacacattctgg gtggcaactagccctttttacttagaaatctcagaaccttacgatcatgccaaagaataa >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_11|106_aa MPYLDSKYQDDSTSSISHSGHTSDPNHALISHSPPNPANLLLLWVITQREDLANIGSESK QYCKTMDLIFKRNMESKQGEQEIRKLRPREAKETHRHHTVSWQQVS >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_11|321_bp atgccttacctggacagtaaataccaggatgactccaccagcagcatcagtcactctggc cacacctctgaccccaatcatgctctcatttctcactcacctccaaacccagccaactta ctactattgtgggttatcacacagagagaggaccttgcaaatattggcagtgaatcaaag cagtactgtaagaccatggatctcattttcaaacgaaacatggaaagtaagcagggagag caggagataaggaaactgagacccagagaagctaaagaaactcacagacatcacacagtt agttggcagcaggtctcctaa >gi568815590r:95147620_95369189|GENSCAN_predicted_peptide_12|270_aa MSGKEPDKMLSAGGFLFSVNSHEACGLVLREANHIQAHAKEQVQPHSAGRATAPTLPPPP CSSRAGAGRQHHTSQVTLKGGGATTAAADVVQCTTIHLPCLFQDILAGCTSWDYERPGDQ KAHWVPNVDSGEIIAHGAWPLMGAEGDTKHEQCLPCSPLCTEQDEPGPGVLPGEETGEPH PPRTHLGRSRCLRRWKAQSMTQQVASNGAEQMNARASRFSKILAAPRPDEAPSFGKTTIT NKSAVLLEVRRTLAKSNYGSPSSATARPGK >gi568815590r:95147620_95369189|GENSCAN_predicted_CDS_12|813_bp atgtctggaaaagagccagacaagatgctttctgctggtggtttcttattttctgtcaat tcacacgaagcctgcggtctggtactgagggaagcaaatcacatccaggcacacgccaag gaacaggtccagccccactccgctggcagagctacagcccccaccctgcctcctcctccc tgctcaagcagggctggagcaggccgccagcaccacacttctcaggttacactgaagggt ggaggggctaccactgcagcggcagatgttgtccagtgcaccaccatccatcttccctgc ctcttccaggatatactggctggatgcactagttgggactatgagaggcctggagaccag aaagcacattgggtgcccaatgttgacagcggggaaatcattgctcatggagcctggccc ctgatgggagccgaaggagacacaaaacatgagcagtgccttccctgctctccattgtgc acagagcaagatgagcccgggccaggagtactacctggagaagagaccggggagccacat ccacccaggacccacttgggtagaagccggtgcctaagaaggtggaaagcgcaatctatg acacaacaggtggcaagcaacggtgctgagcaaatgaatgcacgggcatcaaggttttcc aagatacttgctgcgccacggcctgatgaagccccaagttttggaaaaacaaccattaca aacaaatcagctgtcctcctggaagtcagaaggaccctggctaaatccaactatggttca ccttcctctgccacagccaggcctgggaagtga