GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:09:33 Sequence gi568815591r:131404314_131656359 : 252046 bp : 44.62% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 6993 7070 78 1 0 89 121 37 0.627 6.62 1.02 Intr + 10311 10397 87 1 0 51 109 27 0.205 1.04 1.03 Intr + 31207 31310 104 2 2 51 111 -11 0.164 -2.61 1.04 Intr + 33472 33684 213 0 0 115 36 129 0.128 9.31 1.05 Intr + 39168 39389 222 2 0 68 52 75 0.001 0.22 1.06 Intr + 41461 41590 130 0 1 130 82 69 0.002 10.77 1.07 Intr + 58904 59051 148 0 1 80 87 77 0.499 6.09 1.08 Intr + 66529 66631 103 1 1 91 88 35 0.718 3.98 1.09 Intr + 74310 74364 55 2 1 79 106 -3 0.024 -0.85 1.10 Term + 83294 83415 122 0 2 65 36 100 0.061 1.14 1.11 PlyA + 83620 83625 6 1.05 2.00 Prom + 83697 83736 40 -1.96 2.01 Init + 93197 93251 55 1 1 71 92 27 0.506 2.85 2.02 Intr + 93766 93919 154 2 1 83 88 49 0.358 3.53 2.03 Term + 98123 98147 25 2 1 139 55 30 0.282 2.50 2.04 PlyA + 98497 98502 6 -3.44 3.20 PlyA - 98903 98898 6 -1.95 3.19 Term - 100195 99998 198 1 0 81 37 352 0.863 26.80 3.18 Intr - 101722 101555 168 1 0 93 115 311 0.999 34.44 3.17 Intr - 102008 101947 62 0 2 99 99 136 0.995 14.25 3.16 Intr - 102413 102266 148 2 1 75 79 213 0.999 18.91 3.15 Intr - 104781 104638 144 0 0 43 56 113 0.809 4.08 3.14 Intr - 105272 105052 221 0 2 50 28 100 0.688 -1.78 3.13 Intr - 107120 106515 606 0 0 77 102 453 0.057 38.02 3.12 Intr - 129903 129754 150 1 0 114 90 4 0.294 3.33 3.11 Intr - 132223 132135 89 0 2 76 50 61 0.222 0.71 3.10 Intr - 133797 133506 292 1 1 120 58 112 0.342 7.59 3.09 Intr - 137075 136953 123 0 0 68 76 56 0.756 2.96 3.08 Intr - 139764 139635 130 0 1 90 57 84 0.662 5.77 3.07 Intr - 145540 145447 94 0 1 118 39 -8 0.141 -2.73 3.06 Intr - 149693 149564 130 0 1 81 115 -7 0.684 1.05 3.05 Intr - 152142 151947 196 0 1 35 101 223 0.007 17.29 3.04 Intr - 157976 157852 125 0 2 117 59 85 0.382 8.80 3.03 Intr - 170475 170427 49 0 1 60 103 50 0.241 1.95 3.02 Intr - 173539 173434 106 0 1 40 86 86 0.126 3.82 3.01 Init - 177664 177645 20 1 2 105 64 40 0.203 1.37 3.00 Prom - 183692 183653 40 -4.66 4.00 Prom + 185328 185367 40 -6.06 4.01 Init + 196609 196720 112 0 1 43 60 56 0.383 -1.32 4.02 Term + 198117 198424 308 1 2 103 38 192 0.657 10.98 4.03 PlyA + 199560 199565 6 -0.45 5.03 PlyA - 200417 200412 6 1.05 5.02 Term - 201065 200889 177 2 0 19 51 145 0.801 1.59 5.01 Init - 203716 203636 81 1 0 67 53 121 0.372 5.65 5.00 Prom - 208344 208305 40 -4.06 6.07 PlyA - 211055 211050 6 1.05 6.06 Term - 215088 214976 113 1 2 111 55 25 0.105 0.22 6.05 Intr - 216070 215967 104 0 2 81 48 54 0.106 0.52 6.04 Intr - 225657 225607 51 2 0 118 44 69 0.127 3.52 6.03 Intr - 233148 233041 108 2 0 94 53 58 0.258 2.30 6.02 Intr - 245744 245645 100 0 1 72 62 67 0.185 1.67 6.01 Intr - 251631 251442 190 0 1 81 72 52 0.251 2.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 107055 106515 541 0 1 98 102 412 0.940 38.64 S.002 Init - 152046 151947 100 0 1 82 101 283 0.849 27.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:131404314_131656359|GENSCAN_predicted_peptide_1|420_aa XGLFNQYISQQEYKPRWSQIIPKSTKETIISSKGDGEDNRPGMRGGHQMVIDVQTVLDII YQCRYVGNYEFSFSVCVIPSDFPLSSHVILNGPSARSCHKMCIDIQRRQIYTLGRYLDSS VRNSKSLKSDFYRYDIDTNTWMLLSEDTAADGGPKLVFDHQMCMDSEKHMIYTFGGRILT CNGSVDDSRASEPQFSGLFAFNCQCQTWKLLREDSCNAGPEDIQSRIGHCMLFHSKNRCL YVFGGQRSKTYLNDFFSYDVDSDHVDIISDGTKKDSGMVPMTGFTQRATIDPELNEIHVL SGLSKDKEKREENVRNSFWIYDIVRNSWFEEKAQVDPLSALKYLQNDLYITVDHSDPEET KEFQLLASALFKSGSDFTALGFSDVDHTYAQRTQLFDTLVNFFPDSMTPPKGNLVDLITL >gi568815591r:131404314_131656359|GENSCAN_predicted_CDS_1|1263_bp natggcttgttcaatcagtatatcagtcaacaggaatataagccacgatggagtcaaatc attcccaaaagtaccaaagaaactattatttcttctaaaggtgatggggaagataaccgt ccaggaatgagaggaggccatcagatggttattgatgttcaaacagtgcttgatataatt taccagtgtagatatgtgggcaattatgaattcagtttttcagtatgtgtaattccttca gattttcctctttcatctcatgtcatcttgaatggtcctagtgccagatcgtgtcataaa atgtgcattgatattcaacggaggcaaatctacacattggggcgttacttggattcctct gtgaggaacagcaaatctctgaaaagtgacttctatcgttatgacattgatacaaacaca tggatgttactaagtgaggatactgctgctgatggagggccgaaattggtgtttgatcat cagatgtgtatggactcagaaaaacatatgatctacacttttggtggtagaattttgact tgtaatggcagcgtagatgacagcagagccagtgaaccacaattcagtggcttgtttgct ttcaactgtcaatgtcaaacctggaaacttcttcgagaggactcctgtaatgctgggcct gaggacatccagtctcgaataggacactgcatgttattccactcaaaaaatcgttgctta tatgtatttggtggccagcgatcaaagacctatttgaatgatttctttagttatgatgtg gactctgatcatgtagacataatatcagatggcaccaagaaagactctgggatggttcca atgacaggatttacacagagagcaactattgatccagaactgaatgaaatacacgtctta tctggactcagcaaagataaggaaaagagggaagaaaatgttagaaattcattctggatt tatgacattgtgaggaatagttggtttgaagaaaaggcccaagtggatccccttagtgct ctgaaatatttacaaaatgatctttatataactgtggatcattcagacccagaagagaca aaagagtttcagctcctggcatcagctctattcaaatctggttcagattttacagctctg ggcttttctgatgtggatcacacctatgctcaaagaactcagctctttgacaccttagta aatttctttcctgacagcatgactcctcctaaaggcaacctggtagacctcatcacactg taa >gi568815591r:131404314_131656359|GENSCAN_predicted_peptide_2|77_aa MQDTTLDIFVTTFWWEDSTSLLSYGDFFPVGFHPAKAAAVNHILAREILNIGETVIRRNY VEVACLALRSISDGYRL >gi568815591r:131404314_131656359|GENSCAN_predicted_CDS_2|234_bp atgcaggacactacccttgatatatttgtcactaccttctggtgggaagacagcacttca ctattgtcttacggggacttttttcctgtgggctttcacccagcaaaagcagcagccgtc aatcatatccttgccagagagattcttaatataggagaaactgtaataaggagaaactac gtagaagtagcctgcctagcactacgttcgatatcagatggctacagactgtga >gi568815591r:131404314_131656359|GENSCAN_predicted_peptide_3|1016_aa MAAQLLGRMITATIKQQQQQHNGSSDNTATAATILALILLHKAFELSMQLHAGKGLYAEY SWPIIYSHGYNPICLVMVLGSPFPVLTLGLSFTVTDKTSRASPGGAHAPTARTRGSSAGT AATCSRPRGDDTMRCALALSALLLLLSTPPLLPSSPSPSPSPSQNAYLPIYGQIMQPTAK KAFTLRVGQFIRPPFSAFWFDNHLAEPQCLIPLGNKPPHTSEDQGVGWHLVHQLSWPEGY ASVSSSVNEELAQWLSGAFLFSISHGLKLGDFGPRTAGIMGLMATPSHPQVTPNPTSTPL PTVQKIQAPLSKTTKEIRIVCSVGGPQWALCSLATDVFEVRTHQFPKDRDCDGHGSPIHN GPKLEAAQMPISSRMDKYIVACAHNRVLHCNGGEQSATAHHEKDELHKAKLSEGSQTQES RRPDTSCCWPEGVSTILEHTYQVFVGTGFCADQPCLTEKEEPHGSGQTCTHSHLGKACAG VSGSRGSGGGVEEEVGCAALTATQTTTDSSNKTAPTPASSVTIMATDTAQQSTVPTSKAN EILASVKATTLGVSSDSPGTTTLAQQVSGPVNTTVARGGGSGNPTTTIESPKSTKSADTT TVATSTATAKPNTTSSQNGAEDTTNSGGKSSHSVTTDLTSTKAEHLTTPHPTSPLSPRQP TSTHPVATPTSSGHDHLMKISSSSSTVAIPGYTFTSPGMTTTLPSSVISQRTQQTSSQMP ASSTAPSSQETVQPTSPATALRTPTLPETMSSSPTAASTTHRYPKTPSPTVAHESNWLGG NVKDSMTISPNLNLSGFFQAKCEDLETQTQSEKQLVLNLTGNTLCAGGASDEKLISLICR AVKATFNPAQDKCGIRLASVPGSQTVVVKEITIHTKLPAKDVYERLKDKWDELKEAGVSD MKLGDQGPPEEAEDRFSMPLIITIVCMASFLLLVAALYGCCHQRLSQRKDQQRLTEELQT VENGYHDNPTLEVMETSSEMQEKKVVSLNGELGDSWIVPLDNLTKDDLDEEEDTHL >gi568815591r:131404314_131656359|GENSCAN_predicted_CDS_3|3051_bp atggcggcacagctcctggggagaatgataacagcaaccataaaacaacaacaacaacaa cacaacggcagcagcgacaatacagccacagcagcaacaatacttgctctcattcttctt cataaggcttttgaactctccatgcagctccatgcaggtaaagggctctatgcagaatat tcctggcccatcatctacagtcatggctacaaccccatctgcttggttatggttctcgga tctccatttccagtcctgactttaggtctgagctttactgtcacagataaaacttctagg gcctccccgggcggcgcccacgctcctaccgcccggacgcgcggatcctccgccggcacc gcagccacctgctcccggcccagaggcgacgacacgatgcgctgcgcgctggcgctctcg gcgctgctgctactgttgtcaacgccgccgctgctgccgtcgtcgccgtcgccgtcgccg tcgccctcccagaatgcttaccttccaatatatgggcagattatgcagcctacagcaaag aaagcattcaccttaagggttggacaattcatcagaccccctttcagtgctttttggttt gataaccatctggcagaaccacaatgccttattccccttgggaataagccacctcacact tccgaggaccagggtgtagggtggcacctcgtccaccaattgtcatggccagaaggctat gcctcagtttcctcatctgtgaatgaggagctggcccagtggctctcaggggccttcctg tttagtatttcccatggtctaaagctgggtgatttcggccccaggactgctggcatcatg ggactgatggcaacaccttcccacccccaagtcacccccaaccccacttccactccactc cctactgtccaaaagatacaggctccactgtcaaaaaccaccaaagaaattcggatcgtg tgctctgtgggagggccacagtgggccctgtgcagccttgccacagatgtgtttgaagtg cgtacacatcagtttcccaaagacagggactgcgatggtcacggaagccctattcacaat ggccctaaactggaagccgcccaaatgcccatcagcagcagaatggacaaatacatagtg gcatgtgcacacaatagagtcctccactgtaatgggggcgaacagtctgcgaccgcacac cacgagaaggatgagttacacaaagcaaagctgagtgagggaagccagacccaagagtca cgacgccctgacaccagctgctgctggcccgagggtgtaagcaccatcctggaacacact tatcaggtgtttgttgggactgggttctgtgcagatcagccatgcctcactgagaaggaa gagccacatgggtctgggcagacctgcacccactctcaccttggcaaggcctgtgcaggt gtgagtggctcgagagggagtggaggaggagtggaggaggaggtgggctgcgcggctcta acggcaacccagactactacggactcatctaacaaaacagcaccgactccagcatccagt gtcaccatcatggctacagatacagcccagcagagcacagtccccacttccaaggccaac gaaatcttggcctcggtcaaggcgaccacccttggtgtatccagtgactcaccggggact acaaccctggctcagcaagtctcaggcccagtcaacactaccgtggctagaggaggcggc tcaggcaaccctactaccaccatcgagagccccaagagcacaaaaagtgcagacaccact acagttgcaacctccacagccacagctaaacctaacaccacaagcagccagaatggagca gaagatacaacaaactctggggggaaaagcagccacagtgtgaccacagacctcacatcc actaaggcagaacatctgacgacccctcaccctacaagtccacttagcccccgacaaccc acttcgacgcatcctgtggccaccccaacaagctcgggacatgaccatcttatgaaaatt tcaagcagttcaagcactgtggctatccctggctacaccttcacaagcccggggatgacc accaccctaccgtcatcggttatctcgcaaagaactcaacagacctccagtcagatgcca gccagctctacggccccttcctcccaggagacagtgcagcccacgagcccggcaacggca ttgagaacacctaccctgccagagaccatgagctccagccccacagcagcatcaactacc caccgataccccaaaacaccttctcccactgtggctcatgagagtaactggttgggggga aatgtcaaagattccatgactattagcccaaaccttaatctcagtggcttcttccaggca aagtgtgaggatcttgagacacagacacagagtgagaagcagctcgtcctgaacctcaca ggaaacaccctctgtgcagggggcgcttcggatgagaaattgatctcactgatatgccga gcagtcaaagccaccttcaacccggcccaagataagtgcggcatacggctggcatctgtt ccaggaagtcagaccgtggtcgtcaaagaaatcactattcacactaagctccctgccaag gatgtgtacgagcggctgaaggacaaatgggatgaactaaaggaggcaggggtcagtgac atgaagctaggggaccaggggccaccggaggaggccgaggaccgcttcagcatgcccctc atcatcaccatcgtctgcatggcatcattcctgctcctcgtggcggccctctatggctgc tgccaccagcgcctctcccagaggaaggaccagcagcggctaacagaggagctgcagaca gtggagaatggttaccatgacaacccaacactggaagtgatggagacctcttctgagatg caggagaagaaggtggtcagcctcaacggggagctgggggacagctggatcgtccctctg gacaacctgaccaaggacgacctggatgaggaggaagacacacacctctag >gi568815591r:131404314_131656359|GENSCAN_predicted_peptide_4|139_aa MGPNKVAGALWMVHAIQRQRDPGFLDWVLGEELSSRHGPFCFSANHHMGPPGHWLQENLI GSGNHQLLDTGQPIERQHPGQVPTLGPISCGPTAVGPMVQVVFLKGAAGEPGLAVSTTRK KRPFQKNSNERAMSSTRDV >gi568815591r:131404314_131656359|GENSCAN_predicted_CDS_4|420_bp atgggacccaacaaggtagcaggtgctctgtggatggtccatgctattcaaagacaaaga gatccaggatttctcgactgggtactaggggaagaactgtcctcacgccatgggccattc tgcttctctgccaaccaccacatgggtccccctgggcattggctgcaggagaacctgatt ggctcaggtaatcaccagctcttggacactggccaaccaatagagcggcagcatccaggt caggtgcccacccttggtccaatcagctgtggcccaactgccgtagggcccatggtgcag gtggtgtttttaaagggggctgcaggtgagccaggtcttgcggtgtcaactacacgaaaa aagcgaccctttcagaagaattctaatgagagagccatgtcgtctacccgggacgtgtga >gi568815591r:131404314_131656359|GENSCAN_predicted_peptide_5|85_aa MLFYRCHASTRPHACPLATVTLNAASVSSSPRLPGFSGECDSEPGHLGSEDSAGPQANHR ELRSRTSPDVEKSLAASSDFASREQ >gi568815591r:131404314_131656359|GENSCAN_predicted_CDS_5|258_bp atgttgttctaccgatgccacgcatccaccagaccccacgcctgcccgctggccacagtc acgctcaatgccgcctctgttagcagctctcccagactccctggattctcgggtgaatgt gacagcgagccaggacacctgggttctgaggactctgcaggacctcaggcaaaccaccgt gaacttagaagccgcacttctcccgacgtggagaagagtttggctgcatcctctgacttc gcatcccgggagcagtga >gi568815591r:131404314_131656359|GENSCAN_predicted_peptide_6|221_aa SCSRKGATHLASGSFIDRLHQNLRASECEFHTDESEKTGVCIPETLTLKGPHVRRFGDNS LGSGLKHTQPGIPGRIPVPESSVASTQGAAKTASIPRRSHVHSNGNFPETVALLWLSTLE DTSPGTKRAMACHLQEEQLQDTVMVSPWRLPSWTPQDKPAEVEGLEGILQDDLAGCKTQQ QHWVEGEPVWYQKDGGIRTPGSCPFHSPHQLPMREPELSTH >gi568815591r:131404314_131656359|GENSCAN_predicted_CDS_6|666_bp tcatgttctagaaaaggggctacacacctggcttccgggagcttcatagaccgattgcat cagaatctcagagcatcagaatgtgagtttcacacagatgagtctgaaaagaccggtgtg tgtattcctgaaacactaactcttaaaggcccccatgtcagacgctttggggacaacagc cttggctcaggcctaaagcacacacagccagggatccccggcaggattccagttccagag tcctcggtagcatctacccagggagcagccaagacagccagcatccccaggcgctcacat gttcattccaatggcaactttccagagacagtggccttgctctggttgtccaccttggag gacacaagtcctggtaccaagagggccatggcctgccacctgcaggaagagcagctgcag gacacagtgatggtgtctccgtggcgcttgccaagctggactccacaggataaaccagca gaggtggaagggctggaaggaatcctacaagacgacttggcaggatgcaagacccaacaa caacactgggttgaaggagagcctgtatggtaccagaaagatgggggaatcagaactcct gggtcttgcccatttcactctccacatcagctgcccatgcgggaacctgagttgagtacc cactga