GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:20:27 Sequence gi568815592r:136161328_136379866 : 218539 bp : 39.94% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 12214 12252 39 0 0 55 84 58 0.574 2.34 1.02 Intr + 12470 12561 92 1 2 140 105 86 0.992 13.37 1.03 Intr + 17670 17814 145 0 1 61 97 105 0.997 8.06 1.04 Intr + 19900 19996 97 0 1 94 96 26 0.939 2.66 1.05 Intr + 25709 25789 81 0 0 120 97 30 0.631 5.79 1.06 Term + 30287 30513 227 0 2 74 50 269 0.417 17.66 1.07 PlyA + 31023 31028 6 1.05 2.00 Prom + 35053 35092 40 -4.05 2.01 Init + 45445 45546 102 0 0 68 94 133 0.758 12.19 2.02 Term + 50810 50968 159 1 0 34 42 188 0.953 5.76 2.03 PlyA + 51460 51465 6 1.05 3.00 Prom + 53316 53355 40 -8.25 3.01 Init + 53476 53617 142 0 1 67 27 206 0.874 12.74 3.02 Intr + 57603 57740 138 1 0 87 115 59 0.509 8.01 3.03 Term + 75198 75379 182 1 2 38 52 142 0.232 2.39 3.04 PlyA + 76501 76506 6 1.05 4.13 PlyA - 77120 77115 6 -0.45 4.12 Term - 78493 78120 374 2 2 75 48 297 0.831 18.27 4.11 Intr - 80349 80117 233 2 2 105 94 222 0.997 20.99 4.10 Intr - 81646 81534 113 1 2 96 16 23 0.206 -5.84 4.09 Intr - 83542 83438 105 1 0 51 107 78 0.069 5.49 4.08 Intr - 100150 99938 213 1 0 56 70 248 0.786 17.79 4.07 Intr - 107012 106835 178 1 1 90 78 167 0.930 14.90 4.06 Intr - 108285 108110 176 0 2 98 91 203 0.999 19.42 4.05 Intr - 110752 110668 85 0 1 44 95 73 0.996 2.50 4.04 Intr - 111860 111755 106 0 1 37 91 104 0.896 3.95 4.03 Intr - 114374 114205 170 1 2 45 91 55 0.829 0.17 4.02 Intr - 115181 114516 666 1 0 97 91 593 0.930 50.42 4.01 Init - 117355 116538 818 1 2 40 93 572 0.330 47.13 4.00 Prom - 120238 120199 40 -8.05 5.00 Prom + 120527 120566 40 -4.35 5.01 Init + 123530 123540 11 1 2 77 64 -2 0.052 -3.59 5.02 Intr + 127919 128057 139 2 1 64 80 125 0.168 8.85 5.03 Term + 144188 144343 156 1 0 90 48 198 0.979 12.95 5.04 PlyA + 144948 144953 6 1.05 6.00 Prom + 148263 148302 40 -6.25 6.01 Init + 150714 150844 131 2 2 60 -9 143 0.914 1.47 6.02 Term + 156741 156909 169 0 1 62 42 251 0.952 14.27 6.03 PlyA + 157461 157466 6 1.05 7.00 Prom + 161359 161398 40 -7.05 7.01 Init + 164704 164839 136 0 1 51 96 86 0.355 6.05 7.02 Intr + 169415 169480 66 0 0 73 95 43 0.695 1.46 7.03 Term + 173720 173946 227 0 2 118 44 165 0.951 11.06 7.04 PlyA + 175243 175248 6 1.05 8.08 PlyA - 176157 176152 6 1.05 8.07 Term - 184319 184213 107 0 2 54 43 83 0.570 -2.01 8.06 Intr - 184752 184529 224 2 2 48 115 268 0.882 22.35 8.05 Intr - 195467 195365 103 0 1 49 105 61 0.337 2.21 8.04 Intr - 198550 198493 58 1 1 107 68 73 0.948 4.74 8.03 Intr - 199471 199370 102 1 0 51 65 220 0.611 15.45 8.02 Intr - 199852 199678 175 0 1 83 80 164 0.757 14.12 8.01 Init - 201364 201123 242 1 2 104 94 289 0.943 26.49 8.00 Prom - 201628 201589 40 -8.25 9.07 PlyA - 201733 201728 6 1.05 9.06 Term - 203056 202758 299 2 2 114 33 226 0.548 14.14 9.05 Intr - 204691 204408 284 0 2 69 99 285 0.360 23.84 9.04 Intr - 205112 205000 113 2 2 35 111 44 0.114 -0.34 9.03 Intr - 211298 211174 125 0 2 61 101 99 0.014 7.88 9.02 Intr - 216541 216428 114 2 0 26 98 95 0.372 3.80 9.01 Intr - 216651 216613 39 1 0 110 68 40 0.293 1.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 49988 50095 108 1 0 70 51 98 0.832 4.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_1|226_aa MLEIDPHSTLQQRNMSVLENHHWRSTIGMLRESRLLAHLPKEMTQDIEQQLGSLILATDI NRQNEFLTRLKAHLHNKDLRLEDAQDRHFMLQIALKCADICNPCRIWEMSKQWSERVCEE FYRQGELEQKFELEISPLCNQQKDSIPSIQIGFMSYIVEPLFREWAHFTGNSTLSENMLG HLAHNKAQWKSLLPRQHRSRGSSGSGPDHDHAGQGTESEEQEGDSP >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_1|681_bp atgctggaaatagatcctcattcaaccctgcagcagaggaatatgtctgtgctggagaat catcactggcgatctacaattggcatgcttcgagaatcaaggcttcttgctcatttgcca aaggaaatgacacaggatattgaacagcagctgggctccttgatcttggcaacagacatc aacaggcagaatgaatttttgaccagattgaaagctcacctccacaataaagacttaaga ctggaggatgcacaggacaggcactttatgcttcagatcgccttgaagtgtgctgacatt tgcaatccttgtagaatctgggagatgagcaagcagtggagtgaaagggtctgtgaagaa ttctacaggcaaggtgaacttgaacagaaatttgaactggaaatcagtcctctttgtaat caacagaaagattccatccctagtatacaaattggtttcatgagctacatcgtggagccg ctcttccgggaatgggcccatttcacgggtaacagcaccctgtcggagaacatgctgggc cacctcgcacacaacaaggcccagtggaagagcctgttgcccaggcagcacagaagcagg ggcagcagtggcagcgggcctgaccacgaccacgcaggccaagggactgagagcgaggag caggaaggcgacagcccctag >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_2|86_aa MVEEVSKAGEIMVEERAVVEEAMEIHMEEEEVVKYGDGVSELQYGDGVSELQYGDGVSEL QYGDGVSELQYGDGVSELQYGDGVGH >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_2|261_bp atggtagaggaggtttccaaggcaggggagattatggtggaagagagggctgtggtggaa gaggctatggagatccatatggaggaggaggaggtggtgaagtatggagatggagtctct gagttgcagtatggagatggagtctctgagttgcagtatggagatggagtctctgagttg cagtatggagatggagtctctgagttgcagtatggagatggagtctctgagttgcagtat ggagatggagttggtcattag >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_3|153_aa MNYHAAAIKKKVVRTHPRTEFAQTISGRTCGKLMTAFASEEETWRTGAWYLEDSKHFGIR PCFNARDTPVNKADTILYLHTAYTCTHKYVYGDVMVTVTELPLDTKPDNTSTRSSWDQAC TYSHEILLNKLPHALPVVTGFPSLSSSQGFHLK >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_3|462_bp atgaattaccatgcagcagcaataaaaaagaaagtggtgcgcacacatccacgcacagag tttgcgcagaccatctctggaaggacctgtgggaaactgatgaccgcatttgcctctgaa gaggagacctggagaactggggcatggtacctggaagacagtaagcattttggaatcagg ccttgtttcaatgccagagacacacctgtgaataaagcagacacaattctctatctacac acagcttacacctgcacacataagtatgtatatggtgatgtgatggtaacagtcacagag ttgccacttgatactaagcccgacaacacaagtacaagatctagttgggaccaagcatgc acctacagccatgagattcttctcaacaagctgcctcatgctcttcctgtggttactggt ttcccttctctttcctcttcacaaggatttcacttaaaatga >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_4|1078_aa MRRPYGYRGRGRGYYQGGGGRYHRGGYRPVWNRRHSRSPRRGRSRSRSPKRRSVSSQRSR SRSRRSYRSSRSPRSSSSRSSSPYSKSPVSKRRGSQEKQTKKAEGEPQEESPLKSKSQEE PKDTFEHDPSESIDEFNKSSATSGDIWPGLSAYDNSPRSPHSPSPIATPPSQSSSCSDAP MLSTVHSAKNTPSQHSHSIQHSPERSGSGSVGNGSSRYSPSQNSPIHHIPSRRSPAKTIA PQNAPRDESRGRSSFYPDGGDQETAKTGKFLKRFTDEESRVFLLDRGNTRDKEASKEKGS EKGRAEGEWEDQEALDYFSDKESGKQKFNDSEGDDTEETEDYRQFRKSVLADQGKSFATA SHRNTEEEGLKYKSKVSLKGNRESDGFREEKNYKLKETGYVVERPSTTKDKHKEEDKNSE RITVKKETQSPEQVKSEKLKDLFDYSPPLHKNLDAREKSTFREESPLRIKMIASDSHRPE VKLKMAPVPLDDSNRPASLTKDRLLASTLVHSVKKEQEFRSIFDHIKLPQASKSTSESFI QHIVSLVHHVKEQYFKSAAMTLNERFTSYQKATEEHSTRQKSPEIHRRIDISPSTLRKHT RLAGEERVFKEENQKGDKKLRCDSADLRHDIDRRRKERSKERGDSKGSRESSGSRKQEKT PKDYKEYKSYKDDSKHKREQDHSRSSSSSASPSSPSSREEKESKKEREEEFKTHHEMKEY SGFAGVSRPRGTFHDDRDDGVDYWAKRGRGRGTFQRGRGRFNFKKSGSSPKWTHDKYQGD GIVEDEEETMENNEEKKDRRKEEKVLLIWENKDYGSTRSIVRIIGKMLPLEPCRRPNFEL IPLLNSVDSDNCGSMVPSFADILYVANDEEASYLRFRNSIWKNEEEKVEIFHPLRLVRDP LSPAVRQKETVKNDLPVNEAAIRKIAALENELTFLRSQIAAIVEMQELKNSTNSSSFGLS DERISLGQLSSSRAAHLSVDPDQLPGSVLSPPPPPPLPPQFSSLQPPCFPPVQPGSNNIC DSDNPATEMSKQNPAANKTNYSHHSKSQRNKDIPNMLDVLKDMNKVKLRAIERYVVVI >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_4|3237_bp atgagacgaccttatgggtacagaggaaggggtagagggtattatcaaggaggaggaggt agatatcatcgaggtggttatagacctgtctggaatagaaggcactctaggagtcctaga cgaggtcgttcacgttccaggagtccaaaaagaagatccgtttcttctcaaagatccaga agcagatctcgccggtcatatagatcttctaggtctccaagatcatcctcttctcgttct tcatccccatatagcaaatctcctgtttctaaaagacgagggtctcaggaaaaacaaacc aaaaaagctgaaggggaaccccaagaagagagtccgttgaaaagtaaatcacaggaggaa ccgaaagatacatttgaacatgacccatctgagtctatcgatgaatttaataagtcatca gccacatccggtgatatttggcctggcctttcagcttatgataatagtcctagatcaccc catagtccttcacctattgctacaccacctagtcagagttcatcttgctctgatgctccc atgctcagtacagttcactctgcaaaaaatactccttctcagcattcacattccattcag catagtcctgaaaggtctgggtctggttctgttggaaatggatctagtcgatacagtcct tctcagaatagtccaattcatcacatcccttcacgaagaagtcctgcaaagacaatcgca ccacagaatgctccaagagatgagtctaggggccgttcctcgttttatcctgatggtgga gatcaggaaactgcaaagactgggaagttcttaaaaaggttcacagatgaagagtctaga gtattcctgcttgataggggtaataccagggataaagaggcttcaaaagagaaaggatca gagaaagggagggcagagggagaatgggaagatcaggaagctctagattacttcagtgat aaagagtctggaaaacaaaagtttaatgattcagaaggggatgacacagaggagacagag gattatagacagttcaggaagtcagtcctcgcagatcagggtaaaagttttgctactgca tctcaccggaatactgaggaggaaggactcaagtacaagtccaaagtttcactgaaaggc aatagagaaagtgatggatttagagaagaaaaaaattataaacttaaagagactggatat gtagtggaaaggcctagcactacaaaagataagcacaaagaagaagacaaaaattctgaa agaataacagtaaagaaagaaactcagtcacctgagcaggtaaagtctgaaaagctcaaa gacctctttgattacagtccccctctacacaagaatctggatgcacgagaaaagtctacc ttcagagaggaaagcccacttaggatcaaaatgatagcgagtgattctcaccgtcctgaa gtcaaactcaaaatggcacctgttcctcttgatgattctaacagacctgcttccttgact aaagacaggctgcttgctagtacacttgtccattctgtcaagaaggagcaagaattccga tccatctttgaccacattaagttgccacaggccagcaaaagcacttcagagtcatttatt caacacattgtgtccttggttcatcatgttaaagagcaatacttcaagtcagctgcaatg accctaaacgagcggttcacttcgtatcagaaagccactgaagaacatagtactcggcaa aagagccctgaaatacacaggagaattgacatctcaccaagtaccctgaggaagcatacc cgtttagcaggggaagagagagtttttaaagaagaaaatcaaaagggagataaaaaatta aggtgtgactctgctgaccttcggcatgacattgatcgccgtagaaaagaaagaagtaaa gaacggggagattccaagggctccagggaatccagtggatcaagaaagcaggaaaaaact ccaaaagattacaaggaatacaaatcttacaaagatgacagtaaacataaaagagagcaa gatcattctcgatcttcatcctcttcagcatcaccttcttctcccagttctcgagaagaa aaggagagtaagaaggaaagagaagaagaatttaaaactcaccatgaaatgaaagaatac tcaggctttgcaggagttagccgaccacgaggaacctttcatgacgacagagatgatggt gtggattattgggccaaaagaggaagaggtcgtggtacttttcaacgtggcagagggcgc tttaacttcaaaaaatcaggtagcagtcctaaatggactcatgacaaataccaaggggat gggattgttgaagatgaagaagagaccatggaaaataatgaagaaaagaaggacagacgc aaggaagaaaaggttttgctgatttgggaaaataaagactatggatcaactaggagtatt gttcgtattattgggaaaatgcttccactggaaccttgtcgaagacctaattttgagttg atcccgctcttgaactctgtagactctgataattgtggatctatggttccatcttttgct gatattttgtatgtggcaaatgatgaagaagccagttatctcagatttcgaaatagtata tggaaaaatgaagaagagaaagtggaaatttttcatcctttgcgactagttcgggatcca ctgtcacctgctgtaagacagaaagaaactgtgaaaaatgacctgcctgtaaatgaagct gcaattagaaaaatagctgcccttgaaaatgagctgacttttcttcgctctcagattgca gcaattgtggaaatgcaggaactgaaaaatagtacaaattctagttcctttggcttgagt gacgagcgcattagtttgggtcagctgtcatcatcgcgggctgcccatctgagtgtggac ccagatcagcttccaggttcagtgctttctcctcctcctcctccaccacttcctcctcag ttttcatctctccagccaccgtgttttcctcccgtacaaccaggatctaataatatttgt gactcagataatccagcaactgaaatgagcaaacagaacccggctgctaataagaccaat tatagtcatcattcaaaaagccagagaaataaagatattccaaacatgttggacgttcta aaggatatgaataaggttaagcttcgtgcaattgagcggtatgttgttgtaatttga >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_5|101_aa MQRRPRRKVAVPANERRTQVAPPAAVGAIGGGEKNERARVRGAFSRARAEERKGIPVHPS NANRGNGPRGNDAAATASPGDLLATEVLRLRHRTTESEILE >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_5|306_bp atgcagaggaggccacgaagaaaggtggcagtcccagccaatgagaggcggacacaagtt gcacctccagcggccgtcggcgccataggcgggggagagaaaaatgaacgagcgcgcgtg cgcggagcgttcagtcgggcgcgcgcagaggagaggaagggtattcccgttcatcccagt aatgctaatcgaggaaatggtcctcgaggaaatgacgctgcagcaacagcatcacctggg gacttgttagcaacagaagttctcagactccgccatagaactactgaatcagaaattctg gaataa >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_6|99_aa MTVCCWCEDRQINEQNKQPGDDTVTKTTKWGKERLFERMVLNNCKLTCAESFSDYSPPSC FAVHDVRHTVAVGIIEAVPKKPARAGKVTESARKAQQAK >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_6|300_bp atgacggtgtgctgttggtgtgaggacagacagatcaatgaacagaataaacagcctgga gatgatacagtgaccaagacaactaaatggggaaaagaacgtctttttgaacgaatggtg ctgaacaactgcaagctcacgtgtgctgagagcttctctgactattctcctccgagttgt tttgctgttcatgatgtgagacacacagttgctgtgggtatcatcgaagcagtgcccaag aagcctgccagagctggcaaagtcactgagtctgcccggaaagctcagcaggctaaatga >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_7|142_aa MSDWPISSAMRAKDSEDTHTRVQKPLPASKACLFCHDNPCHFHLEGLHELKAMGKQQKVL SRVHALDGVLSSGAPKKRATLPSHTLQGGQGDISCFIGRALLPLRLWVEPLLATSQLLVV AVDPWHSLACNCIPPASESVTQ >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_7|429_bp atgtcagactggccaatttcatcagccatgagagccaaagactcagaggatacacacact cgtgtacaaaagcctctgccagccagcaaagcctgtttgttttgtcatgacaacccttgt cacttccatctggagggcctccatgagctgaaagctatggggaaacaacaaaaggttttg agcagggtccatgcattagatggggttttgagcagcggggcaccgaagaagcgagccaca ctcccatcacacaccctgcaagggggacaaggggacatttcctgtttcattggtagggcc ctgctccctctgaggctctgggtagaacccttgcttgccacttctcagcttctggtggtg gccgtggatccctggcattccttggcttgcaactgcatccctccagcctctgaatctgtc acccaatga >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_8|336_aa MAPAPASAPAPASAPAPAPVPTPAMVSAPSSTVNASASVKTSAGTTDPEEATRLLAEKRR LAREQREKEERERREQEELERQKREELAQRVAEERTTRREEESRRLEAEQAREKEEQLQR QAEERALREREEAERAQRQKEEEARVREEAERVRQEREKHFQREEQERLERKKKTSDQRN GDIAKGALTGGTEVSALPCTTNAPGNGKPVGSPHVVTSHQSKVTVESTPDLEKQPNENGV SVQNENFEEIINLPIGSKPSRLDVTNSESPEIPLNPILAFDDEGTLGPLPQVDGVQTQQT AVSAEVQSAAPSGCLSASLHRAAPITRHHACSEYAL >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_8|1011_bp atggccccagctccagcctcggccccagctccagcctcggccccagctccagccccggtc cccaccccagccatggtctcagccccgtcatccactgtgaatgccagtgcttctgttaag acttctgcaggcaccaccgacccagaggaggccacaaggcttctagctgagaagaggcgg ctggcccgagagcagagagaaaaggaagaaagggagaggagggagcaggaagagcttgaa agacaaaagagagaggaattggctcaacgtgtggctgaagagaggacgactcgccgtgag gaggagtcgcgcaggctggaagccgagcaggcccgggagaaggaggagcagctgcagcgg caggcggaggagcgggcgctgcgcgagcgggaggaggcagagcgcgcccagaggcagaaa gaagaagaagctcgcgttcgtgaagaagcagagagggtccggcaggaacgagagaagcat ttccagagagaagagcaagagcgcctggagagaaagaagaaaaccagtgatcagagaaac ggtgatatagccaagggagctctcactggaggaacagaggtgtctgcacttccatgtaca acaaacgctccgggaaatggaaagccagttggcagcccacatgtggttacctcacaccag tcaaaagtgacagtggagagcactcccgatttggaaaaacaaccaaatgaaaatggtgta tctgttcagaatgaaaattttgaagaaattataaacttacccattggatctaaaccatcc agattagatgtcaccaacagtgagagcccagaaattcctttgaatccaattttggccttt gatgatgaagggacacttgggcccctgcctcaggtagatggtgttcagacacagcagact gcagtcagtgcagaagtccagtccgcagcacccagtgggtgtctgtcagcctctctgcac agagcagcacccattacccgacatcatgcatgctcagaatatgctttgtag >gi568815592r:136161328_136379866|GENSCAN_predicted_peptide_9|324_aa XPGPVKAQRMEAIARRLQLSPWESSVVNRLLTPTHSFLARSKSTAALSGEAASCSPIIMP YKAAHSRNSMDRPKLFVTPPEGSSRRRIIHGTASYKKERERENVLFLTSGTRRAVSPSNP KARQPARSRLWLPSKSLPHLPGTPRPTSSLPPGSVKAAPAQVRPPSPGNIRPVKREVKVE PEKKDPEKEPQKVANEPSLKGRAPLVKVEEATVEERTPAEPEVGPAGTALVEGAGPSAGA GPRAPPLQMRLPMSTLAKAFANKPVQKGSTFTPAALMRALILSSLTVTSPSCRMRAEPYA ECRRAQKQRPWCEQVWGWHCQARC >gi568815592r:136161328_136379866|GENSCAN_predicted_CDS_9|975_bp nagcctggtcctgtgaaggcccagcgtatggaagcaatagctcgccgcctgcagctcagc ccatgggagagcagcgttgttaacagactcctgacgcccacacattcgttcctggccaga agtaaaagcacagctgccttgtctggagaagcagcatcttgcagccccatcatcatgccc tacaaagctgcacactctagaaattcgatggatcgaccaaaactctttgtaacaccacct gagggctcttctcgcaggaggatcattcatggcacagcgagctataaaaaagaaagagag agagaaaatgtactcttcctcacatctggcacccgaagggctgtatctccatctaatccc aaagcaagacaaccagctcgctcccgactttggcttccgtccaagtctcttcctcatttg cctggcacacccagaccgacatcctccttgccacccggctcagtcaaagctgctcctgct caggtccggcccccatcccccggcaacatccgccctgtcaagagggaagtcaaagtggag cctgagaagaaagatcctgagaaggaacctcagaaagttgccaatgagccctcactaaag ggcagagcacctttagtgaaggtagaagaagccacagttgaagagcggacacctgctgaa ccagaagttggccctgctggaacagcattggtggagggggcaggaccttctgctggtgca ggtccacgagctcctccattgcagatgaggctccccatgtcaacattggccaaggctttt gcaaacaagccagtccaaaaaggttcaacatttacaccagctgctttaatgagggcattg atcttatcctccttgacggtcacctcaccctcatgcagaatgagggccgagccttatgca gaatgcaggcgagctcagaaacagaggccatggtgtgagcaagtgtggggctggcactgc caggcaaggtgctag