GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:00:56 Sequence gi568815587f:66956903_67171599 : 214697 bp : 50.07% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 1120 1473 354 0 0 74 48 183 0.870 6.96 1.02 PlyA + 4294 4299 6 1.05 2.00 Prom + 6010 6049 40 -5.16 2.01 Init + 9107 9159 53 1 2 106 54 17 0.486 0.77 2.02 Intr + 9294 9347 54 0 0 92 50 64 0.349 1.09 2.03 Term + 12992 13199 208 2 1 32 46 182 0.577 5.21 2.04 PlyA + 13236 13241 6 1.05 3.00 Prom + 14580 14619 40 -6.26 3.01 Init + 18461 18736 276 1 0 90 80 104 0.457 6.80 3.02 Intr + 26111 26282 172 1 1 104 96 -2 0.321 1.72 3.03 Intr + 26525 26606 82 0 1 94 93 42 0.860 4.00 3.04 Intr + 27125 27188 64 2 1 67 94 48 0.725 2.02 3.05 Intr + 27290 27398 109 1 1 72 85 -6 0.327 -2.64 3.06 Intr + 32203 32334 132 2 0 54 40 134 0.080 5.82 3.07 Intr + 42900 42974 75 1 0 130 66 6 0.061 2.09 3.08 Intr + 53646 53749 104 1 2 79 53 77 0.141 3.19 3.09 Intr + 55834 55891 58 0 1 63 116 -17 0.130 -2.94 3.10 Intr + 58494 58553 60 1 0 109 46 36 0.115 0.21 3.11 Intr + 64967 65011 45 0 0 148 116 27 0.960 9.88 3.12 Intr + 68669 68737 69 0 0 88 94 45 0.851 4.25 3.13 Intr + 73220 73276 57 0 0 125 101 70 0.892 10.86 3.14 Intr + 77743 77936 194 2 2 52 100 159 0.146 12.71 3.15 Intr + 82909 83301 393 0 0 120 114 659 0.995 66.55 3.16 Intr + 86736 86951 216 2 0 118 115 352 0.997 39.60 3.17 Intr + 87691 87811 121 0 1 89 90 218 0.999 22.17 3.18 Intr + 88842 88975 134 1 2 107 94 300 0.999 32.86 3.19 Term + 91682 91855 174 1 0 111 43 260 0.923 21.56 3.20 PlyA + 93884 93889 6 1.05 4.00 Prom + 94562 94601 40 -7.56 4.01 Init + 95024 95141 118 1 1 77 111 21 0.389 3.12 4.02 Intr + 99180 99311 132 1 0 88 2 96 0.449 1.62 4.03 Intr + 99987 100132 146 1 2 62 105 259 0.670 25.00 4.04 Intr + 108994 109081 88 0 1 120 94 141 0.994 17.44 4.05 Intr + 109836 109945 110 1 2 61 77 236 0.913 19.80 4.06 Intr + 113523 113657 135 2 0 99 53 239 0.997 22.26 4.07 Term + 114533 114700 168 1 0 103 53 200 0.993 15.88 4.08 PlyA + 116992 116997 6 1.05 5.04 PlyA - 119314 119309 6 1.05 5.03 Term - 120459 120287 173 1 2 83 31 116 0.424 3.39 5.02 Intr - 138254 138036 219 0 0 94 75 36 0.047 1.17 5.01 Init - 139998 139854 145 0 1 91 109 -4 0.447 2.28 5.00 Prom - 154853 154814 40 -4.26 6.07 PlyA - 155794 155789 6 1.05 6.06 Term - 156329 156270 60 2 0 98 54 63 0.783 1.70 6.05 Intr - 161560 160944 617 2 2 68 65 214 0.000 9.17 6.04 Intr - 162626 162433 194 1 2 121 -7 204 0.000 13.34 6.03 Intr - 177150 177120 31 1 1 53 82 69 0.007 -0.01 6.02 Intr - 194377 194188 190 1 1 44 70 55 0.449 -1.54 6.01 Init - 201709 201650 60 1 0 37 88 97 0.862 5.85 6.00 Prom - 207164 207125 40 -2.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 162883 162380 504 1 0 69 46 286 0.875 16.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:66956903_67171599|GENSCAN_predicted_peptide_1|117_aa MRQPAGGRCGGALGGCSRGRVRGRVQPAEPPREGEGRRPLRRVAHRAQPVAVSSSRHSRS PRRLCARDPYISVPRSGAGARPPAPTRRGKRTVRSAADTHLLAVASPRRRLYRCCLH >gi568815587f:66956903_67171599|GENSCAN_predicted_CDS_1|354_bp atgaggcagcccgcaggcggacgctgcggaggggcgctcgggggctgcagccgggggcgg gtccgggggcgggtccagccggccgagcccccgcgcgagggggaggggcgccgtccgctg cgtcgcgtcgcgcaccgcgcacagcccgtcgctgtctcctccagccgtcactcgcggagc ccgcgccgcctttgcgcgcgcgacccttacataagcgtccctcgcagcggtgccggcgca cgtccacccgctccgacccgccgcgggaagaggaccgtgcggagcgcggcggacactcac ctcctcgccgtcgccagtcctcgccgccgcctctaccgctgctgcctccactga >gi568815587f:66956903_67171599|GENSCAN_predicted_peptide_2|104_aa MLSELAGMLSSLHPNGMRLNPAIVSFMKLSKATLMVQLPHSEMVFGEAQVAAINHQQTEA LSPTIRKELNYANNHKNELGSRFTPEGPGEELTVASERSEPETC >gi568815587f:66956903_67171599|GENSCAN_predicted_CDS_2|315_bp atgctgtctgaacttgctggtatgctgtcctccctacaccccaatggtatgagacttaac ccagccattgtctccttcatgaagctctccaaggccactctcatggtccagctgccacat agtgagatggtctttggagaggcccaagtagcagcaatcaaccatcaacaaacagaagcc ctcagtcccacaatccgaaaagaattgaattatgcaaacaaccacaagaatgagcttgga agcagattcaccccagaaggacctggagaagagttgactgtagcctccgagagatccgag ccagagacctgctaa >gi568815587f:66956903_67171599|GENSCAN_predicted_peptide_3|844_aa MGTGLRSQSLREPRPSYGKLQEPWGRPQEGQLRRALSLRQGQEKSRSQGLERGTEGPDAT AQERVPGSLGDTEQLIQAQRRGSRWWLRRYQQGPRHLDIGCGGGSMVAPKRCVHVLIPEP VNVTLFGKKVFAGVIKLRSWEEEIIVDYPELRENPFLLLSATGFGVICYSSHRKLTRLLQ VLPILVLSAHTNFTVMGQGDDKFPCQLLGQGEVTGGGVDRTELGQSHGRAGFKPASPTST TPCSKAPSPIDHPRAEECERTARDWQAAPPAAPVRDPLGLFFSLLDTVPSLLASIGVRHD RTVGLFQLHLNWLPSTLLVTPATTSKCLLWLPYILLMPEVGTVINYILQLRKLRHREGCS PMICAMDSCFQGPHLYPGPGNEMKTKAPPVGEAKDTVLLPGRPPGCQREPTEVVPVTVTA ADIMAVDVAEYHLSVIKSPPGWEVGVYAAGALALLGIAAVSLWKLWTSGSFPSPSPFPNY DYRYLQQKYGESCAEAREKRVPAWNAQRASTRGPPSRKGSLSIEDTFESISELGPLELMG RELDLAPYGTLRKSQSADSLNSISSVSNTFGQDFTLGQVEVSMEYDTASHTLNVAVMQGK DLLEREEASFESCFMRVSLLPDEQIVGISRIQRNAYSIFFDEKFSIPLDPTALEEKSLRF SVFGIDEDERNVSTGVVELKLSVLDLPLQPFSGWLYLQDQNKAADAVGEILLSLSYLPTA ERLTVVVVKAKNLIWTNDKTTADPFVKVYLLQDGRKMSKKKTAVKRDDPNPVFNEAMIFS VPAIVLQDLSLRVTVAESSSDGRGDNVGHVIIGPSASGMGTTHWNQMLATLRRPVSMWHA VRRN >gi568815587f:66956903_67171599|GENSCAN_predicted_CDS_3|2535_bp atgggaacagggctgcgaagtcagtccttgcgagagccccgaccctcctatggaaagctg caggagccctgggggaggccccaggagggccaactccgcagggcgctaagcctcagacag gggcaagagaagtccaggtcccagggcctcgagagaggcacagaagggccagatgccact gcccaggagcgggtgccggggagcctgggggacacagagcagctgatccaagcccagcga agaggcagccggtggtggctgaggcggtaccaacagggccccaggcacttagacatcggg tgtggagggggttcaatggtggccccaaagagatgtgtccatgttctaatccctgaacct gtgaatgtgaccttatttggaaaaaaggtctttgcaggtgtgattaagttaagatcttgg gaggaagagatcattgtggattacccagaactgcgagagaacccatttctgttgctttca gccactggatttggtgtcatttgttacagcagccacaggaaactcacaagacttcttcag gtcttgcccatcctggttttgtctgcacatacgaacttcacggtcatggggcagggagat gataaatttccttgccagctcttagggcagggtgaggtgactggaggaggcgtggaccgc actgagctgggccagagccatggcagggctgggttcaaacctgcctccccgacgagcacc accccctgctccaaggcgcccagtcccatcgaccacccaagggctgaggaatgcgagcgc acggcgcgggactggcaggcagctccacctgcagccccggtgcgggatccactaggtctc ttcttcagcctcttggacacagtgccatcgctattggcatcaataggggtcaggcacgac aggacagtaggcctgttccagctccatcttaactggctgcccagtaccctgctggtgaca cctgccaccacctccaaatgccttctgtggctgccctacatcctgcttatgccagaggtt ggcacggttattaactacattttgcagttgaggaaactgaggcacagagaaggctgctct ccaatgatctgtgctatggacagctgcttccaggggccacatctttaccctggtccagga aatgaaatgaagacaaaagcgccccctgtgggtgaagccaaagacacagtccttctgcct ggcagacctccaggctgccagcgggagcccactgaggtggtcccagtcacagtcactgca gcagacatcatggctgtggatgtggcagaataccatctgagcgtcatcaagagcccccct ggctgggaggtgggtgtctatgctgcaggggccctggccctgctgggaatcgcagctgtg agcctgtggaagctctggacgtcggggagcttccccagcccctctccgttccccaattac gactacaggtaccttcagcagaagtacggcgagagctgcgcagaggccagggagaagaga gtgcctgcctggaatgcccagcgggccagcacgcggggaccacccagccgcaaaggcagt ctcagcattgaggacacctttgagagcatcagtgaactggggcctctggagctgatgggc cgggagttggacctggccccctatgggaccctccggaagtcccagtcggccgactccctg aactccatctcctccgtgagcaacacctttgggcaggacttcacactgggccaggtggag gtgagcatggagtacgacactgcctcccacacgctgaacgtggcggtgatgcagggcaag gacctcctggagcgggaggaggccagcttcgagtcctgcttcatgcgcgtcagcctgctg ccggacgagcagatcgtgggcatttctcggatccagagaaatgcctactccatcttcttt gatgagaagttctccatccccctggatcccacagccctggaggagaagagcctgcggttt tctgtatttggcatcgatgaggatgagcgcaacgtcagcacgggggtggtggagctgaag ctttctgtgcttgacctcccgctgcagcccttcagtggctggctctatttacaggaccag aacaaggccgccgatgctgtgggggagatcctgctctccctcagctacctccccacagcc gagcgcctcaccgtggtcgtggttaaggccaagaacctcatctggaccaacgacaagacc acagcggaccccttcgtcaaggtgtacctgctgcaggatgggaggaagatgagcaaaaag aagacagccgtgaagagggatgaccccaacccggtgttcaacgaagccatgatcttctcg gtgccagccattgtgctccaggacctgtctctccgcgtgacggtggctgagagcagcagc gacggccgtggggacaacgtgggccatgtcatcattgggccgtcagccagtggcatggga accacacattggaaccagatgttggccacgctgcgcaggcccgtgtccatgtggcacgct gtccggcgaaactag >gi568815587f:66956903_67171599|GENSCAN_predicted_peptide_4|298_aa MDKDELMENSQQLLGGLSTTGTDSEELLRQPMRRAVAGADSERFPDSLNVGEELFSPTPV PLPVPTPVVVLITLMSCVSTILSPPGPGMTAAQAAGEEAPPGVRSVKVVLVGDGGCGKTS LLMVFADGAFPESYTPTVFERYMVNLQVKGKPVHLHIWDTAGQDDYDRLRPLFYPDASVL LLCFDVTSPNSFDNIFNRWYPEVNHFCKKVPIIVVGCKTDLRKDKSLVNKLRRNGLEPVT YHRGQEMARSVGAVAYLECSARLHDNVHAVFQEAAEVALSSRGRNFWRRITQGFCVVT >gi568815587f:66956903_67171599|GENSCAN_predicted_CDS_4|897_bp atggataaggatgagctgatggagaattcccagcagctcctgggaggcctgtccaccaca gggactgactccgaagagcttctgaggcaaccaatgagacgggctgtggcaggagccgac tctgaaaggtttcctgactccctcaacgtgggcgaggagctcttttcccctactcccgtc ccgctgcctgtccctactcccgttgtagttctgatcacactgatgtcctgcgtgtctacc attctaagcccgcccggccccgggatgacggcggcccaggccgcgggtgaggaggcgcca ccaggcgtgcggtccgtcaaggtggtcctggtgggcgacggcggctgcgggaagacgtcg ctgctgatggtcttcgccgatggggccttccccgagagctacacccccacggtgtttgag cggtacatggtcaacctgcaagtgaaaggcaaacctgtgcacctccacatctgggacaca gcagggcaagatgactatgaccgcctgcggcccctgttctaccctgacgccagcgtcctg ctgctttgcttcgatgtcaccagcccgaacagctttgacaacatctttaaccggtggtac ccagaagtgaatcatttctgcaagaaggtacccatcatcgtcgtgggctgcaagactgac ctgcgcaaggacaaatcactggtgaacaagctccgaagaaacggattggagcctgtgacc taccacaggggccaggagatggcgaggtccgtgggcgcggtggcctacctcgagtgctcg gctcggctccatgacaacgtccacgccgtcttccaggaggccgccgaggtggccctcagc agccgcggtcgcaacttctggcggcggattacccagggcttttgcgtggtgacctga >gi568815587f:66956903_67171599|GENSCAN_predicted_peptide_5|178_aa MGIEFELADVGRKARWKQKPPVPHPTARASLLGGSLLGSKAELGGQEAGAWASSMLAWQS YPTPPPPPGGLLIPGIISSADAAGFKVGRGASELGEAWELLCLQPRDPGRVGKGNLSEQA PEGAAEGFCAGTHLCFGGWLCQGEEDGGGREPAGWETYVMVQARMDEGLNGARIHGIH >gi568815587f:66956903_67171599|GENSCAN_predicted_CDS_5|537_bp atgggcattgagtttgaacttgcagatgtgggaaggaaagcccgctggaaacaaaaacct cctgtgccacatcccactgcaagagccagcctgctgggaggcagcctgctgggctctaag gcagagttgggagggcaggaggcaggtgcctgggcctctagcatgctggcctggcaatcc taccccaccccacccccgccccctggcgggcttttaatacctggcataatctcttctgct gatgcagctggttttaaggtgggaagaggggccagtgaacttggagaagcttgggagctg ctctgcctgcagcccagagacccaggacgggtgggcaaaggaaacttgagcgaacaggca ccagagggggctgctgaaggcttctgtgcagggacgcacttatgctttggaggatggctc tgccagggtgaggaggatggaggaggccgagagcctgcaggctgggagacttatgtgatg gtccaagcccgaatggatgagggtctgaatggagcacggattcatggcattcattaa >gi568815587f:66956903_67171599|GENSCAN_predicted_peptide_6|383_aa MRYHYTTVKIENTTPNAGKEQPVASGYHSVQHNSRLLEAAAEPKLKKSATKRVIQYMYVY LLKQRMLKQFCIFSIIGQTYSYQESTLYNFNKDRSGSALRDEPPGYSPVAEEGFRGMGPL ALPLLPERPLPRKDRAATGPAAGKVLREPPPPPPLPGRGPGAAATGPVCPGPGGSSVQRP GPASPTARRSEGAGNRSSMSVLGSAPRGCCRRLTRASTGTRRRSGLRDGPGAVAAGAPTA GRRRPARWPRKPPLLGHAQLSAAPVPPAARVGRDPAPGESGRPPRRGARAAPGPRPDPAA LGRGGRGRPGREDVSCWCAQLRAGVSDGAAAARAAPALPTDRLPRPDSCPTGQQAPNVDG ARPEGIRYLSSVSAKAGVPSATL >gi568815587f:66956903_67171599|GENSCAN_predicted_CDS_6|1152_bp atgagataccactacacaacggtcaaaattgaaaatacaacaccaaatgctggcaaggag caacctgtggctagtggctaccatagcgtacaacacaactctaggctgttggaagcagca gcagaaccaaagctgaaaaaatcagccaccaagagagtgattcaatatatgtatgtgtat ctcctaaagcaacgtatgctgaaacagttctgtattttcagcataattggtcaaacatac agttaccaagaaagcacactgtacaacttcaacaaggacaggtctggctccgcgctgcgg gacgagccccccggctactccccagtcgcagaagagggattccgggggatgggccccctg gcgttgcccctcctcccggagcgtcccctgcccaggaaagacagagcggccacgggccca gccgctgggaaggtgctccgagagccgccgccgccgccgccgctgccggggaggggcccc ggcgccgccgccacggggcccgtctgccctgggcccgggggctcctcggtgcagcggcct ggccccgcctcacccactgctcgtcgctccgaaggggcaggaaacaggagtagcatgtct gtcctgggctcggctccccgggggtgctgcaggcgcttaacaagggcgtccacaggtacc aggcgcagaagcggtctccgcgacggcccgggggcggtggcggccggcgcgcccaccgcc ggccggaggcgcccggcgaggtggcccaggaagcccccgctcctcggccacgctcagctt tccgccgcgccggtgcccccggcggcccgggtgggacgtgaccccgcccccggcgagtcg gggcgccccccacggaggggtgcgcgggccgcccccggcccccggcccgaccctgccgcc ctgggccgtggcgggcgcgggaggcccggccgggaagatgtctcctgttggtgcgcgcag cttcgcgcgggggtcagcgacggggcagcagccgcccgagctgcgccggccctgcccacg gaccgactcccgcggcccgacagctgcccgacgggccagcaggctccgaacgtcgacgga gcccggccggagggcattcgctacctttcatcagtttctgctaaagctggtgttccttca gccactttgtga