GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:18:48 Sequence gi568815579f:34072643_34321738 : 249096 bp : 44.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 3193 3188 6 1.05 1.01 Sngl - 20701 19880 822 1 0 53 44 586 0.653 46.45 1.00 Prom - 21147 21108 40 -6.16 2.00 Prom + 24452 24491 40 -4.36 2.01 Init + 30257 30316 60 1 0 70 64 53 0.212 2.35 2.02 Intr + 38413 38482 70 0 1 123 78 21 0.418 3.35 2.03 Intr + 54457 54605 149 2 2 58 67 153 0.047 10.15 2.04 Intr + 70544 70606 63 1 0 64 86 41 0.015 0.41 2.05 Intr + 92786 92902 117 1 0 66 31 118 0.171 4.56 2.06 Term + 95480 95569 90 1 0 68 35 51 0.078 -4.48 2.07 PlyA + 95589 95594 6 1.05 3.00 Prom + 96529 96568 40 -7.26 3.01 Init + 99784 99859 76 0 1 90 36 190 0.915 13.25 3.02 Intr + 99978 100121 144 1 0 34 107 398 0.972 36.55 3.03 Intr + 124002 124121 120 1 0 75 76 97 0.476 7.67 3.04 Intr + 136376 136409 34 0 1 39 121 13 0.067 -3.02 3.05 Intr + 142482 142658 177 0 0 60 60 120 0.088 5.43 3.06 Intr + 146804 146931 128 2 2 -26 121 83 0.080 0.52 3.07 Intr + 147064 147235 172 2 1 85 60 186 0.633 14.50 3.08 Intr + 148865 149096 232 2 1 50 110 48 0.198 0.98 3.09 Term + 153766 153789 24 0 0 113 42 26 0.221 -1.28 3.10 PlyA + 154071 154076 6 1.05 4.04 PlyA - 154150 154145 6 1.05 4.03 Term - 164933 164863 71 0 2 95 48 75 0.121 2.30 4.02 Intr - 182382 182228 155 2 2 46 36 96 0.211 -0.08 4.01 Init - 182530 182466 65 1 2 56 91 147 0.460 10.43 4.00 Prom - 219684 219645 40 -2.86 5.02 PlyA - 219723 219718 6 1.05 5.01 Sngl - 223972 223634 339 1 0 79 37 403 0.800 30.33 5.00 Prom - 224799 224760 40 -9.46 6.00 Prom + 227193 227232 40 -8.16 6.01 Init + 227862 228374 513 2 0 62 115 526 0.577 47.60 6.02 Intr + 247265 247474 210 1 0 107 76 87 0.010 8.51 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579f:34072643_34321738|GENSCAN_predicted_peptide_1|273_aa MGGPLSVPITAVASGPKQHLKRVAAPQRWMLDKLTGVFAARPSTGPQKLRECLPHIFLRN RLKYALTGDEVKKICMQWFIKVDGKVRTDITYPTGFMDVISIDKTGEGFRLIYGINCRFA VHRVTPEEVKYKLCKMRKIFVCTKGTPHLVIHDAHTIRHPDSLIKVNDTIQIDLETGNIT ALKFDTGNLCLVTGGANWGRIGVITNRERHPGSFDTIHVKDANSNSFSIIFVIDKGNKPC ISLPRGKGIRFTIAEGRDKRLQPNRAVDEMVCG >gi568815579f:34072643_34321738|GENSCAN_predicted_CDS_1|822_bp atgggaggtcctctttctgtgcctattacagctgtggcttccggtcccaagcagcatctg aagcgggtagcagctccacaacgttggatgctggataaattgactggtgtgtttgctgct cgtccatccactggtccccaaaagctgagagagtgtctccctcatattttcctgaggaac agacttaaatatgccctgacaggagatgaagtgaagaagatttgcatgcagtggttcatt aaggtagatggcaaggttcgaactgatataacctaccctactggatttatggatgtcatc agcattgacaagacgggagagggtttccgtctgatctatggcatcaattgtcgctttgct gtacatcgtgttacacctgaggaagtcaagtacaagttatgcaaaatgagaaaaatcttt gtgtgcacaaaaggaacccctcacctggtgattcacgatgctcacactatccgccaccct gattccctcattaaagtgaacgacaccattcagattgatttggagactggcaacatcact gctttgaagttcgacactggtaacctgtgtttggtgactggaggtgctaactggggaaga attggtgtgatcaccaatagagagaggcaccctggatcttttgacacgattcacgtgaaa gatgccaacagcaacagcttttccatcatttttgttattgacaagggcaacaaaccatgt atttctcttccccgaggaaagggtatccgcttcaccattgctgaagggagagacaagaga ctgcagccaaacagagcagtggatgaaatggtctgtgggtga >gi568815579f:34072643_34321738|GENSCAN_predicted_peptide_2|182_aa MSDQDSSNDKVIKNRDSITQISKAHSKGWLGPQSTLSLAENNAVGSPAAAPQQPGLRAQM ATTAADVAVGSAVGHTLGHAVTGGFSGGSNAEPAPKLIAQDVTVLNTVGNCYTVKELIVL FQEVKLANRIYFLFQSKATLLPDNPSQRQNTTGGHTNYEIDAIAWCPNRHFQLNKCKIIC IY >gi568815579f:34072643_34321738|GENSCAN_predicted_CDS_2|549_bp atgtctgaccaggactcctcaaatgacaaagtcatcaaaaacagggacagtattactcag atttctaaggcccactccaaaggctggcttggtcctcagtccacactctctcttgcagaa aacaatgcagttggctctcctgctgctgctccccagcagccaggtctgagggcccagatg gcaaccactgcagctgacgtggctgtgggctctgctgtggggcacacactgggccacgcc gtcaccgggggcttcagtggaggaagtaatgctgagcctgcccctaaattaattgcacag gatgttactgtactgaacactgtgggcaactgttacacagtaaaagaactaatcgtgctg ttccaagaagtgaagttagccaacaggatctacttcctctttcaatccaaggccacgctc ctcccagacaaccccagccaacgacagaacacaacagggggtcataccaattatgaaata gacgcaattgcttggtgtcccaacaggcacttccagctcaacaagtgcaaaattatttgc atctattag >gi568815579f:34072643_34321738|GENSCAN_predicted_peptide_3|368_aa MPRAAAAAAAAAAAAPRLEGGRLGEAGSGGGGAMSGGTPYIGSKISLISKAEIRYEGILY TIDTENSTVALAKGSSTSSFQSMGSYGPFGRMPTYSQFSPSSLVGQQFGAVGVGYKISKN TVISRSLKPSVRPFEKKPNHGTSSADRLSPLTCSSSCWEKESCINQAFAICQPKGRRESG AQARGRFGIRRDGPMKFEKDFDFESANAQFNKEEIDREFHNKLKLKEDKLEKQEKPVNGE DKGDSGVDTQNSEGNADEEDPLGPNCYYDKTKSFFDNISCDDNRERRPTWAEERRLNAET FGIPLRPNRGRGGYRGRGGLGFRGGRGRGGGRGGTFTAPRGFRGGFRGGRGGREFADFEY RKTTAFGP >gi568815579f:34072643_34321738|GENSCAN_predicted_CDS_3|1107_bp atgccccgcgccgccgccgctgccgccgccgccgccgccgccgcgccgcggcttgagggc gggaggctgggggaggctgggagcggcggcggcggcgccatgagcgggggcaccccttac atcggcagcaagatcagcctcatctccaaggcggagatccgctacgagggcatcctctac accatcgacaccgaaaactccaccgtagcccttgccaaaggctcatcgacttcttcattc cagtccatgggttcttatggacctttcggcaggatgcccacatacagtcagttcagtccg agttccttagttgggcagcagtttggtgctgttggtgttggatacaagatctctaaaaac acagttatctcaaggtcgctcaagccctcagttagaccctttgagaaaaagcccaaccat ggaacaagcagtgcagaccgcctcagcccacttacctgctccagcagctgttgggagaag gagtcctgtatcaaccaggcctttgccatctgccagccaaaaggcaggagagaatcagga gcacaggcgaggggaagatttggtattcggcgagatgggccaatgaaatttgagaaagac tttgactttgaaagtgcaaatgcacaattcaacaaggaagagattgacagagagtttcat aataaacttaaattaaaagaagataaacttgagaaacaggagaagcctgtaaatggtgaa gataaaggagactcaggagttgatacccaaaacagtgaaggaaatgccgatgaagaagat ccacttggacctaattgctattatgacaaaactaaatccttctttgataatatttcttgt gatgacaatagagaacggagaccaacctgggctgaagaaagaagattaaatgctgaaaca tttggaatcccacttcgtccaaaccgtggccgtgggggatacagaggcagaggaggtctt ggtttccgtggtggcagagggcgtggtggtggcagaggtggtaccttcactgcccctcga ggatttcgcggtggattcagaggaggtcgtgggggccgggagtttgcggattttgaatat aggaaaaccacagcttttggaccctaa >gi568815579f:34072643_34321738|GENSCAN_predicted_peptide_4|96_aa MRQLAPQGRKSAFLGSLAPPGRPPSLRGLPGAGPGASATSRAPANRGSVPFPRLTGADPP LRCRGRRVSDAPPGLNPLGEDIDNVDEWMLLGLKFL >gi568815579f:34072643_34321738|GENSCAN_predicted_CDS_4|291_bp atgcgccagctggctccgcagggccggaaaagcgcgttcctcggcagcctggcgccgccg ggaaggccgccctccctccgcgggcttccgggagcgggtcctggcgccagcgcgacctcc cgggccccggcgaaccgagggagcgtcccctttccccgcctgacgggcgccgacccgccc ctgcggtgccgcggccgccgagtctccgacgcgcccccaggactcaaccccctgggtgaa gacattgacaatgtggatgaatggatgcttttagggttgaagtttctctag >gi568815579f:34072643_34321738|GENSCAN_predicted_peptide_5|112_aa MHFAKKHKKGLKKMQANNAKAMSARAEAIKALVKPKEVKPKIPKGVSHKLDRLAYIAHPK LGKCARARTAKGLRLCRLKAKAKDQISAQTAAPASVPAQAPKGAQAPTKASE >gi568815579f:34072643_34321738|GENSCAN_predicted_CDS_5|339_bp atgcactttgccaagaagcacaagaagggcctcaagaagatgcaggccaacaatgccaag gccatgagtgcacgtgccgaggctatcaaggccctcgtaaagcccaaggaggttaagccc aagatcccaaagggtgtcagccacaagctcgatcgacttgcctacattgcccaccccaag cttgggaagtgtgctcgtgcccgcactgccaagggactcaggctgtgccggctaaaggcc aaggccaaggatcaaatcagcgcccagactgcagctccagcttcagttccagctcaagct cccaaaggtgcccaggcccctacaaaggcttcagagtag >gi568815579f:34072643_34321738|GENSCAN_predicted_peptide_6|241_aa MDYKRRFLLGGSKQKVQQHQQYPMPELGRALSAPLASTATTAPLGSLTAAGSCHHAMPHT TPIADIQQGISKYLDALNVFCRASTFLTDLFSTVFRNSHYSKAATQLKDVQEHVMEAASR LTSAIKPEIAKMLMELSAGAANFTDQKEFSLQDIEVEYLLCHTCVGKYTTKVLGRCFLTV VQVHFQFLTHALQKVQPVAHSCFAEVIVPEKKNSGSGGGLSGMGHTPEVEEAVRSWRGAA E >gi568815579f:34072643_34321738|GENSCAN_predicted_CDS_6|723_bp atggactacaagcggcgcttcctgcttggcgggtccaagcagaaggtgcagcagcaccag caatacccgatgcctgagctgggccgagcactgagtgctcccctggcatccacggccacc actgcccccctgggcagtctgaccgctgcaggcagctgccaccatgccatgccccacact actcctatcgccgacatccagcagggcatctccaagtatctggatgccctgaacgtcttc tgccgtgccagtactttcctcacagatctcttcagcactgtgttcaggaactctcactac tcaaaggcagccacacagctcaaagatgtgcaggagcatgtcatggaagcagccagtcgg ctgacctcggccataaagcctgagatcgccaagatgctaatggaacttagtgctggggct gcaaattttacggatcagaaggaattcagtctccaggacattgaggtagagtatcttttg tgtcatacttgtgtaggaaaatatactacaaaggtgttggggcgatgtttcctgactgtg gtgcaagtccatttccagtttttgactcatgcgttacagaaggtccagccggtggctcac tcttgctttgctgaggtcatcgtgccagaaaaaaagaacagcggcagtggcggcggctta tctggcatgggccacacacctgaagtagaggaagctgtgcggtcctggcggggggctgct gag