GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:38:07 Sequence gi568815587f:114196557_114412474 : 215918 bp : 43.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2531 2902 372 1 0 90 -3 263 0.035 14.30 1.02 Intr + 8309 8348 40 1 1 83 91 19 0.037 -0.50 1.03 Intr + 21566 21659 94 0 1 76 66 68 0.018 2.42 1.04 Intr + 25850 25958 109 2 1 85 59 64 0.029 3.49 1.05 Intr + 41641 41727 87 0 0 93 71 49 0.087 3.87 1.06 Intr + 43524 43581 58 2 1 70 85 -4 0.038 -4.04 1.07 Intr + 45611 45781 171 0 0 127 96 197 0.935 24.31 1.08 Intr + 50642 50809 168 0 0 107 86 356 0.988 37.22 1.09 Term + 53770 53999 230 2 2 133 49 412 0.985 38.59 1.10 PlyA + 54078 54083 6 1.05 2.04 PlyA - 56076 56071 6 1.05 2.03 Term - 66330 66262 69 0 0 121 44 44 0.458 1.24 2.02 Intr - 75152 74997 156 2 0 82 48 84 0.016 4.01 2.01 Init - 84424 84173 252 1 0 64 51 157 0.076 4.94 2.00 Prom - 92698 92659 40 -4.66 3.00 Prom + 96126 96165 40 -6.66 3.01 Init + 100001 100154 154 1 1 66 97 35 0.869 2.35 3.02 Term + 101395 101606 212 2 2 67 44 283 0.487 19.26 3.03 PlyA + 102515 102520 6 1.05 4.00 Prom + 106293 106332 40 -6.06 4.01 Sngl + 115445 115921 477 1 0 57 44 430 0.317 31.60 4.02 PlyA + 116931 116936 6 1.05 5.08 PlyA - 117606 117601 6 1.05 5.07 Term - 120980 120876 105 2 0 105 42 72 0.641 2.71 5.06 Intr - 123949 123876 74 2 2 57 92 24 0.122 -1.17 5.05 Intr - 144583 144476 108 2 0 49 48 125 0.094 4.86 5.04 Intr - 149196 149139 58 0 1 103 103 -4 0.093 1.06 5.03 Intr - 169637 169444 194 0 2 120 66 49 0.066 5.11 5.02 Intr - 175025 174948 78 0 0 45 72 90 0.127 2.62 5.01 Init - 181010 180998 13 2 1 71 119 0 0.100 0.80 5.00 Prom - 191635 191596 40 -4.16 6.00 Prom + 202094 202133 40 -4.56 6.01 Init + 204116 204211 96 1 0 83 115 96 0.978 12.33 6.02 Intr + 205142 205304 163 1 1 88 93 50 0.905 5.05 6.03 Term + 211079 211251 173 0 2 37 41 95 0.228 -2.31 6.04 PlyA + 213831 213836 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:114196557_114412474|GENSCAN_predicted_peptide_1|442_aa MDAGSWDRDFGHVCWDLGPGCQTRMLGPGTGMPDMDAGPRDGDDGHGCRWAQGRGCRTWM PLGPGTGMPDMDAAGPRDGDAGHGCRWAPGRGCWTWMPLGPGTGMLDMDAGPRDGDASHG CRWAACKWSLALSWIWAGGLDGQSSRLLLEISQRRTCCAARTFLEVQPRGSLREGESGSV GVSKVILGRNLNGYDKDEGSFVRELWPISPLAASNRLLLLLVPMCAQSHWLQTQEQNPVL AVFLLPCSAFLTAGTDMAVFCLLCGKRFQAQSALQQHMEVHAGVRSYICSECNRTFPSHT ALKRHLRSHTGDHPYECEFCGSCFRDESTLKSHKRIHTGEKPYECNGCGKKFSLKHQLET HYRVHTGEKPFECKLCHQRSRDYSAMIKHLRTHNGASPYQCTICTEYCPSLSSMQKHMKG HKPEEIPPDWRIEKTYLYLCYV >gi568815587f:114196557_114412474|GENSCAN_predicted_CDS_1|1329_bp atggatgctgggtcctgggatagggattttggacatgtatgctgggacctgggaccggga tgccagacacggatgctgggccccgggacggggatgccggacatggatgccgggccccgg gacggggatgacggacatggatgccgctgggcccagggacggggatgccggacatggatg ccgctgggcccagggacggggatgccggacatggatgccgctgggccccgggatggggat gccggacatggatgccgctgggccccgggacggggctgctggacatggatgccgctgggc cccgggacggggatgctggacatggatgctgggccccgggatggggatgccagccatgga tgccgctgggcagcttgcaagtggtcactcgccctgagctggatttgggcagggggattg gatgggcagtcctccaggctgctgttggaaatttctcagagaaggacctgctgtgctgca cggaccttcctggaagtccaacccaggggctctttgagagagggggaatcaggaagtgta ggagtgagcaaagtgattctgggcagaaacctgaatggatatgacaaggacgagggatcc tttgtccgggagctgtggcccatcagccctctcgctgcttctaaccgcttgctgctgctg ttggtccccatgtgtgcccagagtcactggcttcaaacacaggagcaaaatcctgtgctg gctgtttttctccttccctgctcagccttcctgactgctggcactgacatggccgtcttc tgtctgctgtgtgggaagcgcttccaggcgcagagcgcactgcagcagcacatggaggtc cacgcgggcgtgcgcagctacatctgcagtgagtgcaaccgcaccttccccagccacacg gctctcaaacgccacctgcgctcacatacaggcgaccacccctacgagtgtgagttctgt ggcagctgcttccgggatgagagcacactcaagagccacaaacgcatccacacgggtgag aaaccctacgagtgcaatggctgtggcaagaagttcagcctcaagcatcagctggagacg cactatagggtgcacacaggtgagaagccctttgagtgtaagctctgccaccagcgctcc cgggactactcggccatgatcaagcacctgagaacgcacaacggcgcctcgccctaccag tgcaccatctgcacagagtactgccccagcctctcctccatgcagaagcacatgaagggc cacaagcccgaggagatcccgcccgactggaggatagagaagacgtacctctacctgtgc tatgtgtga >gi568815587f:114196557_114412474|GENSCAN_predicted_peptide_2|158_aa MAGWRSGHSALAAAFLFLVPSFSGQDLGEQECRLPANEQAGDKQRREEPSHPTGPAAPKL KTWLPSCAQRVKDKGNNPKTGLGQTTEWMLVIHKVGNTGRRTGLEEGEEDDMMRNEELHL EHAWLEAKKLNLTQMQLVSLEAHSMLKTTLVGVEGKTK >gi568815587f:114196557_114412474|GENSCAN_predicted_CDS_2|477_bp atggccgggtggaggtcagggcactctgcgctggctgcggccttcctcttcctggttccc tcattcagtggtcaggatctgggagaacaggagtgcaggctgccggcaaatgagcaggcc ggggacaagcaaaggagggaagagccctcccaccccactggccctgcagcccccaagctc aagacttggctgccctcttgtgcccagagagtgaaagacaagggaaataatccaaaaaca ggtcttggtcagaccactgagtggatgctggtaattcataaagtagggaacacaggaaga agaacaggtttagaggagggtgaagaggatgacatgatgagaaacgaggagttgcatttg gagcatgcttggcttgaggctaaaaaattgaatctaacccagatgcagctcgtgtcacta gaagctcactctatgctgaagaccacactggtcggcgtggaggggaaaacaaagtga >gi568815587f:114196557_114412474|GENSCAN_predicted_peptide_3|121_aa MESGFTSKDTYLSHFNPRDYLEKYYKFGSRHSAESQILKHLLKNLFKIFCLDGVKGDLLI DIGSGPTIYQLLSACESFKEIVVTDYSDQNLQELEKWLKKEPEAFDWSPVVTYVCDLEGN R >gi568815587f:114196557_114412474|GENSCAN_predicted_CDS_3|366_bp atggaatcaggcttcacctccaaggacacctatctaagccattttaaccctcgggattac ctagaaaaatattacaagtttggttctaggcactctgcagaaagccagattcttaagcac cttctgaaaaatcttttcaagatattctgcctagacggtgtgaagggagacctgctgatt gacatcggctctggccccactatctatcagctcctctctgcttgtgaatcctttaaggag atcgtcgtcactgactactcagaccagaacctgcaggagctggagaagtggctgaagaaa gagccagaggcctttgactggtccccagtggtgacctatgtgtgtgatcttgaagggaac aggtag >gi568815587f:114196557_114412474|GENSCAN_predicted_peptide_4|158_aa MTGVENNVCGFVFFRVKGPEKEEKLRQAVKQVLKCDVTQSQPLGAVPLPPADCVLSTLCL DAACPDLPTYCRALRNLGSLLKPGGFLVIMDALKSSYYMIGEQKFSSLPLGREAVEAAVK EAGYTIEWFEVISQSYSSTMANNEGLFSLVARKLSRPL >gi568815587f:114196557_114412474|GENSCAN_predicted_CDS_4|477_bp atgactggagtggaaaacaatgtctgtgggtttgtgtttttcagagtcaagggtccagag aaggaggagaagttgagacaggcggtcaagcaggtgctgaagtgtgatgtgactcagagc cagccactgggggccgtccccttacccccggctgactgcgtgctcagcacactgtgtctg gatgccgcctgcccagacctccccacctactgcagggcgctcaggaacctcggcagccta ctgaagccagggggcttcctggtgatcatggatgcgctcaagagcagctactacatgatt ggtgagcagaagttctccagcctccccctgggccgggaggcagtagaggctgctgtgaaa gaggctggctacacaatcgaatggtttgaggtgatctcgcaaagttattcttccaccatg gccaacaacgaaggacttttctccctggtggcgaggaagctgagcagacccctgtga >gi568815587f:114196557_114412474|GENSCAN_predicted_peptide_5|209_aa MTRPGKCENQSRVLSTSPQTEKLEVEGILLGKSHCGEDRKEHNNACHQEEEAPDVNPLHG LQASGSPGKTQRIKSQALEGTRLYSHSEETRSGLIFLATIILSAPMSLPILDSTCASFLV SGAGGEKMTTMLNPQFLCHHLLNCCLQLPLGFHNALLKDSDLGNMKLQSVTERSQENKSH EARDCVIFTVGSSSQAQCLAHNRDARNIF >gi568815587f:114196557_114412474|GENSCAN_predicted_CDS_5|630_bp atgaccaggcctggtaagtgtgagaaccagtcgagagtgttaagcaccagtccacagact gagaaattggaagtagaaggaatactgctgggaaagtcccactgtggggaggataggaaa gagcataacaatgcctgccaccaggaggaagaggccccagatgtcaacccattgcatggt ttgcaggcatcgggatcaccgggaaagactcaaagaatcaagtcacaggcactggaggga acaaggctttactcacatagcgaagagacaagatcaggcttaatattcctggcaaccatc attctttctgctcctatgagtttgcctattttagattctacatgtgcctctttcttagtt agtggtgctggtggtgagaagatgaccaccatgctaaacccccagttcctgtgccaccac ctgctcaactgctgcctgcagctgccactcggatttcataatgctcttctaaaggattca gacttgggaaacatgaagttacaatcagtaactgagcgaagccaggaaaataagtcccac gaggccagagactgcgtcatcttcactgtaggatcctcatctcaggcccaatgcctagca cataacagggatgcaagaaacatcttctga >gi568815587f:114196557_114412474|GENSCAN_predicted_peptide_6|143_aa MGAAAAEADRTLFVGNLETKVTEELLFELFHQAGPVIKVKIPKDKDGKPKQFAFVNFKHE VSVPYAMNLLNGIKLYGRPIKIQFRSDRHYSREQRYTDHGSDHHYRGKRDDFFYEDRNHD DWSHDYDNRRDSSRDGKWRSSRH >gi568815587f:114196557_114412474|GENSCAN_predicted_CDS_6|432_bp atgggggcggcggcggcggaagcggatcgcactctctttgtgggcaaccttgaaacgaaa gtgaccgaggagctccttttcgagcttttccaccaggctgggccagtaataaaggtgaaa attccaaaagataaggatggtaaaccaaagcagtttgcgtttgtgaatttcaaacatgaa gtgtctgttccttatgcaatgaatctacttaatggaatcaaactttatggaaggcctatc aaaattcaatttagatcagatagacattatagccgggaacagcgttacactgatcatggg tctgaccatcattacagaggaaagagagatgatttcttctatgaagacaggaatcatgat gactggagccatgactatgataacagaagagacagtagtagagatggaaaatggcgctca tctcgacactaa