GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:28:45 Sequence gi568815597r:22020053_22242922 : 222870 bp : 48.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5240 5320 81 0 0 95 103 58 0.436 7.61 1.02 Term + 25903 26048 146 2 2 97 41 64 0.431 0.67 1.03 PlyA + 26105 26110 6 1.05 2.00 Prom + 26217 26256 40 -4.66 2.01 Init + 32449 32620 172 0 1 86 96 117 0.659 9.90 2.02 Intr + 32642 32690 49 0 1 55 105 69 0.430 3.04 2.03 Intr + 54946 55039 94 1 1 47 77 33 0.009 -1.93 2.04 Intr + 61670 61742 73 1 1 115 67 96 0.998 9.18 2.05 Intr + 66387 66496 110 1 2 113 121 7 0.993 6.50 2.06 Intr + 66617 66814 198 1 0 29 110 124 0.605 8.25 2.07 Intr + 71376 71449 74 2 2 104 89 94 0.782 9.30 2.08 Intr + 93265 93400 136 1 1 60 69 78 0.263 3.67 2.09 Intr + 94357 94528 172 0 1 33 105 65 0.332 2.22 2.10 Term + 95185 95264 80 2 2 99 44 18 0.173 -3.57 2.11 PlyA + 95977 95982 6 1.05 3.08 PlyA - 98978 98973 6 1.05 3.07 Term - 100465 99998 468 1 0 125 47 721 0.998 66.67 3.06 Intr - 101301 101159 143 1 2 45 67 180 0.842 11.67 3.05 Intr - 101524 101393 132 2 0 142 73 126 0.929 17.12 3.04 Intr - 109799 109564 236 1 2 91 100 496 0.996 48.43 3.03 Intr - 111051 110981 71 0 2 12 84 60 0.299 -4.02 3.02 Intr - 116345 116219 127 1 1 107 100 47 0.712 8.48 3.01 Init - 122870 122794 77 2 2 99 87 141 0.921 13.98 3.00 Prom - 135968 135929 40 -5.46 4.06 PlyA - 136911 136906 6 1.05 4.05 Term - 139098 138801 298 2 1 71 52 139 0.255 3.34 4.04 Intr - 143281 143154 128 1 2 73 15 131 0.481 3.78 4.03 Intr - 156489 156410 80 1 2 98 46 42 0.222 0.27 4.02 Intr - 165716 165663 54 0 0 85 82 65 0.412 4.55 4.01 Init - 169552 169075 478 1 1 93 86 200 0.735 15.90 4.00 Prom - 180077 180038 40 -5.46 5.00 Prom + 181032 181071 40 -2.96 5.01 Init + 181888 182015 128 0 2 82 34 90 0.279 2.74 5.02 Term + 185376 185490 115 0 1 51 55 92 0.222 0.24 5.03 PlyA + 186103 186108 6 1.05 6.00 Prom + 186570 186609 40 -3.56 6.01 Init + 187779 187842 64 2 1 86 23 74 0.837 0.26 6.02 Term + 187923 188254 332 1 2 84 48 161 0.521 6.52 6.03 PlyA + 189357 189362 6 1.05 7.10 PlyA - 189879 189874 6 1.05 7.09 Term - 190391 190221 171 2 0 91 54 40 0.270 -1.27 7.08 Intr - 192306 192199 108 0 0 32 116 35 0.004 1.18 7.07 Intr - 201448 201273 176 2 2 40 87 89 0.065 3.66 7.06 Intr - 206144 206006 139 2 1 88 39 19 0.058 -2.96 7.05 Intr - 212000 211827 174 2 0 136 103 35 0.699 9.94 7.04 Intr - 212620 212566 55 0 1 105 59 30 0.105 0.78 7.03 Intr - 216041 215995 47 2 2 108 77 14 0.103 -0.49 7.02 Intr - 216826 216713 114 1 0 75 13 92 0.131 1.04 7.01 Init - 218702 218544 159 2 0 52 75 103 0.900 5.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 48816 48686 131 1 2 36 42 149 0.931 3.44 S.002 Init - 49884 49818 67 0 1 63 82 55 0.919 1.73 S.003 Init + 58427 58531 105 1 0 69 93 39 0.964 2.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:22020053_22242922|GENSCAN_predicted_peptide_1|75_aa XGQTAAQKDKELLRSEKAAKNVSKTPHDFKEKHWLFLCLEPAGLWTGTETSALLVLRPSG LDWSYTIGSPGSPAQ >gi568815597r:22020053_22242922|GENSCAN_predicted_CDS_1|228_bp nttgggcaaactgcggctcagaaagataaagaacttctccggagcgaaaaagctgctaag aatgtatccaaaactccgcacgacttcaaagagaaacactggctcttcctctgtcttgag cctgctggcctttggactggaactgaaacatcagctctcctggttctcaggccttcaggc ttggactggagctacaccattggctctcctgggtctccagctcagtga >gi568815597r:22020053_22242922|GENSCAN_predicted_peptide_2|385_aa MQSIAPNSRAAPARACATGGRSQGSPGTALTRPSLFPFPVPPTSAGTQLCVSCALTSAEE TPRSAANAPVEKLRVKAEGTSLSFQLKIESGLFGKLAVISPVSQKVFDNYAVTVMIGGEP YTLGLFDTAGQEDYDRLRPLSYPQTDVFLVCFSVVSPSSFENVKEKWVPEITHHCPKTPF LLVGTQIDLRDDPSTIEKLAKNKQKPITPETAEKLARDLKAVKYVECSALTQKGLKNVFD EAILAALEPPEPKKSRREAMSVAFSSRSFKEATVPDVATKSIVDPYIWQEACEDLRKKGS PWAWEQWLPFWGSQGTWPECALSGTVSSSAGPGPPLNLEKALGYWCCSWRTTEPLSKQGD FPWTGCSQEEAEKEVYFVTAFSRLS >gi568815597r:22020053_22242922|GENSCAN_predicted_CDS_2|1158_bp atgcagagcatagccccgaactcacgagctgcgccggcccgcgcgtgcgcgacaggcggg cggagccagggcagtccaggcaccgccttgacccgccccagcctcttccccttccctgtt cctcccacttccgcgggcacccaactgtgcgtctcctgcgcgctgacgtcagccgaggag accccgcgcagtgctgccaacgccccggtggagaagctgagggttaaagcagagggaact tcattatcattccaattgaagattgaaagtggcctgtttggtaaactggctgtcatctct cctgtttctcagaaggtttttgacaactatgcagtcacagttatgattggtggagaacca tatactcttggactttttgatactgcagggcaagaggattatgacagattacgaccgctg agttatccacaaacagatgtatttctagtctgtttttcagtggtctctccatcttcattt gaaaacgtgaaagaaaagtgggtgcctgagataactcaccactgtccaaagactcctttc ttgcttgttgggactcaaattgatctcagagatgacccctctactattgagaaacttgcc aagaacaaacagaagcctatcactccagagactgctgaaaagctggcccgtgacctgaag gctgtcaagtatgtggagtgttctgcacttacacagaaaggcctaaagaatgtatttgac gaagcaatattggctgccctggagcctccagaaccgaagaagagccgcagggaggccatg tctgttgctttctcaagtagaagcttcaaggaggcaactgttcctgatgtcgctacaaag tccattgttgacccttatatctggcaggaggcctgtgaggacttaagaaaaaaagggagt ccttgggcttgggaacagtggctgcctttttggggcagccagggcacctggccagagtgt gctctgtcgggcactgtcagctcttcagcaggccccggtcctcctcttaacctagagaag gccctgggttattggtgttgttcttggcggaccaccgagcccctcagcaaacagggagac ttcccatggactggttgttctcaggaagaggctgagaaggaggtctactttgtcactgca ttttcacggctctcctag >gi568815597r:22020053_22242922|GENSCAN_predicted_peptide_3|417_aa MSPRSCLRSLRLLVFAVFSAAASNWLFPQPSREDSSMGQWPQDMKGCGDTGRMGLAKEEL SAGKLGDKAVKPGKCQVQVLLNLTGCVALHMRYLAKLSSVGSISEEETCEKLKGLIQRQV QMCKRNLEVMDSVRRGAQLAIEECQYQFRNRRWNCSTLDSLPVFGKVVTQGTREAAFVYA ISSAGVAFAVTRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYGVAFSQSFVDVRE RSKGASSSRALMNLHNNEAGRKAILTHMRVECKCHGVSGSCEVKTCWRAVPPFRQVGHAL KEKFDGATEVEPRRVGSSRALVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGR TCNKTSKAIDGCELLCCGRGFHTAQVELAERCSCKFHWCCFVKCRQCQRLVELHTCR >gi568815597r:22020053_22242922|GENSCAN_predicted_CDS_3|1254_bp atgagtccccgctcgtgcctgcgttcgctgcgcctcctcgtcttcgccgtcttctcagcc gccgcgagcaactggctgttccctcagcccagcagggaggacagttcaatgggtcagtgg ccacaggacatgaaaggttgtggggacacaggtcggatgggtctagcaaaggaagagctc tctgctggcaaattgggggacaaggctgtgaagccaggcaaatgccaggttcaagtcctg ctcaacctcactggctgtgtagccctgcacatgaggtacctggccaagctgtcgtcggtg gggagcatctcagaggaggagacgtgcgagaaactcaagggcctgatccagaggcaggtg cagatgtgcaagcggaacctggaagtcatggactcggtgcgccgcggtgcccagctggcc attgaggagtgccagtaccagttccggaaccggcgctggaactgctccacactcgactcc ttgcccgtcttcggcaaggtggtgacgcaagggactcgggaggcggccttcgtgtacgcc atctcttcggcaggtgtggcctttgcagtgacgcgggcgtgcagcagtggggagctggag aagtgcggctgtgacaggacagtgcatggggtcagcccacagggcttccagtggtcagga tgctctgacaacatcgcctacggtgtggccttctcacagtcgtttgtggatgtgcgggag agaagcaagggggcctcgtccagcagagccctcatgaacctccacaacaatgaggccggc aggaaggccatcctgacacacatgcgggtggaatgcaagtgccacggggtgtcaggctcc tgtgaggtaaagacgtgctggcgagccgtgccgcccttccgccaggtgggtcacgcactg aaggagaagtttgatggtgccactgaggtggagccacgccgcgtgggctcctccagggca ctggtgccacgcaacgcacagttcaagccgcacacagatgaggacctggtgtacttggag cctagccccgacttctgtgagcaggacatgcgcagcggcgtgctgggcacgaggggccgc acatgcaacaagacgtccaaggccatcgacggctgtgagctgctgtgctgtggccgcggc ttccacacggcgcaggtggagctggctgaacgctgcagctgcaaattccactggtgctgc ttcgtcaagtgccggcagtgccagcggctcgtggagttgcacacgtgccgatga >gi568815597r:22020053_22242922|GENSCAN_predicted_peptide_4|345_aa MATSKMAQVAQSLNSHIRGPPECQGPCWILEHHAAPELIPEVLNQLGLSGRTQVSELEAG TTAQLALVIRKLQSEVEHFWYSNYSPAKLSSASSWDPASKSSVQRTPWQAPAERPALKEE AKVLCHDPTFKEVLCPPHPSWLLPQDNGSKITPTKAPGQGEVNSVQEVKRIAKNLTTGPA KVPLSSCQHLLLTSQGHVLHFARELLDVTHDAVTRQREDDSCSHPGGCSDNETSAVPAWD STLTDRGELNQVTPEQISACTLASIAAKATPQGSPTMGARRAFAKAAQTVSLDLHWCWGS DQHRDLSTVRAGKSMGVGAGRDLGRNLHFILNSLCDLDQSVPQLL >gi568815597r:22020053_22242922|GENSCAN_predicted_CDS_4|1038_bp atggccacctccaagatggcacaggtagctcaatctctcaatagccatatacgagggcct cctgagtgtcagggcccatgctggatactggagcaccacgctgcaccagaactcatccca gaagtgctcaaccagttgggactcagtggaagaacccaggtctctgagttggaagctggg accacagcccagctggccctggtgatacggaagctgcagtccgaggtcgagcatttttgg tattccaattactctccagccaagctcagctctgcaagctcatgggacccagccagcaaa tcctctgtgcaacgcacaccttggcaagcacctgccgagcgcccagcactcaaggaagaa gccaaggtgctgtgccatgatcccacctttaaggaagtcctctgtcctccacatccctcc tggctccttccccaagacaatggaagcaagataacacccacgaaagccccaggccaaggt gaggtgaatagtgttcaagaggtaaagcggattgccaaaaacctcaccactgggcctgcc aaagtcccactctccagctgccagcacctgctcctcacctcacagggccacgtccttcac tttgctcgggagttactggacgttactcatgacgcagtgacgcggcagcgtgaggatgac agctgtagccaccccgggggctgctctgacaatgaaactagcgcagtgccggcatgggac tcaacactgactgaccgaggggagctcaaccaagtcaccccagagcagatttcagcctgc actctggccagcatagccgccaaagccacgccccagggctcccctactatgggagcaagg agagcctttgccaaagctgctcagactgtctcactggacctccactggtgctggggaagt gaccagcacagagacctgagcactgtccgtgcaggaaagagcatgggagttggggcaggc agggacctaggccggaatctgcacttcatccttaattcactctgtgacctcgaccagtca gtccctcagcttctctga >gi568815597r:22020053_22242922|GENSCAN_predicted_peptide_5|80_aa MQLVGNLTIRAALEPVQEVVCPLSLEVFKQKLQVWLDDPEASLSWHVGDDQEIRECTNEW DGASFSSGRNWMCSRGHISG >gi568815597r:22020053_22242922|GENSCAN_predicted_CDS_5|243_bp atgcagctggtgggtaatctgaccattagggctgccctagagcctgtccaggaggtagtg tgccctctgtcactggaggtgtttaagcagaagctgcaggtttggctggatgatccagaa gcttctctctcctggcacgtgggtgatgatcaagagattcgtgaatgcacgaatgaatgg gatggtgccagcttttcaagtggaaggaactggatgtgtagcagaggccacatcagtgga tga >gi568815597r:22020053_22242922|GENSCAN_predicted_peptide_6|131_aa MGPGQVACPLALEILEAVLAPWCPSLSGHGEAQSRQSPWPGSNTDSVPTVWHLLIREEFA WRTELGQLVCCNRDAAGWHGLGPSDWAAVFASSKSLALLEQRGIQDADPTEDEKATGSQT PHPLTPQGVFG >gi568815597r:22020053_22242922|GENSCAN_predicted_CDS_6|396_bp atggggcccggccaggtggcttgtccattggctttggaaatcctggaggcagtccttgca ccctggtgtccctcactttcaggccatggggaagcgcagagcaggcagagcccgtggcca gggagcaacacagattcggtccccactgtctggcatctcctgatccgggaggaattcgcc tggagaacagagcttgggcagctggtgtgctgtaaccgagacgcagctggttggcacggg ctagggccttctgattgggccgccgtgtttgcaagctccaaatccttggctctccttgaa caacgaggaattcaggatgctgatcccaccgaggatgaaaaggccacgggctcccagaca ccccaccctcttactccccagggggtgtttggctga >gi568815597r:22020053_22242922|GENSCAN_predicted_peptide_7|380_aa MQRPNAESKGKNSVAQCVETYAATLPNPSADFALATPLPPRPLTVTGSRQLRELPHLQNG EEQVHYTHALSFLDFTQRTFIPSSLQRWKELVKSSLPCGEKKRTKERTGVYFSPSTDEKI EQPRKDVSPGQPLSKQVVFLGAKKNSRELGSGSIHGTAPTSKAAQQRLTGCPQTGPGTGT GGQDEKPHAGVGRDPDEFPPRSGDAVGRNVGPSGSLSGQSQILKRLSPTATSRCGLDIDH SGSIYTTGIGKCQIRACFLERAFCEIFASTPLSKTQPNPMCADFSCTKGGLGGSRNSMGV VGKPSWNEGRQNSALKYRSSPVDKQPRWGGTSQKQQQKGDAAGTATGRNGTHWALRGSDH IGDPTLDFKIQQKGWHWGQR >gi568815597r:22020053_22242922|GENSCAN_predicted_CDS_7|1143_bp atgcagcggcccaatgcagagagtaaaggcaagaattcagtggcccaatgcgtggaaaca tacgcagccacgctccccaacccatccgctgattttgccctggccactcccctcccgcca aggcccctgacagttacgggctctaggcagctcagggagttacctcatctgcaaaacggg gaggagcaggttcactacacccacgcactgtccttcctggacttcacacagaggaccttc attccatcctccctgcaacgatggaaggagctggtcaagtccagccttccttgtggggag aagaagaggacaaaggaaaggacaggtgtatatttctccccatctactgatgagaaaatc gagcagcccagaaaggatgtgagcccaggacaacctttgtccaagcaggtggtctttctg ggtgccaagaagaatagtagggaacttggttctggtagcatccatggcactgccccaact tccaaagctgctcagcaaagactcactggatgtcctcagaccggccctggcactgggaca ggagggcaggatgagaagccccatgccggggtggggagagaccctgatgagtttcctcct aggagcggggatgccgtgggcaggaatgtgggcccctctgggagcctttcaggccaaagt cagatcttgaaaaggctcagccccacggcaacttcacgctgcgggctcgacattgaccac agtgggagtatttacaccacgggaattggcaaatgccaaatcagggcttgctttctcgag agggccttttgtgaaatatttgccagcacaccactgagcaagacacagcccaaccctatg tgtgctgatttttcttgtactaaaggaggactggggggctccaggaacagcatgggggtt gtggggaagccttcctggaatgagggacgacagaattctgctttgaagtatagaagttcc ccagtggacaagcagccaaggtggggcggcaccagtcagaagcagcaacagaagggagat gctgcaggaacggcaacaggcagaaacggcacacactgggctctgagaggcagtgatcac atcggagaccccacacttgatttcaagatccagcaaaagggctggcactggggtcagaga tga