GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:04:15 Sequence gi568815579r:42412349_42628374 : 216026 bp : 45.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 6559 6554 6 1.05 1.02 Term - 11235 10925 311 1 2 26 42 189 0.247 3.32 1.01 Init - 14801 13919 883 2 1 78 109 305 0.691 26.22 1.00 Prom - 15897 15858 40 -6.76 2.13 PlyA - 16215 16210 6 1.05 2.12 Term - 20830 20657 174 1 0 96 43 90 0.910 3.06 2.11 Intr - 21508 21428 81 1 0 79 86 47 0.517 3.43 2.10 Intr - 33216 33052 165 0 0 44 70 100 0.023 3.96 2.09 Intr - 76794 76738 57 0 0 106 87 30 0.036 3.78 2.08 Intr - 98572 98541 32 2 2 100 121 30 0.004 5.35 2.07 Intr - 99280 99228 53 0 2 35 98 16 0.003 -4.15 2.06 Intr - 100131 100002 130 1 1 104 80 79 0.005 8.45 2.05 Intr - 106887 106600 288 1 0 86 93 281 0.301 25.62 2.04 Intr - 109140 108919 222 1 0 97 119 142 0.560 16.30 2.03 Intr - 109854 109576 279 1 0 61 76 280 0.998 21.55 2.02 Intr - 115052 114693 360 0 0 110 84 306 0.527 27.59 2.01 Init - 116026 115963 64 1 1 107 94 20 0.742 3.95 2.00 Prom - 132150 132111 40 -4.16 3.11 PlyA - 132232 132227 6 1.05 3.10 Term - 134471 134206 266 0 2 124 40 80 0.216 2.47 3.09 Intr - 136308 136211 98 2 2 103 24 53 0.204 0.05 3.08 Intr - 141527 141486 42 1 0 125 105 38 0.335 6.56 3.07 Intr - 143575 143458 118 2 1 104 74 45 0.180 4.22 3.06 Intr - 160095 160012 84 1 0 112 0 73 0.011 0.69 3.05 Intr - 176690 176436 255 0 0 110 119 84 0.772 11.12 3.04 Intr - 177387 177109 279 1 0 72 65 244 0.941 17.95 3.03 Intr - 178027 177983 45 2 0 116 108 1 0.902 3.28 3.02 Intr - 181552 181193 360 2 0 72 78 358 0.859 28.39 3.01 Init - 182480 182417 64 2 1 98 94 42 0.872 5.16 3.00 Prom - 186319 186280 40 -3.46 4.05 PlyA - 186561 186556 6 1.05 4.04 Term - 189631 189438 194 2 2 75 46 91 0.211 1.18 4.03 Intr - 213703 213449 255 2 0 107 95 205 0.653 20.52 4.02 Intr - 214099 213952 148 1 1 49 69 -17 0.173 -7.69 4.01 Intr - 214382 214147 236 0 2 58 75 218 0.418 14.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 99953 100098 146 1 2 82 94 159 0.969 15.49 S.002 Term + 101314 101338 25 1 1 144 51 32 0.990 2.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:42412349_42628374|GENSCAN_predicted_peptide_1|397_aa MEPGSKSVSRSDWQPEPHQRPITPLEPGPEKTPIAQPESKTLQGSNTQQKPASNQRPLTQ QETPAQHDAESQKEPRAQQKSASQEEFLAPQKPAPQQSPYIQRVLLTQQEAASQQGPGLG KESITQQEPALRQRHVAQPGPGPGEPPPAQQEAESTPAAQAKPGAKREPSAPTESTSQET PEQSDKQTTPVQGAKSKQGSLTELGFLTKLQELSIQRSALEWKALSEWVTDSESESDVGS SSDTDSPATMGGMVAQGVKLGFKGKSGYKVMSGYSGTSPHEKTSARNHRHYQDTGPRGWN WPPREDRGELMELAGGTEPAEAAMAPASKNAKEGSRSHGRRRWRKDKAKGERAETGSSPE GTSYILRGSGDSVDWPTATGSWGGPRTGPLPLAGTTG >gi568815579r:42412349_42628374|GENSCAN_predicted_CDS_1|1194_bp atggagccaggttctaagtcagtgtctaggtcagactggcaacctgaaccacaccagagg cctataaccccgctagagcctgggccagaaaagacacccatagcccagccagaatcgaag actctgcagggatccaatacccaacagaagcctgcttcaaaccaaagacccctcacccag caggagacccctgcacaacatgatgctgaatcccagaaggaacctagagcccaacaaaaa tctgcttcacaagaggaatttcttgccccacagaagcccgcaccacagcaatcaccttac atccaaagggtgctgctcactcaacaggaagctgcctcccagcagggacctgggctagga aaagaatctataactcaacaggagccagcattgagacaaagacatgtagcccagccaggg cctgggccaggagagccacctccagctcaacaagaagctgaatcaacacctgcggcccag gctaaacctggagccaaaagggagccatctgccccgactgaatctacgtcccaagagaca cctgaacagtcagacaagcaaacaacgccagtccagggagccaaatccaagcagggatct ttgacagagctgggatttctaacaaaacttcaggaactatccatacagcgatcagcccta gagtggaaggcactttctgagtgggtcacagattctgagtcagaatcagatgtgggatca tcttcagacacagattctccagccacgatgggtggaatggtggcccagggagtgaagcta ggcttcaaaggaaaatctggttataaagtgatgtcaggatacagtgggacgtcgccacat gagaaaaccagtgctcggaatcacagacactaccaggatacagggcctcgcggctggaat tggcccccgcgggaggaccgcggggaattgatggagttggccgggggaacggagcccgcc gaggccgctatggccccggcctcgaagaatgccaaagagggctcaaggagccacggtcgc cggcggtggcgaaaagacaaggccaaaggtgagcgcgcggagaccggcagcagcccggaa ggaacctcgtacatcctacggggcagtggggacagcgtggactggcccactgccacaggg agttgggggggtccgcggactggcccattgccgcttgcggggacaacgggatga >gi568815579r:42412349_42628374|GENSCAN_predicted_peptide_2|634_aa MGHLSAPLHRVRVPWQGLLLTASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQ QLFGYSWYKGERVDGNRQIVGYAIGTQQATPGPANSGRETIYPNASLLIQNVTQNDTGFY TLQVIKSDLVNEEATGQFHVYPELPKPSISSNNSNPVEDKDAVAFTCEPETQDTTYLWWI NNQSLPVSPRLQLSNGNRTLTLLSVTRNDTGPYECEIQNPVSANRSDPVTLNVTYTYYRP GANLSLSCYAASNPPAQYSWLINGTFQQSTQELFIPNITVNNSGSYTCHANNSVTGCNRT TVKTIIVTELSPVVAKPQIKASKTTVTGDKDSVNLTCSTNDTGISIRWFFKNQSLPSSER MKLSQGNTTLSINPVKREDAGTYWCEVFNPISKNQSDPIMLNVNYNALPQENGLSPGAIA GIVIGVVALVALIAVALACFLHFGKTGRASDQRDLTEHKPSVSNHTQDHSNDPPNKDTIG DNGSFMKALTGMACQLKVKKRIQISLMVLCAVTAGPGMSSRLIQGAQGSHFTLNVLSPIG AHHVYTCSSQGSPEATGTEARLLGDGSRKAAKNVSAKWKCKAQKMAIKALSYSLLSNVLT LPSKQQCCCSRLVPESPEKKIHDSVWAAKEAVPL >gi568815579r:42412349_42628374|GENSCAN_predicted_CDS_2|1905_bp atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc acagcctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatcc atgccattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccag caactttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgta ggatatgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagaca atataccccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctac accctacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgta tacccggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaag gatgctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggata aacaatcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctc actctactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaaccca gtgagtgcgaaccgcagtgacccagtcaccttgaatgtcacctacacctattaccgtcca ggggcaaacctcagcctctcctgctatgcagcctctaacccacctgcacagtactcctgg cttatcaatggaacattccagcaaagcacacaagagctctttatccctaacatcactgtg aataatagtggatcctatacctgccacgccaataactcagtcactggctgcaacaggacc acagtcaagacgatcatagtcactgagctaagtccagtagtagcaaagccccaaatcaaa gccagcaagaccacagtcacaggagataaggactctgtgaacctgacctgctccacaaat gacactggaatctccatccgttggttcttcaaaaaccagagtctcccgtcctcggagagg atgaagctgtcccagggcaacaccaccctcagcataaaccctgtcaagagggaggatgct gggacgtattggtgtgaggtcttcaacccaatcagtaagaaccaaagcgaccccatcatg ctgaacgtaaactataatgctctaccacaagaaaatggcctctcacctggggccattgct ggcattgtgattggagtagtggccctggttgctctgatagcagtagccctggcatgtttt ctgcatttcgggaagaccggcagggcaagcgaccagcgtgatctcacagagcacaaaccc tcagtctccaaccacactcaggaccactccaatgacccacctaacaaggacaccattgga gacaatggttcttttatgaaggctttgactggaatggcatgccagctcaaagtgaaaaag agaatacaaatatccctgatggtcctttgtgcggtcacagctggacccggtatgtcctcc cggttaatccagggtgctcagggctcccatttcactctgaacgtcctctctcctatcgga gctcaccacgtctacacctgcagcagccaggggtcgccagaggccacagggaccgaggcc aggcttctaggagatggctccaggaaggcggccaagaatgtgagtgcaaagtggaaatgc aaggcacagaagatggcgattaaagctctgtcctactccctcctatcaaatgtattaact ctcccatctaagcagcaatgctgttgttccagattggttcctgagagccccgagaagaaa attcatgacagtgtctgggctgccaaagaagcagtgcccctgtga >gi568815579r:42412349_42628374|GENSCAN_predicted_peptide_3|536_aa MGPISAPSCRWRIPWQGLLLTASLFTFWNPPTTAQLTIEAVPSNAAEGKEVLLLVHNLPQ DPRGYNWYKGETVDANRRIIGYVISNQQITPGPAYSNRETIYPNASLLMRNVTRNDTGSY TLQVIKLNLMSEEVTGQFSVHHHSLDSALSLEVTDSTETPKPSISSNNSNPVEDKDAVAF TCEPETQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLLSVTRNDVGPYECEIQNPASANF SDPVTLNVLYGPDAPTISPSDTYYHAGVNLNLSCHAASNPPSQYSWSVNGTFQQYTQKLF IPNITTKNSGSYACHTTNSATGRNRTTVRMITVSGTLVELEDIMLSKLTQKQKNKYCMFS LIKKNAPGLPVGAFTGIVTRVLVGVAPVATLACFLLLVRTGRPRYPTPGQPLPSMRHHSR QHMDGKRLREKLPVLVRSEAAIKNCWRLGAHISLSTCLTAKAISHLLPSTLPNPVHGGDG PSWNTHHSLSAFSGNHRFYRGHLWNQNSWKDEQRELLRTSWGSWHLVHHMDIINST >gi568815579r:42412349_42628374|GENSCAN_predicted_CDS_3|1611_bp atggggcccatctcagccccttcctgcagatggcgcatcccctggcaggggctcctgctc acagcctcacttttcaccttctggaacccgcccaccactgctcagctcactattgaagct gtgccatccaatgctgcagaggggaaggaggttcttctacttgtccacaatctgccccag gaccctcgtggctacaactggtacaaaggggaaacagtggatgccaaccgtcgaattata ggatatgtaatatcaaatcaacagattaccccagggcctgcatacagcaatcgagagaca atataccccaatgcatccctgctgatgcggaacgtcaccagaaatgacacaggatcctac accctacaagtcataaagctaaatcttatgagtgaagaagtaactggccagttcagcgta catcatcattccttggactctgctctatctttagaggtcactgactcaacggagactccc aagccctccatctccagcaacaactccaaccccgtggaggacaaggatgctgtggccttc acctgtgaacctgagactcagaacacaacctacctgtggtgggtaaatggtcagagtctc ccggtcagtcccaggctgcagctgtccaatggcaacaggaccctcactctactcagtgtc acaaggaatgacgtaggaccctatgaatgtgaaatacagaacccagcgagtgcaaacttc agtgacccagtcaccctgaatgtcctctatggcccagatgcccccaccatttccccttca gacacctattaccatgcaggggtaaatctcaacctctcctgccatgcggcctctaatcca ccctcacagtattcttggtctgtcaatggcacattccagcaatacacacaaaagctcttt atccccaacatcactacaaagaacagcggatcctatgcctgccacaccactaactcagcc actggccgcaacaggaccacagtcaggatgatcacagtctctggaacattggtggagcta gaggacattatgctcagcaaactaacgcagaaacagaaaaacaaatactgcatgttctca cttataaaaaaaaatgccccaggccttcctgtgggggccttcactggcatcgtgaccagg gttctggtcggggtggcaccggtggccaccctggcatgtttcctgctcctcgtcaggact ggaaggccccgctacccgaccccaggacagccgctcccatctatgaggcatcacagcagg cagcacatggatggaaagagactgcgggaaaagctgcctgtattagtccgttccgaagct gctataaagaactgctggagactgggagcccacatcagcctgtccacctgccttacagcc aaagccatcagccacttgctcccttccaccctccccaaccctgtacatggaggtgacggc ccctcctggaacacccaccactcactgtcagccttcagtggtaatcacaggttctacagg ggtcacctgtggaaccagaactcttggaaggacgagcagagggagctgctcaggacatct tggggctcctggcatctggtccaccacatggacataatcaatagcacttag >gi568815579r:42412349_42628374|GENSCAN_predicted_peptide_4|277_aa XKLSKPFVTISNSNPVENEDTAALTCETEIQNTTYLWWVNGQSLPVSPRLKLSNDNRTLT LPRVTRKDAGTYECEIRNLYLLFLCSPGCQPKSTWPEARPLSPSQVQVQRPLLLGTQAGH DFLPQANLDGPDDPNISPSYSYYHPGENLNLSCNTASNPPPQYSWQINGKFQKSAQELFI SKITKKNSGIYICFVRNSATDSRCFTIKKITVTGISVQGSEEMAEASEPKDGERGSNPGA APWNESRWGHTENVVQEGIPRNGQERSRGHRLCGSAA >gi568815579r:42412349_42628374|GENSCAN_predicted_CDS_4|834_bp ncgaaactatccaagcccttcgtcaccatcagcaactccaaccccgtggagaatgaggat actgcggccttaacctgtgaaactgagattcagaacacaacttacctgtggtgggtaaat ggtcagagcctcccggtcagtcccaggctgaagctgtccaatgacaacaggaccctcact ctacccagagtcacaaggaaggacgcaggaacatatgaatgtgaaatccggaatctatat cttctgttcctctgtagcccaggctgccagcccaaatccacatggccagaggccaggcct ctcagtccctctcaggtccaagtacagagacctttactcctgggcacccaggctggccat gacttcctgccccaggcaaacctggatggcccagacgaccccaacatttccccctcttac tcctattaccatccaggggaaaacctcaacctctcctgcaacacagcctctaacccaccc ccacagtattcttggcagattaatgggaagttccaaaaatcagcacaagagctctttatc tccaaaatcactaaaaagaatagtggaatctacatctgtttcgttcgcaactcagccact gacagcaggtgtttcacgatcaagaagatcacagtcactgggatcagtgttcagggctca gaagaaatggcagaggccagtgagcccaaagatggtgagaggggcagcaacccaggggca gctccctggaacgagagcaggtggggccacacggagaacgtggtgcaggaaggaataccc aggaatgggcaggagaggagtcgaggacacaggctctgtgggtctgcagcctag