GENSCAN 1.0 Date run: 8-Nov-116 Time: 03:33:12 Sequence gi568815579r:42409112_42628374 : 219263 bp : 45.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 Intr - 1731 1196 536 1 2 79 99 914 0.405 84.24 1.02 Intr - 14472 14308 165 1 0 26 105 103 0.396 5.73 1.01 Init - 18038 17156 883 2 1 78 109 305 0.837 26.22 1.00 Prom - 19134 19095 40 -6.76 2.13 PlyA - 19452 19447 6 1.05 2.12 Term - 24067 23894 174 1 0 96 43 90 0.910 3.06 2.11 Intr - 24745 24665 81 1 0 79 86 47 0.517 3.43 2.10 Intr - 36453 36289 165 0 0 44 70 100 0.023 3.96 2.09 Intr - 80031 79975 57 0 0 106 87 30 0.036 3.78 2.08 Intr - 101809 101778 32 2 2 100 121 30 0.004 5.35 2.07 Intr - 102517 102465 53 0 2 35 98 16 0.003 -4.15 2.06 Intr - 103368 103239 130 1 1 104 80 79 0.005 8.45 2.05 Intr - 110124 109837 288 1 0 86 93 281 0.301 25.62 2.04 Intr - 112377 112156 222 1 0 97 119 142 0.560 16.30 2.03 Intr - 113091 112813 279 1 0 61 76 280 0.998 21.55 2.02 Intr - 118289 117930 360 0 0 110 84 306 0.527 27.59 2.01 Init - 119263 119200 64 1 1 107 94 20 0.742 3.95 2.00 Prom - 135387 135348 40 -4.16 3.11 PlyA - 135469 135464 6 1.05 3.10 Term - 137708 137443 266 0 2 124 40 80 0.216 2.47 3.09 Intr - 139545 139448 98 2 2 103 24 53 0.204 0.05 3.08 Intr - 144764 144723 42 1 0 125 105 38 0.335 6.56 3.07 Intr - 146812 146695 118 2 1 104 74 45 0.180 4.22 3.06 Intr - 163332 163249 84 1 0 112 0 73 0.011 0.69 3.05 Intr - 179927 179673 255 0 0 110 119 84 0.772 11.12 3.04 Intr - 180624 180346 279 1 0 72 65 244 0.941 17.95 3.03 Intr - 181264 181220 45 2 0 116 108 1 0.902 3.28 3.02 Intr - 184789 184430 360 2 0 72 78 358 0.859 28.39 3.01 Init - 185717 185654 64 2 1 98 94 42 0.872 5.16 3.00 Prom - 189556 189517 40 -3.46 4.05 PlyA - 189798 189793 6 1.05 4.04 Term - 192868 192675 194 2 2 75 46 91 0.211 1.18 4.03 Intr - 216940 216686 255 2 0 107 95 205 0.653 20.52 4.02 Intr - 217336 217189 148 1 1 49 69 -17 0.173 -7.69 4.01 Intr - 217619 217384 236 0 2 58 75 218 0.418 14.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 103190 103335 146 1 2 82 94 159 0.969 15.49 S.002 Term + 104551 104575 25 1 1 144 51 32 0.990 2.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:42409112_42628374|GENSCAN_predicted_peptide_1|528_aa MEPGSKSVSRSDWQPEPHQRPITPLEPGPEKTPIAQPESKTLQGSNTQQKPASNQRPLTQ QETPAQHDAESQKEPRAQQKSASQEEFLAPQKPAPQQSPYIQRVLLTQQEAASQQGPGLG KESITQQEPALRQRHVAQPGPGPGEPPPAQQEAESTPAAQAKPGAKREPSAPTESTSQET PEQSDKQTTPVQGAKSKQGSLTELGFLTKLQELSIQRSALEWKALSEWVTDSESESDVGS SSDTDSPATMGGMVAQGVKLGFKGKSGYKVMSGYSGTSPHEKTSARNHRHYQDTGPRGWN WPPREDRGELMELAGGTEPAEAAMAPASKNAKEGSRSHGRRRWRKDKAKASRLIHNMDLR TMTQSLVTLAEDNIAFFSSQGPGETAQRLSGVFAGVREQALGLEPALGRLLGVAHLFDLD PETPANGYRSLVHTARCCLAHLLHKSRYVASNRRSIFFRTSHNLAELEAYLAALTQLRAL VYYAQRLLVTNRPGVLFFEGDEGLTADFLREYVTLHKGCFYGRCLGFQ >gi568815579r:42409112_42628374|GENSCAN_predicted_CDS_1|1584_bp atggagccaggttctaagtcagtgtctaggtcagactggcaacctgaaccacaccagagg cctataaccccgctagagcctgggccagaaaagacacccatagcccagccagaatcgaag actctgcagggatccaatacccaacagaagcctgcttcaaaccaaagacccctcacccag caggagacccctgcacaacatgatgctgaatcccagaaggaacctagagcccaacaaaaa tctgcttcacaagaggaatttcttgccccacagaagcccgcaccacagcaatcaccttac atccaaagggtgctgctcactcaacaggaagctgcctcccagcagggacctgggctagga aaagaatctataactcaacaggagccagcattgagacaaagacatgtagcccagccaggg cctgggccaggagagccacctccagctcaacaagaagctgaatcaacacctgcggcccag gctaaacctggagccaaaagggagccatctgccccgactgaatctacgtcccaagagaca cctgaacagtcagacaagcaaacaacgccagtccagggagccaaatccaagcagggatct ttgacagagctgggatttctaacaaaacttcaggaactatccatacagcgatcagcccta gagtggaaggcactttctgagtgggtcacagattctgagtcagaatcagatgtgggatca tcttcagacacagattctccagccacgatgggtggaatggtggcccagggagtgaagcta ggcttcaaaggaaaatctggttataaagtgatgtcaggatacagtgggacgtcgccacat gagaaaaccagtgctcggaatcacagacactaccaggatacagggcctcgcggctggaat tggcccccgcgggaggaccgcggggaattgatggagttggccgggggaacggagcccgcc gaggccgctatggccccggcctcgaagaatgccaaagagggctcaaggagccacggtcgc cggcggtggcgaaaagacaaggccaaagcctcaaggctcatccacaacatggacctgcgc acaatgacacagtcgctggtgactctggcggaggacaacatagccttcttctcgagccag ggtcctggggaaacggcccagcggctgtcaggcgtttttgccggtgtacgggagcaggcg ctggggctggagccggccctgggccgcctgctgggtgtggcgcacctctttgacctggac ccagagacaccggccaacgggtaccgcagcctagtgcacacagcccgctgctgcctggcg cacctcctgcacaaatcccgctatgtggcctccaaccgccgcagcatcttcttccgcacc agccacaacctggccgagctggaggcctacctggctgccctcacccagctccgcgctctg gtctactacgcccagcgcctgctggttaccaatcggccgggggtactcttctttgagggc gacgaggggctcaccgccgacttcctccgggagtatgtcacgctgcataagggatgcttc tatggccgctgcctgggcttccag >gi568815579r:42409112_42628374|GENSCAN_predicted_peptide_2|634_aa MGHLSAPLHRVRVPWQGLLLTASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQ QLFGYSWYKGERVDGNRQIVGYAIGTQQATPGPANSGRETIYPNASLLIQNVTQNDTGFY TLQVIKSDLVNEEATGQFHVYPELPKPSISSNNSNPVEDKDAVAFTCEPETQDTTYLWWI NNQSLPVSPRLQLSNGNRTLTLLSVTRNDTGPYECEIQNPVSANRSDPVTLNVTYTYYRP GANLSLSCYAASNPPAQYSWLINGTFQQSTQELFIPNITVNNSGSYTCHANNSVTGCNRT TVKTIIVTELSPVVAKPQIKASKTTVTGDKDSVNLTCSTNDTGISIRWFFKNQSLPSSER MKLSQGNTTLSINPVKREDAGTYWCEVFNPISKNQSDPIMLNVNYNALPQENGLSPGAIA GIVIGVVALVALIAVALACFLHFGKTGRASDQRDLTEHKPSVSNHTQDHSNDPPNKDTIG DNGSFMKALTGMACQLKVKKRIQISLMVLCAVTAGPGMSSRLIQGAQGSHFTLNVLSPIG AHHVYTCSSQGSPEATGTEARLLGDGSRKAAKNVSAKWKCKAQKMAIKALSYSLLSNVLT LPSKQQCCCSRLVPESPEKKIHDSVWAAKEAVPL >gi568815579r:42409112_42628374|GENSCAN_predicted_CDS_2|1905_bp atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc acagcctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatcc atgccattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccag caactttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgta ggatatgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagaca atataccccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctac accctacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgta tacccggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaag gatgctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggata aacaatcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctc actctactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaaccca gtgagtgcgaaccgcagtgacccagtcaccttgaatgtcacctacacctattaccgtcca ggggcaaacctcagcctctcctgctatgcagcctctaacccacctgcacagtactcctgg cttatcaatggaacattccagcaaagcacacaagagctctttatccctaacatcactgtg aataatagtggatcctatacctgccacgccaataactcagtcactggctgcaacaggacc acagtcaagacgatcatagtcactgagctaagtccagtagtagcaaagccccaaatcaaa gccagcaagaccacagtcacaggagataaggactctgtgaacctgacctgctccacaaat gacactggaatctccatccgttggttcttcaaaaaccagagtctcccgtcctcggagagg atgaagctgtcccagggcaacaccaccctcagcataaaccctgtcaagagggaggatgct gggacgtattggtgtgaggtcttcaacccaatcagtaagaaccaaagcgaccccatcatg ctgaacgtaaactataatgctctaccacaagaaaatggcctctcacctggggccattgct ggcattgtgattggagtagtggccctggttgctctgatagcagtagccctggcatgtttt ctgcatttcgggaagaccggcagggcaagcgaccagcgtgatctcacagagcacaaaccc tcagtctccaaccacactcaggaccactccaatgacccacctaacaaggacaccattgga gacaatggttcttttatgaaggctttgactggaatggcatgccagctcaaagtgaaaaag agaatacaaatatccctgatggtcctttgtgcggtcacagctggacccggtatgtcctcc cggttaatccagggtgctcagggctcccatttcactctgaacgtcctctctcctatcgga gctcaccacgtctacacctgcagcagccaggggtcgccagaggccacagggaccgaggcc aggcttctaggagatggctccaggaaggcggccaagaatgtgagtgcaaagtggaaatgc aaggcacagaagatggcgattaaagctctgtcctactccctcctatcaaatgtattaact ctcccatctaagcagcaatgctgttgttccagattggttcctgagagccccgagaagaaa attcatgacagtgtctgggctgccaaagaagcagtgcccctgtga >gi568815579r:42409112_42628374|GENSCAN_predicted_peptide_3|536_aa MGPISAPSCRWRIPWQGLLLTASLFTFWNPPTTAQLTIEAVPSNAAEGKEVLLLVHNLPQ DPRGYNWYKGETVDANRRIIGYVISNQQITPGPAYSNRETIYPNASLLMRNVTRNDTGSY TLQVIKLNLMSEEVTGQFSVHHHSLDSALSLEVTDSTETPKPSISSNNSNPVEDKDAVAF TCEPETQNTTYLWWVNGQSLPVSPRLQLSNGNRTLTLLSVTRNDVGPYECEIQNPASANF SDPVTLNVLYGPDAPTISPSDTYYHAGVNLNLSCHAASNPPSQYSWSVNGTFQQYTQKLF IPNITTKNSGSYACHTTNSATGRNRTTVRMITVSGTLVELEDIMLSKLTQKQKNKYCMFS LIKKNAPGLPVGAFTGIVTRVLVGVAPVATLACFLLLVRTGRPRYPTPGQPLPSMRHHSR QHMDGKRLREKLPVLVRSEAAIKNCWRLGAHISLSTCLTAKAISHLLPSTLPNPVHGGDG PSWNTHHSLSAFSGNHRFYRGHLWNQNSWKDEQRELLRTSWGSWHLVHHMDIINST >gi568815579r:42409112_42628374|GENSCAN_predicted_CDS_3|1611_bp atggggcccatctcagccccttcctgcagatggcgcatcccctggcaggggctcctgctc acagcctcacttttcaccttctggaacccgcccaccactgctcagctcactattgaagct gtgccatccaatgctgcagaggggaaggaggttcttctacttgtccacaatctgccccag gaccctcgtggctacaactggtacaaaggggaaacagtggatgccaaccgtcgaattata ggatatgtaatatcaaatcaacagattaccccagggcctgcatacagcaatcgagagaca atataccccaatgcatccctgctgatgcggaacgtcaccagaaatgacacaggatcctac accctacaagtcataaagctaaatcttatgagtgaagaagtaactggccagttcagcgta catcatcattccttggactctgctctatctttagaggtcactgactcaacggagactccc aagccctccatctccagcaacaactccaaccccgtggaggacaaggatgctgtggccttc acctgtgaacctgagactcagaacacaacctacctgtggtgggtaaatggtcagagtctc ccggtcagtcccaggctgcagctgtccaatggcaacaggaccctcactctactcagtgtc acaaggaatgacgtaggaccctatgaatgtgaaatacagaacccagcgagtgcaaacttc agtgacccagtcaccctgaatgtcctctatggcccagatgcccccaccatttccccttca gacacctattaccatgcaggggtaaatctcaacctctcctgccatgcggcctctaatcca ccctcacagtattcttggtctgtcaatggcacattccagcaatacacacaaaagctcttt atccccaacatcactacaaagaacagcggatcctatgcctgccacaccactaactcagcc actggccgcaacaggaccacagtcaggatgatcacagtctctggaacattggtggagcta gaggacattatgctcagcaaactaacgcagaaacagaaaaacaaatactgcatgttctca cttataaaaaaaaatgccccaggccttcctgtgggggccttcactggcatcgtgaccagg gttctggtcggggtggcaccggtggccaccctggcatgtttcctgctcctcgtcaggact ggaaggccccgctacccgaccccaggacagccgctcccatctatgaggcatcacagcagg cagcacatggatggaaagagactgcgggaaaagctgcctgtattagtccgttccgaagct gctataaagaactgctggagactgggagcccacatcagcctgtccacctgccttacagcc aaagccatcagccacttgctcccttccaccctccccaaccctgtacatggaggtgacggc ccctcctggaacacccaccactcactgtcagccttcagtggtaatcacaggttctacagg ggtcacctgtggaaccagaactcttggaaggacgagcagagggagctgctcaggacatct tggggctcctggcatctggtccaccacatggacataatcaatagcacttag >gi568815579r:42409112_42628374|GENSCAN_predicted_peptide_4|277_aa XKLSKPFVTISNSNPVENEDTAALTCETEIQNTTYLWWVNGQSLPVSPRLKLSNDNRTLT LPRVTRKDAGTYECEIRNLYLLFLCSPGCQPKSTWPEARPLSPSQVQVQRPLLLGTQAGH DFLPQANLDGPDDPNISPSYSYYHPGENLNLSCNTASNPPPQYSWQINGKFQKSAQELFI SKITKKNSGIYICFVRNSATDSRCFTIKKITVTGISVQGSEEMAEASEPKDGERGSNPGA APWNESRWGHTENVVQEGIPRNGQERSRGHRLCGSAA >gi568815579r:42409112_42628374|GENSCAN_predicted_CDS_4|834_bp ncgaaactatccaagcccttcgtcaccatcagcaactccaaccccgtggagaatgaggat actgcggccttaacctgtgaaactgagattcagaacacaacttacctgtggtgggtaaat ggtcagagcctcccggtcagtcccaggctgaagctgtccaatgacaacaggaccctcact ctacccagagtcacaaggaaggacgcaggaacatatgaatgtgaaatccggaatctatat cttctgttcctctgtagcccaggctgccagcccaaatccacatggccagaggccaggcct ctcagtccctctcaggtccaagtacagagacctttactcctgggcacccaggctggccat gacttcctgccccaggcaaacctggatggcccagacgaccccaacatttccccctcttac tcctattaccatccaggggaaaacctcaacctctcctgcaacacagcctctaacccaccc ccacagtattcttggcagattaatgggaagttccaaaaatcagcacaagagctctttatc tccaaaatcactaaaaagaatagtggaatctacatctgtttcgttcgcaactcagccact gacagcaggtgtttcacgatcaagaagatcacagtcactgggatcagtgttcagggctca gaagaaatggcagaggccagtgagcccaaagatggtgagaggggcagcaacccaggggca gctccctggaacgagagcaggtggggccacacggagaacgtggtgcaggaaggaataccc aggaatgggcaggagaggagtcgaggacacaggctctgtgggtctgcagcctag