GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:31:38 Sequence gi568815579r:42385076_42585714 : 200639 bp : 47.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 1963 1958 6 1.05 1.04 Term - 2167 2078 90 1 0 117 49 111 0.956 7.82 1.03 Intr - 2401 2265 137 2 2 78 85 266 0.999 25.59 1.02 Intr - 3964 3851 114 2 0 119 65 161 0.999 17.32 1.01 Init - 4450 4414 37 1 1 82 81 79 0.404 6.77 1.00 Prom - 5256 5217 40 -4.46 2.14 PlyA - 7677 7672 6 1.05 2.13 Term - 12224 11833 392 0 2 66 47 204 0.648 9.25 2.12 Intr - 16397 16213 185 1 2 60 45 69 0.263 -0.77 2.11 Intr - 17000 16843 158 2 2 117 30 337 0.640 29.61 2.10 Intr - 17956 17532 425 2 2 84 77 391 0.995 31.19 2.09 Intr - 20486 20310 177 0 0 74 109 209 0.999 21.49 2.08 Intr - 21361 21086 276 2 0 119 81 273 0.904 27.19 2.07 Intr - 22393 22099 295 1 1 125 85 360 0.999 35.98 2.06 Intr - 22716 22531 186 0 0 93 91 218 0.999 22.49 2.05 Intr - 23046 22901 146 1 2 107 85 281 0.999 29.70 2.04 Intr - 23247 23157 91 0 1 80 70 164 0.982 13.37 2.03 Intr - 25767 25232 536 1 2 79 99 914 0.670 84.24 2.02 Intr - 38508 38344 165 1 0 26 105 103 0.385 5.73 2.01 Init - 42074 41192 883 2 1 78 109 305 0.832 26.22 2.00 Prom - 43170 43131 40 -6.76 3.13 PlyA - 43488 43483 6 1.05 3.12 Term - 48103 47930 174 1 0 96 43 90 0.910 3.06 3.11 Intr - 48781 48701 81 1 0 79 86 47 0.517 3.43 3.10 Intr - 60489 60325 165 0 0 44 70 100 0.023 3.96 3.09 Intr - 104067 104011 57 0 0 106 87 30 0.036 3.78 3.08 Intr - 125845 125814 32 2 2 100 121 30 0.004 5.35 3.07 Intr - 126553 126501 53 0 2 35 98 16 0.003 -4.15 3.06 Intr - 127404 127275 130 1 1 104 80 79 0.005 8.45 3.05 Intr - 134160 133873 288 1 0 86 93 281 0.301 25.62 3.04 Intr - 136413 136192 222 1 0 97 119 142 0.560 16.30 3.03 Intr - 137127 136849 279 1 0 61 76 280 0.998 21.55 3.02 Intr - 142325 141966 360 0 0 110 84 306 0.527 27.59 3.01 Init - 143299 143236 64 1 1 107 94 20 0.742 3.95 3.00 Prom - 159423 159384 40 -4.16 4.06 PlyA - 159505 159500 6 1.05 4.05 Term - 161744 161479 266 0 2 124 40 80 0.216 2.47 4.04 Intr - 163581 163484 98 2 2 103 24 53 0.204 0.05 4.03 Intr - 168800 168759 42 1 0 125 105 38 0.334 6.56 4.02 Intr - 170848 170731 118 2 1 104 74 45 0.179 4.22 4.01 Intr - 187368 187285 84 1 0 112 0 73 0.005 0.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 127226 127371 146 1 2 82 94 159 0.969 15.49 S.002 Term + 128587 128611 25 1 1 144 51 32 0.990 2.80 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:42385076_42585714|GENSCAN_predicted_peptide_1|125_aa MQFEMHDNVKGKAMSYPVTSQPQCATTSCYQTQLSDWHTGLTDCCNDMPVCLCGTFAPLC LACRISDDFGECCCAPYLPGGLHSIRTGMRERYHIQGSVGHDWAALTFCLPCALCQMARE LKIRE >gi568815579r:42385076_42585714|GENSCAN_predicted_CDS_1|378_bp atgcagtttgagatgcacgacaacgtgaaaggcaaagctatgtcctaccctgtgaccagt cagccccagtgcgccaccaccagctgctaccagacccagctcagtgactggcacacaggt ctcacggactgctgcaacgacatgcctgtctgtctgtgcggcacttttgctcctctgtgc cttgcctgccgcatctccgacgactttggcgagtgctgctgcgcgccctacctgcccgga ggcctgcactccatccgcaccggcatgcgggagcgctaccacatccagggctccgtcggg cacgactgggcggccctcaccttttgtctgccctgcgccctctgccagatggcgcgggaa ctgaagatccgagagtaa >gi568815579r:42385076_42585714|GENSCAN_predicted_peptide_2|1304_aa MEPGSKSVSRSDWQPEPHQRPITPLEPGPEKTPIAQPESKTLQGSNTQQKPASNQRPLTQ QETPAQHDAESQKEPRAQQKSASQEEFLAPQKPAPQQSPYIQRVLLTQQEAASQQGPGLG KESITQQEPALRQRHVAQPGPGPGEPPPAQQEAESTPAAQAKPGAKREPSAPTESTSQET PEQSDKQTTPVQGAKSKQGSLTELGFLTKLQELSIQRSALEWKALSEWVTDSESESDVGS SSDTDSPATMGGMVAQGVKLGFKGKSGYKVMSGYSGTSPHEKTSARNHRHYQDTGPRGWN WPPREDRGELMELAGGTEPAEAAMAPASKNAKEGSRSHGRRRWRKDKAKASRLIHNMDLR TMTQSLVTLAEDNIAFFSSQGPGETAQRLSGVFAGVREQALGLEPALGRLLGVAHLFDLD PETPANGYRSLVHTARCCLAHLLHKSRYVASNRRSIFFRTSHNLAELEAYLAALTQLRAL VYYAQRLLVTNRPGVLFFEGDEGLTADFLREYVTLHKGCFYGRCLGFQFTPAIRPFLQTI SIGLVSFGEHYKRNETGLSVAASSLFTSGRFAIDPELRGAEFERITQNLDVHFWKAFWNI TEMEVLSSLANMASATVRVSRLLSLPPEAFEMPLTADPTLTVTISPPLAHTGPGPVLVRL ISYDLREGQDSEELSSLIKSNGQRSLELWPRPQQAPRSRSLIVHFHGGGFVAQTSRSHEP YLKSWAQELGAPIISIDYSLAPEAPFPRALEECFFAYCWAIKHCALLGLCPCQVVTNHPS PKPGSTGERICLAGDSAGGNLCFTVALRAAAYGVRVPDGIMAAYPATMLQPAASPSRLLS LMDPLLPLSVLSKCVSAYAGAKTEDHSNSDQKALGMMGLVRRDTALLLRDFRLGASSWLN SFLELSGRKSQKMSEPIAEPMRRSVSEAALAQPQGPLGTDSLKNLTLRDLSLRGNSETSS DTPEMSLSAETLSPSTPSDVNFLLPPEDAGEEAEAKNELSPMDRGLGVRAAFPEGFHPRR SSQGATQMPLYSSPIVKNPFMSPLLAPDSMLKSLPPVHIVACALDPMLDDSVMLARRLRN LGQPVTLRVVEDLPHGFLTLAALCRETRQAAELGQHRTQAAGNQETWNEGYGGLDFWDQK GVDSLVTRTHLPSDFIQVHCQEMLGPQLLPLKTFASLSLTVFHPSYVQNPNTRRAGKGLP GQTRRTRSLASPLGPLRSGPGFRSLRFRVYWPTDWAWQRLASPKAYRCPSWAVGPAYQAK GPEWPREVEGRVSIGWKIVTDVMDSFRRQTPKKRLACRTRLRRD >gi568815579r:42385076_42585714|GENSCAN_predicted_CDS_2|3915_bp atggagccaggttctaagtcagtgtctaggtcagactggcaacctgaaccacaccagagg cctataaccccgctagagcctgggccagaaaagacacccatagcccagccagaatcgaag actctgcagggatccaatacccaacagaagcctgcttcaaaccaaagacccctcacccag caggagacccctgcacaacatgatgctgaatcccagaaggaacctagagcccaacaaaaa tctgcttcacaagaggaatttcttgccccacagaagcccgcaccacagcaatcaccttac atccaaagggtgctgctcactcaacaggaagctgcctcccagcagggacctgggctagga aaagaatctataactcaacaggagccagcattgagacaaagacatgtagcccagccaggg cctgggccaggagagccacctccagctcaacaagaagctgaatcaacacctgcggcccag gctaaacctggagccaaaagggagccatctgccccgactgaatctacgtcccaagagaca cctgaacagtcagacaagcaaacaacgccagtccagggagccaaatccaagcagggatct ttgacagagctgggatttctaacaaaacttcaggaactatccatacagcgatcagcccta gagtggaaggcactttctgagtgggtcacagattctgagtcagaatcagatgtgggatca tcttcagacacagattctccagccacgatgggtggaatggtggcccagggagtgaagcta ggcttcaaaggaaaatctggttataaagtgatgtcaggatacagtgggacgtcgccacat gagaaaaccagtgctcggaatcacagacactaccaggatacagggcctcgcggctggaat tggcccccgcgggaggaccgcggggaattgatggagttggccgggggaacggagcccgcc gaggccgctatggccccggcctcgaagaatgccaaagagggctcaaggagccacggtcgc cggcggtggcgaaaagacaaggccaaagcctcaaggctcatccacaacatggacctgcgc acaatgacacagtcgctggtgactctggcggaggacaacatagccttcttctcgagccag ggtcctggggaaacggcccagcggctgtcaggcgtttttgccggtgtacgggagcaggcg ctggggctggagccggccctgggccgcctgctgggtgtggcgcacctctttgacctggac ccagagacaccggccaacgggtaccgcagcctagtgcacacagcccgctgctgcctggcg cacctcctgcacaaatcccgctatgtggcctccaaccgccgcagcatcttcttccgcacc agccacaacctggccgagctggaggcctacctggctgccctcacccagctccgcgctctg gtctactacgcccagcgcctgctggttaccaatcggccgggggtactcttctttgagggc gacgaggggctcaccgccgacttcctccgggagtatgtcacgctgcataagggatgcttc tatggccgctgcctgggcttccagttcacgcctgccatccggccattcctgcagaccatc tccattgggctggtgtccttcggggagcactacaaacgcaacgagacaggcctcagtgtg gccgccagctctctcttcaccagcggccgctttgccatcgaccccgagctgcgtggggct gagtttgagcggatcacacagaacctggacgtgcacttctggaaagccttctggaacatc accgagatggaagtgctatcgtctctggccaacatggcatcggccaccgtgagggtaagc cgcctgctcagcctgccacccgaagcctttgagatgccactgactgccgaccccacgctc acggtcaccatctcacccccactggcccacacaggccctgggcccgtcctcgtcaggctc atctcctatgacctgcgtgaaggacaggacagtgaggagctcagcagcctgataaagtcc aacggccaacggagcctggagctgtggccgcgcccccagcaggcaccccgctcgcggtcc ctgatagtgcacttccacggcggtggctttgtggcccagacctccagatcccacgagccc tacctcaagagctgggcccaggagctgggcgcccccatcatctccatcgactactccctg gcccctgaggcccccttcccccgtgcgctggaggagtgcttcttcgcctactgctgggcc atcaagcactgcgccctccttggcctctgcccctgccaggttgtcaccaaccacccctct cccaaaccaggctcaacaggggaacgaatctgccttgcgggggacagtgcaggcgggaac ctctgcttcaccgtggctcttcgggcagcagcctacggggtgcgggtgccagatggcatc atggcagcctacccggccacaatgctgcagcctgccgcctctccctcccgcctgctgagc ctcatggaccccttgctgcccctcagtgtgctctccaagtgtgtcagcgcctatgctggt gcaaagacggaggaccactccaactcagaccagaaagccctcggcatgatggggctggtg cggcgggacacagccctgctcctccgagacttccgcctgggtgcctcctcatggctcaac tccttcctggagttaagtgggcgcaagtcccagaagatgtcggagcccatagcagagccg atgcgccgcagtgtgtctgaagcagcactggcccagccccagggcccactgggcacggat tccctcaagaacctgaccctgagggacttgagcctgaggggaaactccgagacgtcgtcg gacacccccgagatgtcgctgtcagctgagacacttagcccctccacaccctccgatgtc aacttcttattaccacctgaggatgcaggggaagaggctgaggccaaaaatgagctgagc cccatggacagaggcctgggcgtccgtgccgccttccccgagggtttccacccccgacgc tccagccagggtgccacacagatgcccctctactcctcacccatagtcaagaaccccttc atgtcgccgctgctggcacccgacagcatgctcaagagcctgccacctgtgcacatcgtg gcgtgcgcgctggaccccatgctggacgactcggtcatgctcgcgcggcgactgcgcaac ctgggccagccggtgacgctgcgcgtggtggaggacctgccgcacggcttcctgacccta gcggcgctgtgccgcgagacgcgccaggccgcagagctggggcagcatcggactcaagcg gctgggaaccaagaaacctggaatgaggggtatgggggactggatttttgggaccagaag ggagtggactccttggtaacccggacccacctcccctcggacttcatccaggtacactgc caggagatgctgggtcctcagctcctccccttgaagaccttcgcctcactaagcctcacg gttttccacccgagctatgtgcagaatcctaatactaggcgtgctggaaagggcttaccc ggccagactcggagaacccgttccctcgcgagcccgctcgggcctctgcggagtggacca ggattccgatccctcaggttccgggtctattggccgactgactgggcctggcagagactg gcgtctccgaaagcctatcgctgcccgagttgggctgtgggccccgcctaccaggcaaag ggtcctgagtggcccagggaggtagaaggccgagtgtccattggctggaagatagtgact gacgtcatggatagctttcgccgccagactccaaagaagcgcctcgcgtgtcggacccgc ctccggagggactga >gi568815579r:42385076_42585714|GENSCAN_predicted_peptide_3|634_aa MGHLSAPLHRVRVPWQGLLLTASLLTFWNPPTTAQLTTESMPFNVAEGKEVLLLVHNLPQ QLFGYSWYKGERVDGNRQIVGYAIGTQQATPGPANSGRETIYPNASLLIQNVTQNDTGFY TLQVIKSDLVNEEATGQFHVYPELPKPSISSNNSNPVEDKDAVAFTCEPETQDTTYLWWI NNQSLPVSPRLQLSNGNRTLTLLSVTRNDTGPYECEIQNPVSANRSDPVTLNVTYTYYRP GANLSLSCYAASNPPAQYSWLINGTFQQSTQELFIPNITVNNSGSYTCHANNSVTGCNRT TVKTIIVTELSPVVAKPQIKASKTTVTGDKDSVNLTCSTNDTGISIRWFFKNQSLPSSER MKLSQGNTTLSINPVKREDAGTYWCEVFNPISKNQSDPIMLNVNYNALPQENGLSPGAIA GIVIGVVALVALIAVALACFLHFGKTGRASDQRDLTEHKPSVSNHTQDHSNDPPNKDTIG DNGSFMKALTGMACQLKVKKRIQISLMVLCAVTAGPGMSSRLIQGAQGSHFTLNVLSPIG AHHVYTCSSQGSPEATGTEARLLGDGSRKAAKNVSAKWKCKAQKMAIKALSYSLLSNVLT LPSKQQCCCSRLVPESPEKKIHDSVWAAKEAVPL >gi568815579r:42385076_42585714|GENSCAN_predicted_CDS_3|1905_bp atggggcacctctcagccccacttcacagagtgcgtgtaccctggcaggggcttctgctc acagcctcacttctaaccttctggaacccgcccaccactgcccagctcactactgaatcc atgccattcaatgttgcagaggggaaggaggttcttctccttgtccacaatctgccccag caactttttggctacagctggtacaaaggggaaagagtggatggcaaccgtcaaattgta ggatatgcaataggaactcaacaagctaccccagggcccgcaaacagcggtcgagagaca atataccccaatgcatccctgctgatccagaacgtcacccagaatgacacaggattctac accctacaagtcataaagtcagatcttgtgaatgaagaagcaactggacagttccatgta tacccggagctgcccaagccctccatctccagcaacaactccaaccctgtggaggacaag gatgctgtggccttcacctgtgaacctgagactcaggacacaacctacctgtggtggata aacaatcagagcctcccggtcagtcccaggctgcagctgtccaatggcaacaggaccctc actctactcagtgtcacaaggaatgacacaggaccctatgagtgtgaaatacagaaccca gtgagtgcgaaccgcagtgacccagtcaccttgaatgtcacctacacctattaccgtcca ggggcaaacctcagcctctcctgctatgcagcctctaacccacctgcacagtactcctgg cttatcaatggaacattccagcaaagcacacaagagctctttatccctaacatcactgtg aataatagtggatcctatacctgccacgccaataactcagtcactggctgcaacaggacc acagtcaagacgatcatagtcactgagctaagtccagtagtagcaaagccccaaatcaaa gccagcaagaccacagtcacaggagataaggactctgtgaacctgacctgctccacaaat gacactggaatctccatccgttggttcttcaaaaaccagagtctcccgtcctcggagagg atgaagctgtcccagggcaacaccaccctcagcataaaccctgtcaagagggaggatgct gggacgtattggtgtgaggtcttcaacccaatcagtaagaaccaaagcgaccccatcatg ctgaacgtaaactataatgctctaccacaagaaaatggcctctcacctggggccattgct ggcattgtgattggagtagtggccctggttgctctgatagcagtagccctggcatgtttt ctgcatttcgggaagaccggcagggcaagcgaccagcgtgatctcacagagcacaaaccc tcagtctccaaccacactcaggaccactccaatgacccacctaacaaggacaccattgga gacaatggttcttttatgaaggctttgactggaatggcatgccagctcaaagtgaaaaag agaatacaaatatccctgatggtcctttgtgcggtcacagctggacccggtatgtcctcc cggttaatccagggtgctcagggctcccatttcactctgaacgtcctctctcctatcgga gctcaccacgtctacacctgcagcagccaggggtcgccagaggccacagggaccgaggcc aggcttctaggagatggctccaggaaggcggccaagaatgtgagtgcaaagtggaaatgc aaggcacagaagatggcgattaaagctctgtcctactccctcctatcaaatgtattaact ctcccatctaagcagcaatgctgttgttccagattggttcctgagagccccgagaagaaa attcatgacagtgtctgggctgccaaagaagcagtgcccctgtga >gi568815579r:42385076_42585714|GENSCAN_predicted_peptide_4|202_aa XTLVELEDIMLSKLTQKQKNKYCMFSLIKKNAPGLPVGAFTGIVTRVLVGVAPVATLACF LLLVRTGRPRYPTPGQPLPSMRHHSRQHMDGKRLREKLPVLVRSEAAIKNCWRLGAHISL STCLTAKAISHLLPSTLPNPVHGGDGPSWNTHHSLSAFSGNHRFYRGHLWNQNSWKDEQR ELLRTSWGSWHLVHHMDIINST >gi568815579r:42385076_42585714|GENSCAN_predicted_CDS_4|609_bp ngaacattggtggagctagaggacattatgctcagcaaactaacgcagaaacagaaaaac aaatactgcatgttctcacttataaaaaaaaatgccccaggccttcctgtgggggccttc actggcatcgtgaccagggttctggtcggggtggcaccggtggccaccctggcatgtttc ctgctcctcgtcaggactggaaggccccgctacccgaccccaggacagccgctcccatct atgaggcatcacagcaggcagcacatggatggaaagagactgcgggaaaagctgcctgta ttagtccgttccgaagctgctataaagaactgctggagactgggagcccacatcagcctg tccacctgccttacagccaaagccatcagccacttgctcccttccaccctccccaaccct gtacatggaggtgacggcccctcctggaacacccaccactcactgtcagccttcagtggt aatcacaggttctacaggggtcacctgtggaaccagaactcttggaaggacgagcagagg gagctgctcaggacatcttggggctcctggcatctggtccaccacatggacataatcaat agcacttag