GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:01:43 Sequence gi568815596f:44101200_44330715 : 229516 bp : 40.69% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5024 5107 84 1 0 93 60 62 0.681 4.77 1.02 Term + 8021 8179 159 1 0 51 45 119 0.456 0.86 1.03 PlyA + 9172 9177 6 1.05 2.00 Prom + 17694 17733 40 -3.95 2.01 Init + 21000 21131 132 2 0 45 59 133 0.449 6.29 2.02 Intr + 35915 35955 41 1 2 98 68 44 0.008 -0.50 2.03 Intr + 51626 51832 207 2 0 40 78 90 0.014 0.47 2.04 Intr + 65334 65454 121 0 1 62 0 160 0.652 4.18 2.05 Intr + 67748 67913 166 1 1 60 41 212 0.340 12.31 2.06 Term + 68383 68588 206 2 2 103 40 130 0.488 6.25 2.07 PlyA + 69147 69152 6 1.05 3.04 PlyA - 69318 69313 6 1.05 3.03 Term - 70013 69814 200 0 2 60 47 137 0.189 3.38 3.02 Intr - 82197 82102 96 1 0 85 29 90 0.070 1.86 3.01 Init - 86708 86693 16 2 1 81 95 21 0.666 1.08 3.00 Prom - 93589 93550 40 -5.05 4.00 Prom + 98570 98609 40 -4.65 4.01 Init + 100001 100846 846 1 0 81 95 701 0.982 64.81 4.02 Intr + 108011 108128 118 1 1 85 116 111 0.950 12.72 4.03 Intr + 116768 116879 112 0 1 -1 108 98 0.093 1.42 4.04 Intr + 117281 117338 58 2 1 104 98 23 0.985 2.87 4.05 Intr + 129214 129505 292 0 1 7 12 387 0.172 18.78 4.06 Intr + 147055 147150 96 2 0 6 97 120 0.229 3.76 4.07 Intr + 159558 160006 449 1 2 67 86 173 0.239 7.14 4.08 Term + 169230 169778 549 2 0 8 42 465 0.794 27.21 4.09 PlyA + 169932 169937 6 1.05 5.00 Prom + 172479 172518 40 -7.75 5.01 Init + 174337 174766 430 0 1 85 83 503 0.991 45.96 5.02 Intr + 179517 179696 180 1 0 85 115 174 0.993 18.72 5.03 Intr + 180188 180342 155 0 2 42 106 102 0.903 6.27 5.04 Intr + 184833 185004 172 2 1 84 36 84 0.012 1.39 5.05 Intr + 185290 185454 165 2 0 41 101 166 0.008 12.21 5.06 Intr + 191437 191645 209 2 2 10 95 76 0.008 -1.63 5.07 Intr + 198772 198891 120 0 0 106 60 103 0.464 9.07 5.08 Term + 199804 199968 165 0 0 63 36 174 0.369 6.63 5.09 PlyA + 200864 200869 6 1.05 6.00 Prom + 201123 201162 40 -6.25 6.01 Init + 201374 201438 65 1 2 85 68 62 0.827 4.57 6.02 Intr + 202944 203139 196 0 1 97 115 168 0.985 18.90 6.03 Intr + 204787 204924 138 0 0 64 75 97 0.977 5.74 6.04 Intr + 211387 211554 168 0 0 53 87 239 0.940 19.52 6.05 Intr + 212636 212752 117 1 0 49 95 47 0.536 1.24 6.06 Term + 219000 219440 441 2 0 34 48 207 0.843 5.57 6.07 PlyA + 219605 219610 6 1.05 7.07 PlyA - 219655 219650 6 1.05 7.06 Term - 220246 220157 90 1 0 90 47 137 0.918 6.54 7.05 Intr - 220701 220628 74 1 2 63 95 5 0.738 -3.09 7.04 Intr - 221655 221532 124 0 1 94 97 209 0.896 21.64 7.03 Intr - 222212 222063 150 2 0 80 72 158 0.996 12.84 7.02 Intr - 225729 225513 217 2 1 73 105 150 0.819 12.78 7.01 Intr - 227913 227738 176 0 2 70 109 94 0.555 7.52 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 64895 64965 71 1 2 69 50 78 0.857 2.67 S.002 Init + 116773 116879 107 0 2 61 108 83 0.824 7.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:44101200_44330715|GENSCAN_predicted_peptide_1|80_aa MAFVNRNGTGRSVAVKTTRGYSVAILVLSQYSFMVTPEWPNMNGKSLHVSEFQKQMRTKH MQGRGEIIIMNAEVDNSDSL >gi568815596f:44101200_44330715|GENSCAN_predicted_CDS_1|243_bp atggcatttgtaaaccgtaatggcactggtaggagtgtagcagtgaagacgaccagaggt tactctgtggccatcttggttttgtctcagtattcctttatggtaacaccagaatggcct aacatgaatgggaagtcattacatgtatcagagtttcagaagcagatgagaacaaaacat atgcagggcagaggggagatcatcatcatgaatgcagaagttgataacagtgactcatta tga >gi568815596f:44101200_44330715|GENSCAN_predicted_peptide_2|290_aa MFINITTNLIGKDFKYGEAVKFTVANTGFRKFRFFYLKAPIRSLDELSFSGSCDIDARLG DSETLSQKTKKEKKVQFTPNVTLRTGEPLAIFSTFILGKDQTQVFLPETPGDKFWEVSLR HLSFGVRDLKGLYEDIRKELLISTTELQEMSEYYFDGKGKAFRPIIGQLRGSFLPARLES LILRVRWLEDAPERRGCGGGGGGGRIGNGARVERRRHRRRRRLCKHLFHTKALGLPQALA LLFLVHALFENTIFYHSFPHQLLPYSALHFTSLLNRLSLLSMRSTLAFPP >gi568815596f:44101200_44330715|GENSCAN_predicted_CDS_2|873_bp atgttcattaacatcaccaccaatctcattggaaaggactttaaatacggtgaagctgtc aagttcacagtagcaaacacaggttttcgaaaattcagatttttttacttgaaagctcca attcgatcactggatgagctgagcttttctggcagctgtgatattgatgccaggctgggt gacagtgagactctgtctcaaaaaacaaaaaaagaaaagaaagtccagtttacacccaat gtgaccttgcgaacaggtgaacctttggccattttttctaccttcatacttggcaaggat cagacccaagtctttcttcctgaaactccaggggacaaattctgggaggtctccttgagg cacctttcctttggcgtgagagacttgaaaggtctgtatgaggacattagaaaggaactg ctcatatcaacaacagaacttcaggaaatgtctgagtactactttgatggaaaaggaaaa gcctttcgaccaattattgggcagttgcgggggagtttcctgccggcgcggctggagtct ctgattctcagggttcggtggttggaagatgctccagagagacgaggctgcggcggagga ggtggcggcggccgaatcggcaacggcgctagggtggagagaaggcggcatcggcggcgg cggcggctctgcaagcatctctttcacacgaaagcactggggctaccccaagccttggcg cttctcttcctggtgcatgccctgtttgaaaacactatcttttaccattcgtttcctcat cagcttctgccctactctgcgctccacttcacttccttgttaaatcgtctttctctgctg tctatgcgttctaccctcgccttccctccttag >gi568815596f:44101200_44330715|GENSCAN_predicted_peptide_3|103_aa MVAGTYICFQNHTGRMFKAKCGYSSSKWSDFIECHLAKSSQKSDTQCKKTVLKALLVTSK RTITSTNPEDNVLETGEILQYESSNKNTRANIQQQKEPLQKLL >gi568815596f:44101200_44330715|GENSCAN_predicted_CDS_3|312_bp atggtggcgggcacctatatttgtttccaaaaccacacgggaagaatgttcaaggcaaaa tgtggatattcctcatctaaatggtctgatttcatcgaatgtcacttggcaaagtcttca cagaaatcagacacccagtgtaagaaaacggtgttgaaagctctgctggtcacttcaaag agaacaatcacttcaacaaatccagaggataatgtactagaaactggggaaatcttacag tatgaaagctcaaataaaaatactcgtgctaatatacagcaacaaaaagaaccactgcag aaactgttgtaa >gi568815596f:44101200_44330715|GENSCAN_predicted_peptide_4|839_aa MGAFLDKPKTEKHNAHGAGNGLRYGLSSMQGWRVEMEDAHTAVVGIPHGLEDWSFFAVYD GHAGSRVANYCSTHLLEHITTNEDFRAAGKSGSALELSVENVKNGIRTGFLKIDEYMRNF SDLRNGMDRSGSTAVGVMISPKHIYFINCGDSRAVLYRNGQVCFSTQDHKPCNPREKERI QNAGGSVMIQRVNGSLAVSRALGDYDYKCVDGKGPTEQLVSPEPEVYEILRAEEDEFIIL ACDGIWDVMSNEELCEYVKSRLEVSDDLENVCNWVVDTCLHKGSRDNMSIVLVCFSNAPK VSDEAVKKDSELDKHLESRVEEIMEKSGEEGMPDLAHVMRILSAENIPNLPPGGGLAGKR NVIEAVYSRLNPHRESDGASDEAEESGSQGKLVEALRQMRINHRGNYRQLLEEMLTSYRL AKVEGEESPAEPAATATSSNSDAGNPVTMQESHTESESGLAELDSSNEDAGTKMSAASQL QLSYYGRQEAQCESNSCVCHRQPFSRAVLEVLARAIRQEKEIKGIQLGKEEFKLSLFADD MIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIAS KRIKYLGIQLTRDLKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIVKMAILPKRDV TANCPGSWRFEQRIGQNTQQRKNEATKERKQLSRLSARRRPRCNFLRSSRIRVHPTPAAS TMPPKFHPNEIKVVYFTCTGVEVGATSALAPKIGPAGLSPKEVGDDIAKVTGDWKGLRIT VKLTIQNRQAQIEVVPSASALIIKRNYQETERNRKTLNTGGISLFMRSSTLLDRCSTDP >gi568815596f:44101200_44330715|GENSCAN_predicted_CDS_4|2520_bp atgggtgcatttttggataaacccaaaactgaaaaacataatgctcatggtgctgggaat ggtttacgttatggcctgagcagcatgcaaggatggagagtggaaatggaagatgcacac acagctgttgtaggtattcctcacggcttggaagactggtcattttttgcagtttatgat ggtcatgctggatcccgagtggcaaattactgctcaacacatttattagaacacatcact actaacgaagactttagggcagctggaaaatcaggatctgctcttgagctttcagtggaa aatgttaagaatggtatcagaactggatttttgaaaattgatgaatacatgcgtaacttt tcagacctcagaaacgggatggacaggagtggttcaactgcagtgggagttatgatttca cctaagcatatctactttatcaactgtggtgattcacgtgctgttctgtataggaatgga caagtctgcttttctacccaggatcacaaaccttgcaatccaagggaaaaggagcgaatc caaaatgcaggaggcagcgtgatgatacaacgtgttaatggttcattagcagtatctcgt gctctgggggactatgattacaagtgtgttgatggcaagggcccaacagaacaacttgtt tctccagagcctgaggtttatgaaattttaagagcagaagaggatgaatttatcatcttg gcttgtgatgggatctgggatgttatgagtaatgaggagctctgtgaatatgttaaatct aggcttgaggtatctgatgacctggaaaatgtgtgcaattgggtagtggacacttgttta cacaagggaagtcgagataacatgagtattgtactagtttgcttttcaaatgctcccaag gtctcagatgaagcggtgaaaaaagattcagagttggataagcacttggaatcacgggtt gaagagattatggagaagtctggcgaggaaggaatgcctgatcttgcccatgtcatgcgc atcttgtctgcagaaaatatcccaaatttgcctcctgggggaggtcttgctggcaagcgt aatgttattgaagctgtttatagtagactgaatccacatagagaaagtgatggggcctcc gatgaagcagaggaaagtggatcacagggaaaattggtggaagctctcaggcaaatgaga attaatcataggggaaactaccgacaacttctggaggagatgctgactagttacaggcta gctaaagtagagggagaagaaagccctgctgaaccagctgccacagctacttcttcgaac agtgatgctggaaacccagtgacaatgcaggaaagccatactgaatcagaaagtggtctt gctgaattagacagctctaatgaagatgcagggacaaagatgagtgctgcatcgcagcta caactcagttactatggaagacaggaagcacagtgtgagagcaactcctgtgtctgccac aggcagccattctccagagcagtgttggaagttctggccagggcaattaggcaggagaag gaaataaagggtattcaattaggaaaagaggaattcaaattgtccctgtttgcagatgac atgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggatctgaaggacctcttcaaggag aactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattcca tgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaagagggacgtg actgcaaattgtccaggttcttggcgttttgaacaaagaattggacaaaacacgcagcaa agaaagaatgaagcaacaaaagaacgaaagcagctctctcggctttcggctcggaggagg ccaaggtgcaactttcttcggtcgtctcgaatccgggttcatccgacaccagccgcctcc accatgccgccgaagttccaccccaatgagatcaaagtcgtatacttcacgtgcactgga gttgaagtcggtgccacttctgctctggcccccaagattggccccgcgggtctgtctcca aaagaggttggtgatgacattgccaaggtaacgggtgactggaagggcctgaggattaca gtgaaactgaccattcagaacagacaggcccagattgaagtggtaccttctgcctccgcc ctgatcatcaaaaggaactaccaagagacagaaagaaacagaaaaacattaaacacaggg ggaatatcactttttatgagatcgtcaacattgctcgacagatgcagcactgatccttag >gi568815596f:44101200_44330715|GENSCAN_predicted_peptide_5|531_aa MAEDKSKRDSIEMSMKGCQTNNGFVHNEDILEQTPDPGSSTDNLKHSTRGILGSQEPDFK GVQPYAGMPKEVLFQFSGQARYRIPREILFWLTVASVLVLIAATIAIIALSPKCLDWWQE GPMYQIYPRSFKDSNKDGNGDLKGIQDKLDYITALNIKTVWITSFYKSSLKDFRYGVEDF REVDPIFGTMEDFENLVAAIHDKGLKLIIDFIPNHTSDKHIWFQLSRTRTGKYTDYYIWH DCTHENGKTIPPNNWLSVYGNSSWHFDEVRNQCYFHQFMKEQPDLNFRNPDVQEEIKVSI DTHTDFSINGGLVSCTVPVTVSCAVPVTVSCAVPVTVSCAVPVTVSCAVSVTVSCAVSVT ELCTVCDEEVRYLRNRHELSEEAWVGVFPAEESVDRDTLEVLGVVQYHWDVIYRAGSDPM RLERSLGVRSGAPLKALEILRFWLTKGVDGFSLDAVKFLLEAKHLRDEIQVNKTQIPDTV TQYSELYHDFTTTQVGMHDIVRSFRQTMDQYSTEPGRYRLTTAYALISSQA >gi568815596f:44101200_44330715|GENSCAN_predicted_CDS_5|1596_bp atggctgaagataaaagcaagagagactccatcgagatgagtatgaagggatgccagaca aacaacgggtttgtccataatgaagacattctggagcagaccccggatccaggaagctca acagacaacctgaagcacagcaccaggggcatccttggctcccaggagcccgacttcaag ggcgtccagccctatgcggggatgcccaaggaggtgctgttccagttctctggccaggcc cgctaccgcatacctcgggagatcctcttctggctcacagtggcttctgtgctggtgctc atcgcggccaccatagccatcattgccctctctccaaagtgcctagactggtggcaggag gggcccatgtaccagatctacccaaggtctttcaaggacagtaacaaggatgggaacgga gatctgaaaggtattcaagataaactggactacatcacagctttaaatataaaaactgtt tggattacttcattttataaatcgtcccttaaagatttcagatatggtgttgaagatttc cgggaagttgatcccatttttggaacgatggaagattttgagaatctggttgcagccata catgataaaggtttaaaattaatcatcgatttcataccaaaccacacgagtgataaacat atttggtttcaattgagtcggacacggacaggaaaatatactgattattatatctggcat gactgtacccatgaaaatggcaaaaccattccacccaacaactggttaagtgtgtatgga aactccagttggcactttgacgaagtgcgaaaccaatgttattttcatcagtttatgaaa gagcaacctgatttaaatttccgcaatcctgatgttcaagaagaaataaaagtgagtata gatacccacacagacttctccattaatggaggtttagtgagctgtacggtgcctgtgacg gtgagctgtgcggtgcctgtgacggtgagctgtgcggtgcctgtgacggtgagctgtgcg gtgcctgtgacggtgagctgtgcggtgtctgtgacggtgagctgtgcggtgtctgtgact gagctgtgcactgtctgtgacgaggaggtgagatacctgaggaacaggcatgagctaagt gaagaggcatgggtgggagtgtttccagcagaggaaagcgtggatcgtgacacattagag gtgctgggagttgtccagtatcactgggatgtgatatatagagcagggagtgacccgatg agactagagaggtcactgggggtgaggtcaggagcaccactgaaggccctggaaatttta cggttctggctcacaaagggtgttgatggttttagtttggatgctgttaaattcctccta gaagcaaagcacctgagagatgagatccaagtaaataagacccaaatcccggacacggtc acacaatactcggagctgtaccatgacttcaccaccacgcaggtgggaatgcacgacatt gtccgcagcttccggcagaccatggaccaatacagcacggagcccggcagatacaggttg accacggcatatgctctcatttcttcccaggcttag >gi568815596f:44101200_44330715|GENSCAN_predicted_peptide_6|374_aa MHLSINMLPQIAKASIDRSHFRFMGTEAYAESIDRTVMYYGLPFIQEADFPFNNYLSMLD TVSGNSVYEVITSWMENMPEGKWPNWMTVHWLRAEAGSQHPCTVPTSHGGAQAQGLFVTG AVVIRMQFSDLPKIGGPDSSRLTSRLGNQYVNVMNMLLFTLPGTPITYYGEEIGMGNIVA ANLNESYDINTLRSKSPMQWDNSSNAGFSEASNTWLPTNSDYHTVNVDVQKTQPRSALKL YQDLSLLHANELLLNRGWFCHLRNDSHYVVYTRELDGIDRIFIVVLNFGESTLLNLHNMI SGLPAKMRIRLSTNSADKGSKVDTSGIFLDKGEGLIFEHNTKNLLHRQTAFRDRCFVSNR ACYSSVLNILYTSC >gi568815596f:44101200_44330715|GENSCAN_predicted_CDS_6|1125_bp atgcacttatccataaatatgctgcctcaaatagctaaagccagcattgatcggagtcat ttcaggttcatggggactgaagcctatgcagagagtattgacaggaccgtgatgtactat ggattgccatttatccaagaagctgattttcccttcaacaattacctcagcatgctagac actgtttctgggaacagcgtgtatgaggttatcacatcctggatggaaaacatgccagaa ggaaaatggcctaactggatgacagttcactggttaagggcagaggcaggttcccagcat ccttgcacggtgcccacctcacacggaggtgctcaggctcagggtttgtttgttactggt gctgttgttattcgtatgcagttcagcgacttaccaaagattggtggaccagacagttca cggctgacttcgcgtttggggaatcagtatgtcaacgtgatgaacatgcttcttttcaca ctccctggaactcctataacttactatggagaagaaattggaatgggaaatattgtagcc gcaaatctcaatgaaagctatgatattaatacccttcgctcaaagtcaccaatgcagtgg gacaatagttcaaatgctggtttttctgaagctagtaacacctggttacctaccaattca gattaccacactgtgaatgttgatgtccaaaagactcagcccagatcggctttgaagtta tatcaagatttaagtctacttcatgccaatgagctactcctcaacaggggctggttttgc catttgaggaatgacagccactatgttgtgtacacaagagagctggatggcatcgacaga atctttatcgtggttctgaattttggagaatcaacactgttaaatctacataatatgatt tcgggccttcccgctaaaatgagaataaggttaagtaccaattctgccgacaaaggcagt aaagttgatacaagtggcatttttctggacaagggagagggactcatctttgaacacaac acgaagaatctccttcatcgccaaacagctttcagagatagatgctttgtttccaatcga gcatgctattccagtgtactgaacatactgtatacctcgtgttag >gi568815596f:44101200_44330715|GENSCAN_predicted_peptide_7|276_aa DGKLVPMTVFHKTDSEDLQKKPLLVHVYGAYGMDLKMNFRPERRVLVDDGWILAYCHVRG GGELGLQWHADGRLTKKLNGLADLEACIKTLHGQGFSQPSLTTLTAFSAGGVLAGALCNS NPELVRAVTLEAPFLDVLNTMMDTTLPLTLEELEEWGNPSSDEKHKNYIKRYCPYQNIKP QHYPSIHITAYENDERVPLKGIVSYTEKLKEAIAEHAKDTGEGYQTPNIILDIQPGGNHV IEDSHKKITAQIKFLYEELGLDSTSVFEDLKKYLKF >gi568815596f:44101200_44330715|GENSCAN_predicted_CDS_7|831_bp gatggaaaattagtgccaatgactgttttccacaaaactgactctgaggacttgcagaag aaacctctcttggtacatgtatatggagcttatggaatggatttgaaaatgaatttcagg cctgagaggcgggtcctggtggatgatggatggatattagcatactgccatgttcgaggt ggtggtgagttaggcctccagtggcacgctgatggccgcctaactaaaaaactcaatggc cttgctgatttagaggcttgcattaagacgcttcatggccaaggcttttctcagccaagt ctaacaaccctgactgctttcagtgctggaggggtgcttgcaggagcattgtgtaattct aatccagagctggtgagagcggtgactttggaggcacctttcttggatgttctcaacacc atgatggacactacacttcctctgacattagaagaattagaagaatgggggaatccttca tctgatgaaaaacacaagaactacataaaacgttactgtccctatcaaaatattaaacct cagcattatccttcaattcacataacggcatatgaaaacgatgaacgggtacctctgaaa ggaattgtaagttatactgagaaactcaaggaagccatcgcggagcatgctaaggacaca ggtgaaggctatcagacccctaatattattctagatattcagcctggaggcaatcatgta attgaggattctcacaaaaagattacagcccaaattaaattcctgtacgaggaacttgga cttgacagcaccagtgttttcgaggatcttaagaaatacctgaaattctga