GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:04:42 Sequence gi568815581r:38750135_38953107 : 202973 bp : 48.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3016 3200 185 0 2 112 84 312 0.975 31.79 1.02 Intr + 5749 5856 108 1 0 115 113 166 0.999 21.20 1.03 Intr + 10297 10474 178 1 1 110 88 197 0.998 21.82 1.04 Intr + 12277 12371 95 0 2 50 84 91 0.787 3.66 1.05 Term + 13985 14033 49 2 1 98 40 88 0.750 1.78 1.06 PlyA + 14071 14076 6 1.05 2.11 PlyA - 14309 14304 6 1.05 2.10 Term - 19637 19557 81 2 0 87 32 164 0.869 8.39 2.09 Intr - 20405 20302 104 0 2 139 84 126 0.995 17.19 2.08 Intr - 21138 20880 259 0 1 78 106 397 0.990 37.54 2.07 Intr - 27666 27553 114 0 0 108 65 175 0.993 17.84 2.06 Intr - 28238 28200 39 2 0 128 91 31 0.983 5.82 2.05 Intr - 29395 29249 147 1 0 102 98 196 0.997 22.43 2.04 Intr - 30470 30318 153 2 0 65 80 273 0.995 24.47 2.03 Intr - 34205 34109 97 1 1 81 80 103 0.921 8.81 2.02 Intr - 36786 36689 98 0 2 116 97 164 0.995 18.91 2.01 Init - 49290 49132 159 0 0 70 43 236 0.688 17.43 2.00 Prom - 50053 50014 40 -4.46 3.19 PlyA - 50341 50336 6 1.05 3.18 Term - 52072 51958 115 0 1 96 48 80 0.959 2.84 3.17 Intr - 52727 52566 162 1 0 70 81 231 0.943 19.69 3.16 Intr - 56261 56163 99 1 0 79 109 18 0.754 2.23 3.15 Intr - 60461 60334 128 2 2 51 83 150 0.998 10.28 3.14 Intr - 62730 62661 70 2 1 81 111 84 0.999 9.18 3.13 Intr - 64963 64727 237 0 0 55 79 232 0.983 15.73 3.12 Intr - 70939 70767 173 1 2 93 108 313 0.983 32.54 3.11 Intr - 71098 71018 81 1 0 98 59 51 0.891 3.03 3.10 Intr - 75384 75219 166 2 1 28 81 174 0.504 10.66 3.09 Intr - 85212 85137 76 1 1 100 42 60 0.024 1.17 3.08 Intr - 85359 85257 103 0 1 43 64 39 0.286 -3.25 3.07 Intr - 91284 91049 236 1 2 47 80 256 0.428 18.11 3.06 Intr - 100341 100228 114 1 0 129 97 87 0.997 14.12 3.05 Intr - 102598 102470 129 2 0 77 96 154 0.999 15.77 3.04 Intr - 102971 102888 84 0 0 109 94 15 0.932 3.99 3.03 Intr - 103801 103661 141 2 0 47 66 76 0.417 1.62 3.02 Intr - 104632 104596 37 1 1 38 75 4 0.404 -8.06 3.01 Init - 105181 105128 54 1 0 85 101 82 0.614 8.87 3.00 Prom - 107785 107746 40 -8.76 4.00 Prom + 108316 108355 40 -8.16 4.01 Init + 114638 114761 124 1 1 76 99 16 0.818 1.83 4.02 Intr + 119242 119386 145 2 1 89 50 157 0.988 11.34 4.03 Intr + 119902 120124 223 1 1 80 105 148 0.956 13.83 4.04 Intr + 127952 128046 95 1 2 94 77 211 0.879 19.36 4.05 Intr + 137390 137587 198 2 0 68 100 82 0.756 5.87 4.06 Intr + 140286 140370 85 0 1 125 94 104 0.997 14.52 4.07 Intr + 142934 143134 201 1 0 74 0 155 0.291 4.78 4.08 Intr + 148278 148385 108 2 0 153 66 199 0.913 24.68 4.09 Intr + 158603 158656 54 1 0 106 37 48 0.217 0.68 4.10 Intr + 164191 164341 151 0 1 100 100 256 0.998 27.74 4.11 Intr + 164909 165012 104 0 2 78 84 157 0.984 14.19 4.12 Intr + 168377 168424 48 1 0 86 99 17 0.574 1.48 4.13 Term + 168471 168644 174 2 0 77 46 573 0.995 49.76 4.14 PlyA + 171610 171615 6 1.05 5.06 PlyA - 174395 174390 6 1.05 5.05 Term - 175660 175506 155 2 2 41 53 137 0.257 3.58 5.04 Intr - 188598 188439 160 0 1 36 105 69 0.838 2.96 5.03 Intr - 192786 192644 143 1 2 47 111 12 0.506 -0.53 5.02 Intr - 193602 193456 147 1 0 79 98 82 0.973 8.51 5.01 Intr - 201555 201447 109 0 1 84 98 61 0.896 6.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:38750135_38953107|GENSCAN_predicted_peptide_1|204_aa SIMSYNGGAVMAMKGKNCVAIAADRRFGIQAQMVTTDFQKIFPMGDRLYIGLAGLATDVQ TVAQRLKFRLNLYELKEGRQIKPYTLMSMVANLLYEKRFGPYYTEPVIAGLDPKTFKPFI CSLDLIGCPMVTDDFVVSGTCAEQMYGMCESLWEPNMDPDHLFETISQAMLNAVDRDAVS GMGVIVHIIEKDKITTRTLKARMD >gi568815581r:38750135_38953107|GENSCAN_predicted_CDS_1|615_bp tctattatgtcctataacggaggggccgtcatggccatgaaggggaagaactgtgtggcc atcgctgcagacaggcgcttcgggatccaggcccagatggtgaccacggacttccagaag atctttcccatgggtgaccggctgtacatcggtctggccgggctcgccactgacgtccag acagttgcccagcgcctcaagttccggctgaacctgtatgagttgaaggaaggtcggcag atcaaaccttataccctcatgagcatggtggccaacctcttgtatgagaaacggtttggc ccttactacactgagccagtcattgccgggttggacccgaagacctttaagcccttcatt tgctctctagacctcatcggctgccccatggtgactgatgactttgtggtcagtggcacc tgcgccgaacaaatgtacggaatgtgtgagtccctctgggagcccaacatggatccggat cacctgtttgaaaccatctcccaagccatgctgaatgctgtggaccgggatgcagtgtca ggcatgggagtcattgtccacatcatcgagaaggacaaaatcaccaccaggacactgaag gcccgaatggactaa >gi568815581r:38750135_38953107|GENSCAN_predicted_peptide_2|416_aa MSSNCTSTTAVAVAPLSASKTKTKKKHFVCQKVKLFRASEPILSVLMWGVNHTINELSNV PVPVMLMPDDFKAYSKIKVDNHLFNKENLPSRFKFKEYCPMVFRNLRERFGIDDQDYQNS VTRSAPINSDSQGRCGTRFLTTYDRRFVIKTVSSEDVAEMHNILKKYHQFIVECHGNTLL PQFLGMYRLTVDGVETYMVVTRNVFSHRLTVHRKYDLKGSTVAREASDKEKAKDLPTFKD NDFLNEGQKLHVGEESKKNFLEKLKRDVEFLAQLKIMDYSLLVGIHDVDRAEQEEMEVEE RAEDEECENDGVGGNLLCSYGTPPDSPGNLLSFPRFFGPGEFDPSVDVYAMKSHESSPKK EVYFMAIIDILTPYDTKKKAAHAAKTVKHGAGAEISTVNPEQYSKRFNEFMSNILT >gi568815581r:38750135_38953107|GENSCAN_predicted_CDS_2|1251_bp atgtcgtccaactgcaccagcaccacggcggtggcggtggcgccgctcagcgccagcaag accaagaccaagaagaagcatttcgtgtgccagaaagtgaagctattccgggccagcgag ccgatcctcagcgtcctgatgtggggggtgaaccacacgatcaatgagctgagcaatgtt cctgttcctgtcatgctaatgccagatgacttcaaagcctacagcaagatcaaggtggac aatcatctcttcaataaggagaacctgcccagccgctttaagtttaaggagtattgcccc atggtgttccgaaaccttcgggagaggtttggaattgatgatcaggattaccagaattca gtgacgcgcagcgcccccatcaacagtgacagccagggtcggtgtggcacgcgtttcctc accacctacgaccggcgctttgtcatcaagactgtgtccagcgaggacgtggcggagatg cacaacatcttaaagaaataccaccagtttatagtggagtgtcatggcaacacgcttttg ccacagttcctgggcatgtaccgcctgaccgtggatggtgtggaaacctacatggtggtt accaggaacgtgttcagccatcggctcactgtgcatcgcaagtatgacctcaagggttct acggttgccagagaagcgagcgacaaggagaaggccaaggacttgccaacattcaaagac aatgacttcctcaatgaagggcagaagctgcatgtgggagaggagagtaaaaagaacttc ctggagaaactgaagcgggacgttgagttcttggcacagctgaagatcatggactacagc ctgctggtgggcatccacgacgtggaccgggcagagcaggaggagatggaggtggaggag cgggcagaggacgaggagtgtgagaatgatggggtgggtggcaacctactctgctcctat ggcacacctccggacagccctggcaacctcctcagctttcctcggttctttggtcctggg gaattcgacccctctgttgacgtctatgccatgaaaagccatgaaagttcccccaagaag gaggtgtatttcatggccatcattgatatcctcacgccatacgatacaaagaagaaagct gcacatgctgccaaaacggtgaaacacggggcaggggccgagatctcgactgtgaaccct gagcagtactccaaacgcttcaacgagtttatgtccaacatcctgacgtag >gi568815581r:38750135_38953107|GENSCAN_predicted_peptide_3|734_aa MAAARLPERLVPRRGRTACLISSSTCTPAESRREDTSYLLRFYLNSHFAERAKSDVVNLL RNKAAPRGGRGRFLLRPRRGGSSGAKFRISLGLPVGAVINCADNTGAKNLYIISVKGIKG RLNRLPAAGVGDMVMATVKKGKPELRKKVHPAVVIRQRKSYRRKDGVFLYFEDNAGVIVN NKGEMKERPSEATVVGMAYLSECRLRLEKGFILDGVAVSTAARAYGRSRPKLWSAIPPYN AQQDYHARSYFQSHVVPPLLRKTDQSLRTLVDCVFSMGNASDYAWLASHTTPRGMGKPRG HSLQQVTGHDHYNADLKPIDGFNGRESGTSGNVATRSSQNETSTAKASTAEPGPLQNLQV TPTSQSATEPCTSSFRDADAMCERLLEENLAIVEDILPSHACLGYGRNLKKSWHPQTLRN VEKVWKAEQKHEAERKKIEELQRELREERAREEMQRYAEDVGAVKKKEEKLDWMYQGPGG MVNRDEYLLGRPIDKYVFEKMEEKEAGCSSETGLLPGSIFAPSGANSLLDMASKIREDPL FIIRKKEEEKKREVLNNPVKMKKIKELLQMSLEKKEKKKKKEKKKKHKKHKHRSSSSDRS SSEDEHSAGRHNSKVNRRETGQTRSPSPKKEVYQRRHAPGYTRKLSAEELERKRQEMMEN AKWREEERLNILKRHAKDEEREQRLEKLDSRDGKFIHRMKLESASTSSLEDRVKRNIYSL QRTSVALEKNFMKR >gi568815581r:38750135_38953107|GENSCAN_predicted_CDS_3|2205_bp atggctgctgcgcggcttcctgagcgactggttcctcggcgagggcgaacagcgtgtcta atatcctcctccacctgcaccccagcagaatctcgtagagaagatacttcctatctcttg cgcttctacctcaactcccacttcgcagagagggctaaaagcgacgtcgtaaacttatta cgtaataaggcagcgcccagaggcggaagaggccggtttttgctccggccacgacgtggt gggtcctctggtgcgaaattccggatttccttgggtcttccggtaggagctgtaatcaat tgtgctgacaacacaggagccaaaaacctgtatatcatctccgtgaaggggatcaaggga cggctgaacagacttcccgctgctggtgtgggtgacatggtgatggccacagtcaagaaa ggcaaaccagagctcagaaaaaaggtacatccagcagtggtcattcgacaacgaaagtca taccgtagaaaagatggcgtgtttctttattttgaagataatgcaggagtcatagtgaac aataaaggcgagatgaaagaaaggccctccgaggctacggtggttgggatggcgtacctg agcgagtgtcgcctgcgactggagaaaggctttatcttggacggggtggctgtgagcacc gctgcccgcgcttatgggcgctctaggcccaagctgtggtcggcgattccgccctacaac gcgcagcaggactaccacgcccgcagctacttccagagtcacgtggttccgccccttttg cggaaaactgatcagagtctcaggacacttgtggattgcgtgttcagcatggggaatgca agtgactacgcctggttggcatcccataccacacccaggggaatggggaagccaagaggg cattctctccagcaggtgactgggcatgaccactacaatgctgatctgaaaccgatcgat gggttcaatggaagagagtctgggacatcaggcaacgtggcgacccgaagcagccaaaat gagacctcgacagcaaaggcctcgacagcggagcccggacctttgcaaaatctgcaagta acgccaacgtcacaaagcgcgaccgagccctgtacgtcatcgttccgcgacgccgacgcg atgtgtgagagattattggaagaaaaccttgccattgttgaagacatcttgccctcacac gcctgccttgggtatggaaggaatctgaagaagagctggcacccgcagaccctcaggaat gtggagaaagtgtggaaggccgagcagaagcatgaggctgagcggaagaagattgaggag cttcagcgggagctgcgagaagagagagcccgggaagagatgcagcgctatgcggaggat gttggggccgtcaagaaaaaagaagaaaagttggactggatgtaccagggtcctggtggg atggtgaaccgtgacgagtacctgctggggcgccccattgacaaatatgtttttgagaag atggaggagaaggaggcaggctgctcttctgaaacaggacttctcccaggctctatcttt gccccatcaggtgccaattcccttcttgacatggccagcaagatccgggaggacccactc ttcatcatcaggaagaaggaggaggagaaaaaacgagaggtattaaataatccagtgaaa atgaagaaaatcaaagaattgttgcaaatgagtctggaaaaaaaggagaagaagaaaaag aaggagaagaaaaagaagcacaagaaacataagcacagaagctcgagtagtgatcgttcc agcagcgaggatgagcacagtgcagggaggcacaactctaaggtgaacaggagagagaca ggccaaactaggagcccatcacctaaaaaagaggtctaccaaaggcgacatgctcccgga tacaccagaaaactctctgcagaggaattagagcgaaaacggcaagagatgatggaaaac gccaaatggagggaggaggagagactgaacatcctcaagaggcatgctaaggatgaggaa cgggagcagaggctagagaagctggactcccgggatgggaagttcatccaccgcatgaag ctggagagtgcatctacttcctccctggaggatcgggtgaagcggaatatctactcttta cagagaacttcggtagctctggagaagaactttatgaaaagatga >gi568815581r:38750135_38953107|GENSCAN_predicted_peptide_4|569_aa MSTSQSPCESICDYVTSHDKSNFTDMIKLNILRCEVILDYPAPGGGSLGAKHCCSCYTVS SGVTEGERNAGEKGVKLNADGARIRGTPGRGRRAEAEASSPAPAAVAAACVVAAAAASRQ LASGNRTRVSSGVPAPAFLGTMNPNCARCGKIVYPTEKVNCLDKFWHKACFHCETCKMTL NMKNYKGYEKKPYCNAPRLLPAADTCVCLSSSSLPLLEELRVWADLGGQDPGEGWGTGTA RIIWIRDSETGPREKGFAKVTQHYPKQSFTMVADTPENLRLKQQSELQSQPPDERGALII PIIQIGKLRFTEAALLVQSPGTGADIKSHDLALEPVALNSADGTGTEPCAPTLQDAPVRY KEEFEKNKGKGFSVVADTPELQRIKKTQDQISNPGLQELLAVQPAGAATMQIKYHEEFEK SRMGPSGGEGMEPERRDSQDGSSYRRPLEQQQPHHIPTSAPVYQQPQQQPVAQSYGGYKE PAAPVSIQRSAPGGGGALLQGRGELEKNAQTQKRYRAVYDYSAADEDEVSFQDGDTIVNV QQIDDGWMYGTVERTGDTGMLPANYVEAI >gi568815581r:38750135_38953107|GENSCAN_predicted_CDS_4|1710_bp atgtctacatcccagtctccttgtgaatctatttgtgactatgttacctctcatgataaa agcaactttacagatatgattaaattaaatatcttgagatgtgaggttatcctggattat ccagcgcctggaggcggctccttgggtgctaagcattgttgcagctgctacactgtgagt agtggtgtaacagaaggagaaaggaatgctggagaaaaaggggtcaagctgaacgcagac ggtgcccgcatccggggaaccccgggaaggggaaggagggcggaggcggaggccagttcc ccagctccagccgccgtcgctgctgcctgtgtagttgcagccgcggccgcctcccgccag ctcgcctcggggaacaggacgcgcgtgagctcaggcgtccccgccccagcttttctcgga accatgaaccccaactgcgcccggtgcggcaagatcgtgtatcccacggagaaggtgaac tgtctggataagttctggcataaagcatgcttccattgcgagacctgcaagatgacactg aacatgaagaactacaagggctacgagaagaagccctactgcaacgcgccacgtctgttg ccagctgcggacacctgtgtttgcctttcctcatcttcccttcctctgctggaggaactg cgagtctgggctgacctgggtggccaggacccaggtgagggctgggggacgggcaccgct aggatcatctggatccgggacagtgaaactggcccaagggaaaaaggatttgccaaggtc acccaacactaccccaagcagtccttcaccatggtggcggacaccccggaaaaccttcgc ctcaagcaacagagtgagctccagagtcagccacctgatgaaagaggggctcttattatc cccatcatacagatagggaaactgaggtttacagaggctgccctgcttgtgcagagccca gggacaggggcagacatcaagtcccatgacctggccctggagcccgttgctctgaacagt gctgatggcacaggcaccgagccctgtgctcccaccctgcaggatgcaccagtgcgctac aaggaggagtttgagaagaacaagggcaaaggtttcagcgtagtggcagacacgcccgag ctccagagaatcaagaagacccaggaccagatcagtaaccccgggctgcaggaactattg gctgttcagccagcaggagcagcaaccatgcagataaaataccatgaggagtttgagaag agccgcatgggccctagcgggggcgagggcatggagccagagcgtcgggattcacaggac ggcagcagctaccggcggcccctggagcagcagcagcctcaccacatcccgaccagtgcc ccggtttaccagcagccccagcagcagccggtggcccagtcctatggtggctacaaggag cctgcagccccagtctccatacagcgcagcgccccaggtggtggcggggctctgctccag ggtcgtggagagttagaaaaaaatgctcagacccagaagcggtaccgcgcggtgtatgac tacagcgccgccgacgaggacgaggtctccttccaggacggggacaccatcgtcaacgtg cagcagatcgacgacggctggatgtacgggacggtggagcgcaccggcgacacggggatg ctgccggccaactacgtggaggccatctga >gi568815581r:38750135_38953107|GENSCAN_predicted_peptide_5|237_aa TLTAGWDELECHRVYNFLCELTNLCRKIQMAVCSKPGQVVWQEMIEEPTDEFSLKGLADA IKLLYDASTKEWTADDVISLVDELSVVPREWLLENNARLLMLSGNNICFSFMASKAVNGR TIELARLVVFLALVCEKELYCMDWTVKMMQKVCKVFSTPVERKNFLQNVANAFACVIMEM LQSIMSGSGRSSRPLEQIDNVGQNDQAYDGKEHQDENIQHCGFLGRPSPWILVNREL >gi568815581r:38750135_38953107|GENSCAN_predicted_CDS_5|714_bp accttaacagcaggttgggatgaacttgagtgccatcgcgtttataatttcttatgcgaa ctgactaatctctgccgcaagatacaaatggctgtctgcagcaaaccaggacaggtggtt tggcaggaaatgatagaagaacctacagatgaattcagtctgaaaggtttggctgatgcc attaagttactatatgacgctagcactaaagagtggacagcagatgatgttatcagtctt gtagatgaactatcagtggttccccgtgagtggcttctagagaataatgcacgtctccta atgctaagtggaaacaacatctgtttcagtttcatggctagtaaagctgtgaatggacgc accattgaactggcaaggctcgtagtctttttggctttggtgtgtgagaaagaactgtac tgcatggattggacagttaaaatgatgcaaaaagtctgcaaagtctttagcactccagtg gaaagaaagaacttcctgcagaatgtggcaaatgcatttgcatgtgttataatggaaatg ctgcaatcaattatgtctggctccggccgcagctccaggccactagagcagatagataat gtaggacagaatgaccaggcctatgatggcaaagaacatcaggatgaaaatatccagcat tgtggattcctaggccgaccaagcccctggattctggtgaaccgtgagctgtga