GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:31:17 Sequence gi568815581f:38770190_39018775 : 248586 bp : 47.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.09 Intr - 350 247 104 0 2 139 84 126 0.995 17.19 1.08 Intr - 1083 825 259 0 1 78 106 397 0.990 37.54 1.07 Intr - 7611 7498 114 0 0 108 65 175 0.993 17.84 1.06 Intr - 8183 8145 39 2 0 128 91 31 0.983 5.82 1.05 Intr - 9340 9194 147 1 0 102 98 196 0.997 22.43 1.04 Intr - 10415 10263 153 2 0 65 80 273 0.995 24.47 1.03 Intr - 14150 14054 97 1 1 81 80 103 0.921 8.81 1.02 Intr - 16731 16634 98 0 2 116 97 164 0.995 18.91 1.01 Init - 29235 29077 159 0 0 70 43 236 0.688 17.43 1.00 Prom - 29998 29959 40 -4.46 2.19 PlyA - 30286 30281 6 1.05 2.18 Term - 32017 31903 115 0 1 96 48 80 0.959 2.84 2.17 Intr - 32672 32511 162 1 0 70 81 231 0.943 19.69 2.16 Intr - 36206 36108 99 1 0 79 109 18 0.754 2.23 2.15 Intr - 40406 40279 128 2 2 51 83 150 0.998 10.28 2.14 Intr - 42675 42606 70 2 1 81 111 84 0.999 9.18 2.13 Intr - 44908 44672 237 0 0 55 79 232 0.983 15.73 2.12 Intr - 50884 50712 173 1 2 93 108 313 0.983 32.54 2.11 Intr - 51043 50963 81 1 0 98 59 51 0.891 3.03 2.10 Intr - 55329 55164 166 2 1 28 81 174 0.504 10.66 2.09 Intr - 65157 65082 76 1 1 100 42 60 0.024 1.17 2.08 Intr - 65304 65202 103 0 1 43 64 39 0.286 -3.25 2.07 Intr - 71229 70994 236 1 2 47 80 256 0.428 18.11 2.06 Intr - 80286 80173 114 1 0 129 97 87 0.997 14.12 2.05 Intr - 82543 82415 129 2 0 77 96 154 0.999 15.77 2.04 Intr - 82916 82833 84 0 0 109 94 15 0.932 3.99 2.03 Intr - 83746 83606 141 2 0 47 66 76 0.417 1.62 2.02 Intr - 84577 84541 37 1 1 38 75 4 0.404 -8.06 2.01 Init - 85126 85073 54 1 0 85 101 82 0.614 8.87 2.00 Prom - 87730 87691 40 -8.76 3.00 Prom + 88261 88300 40 -8.16 3.01 Init + 94583 94706 124 1 1 76 99 16 0.818 1.83 3.02 Intr + 99187 99331 145 2 1 89 50 157 0.988 11.34 3.03 Intr + 99847 100069 223 1 1 80 105 148 0.956 13.83 3.04 Intr + 107897 107991 95 1 2 94 77 211 0.879 19.36 3.05 Intr + 117335 117532 198 2 0 68 100 82 0.756 5.87 3.06 Intr + 120231 120315 85 0 1 125 94 104 0.997 14.52 3.07 Intr + 122879 123079 201 1 0 74 0 155 0.291 4.78 3.08 Intr + 128223 128330 108 2 0 153 66 199 0.913 24.68 3.09 Intr + 138548 138601 54 1 0 106 37 48 0.217 0.68 3.10 Intr + 144136 144286 151 0 1 100 100 256 0.998 27.74 3.11 Intr + 144854 144957 104 0 2 78 84 157 0.984 14.19 3.12 Intr + 148322 148369 48 1 0 86 99 17 0.574 1.48 3.13 Term + 148416 148589 174 2 0 77 46 573 0.995 49.76 3.14 PlyA + 151555 151560 6 1.05 4.07 PlyA - 154340 154335 6 1.05 4.06 Term - 155605 155451 155 2 2 41 53 137 0.257 3.58 4.05 Intr - 168543 168384 160 0 1 36 105 69 0.838 2.96 4.04 Intr - 172731 172589 143 1 2 47 111 12 0.506 -0.53 4.03 Intr - 173547 173401 147 1 0 79 98 82 0.979 8.51 4.02 Intr - 181500 181392 109 0 1 84 98 61 0.908 6.56 4.01 Init - 182506 182501 6 1 0 71 81 0 0.538 -1.35 4.00 Prom - 201889 201850 40 -3.06 5.00 Prom + 206639 206678 40 -1.76 5.01 Init + 211316 211410 95 1 2 97 48 17 0.097 -1.55 5.02 Intr + 226105 226303 199 1 1 48 72 118 0.179 5.55 5.03 Term + 233403 233501 99 2 0 81 42 71 0.540 -0.07 5.04 PlyA + 234226 234231 6 1.05 6.02 PlyA - 234239 234234 6 1.05 6.01 Term - 242840 242766 75 2 0 83 49 117 0.599 5.14 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:38770190_39018775|GENSCAN_predicted_peptide_1|390_aa MSSNCTSTTAVAVAPLSASKTKTKKKHFVCQKVKLFRASEPILSVLMWGVNHTINELSNV PVPVMLMPDDFKAYSKIKVDNHLFNKENLPSRFKFKEYCPMVFRNLRERFGIDDQDYQNS VTRSAPINSDSQGRCGTRFLTTYDRRFVIKTVSSEDVAEMHNILKKYHQFIVECHGNTLL PQFLGMYRLTVDGVETYMVVTRNVFSHRLTVHRKYDLKGSTVAREASDKEKAKDLPTFKD NDFLNEGQKLHVGEESKKNFLEKLKRDVEFLAQLKIMDYSLLVGIHDVDRAEQEEMEVEE RAEDEECENDGVGGNLLCSYGTPPDSPGNLLSFPRFFGPGEFDPSVDVYAMKSHESSPKK EVYFMAIIDILTPYDTKKKAAHAAKTVKHG >gi568815581f:38770190_39018775|GENSCAN_predicted_CDS_1|1170_bp atgtcgtccaactgcaccagcaccacggcggtggcggtggcgccgctcagcgccagcaag accaagaccaagaagaagcatttcgtgtgccagaaagtgaagctattccgggccagcgag ccgatcctcagcgtcctgatgtggggggtgaaccacacgatcaatgagctgagcaatgtt cctgttcctgtcatgctaatgccagatgacttcaaagcctacagcaagatcaaggtggac aatcatctcttcaataaggagaacctgcccagccgctttaagtttaaggagtattgcccc atggtgttccgaaaccttcgggagaggtttggaattgatgatcaggattaccagaattca gtgacgcgcagcgcccccatcaacagtgacagccagggtcggtgtggcacgcgtttcctc accacctacgaccggcgctttgtcatcaagactgtgtccagcgaggacgtggcggagatg cacaacatcttaaagaaataccaccagtttatagtggagtgtcatggcaacacgcttttg ccacagttcctgggcatgtaccgcctgaccgtggatggtgtggaaacctacatggtggtt accaggaacgtgttcagccatcggctcactgtgcatcgcaagtatgacctcaagggttct acggttgccagagaagcgagcgacaaggagaaggccaaggacttgccaacattcaaagac aatgacttcctcaatgaagggcagaagctgcatgtgggagaggagagtaaaaagaacttc ctggagaaactgaagcgggacgttgagttcttggcacagctgaagatcatggactacagc ctgctggtgggcatccacgacgtggaccgggcagagcaggaggagatggaggtggaggag cgggcagaggacgaggagtgtgagaatgatggggtgggtggcaacctactctgctcctat ggcacacctccggacagccctggcaacctcctcagctttcctcggttctttggtcctggg gaattcgacccctctgttgacgtctatgccatgaaaagccatgaaagttcccccaagaag gaggtgtatttcatggccatcattgatatcctcacgccatacgatacaaagaagaaagct gcacatgctgccaaaacggtgaaacacggg >gi568815581f:38770190_39018775|GENSCAN_predicted_peptide_2|734_aa MAAARLPERLVPRRGRTACLISSSTCTPAESRREDTSYLLRFYLNSHFAERAKSDVVNLL RNKAAPRGGRGRFLLRPRRGGSSGAKFRISLGLPVGAVINCADNTGAKNLYIISVKGIKG RLNRLPAAGVGDMVMATVKKGKPELRKKVHPAVVIRQRKSYRRKDGVFLYFEDNAGVIVN NKGEMKERPSEATVVGMAYLSECRLRLEKGFILDGVAVSTAARAYGRSRPKLWSAIPPYN AQQDYHARSYFQSHVVPPLLRKTDQSLRTLVDCVFSMGNASDYAWLASHTTPRGMGKPRG HSLQQVTGHDHYNADLKPIDGFNGRESGTSGNVATRSSQNETSTAKASTAEPGPLQNLQV TPTSQSATEPCTSSFRDADAMCERLLEENLAIVEDILPSHACLGYGRNLKKSWHPQTLRN VEKVWKAEQKHEAERKKIEELQRELREERAREEMQRYAEDVGAVKKKEEKLDWMYQGPGG MVNRDEYLLGRPIDKYVFEKMEEKEAGCSSETGLLPGSIFAPSGANSLLDMASKIREDPL FIIRKKEEEKKREVLNNPVKMKKIKELLQMSLEKKEKKKKKEKKKKHKKHKHRSSSSDRS SSEDEHSAGRHNSKVNRRETGQTRSPSPKKEVYQRRHAPGYTRKLSAEELERKRQEMMEN AKWREEERLNILKRHAKDEEREQRLEKLDSRDGKFIHRMKLESASTSSLEDRVKRNIYSL QRTSVALEKNFMKR >gi568815581f:38770190_39018775|GENSCAN_predicted_CDS_2|2205_bp atggctgctgcgcggcttcctgagcgactggttcctcggcgagggcgaacagcgtgtcta atatcctcctccacctgcaccccagcagaatctcgtagagaagatacttcctatctcttg cgcttctacctcaactcccacttcgcagagagggctaaaagcgacgtcgtaaacttatta cgtaataaggcagcgcccagaggcggaagaggccggtttttgctccggccacgacgtggt gggtcctctggtgcgaaattccggatttccttgggtcttccggtaggagctgtaatcaat tgtgctgacaacacaggagccaaaaacctgtatatcatctccgtgaaggggatcaaggga cggctgaacagacttcccgctgctggtgtgggtgacatggtgatggccacagtcaagaaa ggcaaaccagagctcagaaaaaaggtacatccagcagtggtcattcgacaacgaaagtca taccgtagaaaagatggcgtgtttctttattttgaagataatgcaggagtcatagtgaac aataaaggcgagatgaaagaaaggccctccgaggctacggtggttgggatggcgtacctg agcgagtgtcgcctgcgactggagaaaggctttatcttggacggggtggctgtgagcacc gctgcccgcgcttatgggcgctctaggcccaagctgtggtcggcgattccgccctacaac gcgcagcaggactaccacgcccgcagctacttccagagtcacgtggttccgccccttttg cggaaaactgatcagagtctcaggacacttgtggattgcgtgttcagcatggggaatgca agtgactacgcctggttggcatcccataccacacccaggggaatggggaagccaagaggg cattctctccagcaggtgactgggcatgaccactacaatgctgatctgaaaccgatcgat gggttcaatggaagagagtctgggacatcaggcaacgtggcgacccgaagcagccaaaat gagacctcgacagcaaaggcctcgacagcggagcccggacctttgcaaaatctgcaagta acgccaacgtcacaaagcgcgaccgagccctgtacgtcatcgttccgcgacgccgacgcg atgtgtgagagattattggaagaaaaccttgccattgttgaagacatcttgccctcacac gcctgccttgggtatggaaggaatctgaagaagagctggcacccgcagaccctcaggaat gtggagaaagtgtggaaggccgagcagaagcatgaggctgagcggaagaagattgaggag cttcagcgggagctgcgagaagagagagcccgggaagagatgcagcgctatgcggaggat gttggggccgtcaagaaaaaagaagaaaagttggactggatgtaccagggtcctggtggg atggtgaaccgtgacgagtacctgctggggcgccccattgacaaatatgtttttgagaag atggaggagaaggaggcaggctgctcttctgaaacaggacttctcccaggctctatcttt gccccatcaggtgccaattcccttcttgacatggccagcaagatccgggaggacccactc ttcatcatcaggaagaaggaggaggagaaaaaacgagaggtattaaataatccagtgaaa atgaagaaaatcaaagaattgttgcaaatgagtctggaaaaaaaggagaagaagaaaaag aaggagaagaaaaagaagcacaagaaacataagcacagaagctcgagtagtgatcgttcc agcagcgaggatgagcacagtgcagggaggcacaactctaaggtgaacaggagagagaca ggccaaactaggagcccatcacctaaaaaagaggtctaccaaaggcgacatgctcccgga tacaccagaaaactctctgcagaggaattagagcgaaaacggcaagagatgatggaaaac gccaaatggagggaggaggagagactgaacatcctcaagaggcatgctaaggatgaggaa cgggagcagaggctagagaagctggactcccgggatgggaagttcatccaccgcatgaag ctggagagtgcatctacttcctccctggaggatcgggtgaagcggaatatctactcttta cagagaacttcggtagctctggagaagaactttatgaaaagatga >gi568815581f:38770190_39018775|GENSCAN_predicted_peptide_3|569_aa MSTSQSPCESICDYVTSHDKSNFTDMIKLNILRCEVILDYPAPGGGSLGAKHCCSCYTVS SGVTEGERNAGEKGVKLNADGARIRGTPGRGRRAEAEASSPAPAAVAAACVVAAAAASRQ LASGNRTRVSSGVPAPAFLGTMNPNCARCGKIVYPTEKVNCLDKFWHKACFHCETCKMTL NMKNYKGYEKKPYCNAPRLLPAADTCVCLSSSSLPLLEELRVWADLGGQDPGEGWGTGTA RIIWIRDSETGPREKGFAKVTQHYPKQSFTMVADTPENLRLKQQSELQSQPPDERGALII PIIQIGKLRFTEAALLVQSPGTGADIKSHDLALEPVALNSADGTGTEPCAPTLQDAPVRY KEEFEKNKGKGFSVVADTPELQRIKKTQDQISNPGLQELLAVQPAGAATMQIKYHEEFEK SRMGPSGGEGMEPERRDSQDGSSYRRPLEQQQPHHIPTSAPVYQQPQQQPVAQSYGGYKE PAAPVSIQRSAPGGGGALLQGRGELEKNAQTQKRYRAVYDYSAADEDEVSFQDGDTIVNV QQIDDGWMYGTVERTGDTGMLPANYVEAI >gi568815581f:38770190_39018775|GENSCAN_predicted_CDS_3|1710_bp atgtctacatcccagtctccttgtgaatctatttgtgactatgttacctctcatgataaa agcaactttacagatatgattaaattaaatatcttgagatgtgaggttatcctggattat ccagcgcctggaggcggctccttgggtgctaagcattgttgcagctgctacactgtgagt agtggtgtaacagaaggagaaaggaatgctggagaaaaaggggtcaagctgaacgcagac ggtgcccgcatccggggaaccccgggaaggggaaggagggcggaggcggaggccagttcc ccagctccagccgccgtcgctgctgcctgtgtagttgcagccgcggccgcctcccgccag ctcgcctcggggaacaggacgcgcgtgagctcaggcgtccccgccccagcttttctcgga accatgaaccccaactgcgcccggtgcggcaagatcgtgtatcccacggagaaggtgaac tgtctggataagttctggcataaagcatgcttccattgcgagacctgcaagatgacactg aacatgaagaactacaagggctacgagaagaagccctactgcaacgcgccacgtctgttg ccagctgcggacacctgtgtttgcctttcctcatcttcccttcctctgctggaggaactg cgagtctgggctgacctgggtggccaggacccaggtgagggctgggggacgggcaccgct aggatcatctggatccgggacagtgaaactggcccaagggaaaaaggatttgccaaggtc acccaacactaccccaagcagtccttcaccatggtggcggacaccccggaaaaccttcgc ctcaagcaacagagtgagctccagagtcagccacctgatgaaagaggggctcttattatc cccatcatacagatagggaaactgaggtttacagaggctgccctgcttgtgcagagccca gggacaggggcagacatcaagtcccatgacctggccctggagcccgttgctctgaacagt gctgatggcacaggcaccgagccctgtgctcccaccctgcaggatgcaccagtgcgctac aaggaggagtttgagaagaacaagggcaaaggtttcagcgtagtggcagacacgcccgag ctccagagaatcaagaagacccaggaccagatcagtaaccccgggctgcaggaactattg gctgttcagccagcaggagcagcaaccatgcagataaaataccatgaggagtttgagaag agccgcatgggccctagcgggggcgagggcatggagccagagcgtcgggattcacaggac ggcagcagctaccggcggcccctggagcagcagcagcctcaccacatcccgaccagtgcc ccggtttaccagcagccccagcagcagccggtggcccagtcctatggtggctacaaggag cctgcagccccagtctccatacagcgcagcgccccaggtggtggcggggctctgctccag ggtcgtggagagttagaaaaaaatgctcagacccagaagcggtaccgcgcggtgtatgac tacagcgccgccgacgaggacgaggtctccttccaggacggggacaccatcgtcaacgtg cagcagatcgacgacggctggatgtacgggacggtggagcgcaccggcgacacggggatg ctgccggccaactacgtggaggccatctga >gi568815581f:38770190_39018775|GENSCAN_predicted_peptide_4|239_aa MVTLTAGWDELECHRVYNFLCELTNLCRKIQMAVCSKPGQVVWQEMIEEPTDEFSLKGLA DAIKLLYDASTKEWTADDVISLVDELSVVPREWLLENNARLLMLSGNNICFSFMASKAVN GRTIELARLVVFLALVCEKELYCMDWTVKMMQKVCKVFSTPVERKNFLQNVANAFACVIM EMLQSIMSGSGRSSRPLEQIDNVGQNDQAYDGKEHQDENIQHCGFLGRPSPWILVNREL >gi568815581f:38770190_39018775|GENSCAN_predicted_CDS_4|720_bp atggttaccttaacagcaggttgggatgaacttgagtgccatcgcgtttataatttctta tgcgaactgactaatctctgccgcaagatacaaatggctgtctgcagcaaaccaggacag gtggtttggcaggaaatgatagaagaacctacagatgaattcagtctgaaaggtttggct gatgccattaagttactatatgacgctagcactaaagagtggacagcagatgatgttatc agtcttgtagatgaactatcagtggttccccgtgagtggcttctagagaataatgcacgt ctcctaatgctaagtggaaacaacatctgtttcagtttcatggctagtaaagctgtgaat ggacgcaccattgaactggcaaggctcgtagtctttttggctttggtgtgtgagaaagaa ctgtactgcatggattggacagttaaaatgatgcaaaaagtctgcaaagtctttagcact ccagtggaaagaaagaacttcctgcagaatgtggcaaatgcatttgcatgtgttataatg gaaatgctgcaatcaattatgtctggctccggccgcagctccaggccactagagcagata gataatgtaggacagaatgaccaggcctatgatggcaaagaacatcaggatgaaaatatc cagcattgtggattcctaggccgaccaagcccctggattctggtgaaccgtgagctgtga >gi568815581f:38770190_39018775|GENSCAN_predicted_peptide_5|130_aa MAPACKFKLQCDTLTTLNAEGCGATGTLIHCWFHYAQTEEELSPCGSCFPNEDWGPAQSF PLPLAEDVCPHPFLDECEGQISRGYLEQLRGQVAGLFQDCHQALFADVFPDCRLALSSFL SCRSMYPAAG >gi568815581f:38770190_39018775|GENSCAN_predicted_CDS_5|393_bp atggcgcctgcctgcaaatttaaactacaatgtgatacactgacaacactaaatgctgaa ggatgtggagcaacaggaaccctcattcattgctggtttcattatgcacagacagaggag gagctcagtccttgtggcagctgctttcctaatgaagactggggccctgctcagtccttc ccactccctcttgctgaggatgtatgtccccacccttttcttgatgaatgtgagggccag atctcccgtggatatttggaacagctccgcgggcaagttgctggcttgtttcaggactgc catcaggccctctttgcagatgtcttcccagactgccgtttagcactgagctcctttctg agctgtcggtccatgtatccagctgctggctag >gi568815581f:38770190_39018775|GENSCAN_predicted_peptide_6|24_aa VRKPWFKAEITNCWTTDLHEFTAC >gi568815581f:38770190_39018775|GENSCAN_predicted_CDS_6|75_bp gtgaggaagccatggtttaaagcagagatcaccaactgctggaccacagacctgcacgag ttcacggcctgttag