GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:30:23 Sequence gi568815590f:117047030_117272678 : 225649 bp : 38.33% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 381 521 141 2 0 51 110 57 0.760 3.83 1.02 Term + 6406 6558 153 0 0 75 41 120 0.740 2.94 1.03 PlyA + 7721 7726 6 1.05 2.00 Prom + 14430 14469 40 -4.55 2.01 Init + 41414 41456 43 1 1 90 88 46 0.512 5.43 2.02 Intr + 41683 41818 136 2 1 60 74 68 0.422 1.31 2.03 Intr + 46772 46848 77 2 2 82 73 73 0.549 3.44 2.04 Intr + 47201 47352 152 0 2 68 5 128 0.269 1.46 2.05 Intr + 47553 47786 234 2 0 35 109 152 0.547 8.96 2.06 Intr + 48157 48271 115 0 1 48 18 76 0.055 -4.30 2.07 Intr + 53482 53580 99 2 0 72 53 71 0.151 1.16 2.08 Intr + 54650 54873 224 0 2 28 67 152 0.488 4.02 2.09 Term + 61997 62200 204 1 0 70 42 163 0.578 6.29 2.10 PlyA + 63076 63081 6 1.05 3.03 PlyA - 64192 64187 6 1.05 3.02 Term - 64902 64603 300 0 0 70 48 148 0.587 3.24 3.01 Init - 70417 70352 66 1 0 73 49 70 0.478 2.72 3.00 Prom - 71859 71820 40 -6.45 4.00 Prom + 72770 72809 40 -5.25 4.01 Init + 85071 85141 71 2 2 62 87 40 0.864 1.87 4.02 Intr + 86727 86894 168 0 0 109 52 100 0.631 6.64 4.03 Intr + 99925 100124 200 1 2 44 92 116 0.184 5.67 4.04 Intr + 105915 106061 147 1 0 94 96 104 0.990 11.09 4.05 Intr + 110662 110815 154 2 1 50 75 196 0.994 12.71 4.06 Intr + 114709 114859 151 1 1 103 113 90 0.998 12.24 4.07 Intr + 116396 116501 106 1 1 108 97 12 0.958 2.97 4.08 Intr + 124005 124139 135 1 0 108 82 26 0.788 3.62 4.09 Term + 125507 125652 146 0 2 73 36 172 0.739 7.59 4.10 PlyA + 126092 126097 6 1.05 5.00 Prom + 134505 134544 40 -3.65 5.01 Init + 135624 135813 190 2 1 76 115 103 0.946 11.02 5.02 Term + 139924 140126 203 2 2 17 43 104 0.363 -4.63 5.03 PlyA + 140180 140185 6 1.05 6.00 Prom + 142238 142277 40 -5.85 6.01 Init + 145790 146141 352 1 1 71 77 215 0.295 16.07 6.02 Term + 147722 148365 644 0 2 35 43 391 0.341 22.64 6.03 PlyA + 148553 148558 6 1.05 7.06 PlyA - 150096 150091 6 1.05 7.05 Term - 152265 152140 126 0 0 31 44 125 0.175 -0.30 7.04 Intr - 162539 162450 90 2 0 49 88 99 0.480 5.27 7.03 Intr - 168467 168330 138 2 0 40 76 101 0.492 3.84 7.02 Intr - 170528 170489 40 1 1 115 100 12 0.341 2.41 7.01 Init - 175005 174914 92 0 2 83 19 86 0.325 1.21 7.00 Prom - 175872 175833 40 -4.65 8.00 Prom + 183536 183575 40 -3.45 8.01 Init + 188568 188615 48 2 0 83 110 35 0.441 6.20 8.02 Intr + 192911 192959 49 1 1 103 81 22 0.098 0.33 8.03 Intr + 195781 195888 108 2 0 61 119 29 0.055 2.64 8.04 Intr + 198147 198330 184 1 1 -72 54 226 0.023 1.12 8.05 Term + 207444 207639 196 0 1 54 49 96 0.047 -1.80 8.06 PlyA + 207700 207705 6 1.05 9.03 PlyA - 208233 208228 6 1.05 9.02 Term - 209382 208874 509 1 2 81 50 165 0.297 5.58 9.01 Init - 211811 211721 91 2 1 70 107 24 0.302 3.20 9.00 Prom - 213699 213660 40 -1.85 10.04 PlyA - 214514 214509 6 1.05 10.03 Term - 215535 215396 140 1 2 49 41 132 0.595 1.74 10.02 Intr - 215843 215706 138 0 0 58 16 108 0.307 0.11 10.01 Init - 216537 216468 70 0 1 83 51 83 0.662 5.36 10.00 Prom - 218641 218602 40 -6.95 11.03 PlyA - 219252 219247 6 1.05 11.02 Term - 220429 220231 199 0 1 36 42 149 0.756 0.99 11.01 Init - 220830 220568 263 0 2 77 75 175 0.647 11.69 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_1|97_aa REWNINNICKNAFESDLVQEVQRLITPLRIIIEYSFKVTDINSQVQQMVQTENRGRVAQI HILGCICMDGRTISWMSEPEPEPQPSYYRNVYYNSLY >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_1|294_bp agggaatggaatataaataatatttgcaaaaatgcttttgagtctgacctggtccaggag gttcagaggctgataactcctctgaggattattattgagtacagtttcaaggtgactgac atcaactcccaagtccaacagatggtgcagacggaaaacagaggccgtgttgctcagatc catatcctgggatgcatctgtatggatggaagaaccatttcctggatgtctgagcccgag cccgagccccagcccagctattataggaatgtctattataatagtctttattag >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_2|427_aa MHINNPNTCRELLLRNTHVFPSTPCARRPYDFTDGDGILLSPDVGGRACIWLTVLDDVGW LCLARATDLNPMPAKGKPGAKWQGVSLLAPPFGRSRVLVLCPGRMKYTDKWRVSKVKRSF IEQQLRGEQLRGDPQWRGISVCQLVHEWPWVGLEKAPQVPTPVCGTSSLAPRLQALPGLK VGLYLGPVPFHPGACLPPAVIHGVQAVHAMGHLQSPGCTSPTVASVIAAAALDGPLLSSI LVGFIHSMVCSTRHAWVLLEQAMEYWFVLETRSQYGYWRHLQVFYSGHNLISCQCGQNKS RQKNVERLDWFSLLAYIFLPCWMLPTLEYRTSSSSALGFRLTSLLFSLQMAYCGTSPCDR PGKLTEKEGVLTDGSSKSHEVEFYWIDLHHIPVTGPITVARGIGHTDWPGLCHAPSLEGS MCWDSPV >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_2|1284_bp atgcacattaataacccaaatacctgccgagagctgctacttcgtaatacacatgttttt ccaagtaccccgtgtgcacgtagaccctatgatttcacagatggtgatgggatactgttg agtccagatgttggaggcagagcttgtatttggctgactgtattagatgacgttggttgg ctgtgcttggctcgtgctactgacctgaatcccatgcctgccaagggaaaaccaggtgca aagtggcaaggggtatctcttttagccccaccatttggcaggtcccgagttcttgtcctg tgtccaggaagaatgaagtacacagacaagtggagggtgagcaaggtgaagaggagtttt attgagcagcagctcagaggagaacagctcagaggagacccgcagtggaggggaataagt gtgtgccagttggtccatgagtggccatgggtgggcctggaaaaagcaccgcaagttccc actcctgtttgtgggaccagcagcctggcccccaggcttcaggctctccctggcttgaag gtggggctttacctgggacccgtccctttccacccaggagcctgtctgcctcctgctgtc attcatggcgtccaggctgttcatgccatggggcacctgcagagccctggctgtacctct ccgactgtagccagtgtcatagcagcagctgctctagatgggccgctgctgtcatcaata ctagtcggttttattcactcaatggtttgttctactaggcatgcatgggtgctcttagag caggcaatggagtactggtttgtgctggaaacaagaagccagtatggctactggaggcac ttgcaggtcttctacagtgggcacaatctaatcagctgccagtgtggccagaataaaagc aggcagaagaatgtggaaagactagactggtttagtcttctggcctacatctttctcccc tgctggatgcttcctacccttgaatatcggacttcaagttcttcagctttgggatttaga ctgacttccttgcttttcagcttgcagatggcctattgcgggacctcaccttgtgatcgt cctggaaaactcacagaaaaagaaggcgtcttgactgatggttccagtaaaagtcacgag gttgagttttactggattgatctgcatcatatacccgtaactggaccaatcacagtggcc agagggattggacacactgattggccaggcctgtgtcatgcacctagccttgaaggcagc atgtgttgggacagccccgtgtaa >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_3|121_aa MSLVINKSDPLGPDSVAEALTETFPDIKQGGHFENQKLSIYKEEEKMPQPGAELVLCTVT SNHVIGELVSHYLSAVASSRSTSWHQKDAPFGAARWPPAPQTYIVVALQRQWKDSASLPL V >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_3|366_bp atgagtttagtaattaataaatcggatcctctgggacctgactctgtggctgaagctctt actgagacatttccagatattaaacaaggaggacattttgagaatcaaaaactgtccatt tataaagaggaagaaaaaatgcctcaaccaggagcagagctggttttatgcacagttacc tcaaaccatgtaattggggagttagtttcccactatctctcagctgtggcttcctccagg tcgacttcatggcatcaaaaagatgctccctttggggcagcaagatggcctccagcccct cagacctatattgtagtagctttgcagcgccagtggaaagacagtgcctctttaccacta gtttga >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_4|425_aa MVACTHWQHEVFLKYEFKYGEHKLITDYFVRLRVRIYEKKYIGFYSANSRPRISMPSLAR KRLPQSLLEAPNCVSKVKDHVELQQKPVNKDQCPRERPEELESGGMYHCHSGSKPTEKGA NEYAYAKWKLCSASAICFIFMIAEVVGGHIAGSLAVVTDAAHLLIDLTSFLLSLFSLWLS SKPPSKRLTFGWHRAEILGALLSILCIWVVTGVLVYLACERLLYPDYQIQATVMIIVSSC AVAANIVLTVVLHQRCLGHNHKEVQANASVRAAFVHALGDLFQSISVLISALIIYFKPEY KIADPICTFIFSILVLASTITILKDFSILLMEGVPKSLNYSGVKELILAVDGVLSVHSLH IWSLTMNQVILSAHVATAASRDSQVVRREIAKALSKSFTMHSLTIQMESPVDQDPDCLFC EDPCD >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_4|1278_bp atggttgcatgcacacattggcaacatgaagtatttctgaaatacgagtttaaatatggg gaacacaaattaataactgattactttgtaaggttaagagtaagaatatacgagaaaaaa tacattggtttttattctgctaacagtagacccaggatttcaatgccttcacttgcaagg aaaaggctcccccagagcctcttagaggctccaaactgtgtgagcaaagttaaggatcat gtggaactccaacagaaaccggtgaataaagatcagtgtcccagagagagaccagaggag ctggagtcaggaggcatgtaccactgccacagtggctccaagcccacagaaaagggggcg aatgagtacgcctatgccaagtggaaactctgttctgcttcagcaatatgcttcattttc atgattgcagaggtcgtgggtgggcacattgctgggagtcttgctgttgtcacagatgct gcccacctcttaattgacctgaccagtttcctgctcagtctcttctccctgtggttgtca tcgaagcctccctctaagcggctgacatttggatggcaccgagcagagatccttggtgcc ctgctctccatcctgtgcatctgggtggtgactggcgtgctagtgtacctggcatgtgag cgcctgctgtatcctgattaccagatccaggcgactgtgatgatcatcgtttccagctgc gcagtggcggccaacattgtactaactgtggttttgcaccagagatgccttggccacaat cacaaggaagtacaagccaatgccagcgtcagagctgcttttgtgcatgcccttggagat ctatttcagagtatcagtgtgctaattagtgcacttattatctactttaagccagagtat aaaatagccgacccaatctgcacattcatcttttccatcctggtcttggccagcaccatc actatcttaaaggacttctccatcttactcatggaaggtgtgccaaagagcctgaattac agtggtgtgaaagagcttattttagcagtcgacggggtgctgtctgtgcacagcctgcac atctggtctctaacaatgaatcaagtaattctctcagctcatgttgctacagcagccagc cgggacagccaagtggttcggagagaaattgctaaagcccttagcaaaagctttacgatg cactcactcaccattcagatggaatctccagttgaccaggaccccgactgccttttctgt gaagacccctgtgactag >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_5|130_aa MQSGGGKGKIAWTEAPGRKNLAALRKRKEADVLSVVNKAGGRGVIWAGLGKSPVGHEKEL EFYGQGTADSLAVFLSGRDRAKSLRMPEWVEFTGHSTRGERDIQRERERERERERARDLQ EDFLKSSAEE >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_5|393_bp atgcaaagtggaggaggaaaaggaaagatagcttggacagaggccccaggtaggaagaac ctggcagccttgagaaaaagaaaggaagcagatgtactgagtgtagtgaacaaggctggg ggaagaggggtgatctgggctggactgggcaaaagccctgtaggccatgagaaggagcta gaattttatgggcagggaactgcagacagtctggcagtcttcctgagtgggagagataga gctaagagtctgaggatgcccgagtgggtagagttcacaggacatagtactagaggtgag agagatatacagagagagagagagagagagagagagagagagagagccagagatcttcag gaagacttcctcaagtcctcagctgaggagtga >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_6|331_aa MEDETNEMKHEEKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQD IIQENFPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKEIQ TTIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITGSEIVAIINSLPTK KSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGILPNSFYEASIILISKPSRDTTKKE KFRPISLMNIDAKILNKILAKRIQQHIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINR TKSKNHMIISIDAENDFDKIQQPFMLKLSIN >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_6|996_bp atggaagacgaaacaaatgaaatgaagcatgaagagaagtttagagaaaaaagaataaaa agaaatgaacaaagcctccaagaaatatgggactatgtgaaaagaccaaatctacgactg attggtgtacctgaaagtgacggggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacattcaaattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaagaaatacaa actaccatcagagaatactacaaacacctctatgcaaataaactagaaaatctagaagaa atggataaattcctcgacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgtggcaataatcaatagcttaccaacgaaa aagagtccaggaccagatggattcacagccgaattctaccagagatacaaggaggaactg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaactca ttttatgaggccagcatcatcctgatatcaaagccaagcagagacacaaccaaaaaagag aagtttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactggca aaacgaatccagcagcacatcaaaaagcttatccaccatgaccaagtgggcttcatccct gggatgcaaggctggttcaatatacacaaatcaataaatgtaatccagcatataaacaga accaaaagcaaaaaccacatgattatctcaatagatgcagaaaacgactttgacaaaatt caacaacccttcatgctaaaactctcaataaattag >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_7|161_aa MAEGKGGKGTSYMAAGKTVCAGELPFIKTIRKSRNRQTIKFGAKLSTKLVKAGTFKWTRK PWAKILGFASWKESPDAMKLKDPCSSSLDYDQLGNFSNRFLEEDAANENLQQLPEGQYTQ EESKNLRRREIEYVKFLKKKVGTSDTWECGWSMNRGKQERC >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_7|486_bp atggcagaaggcaaaggaggcaaaggcacatcttacatggcagcaggcaagaccgtgtgt gcaggggaactgccctttataaaaaccatcagaaaatcaagaaataggcagacaattaaa tttggagctaagcttagcaccaagttggtgaaagcaggtacatttaaatggactaggaag ccctgggcaaagatcttgggctttgcttcctggaaggaaagccctgatgccatgaagctg aaggacccctgctcaagttcactggattacgatcaacttgggaatttttcaaatagattc ttagaggaggatgctgccaatgaaaatttacaacaactccccgaaggtcagtacacacag gaagaaagtaagaatcttcgaaggagagaaatagaatatgtgaagttcctgaagaagaaa gttgggacttctgatacatgggaatgtggctggagtatgaacagagggaagcaagagcgg tgttag >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_8|194_aa MAEAKGGAKSRLTWWQINLLNSVSGSSLFIVPIFSSVRWRLIITVPISEGVVMRIKTTCE SVYPTSVPVNSQEEAANQMNKCEVKMPTPVKTQPKLAKANGENAGSISELFLEMESNYIN AFRQRETDGRQYDFTAKNPEDSTKRLLELINDFSKVLEYKINVQKPVAFLYINNVQAESQ IKKEISFKIATKST >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_8|585_bp atggcagaagccaaaggaggagcaaagtcacgtcttacatggtggcagataaacctgctc aattccgtgtctggttctagtttattcattgtgcccattttctcatctgtaagatggaga ctgataataacggtaccaatatcagagggcgttgtgatgaggattaagacaacctgtgaa agtgtttaccccacatctgtacctgttaattcgcaagaggaagctgctaatcaaatgaac aagtgtgaagtgaaaatgccaaccccagtgaagacgcagccaaagctggcaaaggcaaat ggagaaaacgcaggcagcatcagtgagctgttcctggaaatggagtctaactacattaac gcattcagacagagggaaacagatggtcgacaatatgattttacagctaaaaaccctgaa gactccaccaaaaggctcctggaacttataaatgacttcagtaaagttttagaatacaaa atcaatgtacaaaaaccagtggcatttctatatatcaataatgttcaagctgagagtcaa atcaagaaagaaatctcatttaaaatagccacaaaaagtacctag >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_9|199_aa MYARAQAHLLEVKNGVPTGGLFSVLFMSLTGFLFTLMIVSFAVQMLFSLISKKNKAGGIM LPDFKLYYKATVTKTAQYWYQNRYIGPWNGTEAAEIMPHIYNHLIFDKPDKNKQWGKDSL FNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKGLNVRPKTIKTLEENLGNTIQDLGMG KDFMTKIPKAMSTQAKIDK >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_9|600_bp atgtacgcacgtgcacaggcacatttgctagaggtgaaaaatggagtacccactgggggt cttttctcagttcttttcatgagtctgactggtttcctgttcactctgatgatagtttct tttgctgtgcagatgctctttagtttaattagcaaaaagaacaaagctggaggcatcatg ctacctgacttcaaactatactacaaggctacagtaaccaaaacagcacagtactggtac caaaacagatatataggcccatggaatggaacagaggccgcagaaataatgccacacatc tacaaccatctgatctttgacaaacctgacaaaaacaagcaatgggggaaggattcccta tttaataaatggtgttgggaaaactggctagccatatgcagaaaactgaaactggacccc ttccttacaccttatacaaaaattaactcaagatggattaaaggcttaaacgtaagacct aaaaccataaaaaccctggaagaaaacctaggcaataccattcaggacttaggcatgggc aaggacttcatgactaaaataccaaaagcaatgtcaacacaagctaaaattgacaaatga >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_10|115_aa MAEGKEEQVTSYMDGSRQRENEEDVKSNDPLQERFLLAPVHFHSPRRTSQYGKKNMGFGI RQSQFPILAGLENIKLSKTNLALREPDEVLSADVMTVLSRVVSGINTDRKLQFIV >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_10|348_bp atggcagaaggcaaggaggagcaagtcacatcttacatggatggcagcaggcaaagagag aatgaggaagatgttaaatccaatgacccacttcaggaacgtttcctccttgctccagtc cacttccacagccccagacgtacttcacaatatggcaaaaagaacatgggttttggcatt agacagagccagtttccaatcttagcaggccttgaaaatataaagctaagtaagacaaat ttggccttgagagagccggatgaagttctaagtgccgatgtcatgacagtcctgtcccgt gtggttagtggaatcaacactgataggaagttacaatttatagtgtga >gi568815590f:117047030_117272678|GENSCAN_predicted_peptide_11|153_aa MVSCDPAAPAVAVAKRGQGTARAMASEGASSKPWWLPHCTGHMGVQKTRAELWEPLPSIQ RMYENAWMSRQKSALGEEPSWRTTTTARKASSTQHQPMKAAAGAEPRRYTASELLKALGA NPLHHHALDVRSGVKEGYFGALRLTECLASFQS >gi568815590f:117047030_117272678|GENSCAN_predicted_CDS_11|462_bp atggtgtcctgtgacccagctgctccagctgtagctgtggctaaaaggggccaaggtaca gctcgggccatggcttcagagggtgcaagctccaagccttggtggcttccacattgtacg ggtcatatgggtgtgcagaagacaagagctgagctttgggagcctctgcctagcattcag agaatgtatgaaaatgcctggatgtccaggcagaagtctgctttaggggaggaaccctca tggagaaccactactacggcaagaaaagcctcaagcactcaacaccagcccatgaaagca gctgctggagctgaaccccgcagatacacagcgtcagagcttctgaaggccttgggagcc aaccccttgcatcaccatgccctggatgtgagatctggagtcaaggaaggttattttgga gctttaagacttactgagtgccttgccagctttcagagttga