GENSCAN 1.0 Date run: 8-Nov-116 Time: 11:23:24 Sequence gi568815581r:54862736_55068646 : 205911 bp : 38.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 13748 14394 647 0 2 14 33 258 0.056 6.20 1.02 PlyA + 15767 15772 6 1.05 2.00 Prom + 23282 23321 40 -2.25 2.01 Init + 30586 30670 85 0 1 64 59 53 0.336 1.03 2.02 Intr + 32294 32426 133 0 1 37 97 90 0.598 3.58 2.03 Intr + 37989 38188 200 0 2 89 82 177 0.909 15.27 2.04 Intr + 40192 40274 83 2 2 -4 99 61 0.488 -3.56 2.05 Intr + 40945 41057 113 0 2 62 92 85 0.545 4.56 2.06 Intr + 42754 42832 79 1 1 52 89 66 0.666 1.73 2.07 Intr + 49931 50080 150 1 0 61 115 37 0.727 3.14 2.08 Intr + 51013 51138 126 0 0 44 75 104 0.815 4.66 2.09 Intr + 51904 52008 105 0 0 72 92 48 0.886 3.09 2.10 Intr + 53011 53127 117 0 0 87 41 112 0.446 6.14 2.11 Intr + 67338 67471 134 2 2 35 83 187 0.036 11.42 2.12 Intr + 73914 73974 61 0 1 59 101 30 0.645 -0.78 2.13 Term + 74374 74499 126 0 0 109 41 82 0.802 2.90 2.14 PlyA + 76629 76634 6 1.05 3.00 Prom + 78226 78265 40 -2.85 3.01 Init + 86591 86617 27 1 0 114 99 -13 0.482 2.14 3.02 Intr + 88482 88577 96 2 0 31 82 78 0.046 0.79 3.03 Intr + 91655 91843 189 1 0 63 43 130 0.206 4.86 3.04 Term + 95305 95451 147 0 0 14 55 128 0.277 -0.98 3.05 PlyA + 97668 97673 6 1.05 4.07 PlyA - 98344 98339 6 1.05 4.06 Term - 100180 99998 183 1 0 68 45 129 0.996 3.16 4.05 Intr - 100696 100571 126 1 0 59 88 74 0.955 4.46 4.04 Intr - 102117 101962 156 0 0 80 92 118 0.999 10.69 4.03 Intr - 105831 105546 286 2 1 51 91 232 0.374 16.22 4.02 Intr - 106514 106393 122 2 2 108 22 60 0.323 -0.23 4.01 Init - 113803 113756 48 1 0 85 119 17 0.345 5.50 4.00 Prom - 114881 114842 40 -4.05 5.00 Prom + 115023 115062 40 -4.35 5.01 Init + 124338 124454 117 2 0 34 45 71 0.306 -2.35 5.02 Intr + 128106 128222 117 2 0 76 93 143 0.866 13.34 5.03 Intr + 138073 138148 76 0 1 108 99 19 0.189 3.17 5.04 Term + 144771 144931 161 1 2 90 54 23 0.092 -3.78 5.05 PlyA + 145065 145070 6 1.05 6.04 PlyA - 145619 145614 6 1.05 6.03 Term - 149854 149546 309 1 0 -27 55 249 0.207 4.18 6.02 Intr - 150250 150019 232 0 1 26 49 124 0.243 -0.85 6.01 Init - 151360 150447 914 1 2 64 52 341 0.507 21.71 6.00 Prom - 159441 159402 40 -6.65 7.00 Prom + 160221 160260 40 -6.05 7.01 Init + 162054 162107 54 2 0 69 83 26 0.303 1.53 7.02 Intr + 168433 168529 97 0 1 92 75 84 0.752 6.16 7.03 Intr + 171433 171524 92 2 2 77 80 38 0.274 0.69 7.04 Intr + 180501 180590 90 2 0 41 86 96 0.170 3.97 7.05 Term + 198806 198919 114 1 0 87 41 37 0.014 -3.51 7.06 PlyA + 199125 199130 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 65146 64847 300 1 0 82 42 215 0.952 11.84 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:54862736_55068646|GENSCAN_predicted_peptide_1|215_aa XGDEKLVGYWSKGDSCYVLAKRLAAFCSCPRDLWNFELERDGLGYLAKKISEQQSIQEVT WVLLKAFSFIKEAEHKSLGNLQPDNVIEKENPFSEEKLKPAAEICIRNKEPNVNLQHNGE NVSRACQRASQQPLPSQAPRPRRKWFPGPGPGSPCCVQSMDLVHHIPATRAMTKRGQGTA RAVASEGGSPKPWQLPCGVEPECEQKSQTGLGTST >gi568815581r:54862736_55068646|GENSCAN_predicted_CDS_1|648_bp ngtggagatgagaaacttgttgggtactggagcaaaggtgactcttgttatgttttagca aagagactggcagcattttgctcctgccctagagatttgtggaactttgaactggagaga gatggtttagggtatctagcgaaaaaaatttctgagcagcaaagcattcaagaggtgact tgggtgctattaaaggcattcagttttataaaggaagcagaacataaaagtttgggaaat ttacagcctgacaatgtgatagaaaaggaaaacccattttctgaggagaaattgaagcca gctgcagaaatttgcatacgtaacaaggagccgaatgttaatctccaacacaatggggaa aatgtctctagggcatgtcagagggcttcacagcagcccctcccatcacaggctccgagg cctagaagaaaatggtttcctgggccagggccagggtctccttgctgtgtgcagtccatg gacttggtgcaccacatcccagccactcgagccatgactaaaaggggccaaggtacagct cgggctgttgcttcagagggtggaagccccaagccttggcagcttccatgtggtgttgag cctgagtgtgaacagaagtcacaaacaggtttgggaacttccacctag >gi568815581r:54862736_55068646|GENSCAN_predicted_peptide_2|503_aa MQTNEEMRAFQIAVNAMKTITQNKRIKRDTLQDQPQILAGGDGRGWQCDAMEKGLSLEMR KTQIHLHIQSDFSGSAERTASARAREALGRRGGAKGTERVESGAGERWRRISWARPSGAT MAFGKSHRDPYATSVGHLIVRIVIQRQFEIIKAESKSVCNGSFYIVELRLNSFFLAEKAT FAGVQTEDWGQFMHICDIINTTQDGPKDAVKALKKRISKNYNHKEIQLTLSLIDMCVQNC GPSFQSLIVKKEFVKENLVKLLNPRYNLPLDIQNRILNFIKTWSQGFPGGVDVSEVKEVY LDLVKKGVQFPPSEAEAETARQETAQISSNPPTSVPTAPALSSVIAPKNSTVTLVPEQIG KLHSELDMVKMNVRVMSAILMENTPGSENHEDIELLQKLYKTGREMQERIMDLLVVVENE DVTVELIQVNEDLNNAILGYERFTRNQQRILEQNKNQKEATNTTSEPSAPSQDLLDLSPS PRMPRATLGELNTMNNQLSGLSK >gi568815581r:54862736_55068646|GENSCAN_predicted_CDS_2|1512_bp atgcaaacaaatgaagaaatgagagcatttcaaatagcagtaaatgctatgaagacaata acccagaataagaggataaagagagatactctccaagatcagccacagatccttgctggt ggggatggcaggggctggcagtgtgatgcaatggaaaaaggcttaagcttggaaatgaga aaaacccaaattcacctccatattcagagtgactttagcggcagcgcagaacgcaccgcc tctgccagagcccgggaagcgctcgggcgaagaggaggagccaagggtaccgagcgggtg gagtcgggagccggagagcggtggaggcggatttcctgggcccggccctctggcgctacc atggcgtttggcaagagtcaccgggatccctacgcgacctccgtgggccacctcatagtt cggatagttatccagcgccagtttgaaataattaaggctgaaagtaaaagtgtatgcaat ggaagtttttacatagttgagctcagactcaacagctttttcttggcagaaaaggctaca tttgctggagttcagactgaagattggggccagttcatgcacatctgtgacataattaac actacccaggatgggccaaaagatgcagtgaaagctttgaagaaaaggatttccaaaaac tacaatcataaagaaatccaacttaccttgtcacttattgacatgtgtgtgcagaactgt ggtccaagtttccagtctctgattgtgaagaaggaatttgttaaagagaatttagttaag ctactgaatcccagatacaacttgccattagacattcagaatagaatcttgaatttcatt aagacttggtcacagggcttcccaggaggtgtggatgtaagcgaagtcaaagaagtatac ctcgacctggttaagaaaggcgttcagtttcctccctcagaagcagaggctgaaacagca agacaagagactgctcaaatctcatcaaatcctccaacatctgtccctactgcaccagct ctttcttctgtaattgctccaaagaactcgactgttacattggtcccagaacagattgga aaactgcacagtgaattggatatggtgaaaatgaatgtgcgagtgatgtccgccatattg atggagaatactcctgggtctgaaaaccatgaagacatagagcttctgcagaaactctat aaaacaggtcgggagatgcaggagaggatcatggacctgcttgtggtggtggagaacgaa gatgtaactgttgagctaattcaggtgaatgaggatttgaataatgctatccttggatat gagaggtttactagaaaccaacaaaggattttggagcaaaataagaaccagaaggaagcc accaatactaccagtgagccttctgccccatctcaagatctcctcgacctaagtcccagt ccccggatgcctagggccactctgggagaactcaacaccatgaataatcaactttcaggc ttaagtaaataa >gi568815581r:54862736_55068646|GENSCAN_predicted_peptide_3|152_aa MPSRKVVNLDIAKDADEEMHRARHAEKAMEFPCPPWSPTLQACSEWQQSPSRVICVPTMS GAPWQRRVQQGPAIAIAGATAGSAEFGGPGKYRTGPCFSLEVQRLSEKFGNEYCRPKEQP VQRPRGQSSGTFKEMEMLKPRVKKVNRELSTR >gi568815581r:54862736_55068646|GENSCAN_predicted_CDS_3|459_bp atgcccagcagaaaagtagtaaatctcgacattgcaaaggatgcagatgaagagatgcac agggcgaggcatgcggaaaaggccatggagtttccgtgccctccctggagtcccaccctc caggcatgctctgagtggcaacagtcaccctccagggttatatgcgttcccactatgagt ggtgctccctggcaaagaagagtgcaacaggggcctgctattgccatagcaggagccaca gctggctctgcagagtttggagggcctggcaaatacagaacgggcccttgcttttccctg gaggtgcagaggttaagtgaaaagtttggaaatgagtattgtaggcccaaggaacaacct gtacaaagacccagaggacagagctctggcacattcaaggaaatggaaatgctaaaacca agagtaaagaaagttaacagggagctcagtaccagatga >gi568815581r:54862736_55068646|GENSCAN_predicted_peptide_4|306_aa MMAEGSGIILSRCNDLNKLQRDSLPPAPAVVLSTGQVVRGIRAGEADCPRAEKMQVSAAE RVEPFLRPEWSGTGGAERGLRWLGTWKRCSLRARHPALQPPRRPKSSNPFTRAQEEERRR QNKTTLTYVAAVAVGMLGASYAAVPLYRLYCQTTGLGGSAVAGHASDKIENMVPVKDRII KISFNADVHASLQWNFRPQQTEIYVVPGETALAFYRAKNPTDKPVIGISTYNIVPFEAGQ YFNKIQCFCFEEQRLNPQEEVDMPVFFYIDPEFAEDPRMIKVDLITLSYTFFEAKEGHKL PVPGYN >gi568815581r:54862736_55068646|GENSCAN_predicted_CDS_4|921_bp atgatggctgaagggtctggtatcatattgtccagatgcaatgatctgaacaaacttcag cgtgacagcctccctcctgctccagctgtggtgctttccactgggcaagttgtaaggggc atcagagcaggggaagccgactgcccacgggctgagaaaatgcaggtgtcggctgcagag agggtagagccgtttcttaggccagagtggagtgggacaggaggtgccgagagaggactg aggtggcttgggacatggaagcgctgcagccttcgagcccggcatccagcattgcagccg ccgcggcggcctaagagctcgaaccctttcacacgcgcgcaggaggaggagcggcggcgg cagaacaagacgaccctcacttacgtggccgctgtcgccgtgggcatgctgggggcgtcc tacgctgccgtacccctttatcggctctattgccagactactggacttggaggatcagca gttgcaggtcatgcctcagacaagattgaaaacatggtgcctgttaaagatcgaatcatt aaaattagctttaatgcagatgtgcatgcaagtctccagtggaactttagacctcagcaa acagaaatatatgtggtgccaggagagactgcactggcgttttacagagctaagaatcct actgacaaaccagtaattggaatttctacatacaatattgttccatttgaagctggacag tatttcaataaaatacagtgcttctgttttgaagaacaaaggcttaatccccaagaggaa gtagatatgccagtgtttttctacattgatcctgaatttgctgaagatccaagaatgatt aaagttgatcttatcactctttcttacactttttttgaagcaaaggaagggcacaagttg ccagttccaggatataattga >gi568815581r:54862736_55068646|GENSCAN_predicted_peptide_5|156_aa MKIQGVEIQTLLLVKGSSKVTLPRCMQTGMGGIGGHFANMITIAKETGLGLKVLGGINRN EGPLVYIQEIIPGGDCYKIKTGYNKTVQIPITSENSTVGLSNTDVASAWTENYGLQEKIS LNPSVRFKAEKLEMVKPLNFHFSSIRQKQERSILLL >gi568815581r:54862736_55068646|GENSCAN_predicted_CDS_5|471_bp atgaaaattcaaggagtggagatacagactttgcttttagtgaaaggaagcagtaaagtc acattaccaagatgcatgcagacaggaatgggaggaattggtggccattttgcaaacatg attacaattgccaaggaaacaggccttggcctgaaggtactaggaggaattaaccggaat gaaggcccattggtatatattcaggaaattattcctggaggagactgttataagataaaa actggatacaacaaaacagtacagattccaattacttcagaaaacagtactgtgggtttg tctaatacagatgttgcttctgcctggactgaaaattatgggctacaagaaaagatctcc ctaaatccctctgttcgctttaaggcagagaaactggaaatggtaaaacctttaaatttt catttttcttccataagacaaaaacaggaaagaagtattctccttctttga >gi568815581r:54862736_55068646|GENSCAN_predicted_peptide_6|484_aa MQQPCNTPILGAQKPSGQWRLVQDLRIINEAVVPLYPAVPNPYTLLSQIPEEAEWFTVLD LKDVFFCIPVHPDSQFLFAFEDPLNPMSQLTCTVLPQGFRDSPHLFGQALAQDLSQLSYL DTLVLQYVDDLLLAACSETLCHQATQALLNFLATCGYKVSKEKAQLCSQQVKYLGLKLSK GTKALSEECIQPILAYPHLKTLKQLREFLGITGFCRIWIPRYGKIARPLYTVIKETQKAN THLVRWIPEAKVAFQALKKALTQVPVLSLPTGQDFSLYITEKNRNSSGSPYTGPRDKLAT HGIPELYYLKGQCCNCALVQLLTQSHFFQTMKIEYNCQQIISQTYATRGDLLEVPLTDPD LNLYTDGSSFVEKGPQKAGYAVAICREREFLTSEGTPITHQEAIRRLLLAVQKPKEVEVL HCWGHQKGKKREIEGNCQADIEAKRAARQDPPLEMLIEGTLLWGNALRETKPQYSEEEIE WGTS >gi568815581r:54862736_55068646|GENSCAN_predicted_CDS_6|1455_bp atgcaacagccctgcaatactccaattttaggagcacagaaacccagtggacagtggagg ttagtgcaagatctcaggattatcaatgaggccgttgtccctctatacccagctgtacct aacccttatactctgctttcccaaataccagaggaagcagagtggtttacagtcctggac cttaaggatgtctttttctgcatccctgtacatcctgactctcaattcttgtttgccttt gaagatcctttgaacccaatgtctcaactcacctgcactgttttaccccaagggttcagg gatagcccccatctatttggccaggcattagcccaagacttgagtcagttatcatacctg gacactcttgtccttcagtatgtggatgatttacttttagctgcctgttcagaaaccttg tgccatcaagccacccaagcactcttaaatttcctcgccacctgtggctacaaggtttcc aaagagaaggctcagctctgctcacagcaggttaaatacttaggactaaaattatccaaa ggcaccaaggccctcagtgaggaatgtatccagcctatactggcttatcctcatctcaaa accctaaagcaactaagagagttccttggcataacaggcttctgccgaatatggattccc aggtatggcaaaatagccaggccattatatacagtaattaaggaaactcagaaagccaat acccatttagtaagatggatacctgaagcaaaagtggctttccaggccctaaagaaggcc ttaacccaagtcccagtgttaagcttgccaacggggcaagacttttctttatacatcaca gaaaaaaacagaaacagctctgggagtccttacacaggtccaagggacaagcttgcaacc catggcatacctgagctctattacttgaaaggccagtgctgcaactgtgcacttgtgcaa ctcttaacccagtcacatttcttccagacaatgaagatagaatataactgtcaacaaata atttctcaaacctatgccactcgaggggaccttctagaggttcccttgactgatcctgac ctcaacttgtatactgatggaagttcctttgtagaaaaaggacctcaaaaagcggggtat gcagtggcaatatgcagagaaagggaattcctaacttccgagggaacacctatcacacat caggaagccattaggagattattactggcagtacagaaacctaaagaggtggaagtctta cactgctggggtcatcagaaaggaaagaaaagggaaatagaagggaactgccaagcagat attgaagcaaaaagagctgcaaggcaggaccctccattagaaatgcttatagaaggaacc ctactatggggtaatgccctccgggaaacaaagccccagtactcagaagaagaaatagaa tggggaacctcatga >gi568815581r:54862736_55068646|GENSCAN_predicted_peptide_7|148_aa MKEQEPPASAQISQCHVLALNYLGIQPTKEQHQALRQQVQADSKGTVSFGDFVQVARNLF CLQLDEVNVGAHEISNILDSQLLPCDSSEADEMERLKCERDDALKEVNTLKIVTVASECQ LLFSTKNPLYSDPSCYSFIATATSYLLP >gi568815581r:54862736_55068646|GENSCAN_predicted_CDS_7|447_bp atgaaggaacaagagcctcctgcatcagcccaaatttcacaatgtcatgttttagctcta aattatcttggtattcagcccacaaaggaacaacaccaagccctgagacagcaagtacaa gcagactcaaaagggacagtgtcttttggagattttgtccaggttgccagaaacttgttt tgcttgcagttggatgaagtaaatgttggtgcacatgaaatttccaatatattagattca cagcttcttccttgtgattcttcagaagcagatgaaatggaaaggctcaagtgtgaaaga gatgatgccttgaaagaagtaaatacacttaagatagtcactgtggcctcagaatgccag cttttattttctaccaaaaaccctctttactcagatccctcatgctattcttttattgcc acagccacatcctacctgctcccctaa