GENSCAN 1.0 Date run: 7-Nov-116 Time: 15:46:47 Sequence gi568815592f:125886551_126138833 : 252283 bp : 38.51% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2422 3431 1010 1 2 91 121 988 0.870 91.83 1.02 Intr + 4092 4260 169 1 1 44 105 197 0.716 15.08 1.03 Intr + 28783 28930 148 1 1 78 83 162 0.009 14.02 1.04 Intr + 34393 34518 126 0 0 100 80 209 0.982 21.26 1.05 Intr + 36132 36284 153 2 0 75 94 201 0.999 18.75 1.06 Intr + 41113 41208 96 0 0 75 77 52 0.797 2.09 1.07 Intr + 41624 41697 74 1 2 97 86 34 0.918 1.39 1.08 Intr + 42086 42203 118 2 1 88 48 79 0.185 3.45 1.09 Term + 52723 52743 21 0 0 141 48 10 0.101 -0.37 1.10 PlyA + 52825 52830 6 1.05 2.04 PlyA - 54352 54347 6 1.05 2.03 Term - 62791 62667 125 2 2 99 48 70 0.694 1.67 2.02 Intr - 63353 63171 183 0 0 64 61 192 0.556 12.94 2.01 Init - 67114 67069 46 1 1 71 53 53 0.190 -0.35 2.00 Prom - 67993 67954 40 -3.35 3.00 Prom + 69137 69176 40 -8.75 3.01 Init + 70428 70652 225 2 0 84 47 233 0.166 17.48 3.02 Intr + 80337 80454 118 2 1 87 86 86 0.088 7.42 3.03 Intr + 85709 85778 70 0 1 104 85 40 0.067 2.62 3.04 Intr + 88297 88423 127 1 1 93 79 -11 0.051 -1.74 3.05 Intr + 99590 100072 483 1 0 10 117 352 0.028 22.59 3.06 Intr + 107207 107272 66 1 0 67 115 52 0.429 3.98 3.07 Intr + 111673 111765 93 0 0 97 95 36 0.955 4.44 3.08 Intr + 112000 112134 135 0 0 85 80 63 0.967 5.04 3.09 Intr + 112907 113063 157 1 1 23 68 104 0.960 0.56 3.10 Intr + 121842 121922 81 1 0 63 92 75 0.908 4.09 3.11 Intr + 124703 124867 165 0 0 144 97 178 0.999 23.31 3.12 Intr + 126221 126302 82 0 1 97 115 33 0.879 4.68 3.13 Intr + 153401 153569 169 2 1 110 105 -26 0.002 0.33 3.14 Term + 160913 161092 180 1 0 78 37 119 0.569 2.43 3.15 PlyA + 162839 162844 6 1.05 4.05 PlyA - 165517 165512 6 1.05 4.04 Term - 173423 173292 132 2 0 127 50 37 0.917 0.91 4.03 Intr - 174070 173908 163 0 1 38 59 180 0.605 9.16 4.02 Intr - 177906 177825 82 1 1 70 74 47 0.130 -0.72 4.01 Init - 182755 182584 172 1 1 102 78 83 0.588 8.35 4.00 Prom - 202367 202328 40 -4.35 5.04 PlyA - 202466 202461 6 1.05 5.03 Term - 204965 204744 222 2 0 6 39 207 0.843 3.53 5.02 Intr - 210160 210049 112 0 1 54 89 62 0.524 2.36 5.01 Init - 211685 211624 62 2 2 53 98 75 0.594 5.77 5.00 Prom - 221337 221298 40 -4.65 6.03 PlyA - 222386 222381 6 1.05 6.02 Term - 238155 237832 324 0 0 10 42 222 0.847 3.58 6.01 Init - 240021 239929 93 0 0 97 81 29 0.876 3.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 99626 100072 447 1 0 60 117 315 0.897 27.61 S.002 Intr + 134610 134730 121 0 1 80 84 97 0.893 8.08 S.003 Init + 158912 159016 105 1 0 72 56 115 0.871 6.97 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592f:125886551_126138833|GENSCAN_predicted_peptide_1|638_aa XPGEWEDLASEKDINPFSKFKSINKEKRQQNGEKIMTSDSRPIVPLEKSTGHTPTKPSGS SVSEKLKKLDSSRETSHGSPTVTKLSKEPSDTSSAFESTAKENFLGEDDDFVDLEELSSQ TGGGMHKKDTLKECLSLDPEERKKAESQINNSAVEMQVQSALAFLGTENDVELKGALDLE TCEKQDIMPEVDKQSGSPESRVENTLNIHEDLDKVKLIEYYLTKNKEGPQVSENLQKTEL SDGKSIEPGGIDITLSSSLSQAGDPITEGNKEPDKTWVKKGEPLPVKLNSSTEANVIKEA LDSSLESTLDNSCQGAQMDNKSEVQLWLLKRIQVPIEDILPSKEEKSKTPPMFLCIKVGK PMRKSFATHTAAMVQQYGKRRKQPEYWFAVPRERVDHLYTFFVQWSPDVYGKDAKEQGFV VVEKEELNMIDNFFSEPTTKSWEIITVEEAKRRKSTCSYYEDEDEEVLPVLRPHSALLEN MHIEQLARRLPARVQGYPWRLAYSTLEHGTSLKTLYRKSASLDSPVLLVIKDMDNQIFGA YATHPFKFSDHYYGTGETFLYTFSPHFKVFKWSGENSYFINGDISSLELGGGGGRFGLWL DADLYHGRSNSCSTFNNDILSKKEDFIVQDLEIQEIIN >gi568815592f:125886551_126138833|GENSCAN_predicted_CDS_1|1917_bp nngcctggagaatgggaagacctggcttcagaaaaggatatcaacccattcagtaagttc aaatctatcaacaaggaaaaacgacagcagaatggagagaaaattatgacttcggattcc agaccaatagtacctttggagaagtccacaggacatacacctacaaagccctcaggcagc tctgtgtcagagaaattaaagaaactggactcctctagggagacatcccatggttctccc acagtgactaagctcagcaaggaaccttccgacacttcttctgcatttgaatctacagcc aaagaaaactttctaggggaagatgatgattttgttgacttggaagaactttcttctcaa actggtggtggaatgcacaaaaaagacaccttgaaggagtgcctttctcttgacccagag gaacgaaagaaagctgagtcacaaataaacaattctgccgtggaaatgcaggtgcagtca gccctagcctttttgggaacagagaatgatgttgaactgaagggggcgctagatttagaa acctgtgagaagcaagatataatgccagaagtggacaagcagtctggttcgccagaaagc cgagtagaaaacacactgaacatacatgaagatttagataaagttaaactcattgaatat tacctgactaagaacaaagaagggccacaggtatctgaaaatttgcagaaaacagaatta agtgatggaaaaagtattgaaccagggggaatagacattacccttagtagttctctttcc caggcgggtgatcccataactgagggcaataaagagccagataagacctgggtgaaaaag ggagagcccctcccggtaaaactgaactcttctacagaagcaaatgtgattaaagaggct ctagactcctctttggaatctactctggacaacagctgtcaaggtgcacaaatggataat aaatctgaagttcagttgtggctgttaaagagaattcaggtacccattgaagatatactt ccttcaaaagaagaaaaaagcaagaccccacccatgttcctgtgcatcaaagtgggaaaa ccaatgagaaaatcctttgccactcacactgcagccatggtccagcagtacggcaaacgg agaaagcagccagagtactggtttgctgttcctcgggagagggtggatcatttgtacaca ttctttgttcagtggtctcccgatgtctatggaaaagatgccaaagagcaaggctttgtg gtggtggagaaggaagaactgaacatgattgacaacttcttcagtgagccaacaaccaag agctgggagatcatcactgttgaagaggcaaagcgcaggaagagcacatgcagctactat gaagacgaggacgaagaggtgctgcctgtcctacggccccacagcgcgctcctggagaat atgcacatcgagcagctggcccgacgccttcctgcaagggtgcaagggtatccatggaga ctggcctatagcacgttagagcacgggaccagcttaaagacgctctaccggaaatcggca tcactagacagtcctgtcctattggtcatcaaagatatggataatcagatttttggagca tatgcaactcatcctttcaagttcagtgaccactattatggcacaggcgaaacttttctc tacacattcagccctcattttaaggtctttaagtggagtggagaaaattcatactttatc aatggagacataagttctttagaacttggtggtggagggggacgatttggtttatggcta gatgctgatttataccacggacgaagcaactcttgcagcactttcaataatgatattctt tccaaaaaggaagacttcatagttcaggatctggagatccaggagataatcaactaa >gi568815592f:125886551_126138833|GENSCAN_predicted_peptide_2|117_aa MLAAAEGQAAPGTRTVNNSRRIEALAGIPMYQNQFTAWYWQMSVVSTRTLVRSMFYFGQK KEKPTGDEVEQKDASTSQLCLRGLYLMTQAEGAASIWNVSGVRAEGRNNTMEPHDGF >gi568815592f:125886551_126138833|GENSCAN_predicted_CDS_2|354_bp atgctggctgctgcggaggggcaggcagctccaggcacccgcacagtaaacaactccagg aggatagaggccctcgctggcatccccatgtatcaaaaccagttcacggcctggtactgg cagatgtccgtggtcagcaccaggaccttggtgcgctcaatgttttactttggccagaaa aaggagaagccaacaggtgatgaagtagaacaaaaggatgcctcaacaagtcaactgtgt cttcgtggactttatctcatgacccaggctgaaggagcagcctccatctggaatgtttct ggtgtcagggcagagggaagaaataacacaatggagccacatgatggcttttaa >gi568815592f:125886551_126138833|GENSCAN_predicted_peptide_3|716_aa MAEEQVNRSAGLAPDCEASATAETTVSSVGTCEAAGKSPEPKDYDSTCVFCRIAGRQDPG TELLHCEVGGDARPGNEDLICFKDIKPAATHHYLVVPKKHIGNCRTLRKDQVELVENMVT VGKTILERNNFTDFTNVRMGFHMPPFCSISHLHLHVLAPVDQLGFLSKLVYRVNSYWFIT TNLQGNGEISKEMVKFQSNFEEQVLKGCRESWKHTPSGTITWKYTVLLSQAAPESHASPL TSYQRWETPNHQAFTGEEAPPTAAGVSDSEREKRRDSACLAELPLPALQALSLWRLGFGR EKEGPSAGRAGARWAAAMALSCTLNRYLLLMAQEHLEFRLPEIKSLLLLFGGQFASSQET YGKVPFLHSDSTYKIKIHTFNKTLTQEEKIKRIDALEFLPFEGKVNLKKPQHVFSVLEDY GLDPNCIPENPHNIYFGRWIADGQRELIESYSVKKRHFIGNTSMDAGLSFIMANHGKVKE NDIVFDPFVGTGGLLIACAHFGAYVYGTDIDYNTVHGLGKATRKNQKWRGPDENIRANLR QYGLEKYYLDVLVSDASKPSWRKGTYFDAIITDPPYGIRESTRRTGSQKEIPKGIEKWEK CSGFQQQISNISLIILFGFLDVICGTLGGNLKVFKDDLFSVFLFKPSYLKCFEVIWELPI HPAESHLYHSVKAPYSPSFKFVCDLILPACWTRTRVSRGQGVKGCHPDSPLSWFNT >gi568815592f:125886551_126138833|GENSCAN_predicted_CDS_3|2151_bp atggcggaggaacaggtgaaccgcagcgccggcctggcccccgactgtgaggcctcggcg actgcagaaactacggtttcctcagtggggacctgtgaagccgctggcaagtcaccagag cccaaggactacgacagcacctgcgtgttctgccggatcgcggggcggcaggacccgggc accgaactcctgcactgcgaggtgggcggcgacgcgcggccggggaatgaggacctaatt tgcttcaaagatatcaaaccagcagcaactcatcattatcttgtggtgccaaagaagcat attggaaactgcagaactctaaggaaagatcaagtagaactggttgagaacatggtaact gttggaaaaaccattcttgaaagaaataatttcactgacttcacgaatgtgaggatgggt tttcatatgccaccattctgttccatttcccacttgcaccttcatgttctggcaccagtg gatcagcttggcttcttatccaagttggtttatagagtcaattcctattggtttatcaca acaaacctacaaggaaatggtgaaatttccaaggaaatggtgaaatttcagtccaatttt gaagaacaggttttgaagggctgtagagaaagctggaagcacactccaagtggaacgata acttggaaatatactgtcttattatcacaggctgctcctgaaagccacgcgtccccgctc accagctaccagcgctgggagacccctaaccaccaggcattcacaggtgaggaagcccct cccacagcggccggagtttcggactcggagcgcgagaaacgtcgggatagcgcctgcctc gcagagcttccgcttccggcccttcaggctctgtctctgtggagactgggctttgggagg gagaaagagggacctagcgcgggccgcgcaggcgcacggtgggcagctgcaatggcgctg tcgtgtacccttaacaggtatctgctcctcatggcgcaggagcatctggagttccgcctg ccggaaataaagtctttgcttttgctttttggaggtcagtttgccagcagtcaagaaact tatggaaaggttccatttctacattcggactctacatataaaataaagattcacactttt aataagacattgacacaagaagagaaaatcaagcgaatagatgcacttgaatttctgcca tttgaaggaaaagtgaatttaaagaaaccgcaacatgtattttctgttttggaggattat ggtttagacccaaactgcatccctgagaatccacataatatttattttggtagatggatt gcagatggacagagagagcttattgagtcatacagtgtcaaaaagagacactttattgga aatacaagtatggatgctggtttgtcattcattatggctaaccatggaaaagtgaaagaa aatgatattgtctttgatccatttgttggaacaggtggcctgctgatagcatgtgctcat tttggtgcatatgtgtatgggacagacatagactacaacacagttcatggcttgggaaag gctactaggaaaaaccagaagtggagaggaccagatgaaaacattagggccaatcttcgt caatatggtttagagaagtattaccttgatgtcctggtttcagatgcatctaaaccttcc tggaggaagggcacatattttgatgcaatcattactgatcctccatatggtatcagagaa tctacaagaagaacaggttcacagaaggagataccaaaggggatagaaaaatgggaaaaa tgttcaggatttcagcagcagatctcaaacatttcactgattattttgtttgggtttctg gatgtgatttgtggtacactggggggaaatctaaaagtattcaaggatgatttattttca gtttttctgtttaaaccatcttatttaaaatgtttcgaggttatatgggagctccccatc catcccgctgagagccacctctaccactcagttaaagccccgtattcaccatccttcaag tttgtgtgtgacctgattcttcctgcatgctggacaaggacccgggtatcaagagggcaa ggtgtaaaaggctgtcaccctgactctccactgagctggtttaacacttag >gi568815592f:125886551_126138833|GENSCAN_predicted_peptide_4|182_aa MDNMAEPKEFPFPDHKLQSHWSTAVVACGPTFMAWFSCVLSQAQPYRWLRQSQGMEGASI QEPKALGYPSSCVLHLFESCFENRRANPDSLGAVTPFDLRCVTYALSDGAIREWTILSGE RAESTGLLDYTRSELPEGEINFLSISSELFLDVVTCSSMPMNASGFCEGGFNFIQPDTMI RH >gi568815592f:125886551_126138833|GENSCAN_predicted_CDS_4|549_bp atggacaacatggcagagccaaaagaatttcctttccctgatcacaaactccaaagccat tggtctactgctgttgtagcatgtggcccaacatttatggcctggttcagctgtgtgctt agtcaagcacagccttacaggtggttaaggcaaagccaaggaatggaaggagcctcaatc caggagcccaaggctcttggctaccccagcagctgtgtattacatttatttgaatcctgt tttgagaatagaagggccaacccagactctctgggtgctgtgacgccctttgatctgcgt tgtgtcacatacgcactgagtgatggagccattcgtgaatggactattctctctggagag agggcagagtcaacagggctcttagactacactcggtctgagttgcctgaaggagagata aattttctgagtatttcctcagagctgtttttggatgtggtgacatgcagttccatgcca atgaatgcctcagggttttgtgaaggaggatttaatttcattcaaccagacactatgatt agacattga >gi568815592f:125886551_126138833|GENSCAN_predicted_peptide_5|131_aa MNAAANTSGDQRWEVSGRQAQYRIILGTTGDTKIKKTHPDLQRLKDKTAISTSALMPMNR FRETLALQLGGRFMDRKRSDVQKTEVKYRNSWIGYSSTFALFEHHLNSCPPVSGEKSVIG TRRLRPAYTSS >gi568815592f:125886551_126138833|GENSCAN_predicted_CDS_5|396_bp atgaacgcagcagcaaacacatcaggagatcagaggtgggaagtgagtggacgccaagca cagtacaggatcattctagggactacgggggatacaaagattaagaagactcatcctgac ttacagaggcttaaagacaagactgcaatcagtaccagtgccctaatgcccatgaacagg tttagagagactctggcactgcaacttggtggaagatttatggaccgaaaaaggagtgat gtacagaaaacagaagtgaagtacagaaacagctggattggctacagctcgacatttgcc ttatttgaacaccatttaaacagttgtccacctgtgagtggcgaaaagtcggtgattggt acgcgtaggttacggcctgcttatacttccagttag >gi568815592f:125886551_126138833|GENSCAN_predicted_peptide_6|138_aa MGSELCGNWIKVKTGQKQFRGGCRPREQMSPGDYVDSTLIYKPLIIWQPCSKCSYASDRP ERTDKNAHFLKPNLGSDTVSILSYFIAQSRIDSGPHKKTPPIQGEKQQSHSKGCEYKDAC RIGAVIAIYDTCTKKEGN >gi568815592f:125886551_126138833|GENSCAN_predicted_CDS_6|417_bp atgggttcagaactgtgtggaaactggatcaaagtgaaaacagggcagaagcaattcaga ggaggctgtagaccaagagagcaaatgagtccaggtgactatgtggactccactctgatc tacaagcctctcatcatctggcagccatgttcaaaatgttcttatgccagtgacaggcct gagagaacagacaaaaatgcacactttttgaagcctaaccttggaagtgatacagtgtca atcctgtcatattttattgcccaaagtagaatagactcgggaccccataaaaagacccca cctattcagggagagaaacagcagtctcacagcaaaggatgtgaatacaaggacgcatgc agaattggtgctgttattgcaatctatgacacctgcactaaaaaagagggcaattaa