GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:47:00 Sequence gi568815575r:119489355_119692497 : 203143 bp : 45.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 841 930 90 0 0 60 92 51 0.380 3.19 1.02 Intr + 6451 6582 132 0 0 65 59 67 0.292 2.34 1.03 Intr + 13379 13471 93 1 0 80 62 54 0.239 2.16 1.04 Term + 30833 30955 123 1 0 64 42 66 0.156 -2.02 1.05 PlyA + 31298 31303 6 1.05 2.08 PlyA - 33823 33818 6 1.05 2.07 Term - 50435 50373 63 2 0 106 41 66 0.767 1.59 2.06 Intr - 52066 51974 93 1 0 95 105 148 0.998 17.36 2.05 Intr - 53240 53151 90 2 0 109 87 180 0.996 20.19 2.04 Intr - 55137 54999 139 2 1 68 86 99 0.998 8.17 2.03 Intr - 56150 56109 42 1 0 90 106 46 0.939 3.96 2.02 Intr - 71031 70914 118 1 1 100 94 129 0.874 14.22 2.01 Init - 76001 75878 124 2 1 93 92 109 0.997 12.13 2.00 Prom - 79927 79888 40 -5.96 3.17 PlyA - 80673 80668 6 1.05 3.16 Term - 82949 82879 71 0 2 114 50 32 0.076 0.10 3.15 Intr - 87417 87382 36 1 0 125 98 11 0.126 4.03 3.14 Intr - 101880 100132 1749 1 0 56 89 1194 0.498 103.62 3.13 Intr - 103159 103029 131 0 2 43 101 73 0.130 4.44 3.12 Intr - 116681 116377 305 2 2 14 94 248 0.204 13.39 3.11 Intr - 117176 117047 130 1 1 75 77 110 0.365 9.30 3.10 Intr - 140154 139964 191 0 2 69 106 313 0.942 29.58 3.09 Intr - 144138 144006 133 2 1 -19 84 145 0.466 4.05 3.08 Intr - 147841 147673 169 2 1 52 86 385 0.679 33.70 3.07 Intr - 151434 151338 97 0 1 104 96 104 0.621 12.38 3.06 Intr - 159294 159238 57 0 0 68 86 39 0.623 0.78 3.05 Intr - 160744 160583 162 1 0 143 92 273 0.999 33.37 3.04 Intr - 163686 163500 187 2 1 117 75 245 0.999 25.79 3.03 Intr - 174323 174128 196 0 1 93 103 179 0.971 18.37 3.02 Intr - 186314 186200 115 2 1 66 92 104 0.924 8.62 3.01 Init - 195713 195693 21 2 0 79 95 46 0.518 2.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:119489355_119692497|GENSCAN_predicted_peptide_1|145_aa MSSGLVPWVCIGKDGGCCDGAEWFQAQPSQRAGKSPQESFPSQCLLGRVAVATELIKGLS LHKEACAAVAMGLRRGQHHLLSRTGYSQADAKSKSLPASTLLPALKNAHLFTSTQNIFNF KELGDSLKLIPEAHGAQVKSFYCVG >gi568815575r:119489355_119692497|GENSCAN_predicted_CDS_1|438_bp atgtcttcagggctggtcccatgggtctgcatcggcaaggatggaggctgctgtgatggg gctgagtggtttcaggcccagcccagccagagggctggtaagagtccacaggagagcttc ccaagccagtgccttttaggaagagttgctgtggctactgagctgatcaaaggtctcagc ctccacaaagaagcatgtgctgctgtggcaatgggcctgaggagagggcagcaccacctt ctcagccgcacaggatattctcaggctgatgccaagagcaaatcactcccagccagcaca cttctgccagccttgaaaaatgcccatctgttcacaagcacacagaacatatttaatttc aaggagcttggggactccctgaagctcatccctgaagcccatggagcccaggttaagagc ttctattgtgtaggctag >gi568815575r:119489355_119692497|GENSCAN_predicted_peptide_2|222_aa MPKVVSRSVVCSDTRDREEYDDGEKPLHVYYCLCGQMVLVLDCQLEKLPMRPRDRSRVID AAKHAHKFCNTEDEETMYLRRPEGIERQYRKKCAKCGLPLFYQSQPKNAPVTFIVDGAVV KFGQGFGKTNIYTQKQEPPKKVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREVADSYAQ NAKVIEKQLERKGMSKRRLQELAELEAKKAKMKGTLIDNQFK >gi568815575r:119489355_119692497|GENSCAN_predicted_CDS_2|669_bp atgccgaaagtagtgtctcggtcagtagtctgctctgacactcgggaccgggaggaatat gacgacggcgagaagcccctccatgtttactactgtttgtgcggccagatggtcctagtg ctggactgccagttagagaaattgcccatgaggccccgggaccggtcccgtgtgattgat gctgccaaacatgcccataagttttgtaacacagaagatgaggagactatgtatctgcgg agacctgaaggcattgaacgacagtacaggaagaaatgtgcaaagtgtggactgccgctc ttctaccaatcccagccaaagaatgctcctgttaccttcattgtggatggagcagtagtc aagtttggccagggctttgggaaaacgaacatatatactcagaaacaagagcctcctaag aaggtgatgatgaccaaacggaccaaagacatgggcaagttcagttctgtcaccgtgtct accattgatgaagaggaagaggagattgaggctagggaagttgctgactcatatgcacag aatgccaaagtgattgaaaaacagctggagcgcaaaggcatgagcaagaggcgactgcaa gagctggctgaattggaagccaagaaagcgaaaatgaaggggaccttgattgacaaccag ttcaaataa >gi568815575r:119489355_119692497|GENSCAN_predicted_peptide_3|1249_aa MGLLLCLGEGCRTVPLAGHVGFDSLPDQLVNKSVSQGFCFNILCVGETGLGKSTLMDTLF NTKFEGEPATHTQPGVQLQSNTYDLQESNVRLKLTIVSTVGFGDQINKEDSYKPIVEFID AQFEAYLQEELKIRRVLHTYHDSRIHVCLYFIAPTGHSLKSLDLVTMKKLDSKVNIIPII AKADAISKSELTKFKIKITSELVSNGVQIYQFPTDDESVAEINGTMNYTRDVFVITITIL DKSGSRAHLPFAVIGSTEELKIGNKMMRARQYPWGTVQVENEAHCDFVKLREMLIRVNME DLREQTHTRHYELYRRCKLEEMGFKDTDPDSKPFSLQETYEAKRNEFLGELQKKEEEMRQ MFVQRVKEKEAELKEAEKELHEKFDRLKKLHQDEKKKLEDKKKSLDDEVNAFKQRKTAAE LLQSQGSQAGGSQTLKRDKEKKNASKARSRHWREKSTSREILRRDLEVREQPRRKRTGLT TPDGTRSASSLGSLLGGGEDGWRTSAVGGRLPVAPPLPPLPPPPLPPLPPPPPEPVLEQW RYSHESDWQWALRRSFICRHLHSYPGAALDQLLALSAAWTNHVFLGCRYSPRLMEKILQM AEGIDIGEMPSYDLVLSKPSKGQKRHLSTCDASSSKDERQEDPYGPQTKEVNEQTHFASM PRDIYQDYTQDSFSIQDGNSQYCDSSGFILTKDQPVTANMYFDSGNPAPSTTSQQANSQS TPEPSPSQTFPESVVAEKQYFIEKLTATIWKNLSNPEMTSGSDKINYTYMLTRCIQACKT NPEYIYAPLKEIPPADIPKNKKLLTDGYACEVRCQNIYLTTGYAGSKNGSRDRATELAVK LLQKRIEVRVVRRKFKHTFGEDLVVCQIGMSSYEFPPALKPPEDLVVLGKDASGQPIFNA SAKHWTNFVITENANDAIGILNNSASFNKMSIEYKYEMMPNRTWRCRVFLQDHCLAEGYG TKKTSKHAAADEALKILQKTQPTYPSVKSSQCHTGSSPRGSGKKKDIKDLVVYENSSNPV CTLNDTAQFNRMTVEYVYERMTGLRWKCKVILESEVIAEAVGVKKTVKYEAAGEAVKTLK KTQPTVINNLKKGAVEDVISRNEIQGRSAEEAYKQQIKEDNIGNQLLRKMGWTGGGLGKS GEGIREPISVKEQHKREGLGLDVERVNKIAKRDIEQIIRNYARSESHTDLTFSRELTNDE RKQIHQIAQKYGLKTLKSNDIYPGGKDFGIICIIKVSGFGAFRILDFQI >gi568815575r:119489355_119692497|GENSCAN_predicted_CDS_3|3750_bp atgggcctcctcctctgtctgggtgaaggttgccgaactgtccccctggctggacatgtg gggtttgacagcttgcctgaccagctggtgaataagtccgtcagccagggcttctgcttc aacatcctgtgcgtgggagagacaggtttgggcaagtccaccctcatggacaccctgttc aacaccaaattcgaaggggagccagccacccacacacagccgggtgtccagctccagtct aatacctatgacctccaagagagcaacgtgaggctaaagctcacgatcgttagcacagtt ggctttggggaccagatcaacaaagaggacagctacaagcctatcgtggaattcatcgat gcacaattcgaggcctacctgcaggaagagctaaagatccgaagagtgctacacacctac catgactcccgaatccatgtctgcttgtatttcattgcccccacgggtcattccctgaag tctctggacctagtgactatgaagaagctggacagtaaggtgaacatcatccccatcatt gccaaagcagatgccatttcgaagagtgagctaacaaagttcaaaatcaaaatcaccagc gagcttgtcagcaacggagtccagatctatcagtttcctacagatgatgagtcggtggca gagatcaatggaaccatgaactacacccgagatgtgtttgttatcaccatcaccatctta gacaaatctggctcaagggcccacctgccgtttgctgtcattggcagcacagaagaactg aagataggcaacaagatgatgagggcgcggcagtatccttggggcactgtgcaggttgaa aacgaggcccactgcgactttgtgaagctgcgggagatgctgattcgggtcaacatggag gatctgcgggagcagacccacacccggcactatgagctgtatcgccgctgtaagctggag gagatgggcttcaaggacaccgaccctgacagcaaacccttcagtttacaggagacatat gaggccaaaaggaacgagttcctaggggaactccagaaaaaagaagaggagatgagacag atgttcgtccagcgagtcaaagagaaagaagcggagctcaaagaggcagagaaagagctg cacgagaagtttgaccgtctgaagaaactgcaccaggacgagaagaagaaactggaggat aagaagaaatccctggatgatgaagtgaatgctttcaagcaaagaaagacggcggctgag ctgctccagtcccagggctcccaggctggaggctcacagactctgaagagagacaaagag aagaaaaatgcgtcaaaggcccgtagtcgtcactggagggaaaaaagtacgtcgcgcgag attctgcgacgggatttggaagttagggaacagccgcggcgcaagcgcactggcctcaca accccggacggcacgcggtcggcttcgtccctggggtccctgcttgggggcggagaagat ggctggaggacgtctgctgttggggggcgacttcctgtcgcgccgccgctgccccccctc ccgccgccgccgctgccgcccctcccgccgcccccgcccgagccagtgctggagcagtgg cgctatagccacgaaagtgactggcagtgggctctgcggcgcagcttcatctgtcggcac ctgcacagctatcccggggctgccctcgaccagctcctcgcgctctccgccgcctggacc aaccacgtcttcctgggctgcaggtacagcccacgcttgatggaaaaaattctccaaatg gctgaaggtattgatattggggagatgccttcatatgatctggtgctgtccaaaccttcc aaaggtcaaaaacgccacctctcaacatgtgatgctagtagttcaaaagatgaaagacag gaagatccttatggccctcaaacaaaagaggtaaatgaacaaacacattttgccagcatg ccaagagacatctaccaagattatactcaagactctttcagtatacaagatgggaattct cagtattgtgattcatcaggattcattctcacaaaagaccagcctgtaacagccaacatg tattttgacagtgggaaccctgccccaagcaccacatcacagcaggcaaactctcagtca actcctgagccttcaccatcacagacatttcccgagtctgtggtagccgagaagcagtat tttattgaaaaattaacggcgacaatctggaagaacctttctaatccagaaatgacttct ggatctgataaaattaattatacatatatgttaactcgttgtattcaggcgtgtaagaca aatcctgagtatatatatgctcctttaaaggaaattcctcctgccgacatccccaaaaat aaaaaacttctaactgatggctatgcttgtgaagttagatgccaaaatatctacttaact acaggttatgctggcagcaagaatgggtccagggatcgagctacagagctagctgtaaaa ctcttgcagaaacgtattgaagttagagttgtccggcggaaattcaagcatacatttgga gaggacctcgtggtgtgtcagattggcatgtcctcctatgaatttcctccagctctgaag ccaccagaagacctggtggtgctgggtaaagatgcttccgggcagccaatttttaatgct tctgccaaacactggaccaattttgtcattacagaaaatgcaaatgatgcaattggtatc cttaacaattctgcctcattcaacaagatgtcaattgaatacaaatatgagatgatgcca aatcgcacatggcgttgtcgagtgtttttacaagatcactgcttagctgaaggttatgga accaagaaaacaagtaaacatgcagctgccgacgaggctttgaaaattcttcaaaaaaca cagcccacttatccatctgtcaaaagttcacaatgccatacaggctcttcacccagagga tctggaaagaagaaagatataaaggatcttgtagtttatgagaattcttcaaatcccgtg tgcacgctgaacgacacagctcagtttaaccgaatgacagttgagtatgtctatgaaagg atgacaggcctccgctggaaatgcaaagtgattctagagagtgaagtaattgcagaagca gttggggtgaagaaaactgtcaaatatgaagctgctggggaagctgtgaaaaccctcaaa aagacccagccaactgtcattaacaacttgaagaaaggagctgttgaagatgtgatttca agaaatgaaattcagggccgctcagcagaggaggcttacaaacagcaaatcaaagaagat aatattggaaatcagctgctgagaaagatgggttggactggtggtggtttaggtaaatct ggtgagggcatacgggagcctatctcagtgaaagagcagcataagcgggaagggcttggt ctggatgtagagagggtgaataaaattgccaagagagatattgaacagatcatcagaaac tacgcccgctccgagagccacacagatttgactttctctagagagctgactaatgatgaa cggaagcaaatacatcagattgcccagaagtatggtcttaagaccctgaagagtaatgat atttacccagggggtaaagattttggaattatttgcattataaaagtttcaggttttgga gcatttcggattttagattttcagatttga