GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:10:32 Sequence gi568815575r:119529318_119775669 : 246352 bp : 46.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 328 323 6 1.05 1.07 Term - 10472 10410 63 2 0 106 41 66 0.780 1.59 1.06 Intr - 12103 12011 93 1 0 95 105 148 0.998 17.36 1.05 Intr - 13277 13188 90 2 0 109 87 180 0.996 20.19 1.04 Intr - 15174 15036 139 2 1 68 86 99 0.998 8.17 1.03 Intr - 16187 16146 42 1 0 90 106 46 0.939 3.96 1.02 Intr - 31068 30951 118 1 1 100 94 129 0.874 14.22 1.01 Init - 36038 35915 124 2 1 93 92 109 0.997 12.13 1.00 Prom - 39964 39925 40 -5.96 2.20 PlyA - 40710 40705 6 1.05 2.19 Term - 42986 42916 71 0 2 114 50 32 0.076 0.10 2.18 Intr - 47454 47419 36 1 0 125 98 11 0.126 4.03 2.17 Intr - 61917 60169 1749 1 0 56 89 1194 0.498 103.62 2.16 Intr - 63196 63066 131 0 2 43 101 73 0.130 4.44 2.15 Intr - 76718 76414 305 2 2 14 94 248 0.204 13.39 2.14 Intr - 77213 77084 130 1 1 75 77 110 0.365 9.30 2.13 Intr - 100191 100001 191 0 2 69 106 313 0.942 29.58 2.12 Intr - 104175 104043 133 2 1 -19 84 145 0.466 4.05 2.11 Intr - 107878 107710 169 2 1 52 86 385 0.679 33.70 2.10 Intr - 111471 111375 97 0 1 104 96 104 0.621 12.38 2.09 Intr - 119331 119275 57 0 0 68 86 39 0.623 0.78 2.08 Intr - 120781 120620 162 1 0 143 92 273 0.999 33.37 2.07 Intr - 123723 123537 187 2 1 117 75 245 0.999 25.79 2.06 Intr - 134360 134165 196 0 1 93 103 179 0.971 18.37 2.05 Intr - 146351 146237 115 2 1 66 92 104 0.924 8.62 2.04 Intr - 158772 158732 41 1 2 85 53 24 0.009 -3.46 2.03 Intr - 163465 163316 150 2 0 54 20 136 0.077 3.53 2.02 Intr - 163885 163759 127 1 1 36 103 102 0.425 6.75 2.01 Init - 166266 166207 60 0 0 57 131 12 0.363 3.70 2.00 Prom - 181023 180984 40 -4.76 3.00 Prom + 181685 181724 40 -3.66 3.01 Init + 211827 211833 7 2 1 99 83 0 0.380 1.69 3.02 Intr + 221419 221694 276 2 0 75 68 144 0.593 8.59 3.03 Term + 221787 221917 131 1 2 37 55 89 0.640 -1.16 3.04 PlyA + 224636 224641 6 1.05 4.00 Prom + 226372 226411 40 -7.06 4.01 Sngl + 229351 230298 948 0 0 91 48 1042 0.987 95.08 4.02 PlyA + 232248 232253 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 193366 193430 65 2 2 116 47 42 0.874 0.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:119529318_119775669|GENSCAN_predicted_peptide_1|222_aa MPKVVSRSVVCSDTRDREEYDDGEKPLHVYYCLCGQMVLVLDCQLEKLPMRPRDRSRVID AAKHAHKFCNTEDEETMYLRRPEGIERQYRKKCAKCGLPLFYQSQPKNAPVTFIVDGAVV KFGQGFGKTNIYTQKQEPPKKVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREVADSYAQ NAKVIEKQLERKGMSKRRLQELAELEAKKAKMKGTLIDNQFK >gi568815575r:119529318_119775669|GENSCAN_predicted_CDS_1|669_bp atgccgaaagtagtgtctcggtcagtagtctgctctgacactcgggaccgggaggaatat gacgacggcgagaagcccctccatgtttactactgtttgtgcggccagatggtcctagtg ctggactgccagttagagaaattgcccatgaggccccgggaccggtcccgtgtgattgat gctgccaaacatgcccataagttttgtaacacagaagatgaggagactatgtatctgcgg agacctgaaggcattgaacgacagtacaggaagaaatgtgcaaagtgtggactgccgctc ttctaccaatcccagccaaagaatgctcctgttaccttcattgtggatggagcagtagtc aagtttggccagggctttgggaaaacgaacatatatactcagaaacaagagcctcctaag aaggtgatgatgaccaaacggaccaaagacatgggcaagttcagttctgtcaccgtgtct accattgatgaagaggaagaggagattgaggctagggaagttgctgactcatatgcacag aatgccaaagtgattgaaaaacagctggagcgcaaaggcatgagcaagaggcgactgcaa gagctggctgaattggaagccaagaaagcgaaaatgaaggggaccttgattgacaaccag ttcaaataa >gi568815575r:119529318_119775669|GENSCAN_predicted_peptide_2|1368_aa MGHMDETPRIPCSGAPSAGPCNTFSLSEELLCFLCSSSRCGGTRGSPGGRRSDGSDRYSS PGAAAPRLGRCRMNGRRSRRALRVPGTLALPRPCARLPVVTRDRTGKEARGWAHIHWALI GQAPCQGEGCRTVPLAGHVGFDSLPDQLVNKSVSQGFCFNILCVGETGLGKSTLMDTLFN TKFEGEPATHTQPGVQLQSNTYDLQESNVRLKLTIVSTVGFGDQINKEDSYKPIVEFIDA QFEAYLQEELKIRRVLHTYHDSRIHVCLYFIAPTGHSLKSLDLVTMKKLDSKVNIIPIIA KADAISKSELTKFKIKITSELVSNGVQIYQFPTDDESVAEINGTMNYTRDVFVITITILD KSGSRAHLPFAVIGSTEELKIGNKMMRARQYPWGTVQVENEAHCDFVKLREMLIRVNMED LREQTHTRHYELYRRCKLEEMGFKDTDPDSKPFSLQETYEAKRNEFLGELQKKEEEMRQM FVQRVKEKEAELKEAEKELHEKFDRLKKLHQDEKKKLEDKKKSLDDEVNAFKQRKTAAEL LQSQGSQAGGSQTLKRDKEKKNASKARSRHWREKSTSREILRRDLEVREQPRRKRTGLTT PDGTRSASSLGSLLGGGEDGWRTSAVGGRLPVAPPLPPLPPPPLPPLPPPPPEPVLEQWR YSHESDWQWALRRSFICRHLHSYPGAALDQLLALSAAWTNHVFLGCRYSPRLMEKILQMA EGIDIGEMPSYDLVLSKPSKGQKRHLSTCDASSSKDERQEDPYGPQTKEVNEQTHFASMP RDIYQDYTQDSFSIQDGNSQYCDSSGFILTKDQPVTANMYFDSGNPAPSTTSQQANSQST PEPSPSQTFPESVVAEKQYFIEKLTATIWKNLSNPEMTSGSDKINYTYMLTRCIQACKTN PEYIYAPLKEIPPADIPKNKKLLTDGYACEVRCQNIYLTTGYAGSKNGSRDRATELAVKL LQKRIEVRVVRRKFKHTFGEDLVVCQIGMSSYEFPPALKPPEDLVVLGKDASGQPIFNAS AKHWTNFVITENANDAIGILNNSASFNKMSIEYKYEMMPNRTWRCRVFLQDHCLAEGYGT KKTSKHAAADEALKILQKTQPTYPSVKSSQCHTGSSPRGSGKKKDIKDLVVYENSSNPVC TLNDTAQFNRMTVEYVYERMTGLRWKCKVILESEVIAEAVGVKKTVKYEAAGEAVKTLKK TQPTVINNLKKGAVEDVISRNEIQGRSAEEAYKQQIKEDNIGNQLLRKMGWTGGGLGKSG EGIREPISVKEQHKREGLGLDVERVNKIAKRDIEQIIRNYARSESHTDLTFSRELTNDER KQIHQIAQKYGLKTLKSNDIYPGGKDFGIICIIKVSGFGAFRILDFQI >gi568815575r:119529318_119775669|GENSCAN_predicted_CDS_2|4107_bp atggggcatatggatgagacaccaaggattccgtgctcaggtgcccccagcgctgggccg tgtaacactttctctttgtcggaggagctcctctgtttcctgtgcagtagctcccgttgc ggcggcacccgtggcagccctggcggacgcaggagcgatggcagcgaccgatatagctcg ccaggtgctgcggccccacggctggggcgttgccgaatgaatggccgtaggagccgccga gccctccgggtccctggcaccctggccttgccgcgcccctgcgcacggctgcccgttgtg acccgggaccgcacaggcaaagaggcgcggggctgggcacatattcactgggcactgatc ggccaggccccgtgccagggtgaaggttgccgaactgtccccctggctggacatgtgggg tttgacagcttgcctgaccagctggtgaataagtccgtcagccagggcttctgcttcaac atcctgtgcgtgggagagacaggtttgggcaagtccaccctcatggacaccctgttcaac accaaattcgaaggggagccagccacccacacacagccgggtgtccagctccagtctaat acctatgacctccaagagagcaacgtgaggctaaagctcacgatcgttagcacagttggc tttggggaccagatcaacaaagaggacagctacaagcctatcgtggaattcatcgatgca caattcgaggcctacctgcaggaagagctaaagatccgaagagtgctacacacctaccat gactcccgaatccatgtctgcttgtatttcattgcccccacgggtcattccctgaagtct ctggacctagtgactatgaagaagctggacagtaaggtgaacatcatccccatcattgcc aaagcagatgccatttcgaagagtgagctaacaaagttcaaaatcaaaatcaccagcgag cttgtcagcaacggagtccagatctatcagtttcctacagatgatgagtcggtggcagag atcaatggaaccatgaactacacccgagatgtgtttgttatcaccatcaccatcttagac aaatctggctcaagggcccacctgccgtttgctgtcattggcagcacagaagaactgaag ataggcaacaagatgatgagggcgcggcagtatccttggggcactgtgcaggttgaaaac gaggcccactgcgactttgtgaagctgcgggagatgctgattcgggtcaacatggaggat ctgcgggagcagacccacacccggcactatgagctgtatcgccgctgtaagctggaggag atgggcttcaaggacaccgaccctgacagcaaacccttcagtttacaggagacatatgag gccaaaaggaacgagttcctaggggaactccagaaaaaagaagaggagatgagacagatg ttcgtccagcgagtcaaagagaaagaagcggagctcaaagaggcagagaaagagctgcac gagaagtttgaccgtctgaagaaactgcaccaggacgagaagaagaaactggaggataag aagaaatccctggatgatgaagtgaatgctttcaagcaaagaaagacggcggctgagctg ctccagtcccagggctcccaggctggaggctcacagactctgaagagagacaaagagaag aaaaatgcgtcaaaggcccgtagtcgtcactggagggaaaaaagtacgtcgcgcgagatt ctgcgacgggatttggaagttagggaacagccgcggcgcaagcgcactggcctcacaacc ccggacggcacgcggtcggcttcgtccctggggtccctgcttgggggcggagaagatggc tggaggacgtctgctgttggggggcgacttcctgtcgcgccgccgctgccccccctcccg ccgccgccgctgccgcccctcccgccgcccccgcccgagccagtgctggagcagtggcgc tatagccacgaaagtgactggcagtgggctctgcggcgcagcttcatctgtcggcacctg cacagctatcccggggctgccctcgaccagctcctcgcgctctccgccgcctggaccaac cacgtcttcctgggctgcaggtacagcccacgcttgatggaaaaaattctccaaatggct gaaggtattgatattggggagatgccttcatatgatctggtgctgtccaaaccttccaaa ggtcaaaaacgccacctctcaacatgtgatgctagtagttcaaaagatgaaagacaggaa gatccttatggccctcaaacaaaagaggtaaatgaacaaacacattttgccagcatgcca agagacatctaccaagattatactcaagactctttcagtatacaagatgggaattctcag tattgtgattcatcaggattcattctcacaaaagaccagcctgtaacagccaacatgtat tttgacagtgggaaccctgccccaagcaccacatcacagcaggcaaactctcagtcaact cctgagccttcaccatcacagacatttcccgagtctgtggtagccgagaagcagtatttt attgaaaaattaacggcgacaatctggaagaacctttctaatccagaaatgacttctgga tctgataaaattaattatacatatatgttaactcgttgtattcaggcgtgtaagacaaat cctgagtatatatatgctcctttaaaggaaattcctcctgccgacatccccaaaaataaa aaacttctaactgatggctatgcttgtgaagttagatgccaaaatatctacttaactaca ggttatgctggcagcaagaatgggtccagggatcgagctacagagctagctgtaaaactc ttgcagaaacgtattgaagttagagttgtccggcggaaattcaagcatacatttggagag gacctcgtggtgtgtcagattggcatgtcctcctatgaatttcctccagctctgaagcca ccagaagacctggtggtgctgggtaaagatgcttccgggcagccaatttttaatgcttct gccaaacactggaccaattttgtcattacagaaaatgcaaatgatgcaattggtatcctt aacaattctgcctcattcaacaagatgtcaattgaatacaaatatgagatgatgccaaat cgcacatggcgttgtcgagtgtttttacaagatcactgcttagctgaaggttatggaacc aagaaaacaagtaaacatgcagctgccgacgaggctttgaaaattcttcaaaaaacacag cccacttatccatctgtcaaaagttcacaatgccatacaggctcttcacccagaggatct ggaaagaagaaagatataaaggatcttgtagtttatgagaattcttcaaatcccgtgtgc acgctgaacgacacagctcagtttaaccgaatgacagttgagtatgtctatgaaaggatg acaggcctccgctggaaatgcaaagtgattctagagagtgaagtaattgcagaagcagtt ggggtgaagaaaactgtcaaatatgaagctgctggggaagctgtgaaaaccctcaaaaag acccagccaactgtcattaacaacttgaagaaaggagctgttgaagatgtgatttcaaga aatgaaattcagggccgctcagcagaggaggcttacaaacagcaaatcaaagaagataat attggaaatcagctgctgagaaagatgggttggactggtggtggtttaggtaaatctggt gagggcatacgggagcctatctcagtgaaagagcagcataagcgggaagggcttggtctg gatgtagagagggtgaataaaattgccaagagagatattgaacagatcatcagaaactac gcccgctccgagagccacacagatttgactttctctagagagctgactaatgatgaacgg aagcaaatacatcagattgcccagaagtatggtcttaagaccctgaagagtaatgatatt tacccagggggtaaagattttggaattatttgcattataaaagtttcaggttttggagca tttcggattttagattttcagatttga >gi568815575r:119529318_119775669|GENSCAN_predicted_peptide_3|137_aa MKHSGAQLASPSGSRTGAAGGAACQSRAVCLHFSALGRSMGLGAVEQGVVLVGEAGAAQE PMEWVGGSGMAGCRSRALPRGKAAKAQREIQHSAGPAGCSECGARQAHAHPELQLARKRC AQPRFLLAPLPPHLPAS >gi568815575r:119529318_119775669|GENSCAN_predicted_CDS_3|414_bp atgaaacactcaggagcccagctggcttcacctagtggatcccgcactggggccgcaggt ggagctgcctgccagtcccgtgccgtgtgcttgcacttctcagcccttgggcggtcgatg ggactgggcgccgtggagcagggggtggtgctcgtcggggaggctggggccgcacaggag cccatggagtgggtgggaggctcaggcatggcgggctgcaggtcccgagccctgccccgc gggaaggcagctaaggcccagcgagaaatccagcacagcgccgggccggccggctgctcc gagtgcggggcccgccaagcccacgcccacccagaactccagctggcccgcaagcgctgc gcgcagccccggttcctgctcgcgcctctccctccacacctccctgcaagctga >gi568815575r:119529318_119775669|GENSCAN_predicted_peptide_4|315_aa MAQLGGAANRAPTASLAPTSQSLRCAPQPRPSRADTGSLGRYWGKAAAAASREHPFPGTL MHSAAGSGRRRGALRELLGLQRAAPAGWLSEERAEELGGPSGPGSSRLCLEPREHAWILA AAEGRYEVLRELLEAEPELLLRGDPITGYSVLHWLAKHGRHEELILVHDFALRRGLRLDV SAPGSGGLTPLHLAALQGHDMVIKVLVGALGADATRRDHSGHRACHYLRPDAPWRLRELS GAEEWEMESGSGCTNLNNNSSGTTAWRAASAVGATAVETSRRVAASRTKAKDTAGSRVAQ MHSLFRHLFPSFQDR >gi568815575r:119529318_119775669|GENSCAN_predicted_CDS_4|948_bp atggcccagctcggaggggccgcgaaccgggcacccacggcctctctcgcgccgacctcg cagagcctgcggtgcgccccgcagccccgcccctcgagagcggacactggtagcctgggc aggtactggggcaaagccgcagccgccgcctcccgggagcaccccttcccaggcacgctg atgcactctgcagcgggctcagggcgccggcggggagcgctgcgggaactgctggggctg cagcgggcggctcctgcggggtggctgtcggaggagcgcgccgaggagctgggcgggccg agtgggccgggcagcagcaggctgtgcctggaaccgcgggagcacgcgtggattctggca gccgccgagggccgctatgaggtgctgcgggagctgctggaggctgagccggagctgctg ctgcggggcgacccgatcaccggctactcggttctgcactggctggccaagcacgggcgc cacgaggagctcattctggtacacgatttcgccctacgccgggggctgaggctcgacgtg agcgccccaggcagcggcggcctcacgcccctccacctggcggcccttcagggccacgac atggtcatcaaggtgctggtgggcgccctgggtgctgacgctacgcgccgcgaccacagc ggccaccgggcctgccactacctgcggcccgacgcgccttggaggttgcgggagctgtcg ggagccgaggaatgggagatggagagcggcagcgggtgcaccaacctgaacaacaacagc agtggcaccactgcgtggagggccgcgagcgcagtgggcgcgacggctgtggagacaagc aggagagtggcagcgtcgcggaccaaggcgaaggacaccgcgggcagccgggtggcgcaa atgcatagccttttccgccatctgttcccctcattccaggaccgttga