GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:13:01 Sequence gi568815591f:77437547_77739277 : 301731 bp : 40.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5044 5098 55 0 1 90 30 92 0.838 5.10 1.02 Intr + 10125 10256 132 1 0 57 68 70 0.703 1.60 1.03 Term + 10440 10543 104 1 2 129 43 119 0.997 8.96 1.04 PlyA + 10575 10580 6 1.05 2.05 PlyA - 12429 12424 6 1.05 2.04 Term - 18104 17937 168 2 0 70 54 199 0.158 11.60 2.03 Intr - 24563 24425 139 1 1 15 94 122 0.093 5.05 2.02 Intr - 28674 28598 77 0 2 97 67 40 0.088 0.19 2.01 Init - 34273 34124 150 1 0 103 77 50 0.329 5.49 2.00 Prom - 36252 36213 40 -8.75 3.00 Prom + 38750 38789 40 -6.55 3.01 Init + 48462 48613 152 2 2 62 65 104 0.717 5.06 3.02 Intr + 52197 52375 179 0 2 -36 33 248 0.007 5.54 3.03 Term + 69299 69486 188 0 2 123 38 128 0.416 7.97 3.04 PlyA + 69910 69915 6 -0.45 4.07 PlyA - 70022 70017 6 1.05 4.06 Term - 72637 72543 95 2 2 85 49 80 0.249 0.81 4.05 Intr - 92116 92038 79 1 1 68 110 64 0.574 4.81 4.04 Intr - 99844 99544 301 0 1 93 -51 302 0.974 12.61 4.03 Intr - 100190 100064 127 0 1 97 25 101 0.940 3.52 4.02 Intr - 100774 100700 75 2 0 42 65 116 0.640 3.27 4.01 Init - 107839 107761 79 1 1 60 87 42 0.707 2.57 4.00 Prom - 117738 117699 40 -5.75 5.00 Prom + 131811 131850 40 -3.25 5.01 Init + 136518 136685 168 2 0 35 110 56 0.255 2.08 5.02 Intr + 139217 139444 228 1 0 61 62 122 0.450 4.04 5.03 Intr + 146009 146104 96 1 0 72 103 64 0.770 5.59 5.04 Intr + 163118 163260 143 1 2 69 87 78 0.366 4.03 5.05 Intr + 169689 169755 67 0 1 41 78 48 0.225 -2.91 5.06 Intr + 173402 173500 99 1 0 104 66 158 0.999 14.59 5.07 Intr + 180934 181019 86 0 2 98 72 122 0.974 9.30 5.08 Intr + 189159 190129 971 0 2 82 85 759 0.984 64.71 5.09 Intr + 198185 198303 119 0 2 6 100 100 0.528 2.26 5.10 Intr + 216313 216451 139 0 1 12 81 122 0.005 3.02 5.11 Intr + 225384 225645 262 1 1 -86 106 201 0.053 0.12 5.12 Term + 235026 235122 97 0 1 42 48 143 0.130 2.16 5.13 PlyA + 236545 236550 6 1.05 6.00 Prom + 246413 246452 40 -4.85 6.01 Init + 258924 259509 586 2 1 94 90 504 0.900 44.61 6.02 Intr + 260184 260289 106 1 1 102 56 80 0.877 4.55 6.03 Intr + 265385 265493 109 2 1 42 80 78 0.581 1.77 6.04 Intr + 273438 273549 112 2 1 40 28 57 0.001 -6.07 6.05 Intr + 298864 298980 117 2 0 128 106 84 0.058 13.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 24431 24605 175 2 1 51 48 136 0.803 2.15 S.002 Term + 201673 201747 75 0 0 93 43 98 0.886 2.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:77437547_77739277|GENSCAN_predicted_peptide_1|96_aa MDEIKEEIDSQTYIGQETVLRKDIIKPGEQPNLPNSSHISGDINHNPLPFTPGHLLRDRN RMDCNVNVLQEKGPNPDPKRGFLDLAQEIIQGKSIE >gi568815591f:77437547_77739277|GENSCAN_predicted_CDS_1|291_bp atggatgaaatcaaagaagaaattgattcacaaacatatatcggacaagagactgtgctc aggaaagacatcatcaaacctggtgagcagccaaatctccccaactcatcccacatcagt ggggacataaaccacaaccctcttcccttcacaccagggcatcttcttagggacagaaac aggatggactgcaatgtgaacgtgttacaggaaaagggtccgaatccagaccccaagaga gggttcttggatcttgcgcaagaaataattcagggcaagtccatagagtaa >gi568815591f:77437547_77739277|GENSCAN_predicted_peptide_2|177_aa MESEQAEVRGKKVFQVDGSVCVKAERWERAQSIRGTKIPHGFPPIKLKRVSILMEEDETG NVPIWGSTESLWCNFLCSSKHWMAKRSPPVSGLLANKWCLHSVQFKVPAPKRHTERTWAL LQLLLGTWCSQADESSVDFGSGLVTQSIRDDAESALARATAAIAAASGSQWQQTQLF >gi568815591f:77437547_77739277|GENSCAN_predicted_CDS_2|534_bp atggagagtgaacaggcagaggtaaggggaaaaaaagtattccaggtagatgggtcagta tgtgtgaaggccgaaaggtgggagagagcacagtctattcgaggaactaaaattccacat ggctttcccccaataaagctgaagagagtgtccatactgatggaagaagatgagacaggg aatgttcctatttggggctccactgaaagcctctggtgcaatttcctatgctcatctaaa cactggatggcaaagaggtcaccacccgtgtcaggcttattggcaaacaaatggtgcttg cacagtgtccagttcaaggtgccagctccgaaaaggcacactgagcgcacatgggcccta ctgcagctgcttttggggacttggtgcagccaagcagatgaatcctctgttgactttggt tcggggctggtgactcagagcatcagggatgatgcagaatccgctctggccagggccaca gcagcaatagcagctgcatctggttcccagtggcagcagacacagctgttctga >gi568815591f:77437547_77739277|GENSCAN_predicted_peptide_3|172_aa MWWIVYGKRKDELPDYKDEKHWNQLLREREFVESVSWKKWLSHRKKFFDRSKLEEVDIKE AFNIDNEPPVVHSLTNGEIAEIVLDQDDHNNSDDEDDVINTKEKVPIEDMGKQSMAVHGN RLYKRADGSFAPDYSFLPSDRLEYGCDGWSSPAETRGDLGNGSYKSTRPPQL >gi568815591f:77437547_77739277|GENSCAN_predicted_CDS_3|519_bp atgtggtggattgtatatggtaaacgcaaagatgaacttcctgattataaggatgagaaa cactggaaccagctccttagagagagggagtttgtagaatctgtatcctggaagaagtgg ctctctcacaggaaaaagttctttgatagaagtaagctggaagaagtagatatcaaagaa gcttttaacattgataatgagcctccagttgttcattcattgaccaatggtgaaatagct gaaattgttctggatcaagatgatcacaataatagtgacgatgaagatgatgtcattaac accaaagaaaaggtgcctatagaagacatgggtaagcaaagtatggcagttcatggaaac cgtctttataagagagctgatgggtcctttgcccccgattattcttttcttccttcagac agactggaatatggttgtgatggctggagtagcccagctgaaacaagaggtgaccttggg aatggaagctacaagagtaccaggccacctcagctctag >gi568815591f:77437547_77739277|GENSCAN_predicted_peptide_4|251_aa MLPVQPTSPDCTTSMVSISLQMLPLEITRTPRPPQTWASGEEKRDAGKWKLEVHKALAAR PGEAAMGSGAGKKTRQRGERLTMKSRAKLSSPLWRRLRRHRLGSRHHNFQEQPPLSHSTR RAAALAPTPTPSLLHPPSPPSSSVGDAAAAAAAAAASGTQLALRSPCPPFPVPPPPHPPP AAAAAAAAASPHGRNRAVIKSVGSEVREMKVQKPVPFLVQDTATIRFLNLFPTFPAFLFH KAAIVILARSQ >gi568815591f:77437547_77739277|GENSCAN_predicted_CDS_4|756_bp atgcttccagtgcagcccacatctcctgattgtacaaccagtatggtttccattagccta cagatgctgccacttgaaatcacacggaccccaagaccacctcaaacctgggcctcagga gaggagaagcgggacgcgggtaaatggaaattagaggtacacaaagcactggcggcccgg ccgggagaggcggcgatgggctccggcgccggcaagaaaacgcgacagcgaggggagaga ctcaccatgaagtcccgggcgaagttgtcctccccattgtggcggcggctgcggcggcac cggctgggctcccgacaccacaacttccaggagcagccgccgctctcccacagtaccagg cgagccgctgcgctagcgccaaccccaacccccagcttgctccatcctccttccccgccc tcctcctccgtcggagacgccgccgccgccgccgccgccgccgccgcctcaggaacacag ctagctctccgctcgccctgcccccccttcccggttcctcctcccccccaccccccgcca gcagccgcagccgcagccgcagccgcctctccccacgggcgcaatagggcagtaataaag agcgtgggctctgaagtcagagaaatgaaggttcaaaaacctgttccatttcttgttcag gacacggcaaccatccgatttctcaatcttttccccacctttcccgcctttctattccac aaagccgccattgtcatcctggcccgttctcaatga >gi568815591f:77437547_77739277|GENSCAN_predicted_peptide_5|824_aa MQQKTNWQKSLPLWSLYVSQGDGRYILTYMVCHVVVISVIESVVRMRDREKAENRRFPAL CRVLMVAQNHGCCGFLPKVSQELRVPLNKLQKMQAGEGHPVCCGSGMWQKSQLGGRGAIL EGLADPTIEQGLGVYGPKAYVATQGPLANTVIDFWRMIWEYNVVESRRLYQFHYVNWPDH DVPSSFDSILDMISLMRKYQEHEDVPICIHCSAGCGRTGAICAIDYTWNLLKAGEQYELV HRAIAQLFEKQLQLYEIHGAQKIADGVNEINTENMVSSIEPEKQDSPPPKPPRTRSCLVE GDAKEEILQPPEPHPVPPILTPSPPSAFPTVTTVWQDNDRYHPKPVLHMVSSEQHSADLN RNYSKSTELPGKNESTIEQIDKKLERNLSFEIKKVPLQEGPKSFDGNTLLNRGHAIKIKS ASPCIADKISKPQELSSDLNVGDTSQNSCVDCSVTQSNKVSVTPPEESQNSDTPPRPDRL PLDEKGHVTWSFHGPENAIPIPDLSEGNSSDINYQTRKTVSLTPSPTTQVETPDLVDHDN TSPLFRTPLSFTNPLHSDDSDSDERNSDGAVTQNKTNISTASATVSAATSTESISTRKVL PMSIARHNIAGTTHSGAEKEISEIQGVNNNKISFFSDTPVRSEWSELQSQERSEQKKSES LGVLLACKEAQAKPWTDERPRGEIPNKSQPTSHGRKSYSSALMQNEKKEKGRKEGRKEGT KEGDREREEGERGRKEGRKETERGKKEREEGRRERKEGEKKEGERKKEGKKEKERKEKEK EKERKKKEEERRNKHAEILAKTAVDETAKGEPTIMVGVKLDELK >gi568815591f:77437547_77739277|GENSCAN_predicted_CDS_5|2475_bp atgcaacaaaaaacaaactggcaaaaatccctgcccttgtggagcttatatgttagtcaa ggtgatggacgatacatattaacatatatggtctgtcatgtggtggtaattagtgttatt gagagtgtggttcgaatgagggatagggagaaagcagagaatagaaggttcccggcgctt tgccgggtgttaatggtggcccaaaatcatgggtgctgtggctttctccctaaggtgtca caggagctaagggtgccacttaacaaactgcagaagatgcaggcaggtgaaggacaccca gtctgctgtggcagtggaatgtggcagaaaagccagctgggagggcggggagcaatcctg gaaggcctggctgaccccacaattgaacagggtttgggcgtctatgggccaaaagcatat gtagcaactcaaggacctttagcaaatacagtaatagatttttggaggatgatatgggag tataatgttgtggaatctcgtaggctgtatcagtttcattatgtgaactggccagaccat gatgttccttcatcatttgattctattctggacatgataagcttaatgaggaaatatcaa gaacatgaagatgttcctatttgtattcattgcagtgcaggctgtggaagaacaggtgcc atttgtgccatagattatacgtggaatttactaaaagctggggagcaatatgaacttgtt catagagctattgcccaactgtttgaaaaacagctacaactatatgaaattcatggagct cagaaaattgctgatggagtgaatgaaattaacactgaaaacatggtcagctccatagag cctgaaaaacaagattctcctcctccaaaaccaccaaggacccgcagttgccttgttgaa ggggatgctaaagaagaaatactgcagccaccggaacctcatccagtgccacccatcttg acaccttctcccccttcagcttttccaacagtcactactgtgtggcaggacaatgataga taccatccaaagccagtgttgcatatggtttcatcagaacaacattcagcagacctcaac agaaactatagtaaatcaacagaacttccagggaaaaatgaatcaacaattgaacagata gataaaaaattggaacgaaatttaagttttgagattaagaaggtccctctccaagaggga ccaaaaagttttgatgggaacacacttttgaataggggacatgcaattaaaattaaatct gcttcaccttgtatagctgataaaatctctaagccacaggaattaagttcagatctaaat gtcggtgatacttcccagaattcttgtgtggactgcagtgtaacacaatcaaacaaagtt tcagttactccaccagaagaatcccagaattcagacacacctccaaggccagaccgcttg cctcttgatgagaaaggacatgtaacgtggtcatttcatggacctgaaaatgccataccc atacctgatttatctgaaggcaattcctcagatatcaactatcaaactaggaaaactgtg agtttaacaccaagtcctacaacacaagttgaaacacctgatcttgtggatcatgataac acttcaccactcttcagaacacccctcagttttactaatccacttcactctgatgactca gactcagatgaaagaaactctgatggtgctgtgacccagaataaaactaatatttcaaca gcaagtgccacagtttctgctgccactagtactgaaagcatttctactaggaaagtattg ccaatgtccattgctagacataatatagcaggaacaacacattcaggtgctgaaaaagaa atttcagaaattcaaggtgttaataacaataagatttcatttttctcagatacacctgta agatcggaatggagtgaacttcaaagtcaggaacgatctgaacaaaaaaagtctgaatct cttggggtcctgctggcatgtaaagaagctcaggctaagccatggacggatgagagacca cgtggggagattcccaacaagtcccaacccacatcacatggaagaaagtcatacagcagc gccctcatgcagaatgaaaagaaagaaaaaggaaggaaggaaggaaggaaggaaggaacg aaggaaggagacagagagagggaagaaggagagagaggaaggaaggagggaaggaaggag acagagagaggaaagaaggagagagaggaaggaaggagagagaggaaagaaggagagaag aaggagggagagagaaagaaagaaggaaagaaagaaaaagaaagaaaagaaaaagagaaa gaaaaagaaagaaagaaaaaagaggaagaaagaagaaacaaacatgcggaaatactggca aagacagctgtagatgaaacagccaagggagagccaacaattatggttggtgttaagttg gatgaattgaaataa >gi568815591f:77437547_77739277|GENSCAN_predicted_peptide_6|344_aa MAEPPSPVHCVAAAAPTATVSEKEPFGKLQLSSRDPPGSLSAKKVRTEEKKAPRRVNGEG GSGGNSRQLQPPAAPSPQSYGSPASWSFAPLSAAPSPSSSRSSFSFSAGTAVPSSASASL SQPVPRKLLVPPTLLHAQPHHLLLPAAAAAASANAKSRRPKEKREKERRRHGLGGAREAG GASREENGEVKPLPRANDKTRSFDDFSPDQAAAECLSIRKRRILKGSCQTGYLRKALFSD TVTLGIKASTYGFWGGMIQSIVKGLAETFNFWIHIWELNDLQYISALFACSEEMPDYHLE KQLADKIKDKIKERDKEKEREKKKHKVMNEIKKENGEVKILLKX >gi568815591f:77437547_77739277|GENSCAN_predicted_CDS_6|1032_bp atggcggaaccgccgagccccgtgcactgtgtcgctgccgcggcccccaccgccaccgtc tcggagaaagaaccgtttggcaagctgcaactctcctcccgggaccctccgggttctctg tccgccaagaaggtccggactgaggagaagaaggcaccgcggagagtgaacggagaaggg ggcagcggcgggaacagcaggcagctgcagccgccggcagcaccttcgcctcagagctat ggcagccccgcgtcttggagctttgcccctctgtctgctgctccctccccgtcctcttct cggagcagtttctctttctccgctggcacggccgttccctcctcagcctccgcttccttg tctcagccggtgccgcgcaaactgctggtccctcctacgctgctgcacgctcagcctcac catctcctcctgcccgccgccgccgccgctgcctcggctaacgccaagtcgcgcagacct aaggagaagcgggagaaggagaggaggaggcacggtctcggtggggcccgagaggccggc ggggcctcccgggaggagaacggggaggtgaagccgctgccccgagcaaatgataaaacc aggagctttgacgatttttcgccagatcaagcggctgcagaatgtttgagcattagaaaa aggcgaattcttaagggttcttgtcagacaggttacctccgaaaagccttattttcagat acagtcacattgggcattaaggcttcaacatatggattttggggaggcatgattcagtcc atagtgaagggcttggcagagaccttcaacttctggatacatatttgggaactgaatgac ttgcagtatatttcagccctttttgcctgctcagaagagatgcccgattatcatttagaa aaacaattggctgataaaatcaaagacaaaattaaagagagagacaaagaaaaagaaaga gaaaaaaagaaacataaagtaatgaatgagatcaagaaagagaatggagaagtaaagatt ttgctgaaaann