GENSCAN 1.0 Date run: 7-Nov-116 Time: 01:33:09 Sequence gi568815583f:65769586_65987837 : 218252 bp : 44.09% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 84 79 6 1.05 1.02 Term - 1887 1587 301 2 1 101 43 261 0.928 17.59 1.01 Init - 2527 2229 299 1 2 38 8 154 0.403 -0.90 1.00 Prom - 36319 36280 40 -2.26 2.02 PlyA - 37289 37284 6 1.05 2.01 Sngl - 47081 46674 408 2 0 72 32 218 0.982 10.89 2.00 Prom - 48453 48414 40 -4.66 3.00 Prom + 70384 70423 40 -3.36 3.01 Init + 99866 100040 175 1 1 87 100 199 0.936 18.59 3.02 Intr + 107747 107942 196 0 1 110 94 150 0.999 16.27 3.03 Intr + 108177 108370 194 0 2 96 92 152 0.999 15.54 3.04 Intr + 110086 110166 81 2 0 107 110 6 0.950 4.31 3.05 Term + 118116 118255 140 1 2 93 48 138 0.964 8.43 3.06 PlyA + 118474 118479 6 1.05 4.20 PlyA - 119073 119068 6 1.05 4.19 Term - 128509 128349 161 2 2 71 48 57 0.702 -1.90 4.18 Intr - 129349 129143 207 2 0 84 74 112 0.986 8.45 4.17 Intr - 136556 136500 57 0 0 84 75 58 0.862 2.96 4.16 Intr - 139550 139449 102 0 0 86 71 82 0.815 6.45 4.15 Intr - 144388 144152 237 2 0 105 110 252 0.999 26.69 4.14 Intr - 146013 145885 129 1 0 78 85 100 0.976 9.37 4.13 Intr - 146691 146563 129 1 0 97 63 118 0.396 10.87 4.12 Intr - 147371 147027 345 0 0 60 77 96 0.417 1.06 4.11 Intr - 148483 148381 103 1 1 20 98 108 0.476 4.75 4.10 Intr - 148940 148840 101 0 2 77 74 83 0.483 5.63 4.09 Intr - 151981 151817 165 2 0 75 25 81 0.473 0.43 4.08 Intr - 152887 152753 135 2 0 105 79 87 0.826 10.04 4.07 Intr - 153384 153238 147 1 0 106 72 175 0.999 17.91 4.06 Intr - 158942 158840 103 2 1 145 91 43 0.884 10.05 4.05 Intr - 160298 160135 164 0 2 105 117 82 0.994 12.49 4.04 Intr - 161358 161238 121 0 1 98 90 76 0.783 8.87 4.03 Intr - 162855 162749 107 1 2 43 94 19 0.391 -2.07 4.02 Intr - 168646 168552 95 0 2 73 89 67 0.749 4.91 4.01 Init - 168964 168906 59 1 2 90 36 42 0.558 -0.02 4.00 Prom - 169105 169066 40 -6.76 5.00 Prom + 175010 175049 40 -4.66 5.01 Init + 176995 177043 49 0 1 75 58 48 0.346 -0.39 5.02 Term + 180481 180593 113 2 2 101 38 156 0.988 10.62 5.03 PlyA + 181049 181054 6 1.05 6.08 PlyA - 182735 182730 6 1.05 6.07 Term - 184139 184104 36 2 0 108 44 49 0.590 -0.16 6.06 Intr - 188136 187962 175 2 1 90 95 95 0.898 10.34 6.05 Intr - 195535 195323 213 0 0 140 80 200 0.605 22.23 6.04 Intr - 201104 200968 137 2 2 73 110 28 0.228 2.87 6.03 Intr - 211313 211193 121 1 1 93 127 73 0.865 12.20 6.02 Intr - 212903 212657 247 0 1 123 75 403 0.999 39.02 6.01 Intr - 217603 217464 140 0 2 -6 84 103 0.279 0.61 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:65769586_65987837|GENSCAN_predicted_peptide_1|199_aa MCRHRPGGSECWQPREGPAFPAEGREALPTALGLRINKYKNYYEVLGVTKDSGDEDLKKA YRKLALKFHPDKNHAPGATDAFKKVGNAYAVLSNPEKRKQSGTGQTIKTQTENLGGVYYV SKDFKNEYKGMLLQKAEKSVEEDYVTNIRNNCWKERQQKTDMQYAAKVYRGDRLRRKSDA LSMDNCKELERLTGLYKGG >gi568815583f:65769586_65987837|GENSCAN_predicted_CDS_1|600_bp atgtgtcgacatcgtccaggaggctctgaatgctggcaaccgcgagaaggcccagcgttt cctgcagaaggccgagaagctctacccactgcccttggcctgcgcataaacaaatataaa aattactatgaagtacttggagttacgaaagatagtggtgatgaagatttgaaaaaagct tatagaaagcttgctttgaagtttcatccagacaaaaaccatgcacctggagcaacagat gcttttaaaaaggttgggaatgcttatgctgttttaagtaatccagaaaagcgaaaacaa tctggaacagggcaaactattaaaacgcagacagaaaacttgggtggtgtttattatgtc agcaaggactttaaaaatgaatataaaggaatgttattacaaaaggcagaaaagagtgtg gaggaagattatgtgactaatattcgaaataactgctggaaagaaagacaacaaaaaaca gatatgcagtatgcagcaaaagtataccgtggtgatcgactccgaaggaagtcagatgcc ttgagcatggacaactgtaaagaattagagcggcttaccggtctttataaaggaggatga >gi568815583f:65769586_65987837|GENSCAN_predicted_peptide_2|135_aa MAGASASHRGQSGSGNFGGSHGGSFGGNDNFGHGGNLSGLGGFNDSSGGGGYGGSGDGYN GFSNDGSNFGGGGSYDDFGNYNNRSSNFRPMKGRNFGGRSSGPYGGGGQYFAQPRNQGGR GGSSSSSSYGSGRRF >gi568815583f:65769586_65987837|GENSCAN_predicted_CDS_2|408_bp atggctggtgcttcagccagccacagaggtcaaagtggctctggaaactttggtggtagt catggaggtagtttcggtgggaatgacaattttggtcatggaggaaacttaagtggtctt ggtggctttaatgacagctctggtggtggtggatatggtggcagtggggatgggtataat ggatttagtaatgatggaagcaattttgggggtggtggaagctacgatgattttggcaat tacaacaatcggtcttcaaatttcagacccatgaagggaagaaactttggaggcagaagc tctggcccctatggtggtggaggccaatactttgcccaaccacgaaaccaaggtggccgt ggcggttccagtagcagcagtagctatggcagtggcagaagattttaa >gi568815583f:65769586_65987837|GENSCAN_predicted_peptide_3|261_aa MAQRQGGALHPVRQLKLGARVTPAATPPGPTDTTAAPALSLLGRAMGTRDDEYDYLFKVV LIGDSGVGKSNLLSRFTRNEFNLESKSTIGVEFATRSIQVDGKTIKAQIWDTAGQERYRA ITSAYYRGAVGALLVYDIAKHLTYENVERWLKELRDHADSNIVIMLVGNKSDLRHLRAVP TDEARAFAEKNGLSFIETSALDSTNVEAAFQTILTEIYRIVSQKQMSDRRENDMSPSNNV VPIHVPPTTENKPKVQCCQNI >gi568815583f:65769586_65987837|GENSCAN_predicted_CDS_3|786_bp atggcgcagcggcagggaggggctcttcacccagtccggcagttgaagctcggcgctcgg gttacccctgcagcgacgccccctggtcccacagataccactgctgctcccgccctttcg ctcctcggccgcgcaatgggcacccgcgacgacgagtacgactacctctttaaagttgtc cttattggagattctggtgttggaaagagtaatctcctgtctcgatttactcgaaatgag tttaatctggaaagcaagagcaccattggagtagagtttgcaacaagaagcatccaggtt gatggaaaaacaataaaggcacagatatgggacacagcagggcaagagcgatatcgagct ataacatcagcatattatcgtggagctgtaggtgccttattggtttatgacattgctaaa catctcacatatgaaaatgtagagcgatggctgaaagaactgagagatcatgctgatagt aacattgttatcatgcttgtgggcaataagagtgatctacgtcatctcagggcagttcct acagatgaagcaagagcttttgcagaaaagaatggtttgtcattcattgaaacttcggcc ctagactctacaaatgtagaagctgcttttcagacaattttaacagagatttaccgcatt gtttctcagaagcaaatgtcagacagacgcgaaaatgacatgtctccaagcaacaatgtg gttcctattcatgttccaccaaccactgaaaacaagccaaaggtgcagtgctgtcagaac atctaa >gi568815583f:65769586_65987837|GENSCAN_predicted_peptide_4|888_aa MGIDEDRKTPKHQLPGAKKGLLTYKTRQLDQVIPSVPFSSSIMGYLPTIRTVELTAYPLT GRVDAQIHPSGQKHSFFPSCQKFSGSQGEVCAVSCAAGTYGPNCSSICSCNNGGTCSPVD GSCTCKEGWQGLDCTLPCPSGTWGLNCNESCTCANGAACSPIDGSCSCTPGWLGDTCELP CPDGTFGLNCSEHCDCSHADGCDPVTGHCCCLAGWTGIRCDSTCPPGRWGPNCSVSCSCE NGGSCSPEDGSCECAPGFRGPLCQRICPPGFYGHGCAQPCPLCVHSSRPCHHISGICECL PGFSGALCNQGQDSVDLGEGSMDHAGLQLGLSLKGGKGRKAFCVEEAAWAKHRDMGMLQG TADGYAVHILYPRGCGIRDVGESGNRKQKAVEQYREHWTDCAQLCSCANNGTCSPIDGSC QCFPGWIGKDCSQACPPGFWGPACFHACSCHNGASCSAEDGACHCTPGWTGLFCTQRKPH LLASQPLRIPCCGLLATVGIVQTSREGGMQAAPGLVVPDSCPTRTEELCRGSSRPDWIQG IDKPKVLEGCPAAFFGKDCGRVCQCQNGASCDHISGKCTCRTGFTGQHCEQRCAPGTFGY GCQQLCECMNNSTCDHVTGTCYCSPGFKGIRCDQAALMMEELNPYTKISPALGAERHSVG AVTGIMLLLFLIVVLLGLFAWHRRRQKEKGRDLAPRVSYTPAMRMTSTDYSLSDSHFQIS ALEARYPPEDFYIELRHLSRPAEPHSPGACGMDRRQNTYIMDKGFKDYMKESVCSSSTCS LNSSENPYATIKDPPILTCKLPESSYVEMKSPVHMGSPYTDVPSLSTSNKNIYEVEPTVS VVQEGCGHNSSYIQNAYDLPRNSHIPGHYDLLPVRQSPANGPSQDKQS >gi568815583f:65769586_65987837|GENSCAN_predicted_CDS_4|2667_bp atggggatagatgaagatagaaagacacccaagcatcaacttccaggtgctaagaaaggt ttactcacctataaaacaaggcagctggaccaggtgattcccagtgtccctttcagctct agcatcatgggctatctacccaccattaggacagtggagctaacagcatacccacttacc ggtagagtggacgcccagattcacccctcaggacagaagcactcattcttccccagctgc cagaaatttagtggctcgcagggagaggtctgtgccgtttcctgtgcagcagggacctat ggccccaactgctcgtccatctgtagctgtaacaatggtggcacctgctccccagtagat ggctcctgtacctgcaaggaagggtggcagggcctggactgcaccctgccatgtcccagt gggacgtggggcctgaactgcaacgagagctgcacctgtgccaatggggcagcctgcagc cccatagacggctcctgctcctgcactcctggctggctgggagacacctgtgagctgcct tgcccggatggcacatttgggctgaactgcagtgaacactgtgactgcagccatgctgat ggatgtgaccccgtcacaggccactgctgctgcctggccggatggacaggcatccgctgt gacagcacgtgtccacctggccgctggggccccaactgctctgtctcctgcagctgtgag aatggaggctcctgctccccagaggatgggagctgcgagtgtgcccctggcttccgagga cccttatgccagagaatctgcccccctgggttctatggccacggctgcgcccagccatgc cccctctgcgtgcacagcagcaggccctgccaccacatcagcggcatctgtgagtgcctc ccaggattctctggagctctctgcaaccaaggacaagacagtgtggatttgggtgaaggc tccatggaccatgcaggacttcagctgggccttagcttgaaaggtggaaaaggaagaaaa gcattctgtgtagaggaagcagcttgggcaaagcacagagacatgggcatgctacaggga acagctgatggatatgctgtacatattctctaccctcgtggttgtggaatccgtgatgtt ggtgaaagtggtaatagaaagcagaaggctgtggaacagtacagggagcactggactgac tgtgcccagctctgctcctgtgccaacaacgggacctgcagccctatcgatggctcctgc cagtgctttcctggatggattggcaaggactgctcacaggcttgcccacccgggttctgg ggccccgcctgcttccacgcatgcagctgccacaacggggcgagctgcagcgccgaggac ggggcctgccactgcacccctggctggactggactcttctgcacacagcgtaagccccac ctcctggcctcccagccacttagaataccatgctgcggcctactggccactgtgggcatt gtccagaccagcagggagggaggaatgcaggcagccccaggactggtggtacctgactcc tgccccacgagaactgaggagctctgcaggggcagctccagacctgactggattcagggg attgacaagcccaaagtcctggaaggctgcccagcagcattttttgggaaggactgtggg cgcgtatgccagtgtcagaatggcgccagctgtgaccacatcagtggcaagtgcacctgc cgcacaggcttcaccgggcaacactgtgagcagagatgtgccccaggaacctttggctat gggtgtcagcagctatgtgagtgcatgaacaactccacctgtgaccatgtcaccggcacc tgttactgcagccctggcttcaaaggaatcaggtgtgaccaagctgccctcatgatggag gagctgaatccctacaccaagatcagcccagcactgggtgcagagcggcactcggtgggt gctgtcacaggcatcatgctcctgttattcctcattgtggtgctgctgggcctatttgcc tggcatcggcggcggcagaaagagaagggccgagacctggctccccgtgtctcctacaca cctgccatgaggatgaccagcaccgactactccctctcagactcccatttccagatcagt gccctggaggccaggtacccgcccgaggacttctacattgaacttagacacctcagccgc cccgctgagccacactcaccaggtgcttgtggaatggatagacgtcagaacacatacatt atggacaaaggcttcaaagattacatgaaagaatccgtgtgcagttctagtacttgttcc ttgaatagcagtgaaaacccttacgccacaattaaggacccacccatcctcacctgcaag cttccagaaagcagctatgtagaaatgaagtcgcctgtgcacatggggtctccgtacaca gatgtgccatccttgtcgacatctaataaaaatatatatgaagttgagcccacagtcagt gtggtccaagaaggttgcggtcataactccagctatatccagaatgcatacgacctacct aggaacagccatattcctggtcattatgacctcctcccagtaagacagagccctgccaat gggccgtcccaggacaagcaatcttaa >gi568815583f:65769586_65987837|GENSCAN_predicted_peptide_5|53_aa MGFHHVGQAALELLSSGHSGQDQSSSSSERPLGVNNNSSSSSGIIIIINSNNS >gi568815583f:65769586_65987837|GENSCAN_predicted_CDS_5|162_bp atggggtttcaccatgttggccaggctgctctcgaactcctgagctcagggcactcaggc caggaccagtcatcctcttcaagtgagagaccccttggtgtaaataacaacagcagcagc agtagtggcatcatcatcatcatcaacagcaataacagctaa >gi568815583f:65769586_65987837|GENSCAN_predicted_peptide_6|356_aa XNCLINCSYKFKSLSVKKTVVKPFALLGTQGHTKPGPQSLKQSAGEGGCDSDHWGPHCSN RCQCQNGALCNPITGACVCAAGFRGWRCEELCAPGTHGKGCQLPCQCRHGASCDPRAGEC LCAPGYTGVYCEELCPPGSHGAHCELRCPCQNGGTCHHITGECACPPGWTGAVCAQPCPP GTFGQNCSQDCPCHHGGQCDHVTGQCHCTAGYMGDRCQEECPFGSFGFQCSQHCDCHNGG QCSPTTGACECEPGYKGPRCQERLCPEGLHGPGCTLPCPCDADNTISCHPVTGACTCQPG WSGHHCNESCPVGYYGDGCQLPCTCQNGADCHSITGGCTCAPGFMDIGSGILFTEM >gi568815583f:65769586_65987837|GENSCAN_predicted_CDS_6|1071_bp nntaactgcctgatcaactgtagttacaaatttaaaagcctgtcggtgaagaagacggtg gtgaagccatttgccctcttggggacccaggggcacactaagcctggaccccaaagtctg aagcagtcagcaggggaagggggctgcgacagcgaccactgggggccccactgcagcaac cggtgccagtgccagaacggcgccctgtgtaaccccatcacaggcgcctgcgtgtgcgcc gccggcttccgtggatggcgctgcgaggagctctgcgcacctggcacccacggcaaggga tgccagctgccgtgccagtgccgacacggtgccagctgcgacccccgcgccggcgagtgc ctctgcgcacctggctacaccggcgtctactgcgaggagctgtgccctcctgggagccat ggagctcactgtgagctgcgctgcccctgtcagaatgggggcacctgccaccacatcact ggcgagtgtgcctgccccccaggctggacgggagcagtgtgtgcccagccctgcccacca gggacatttggccagaactgcagccaggattgtccttgccaccatggagggcagtgtgac cacgtgactggacagtgccactgtacagctggatacatgggggacaggtgccaagaggag tgccccttcgggtccttcggcttccagtgctcacagcactgtgactgccacaatgggggg cagtgttcacccaccacgggtgcctgcgagtgtgagcctggctacaagggcccacgctgc caggagcgactgtgcccggagggcctgcatggcccaggctgcaccctgccctgcccctgt gacgctgacaacaccatcagctgccacccagtaactggagcttgtacctgccagccaggc tggtctggtcaccactgcaatgaatcctgccctgttggctactatggcgatggctgccag ctgccttgcacctgtcagaatggcgccgactgccacagcatcactgggggctgcacttgt gctccgggcttcatggacattggaagtggaatcctgttcactgaaatgtga