GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:15:18 Sequence gi568815586r:122373500_122582088 : 208589 bp : 43.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 Intr - 4461 3890 572 1 2 76 97 427 0.012 34.88 1.01 Init - 6953 6869 85 2 1 50 50 82 0.577 1.58 1.00 Prom - 56174 56135 40 -4.06 2.10 PlyA - 56971 56966 6 1.05 2.09 Term - 100776 99998 779 1 2 73 39 557 0.943 42.83 2.08 Intr - 104459 104342 118 2 1 71 96 122 0.998 11.34 2.07 Intr - 104793 104707 87 0 0 90 41 69 0.853 2.57 2.06 Intr - 106812 106691 122 1 2 52 105 22 0.986 0.51 2.05 Intr - 108165 108023 143 2 2 73 75 84 0.988 5.60 2.04 Intr - 108588 108446 143 0 2 8 47 134 0.953 0.45 2.03 Intr - 115964 115887 78 2 0 58 84 107 0.981 7.05 2.02 Intr - 117068 116963 106 1 1 111 68 60 0.995 6.52 2.01 Init - 127341 127109 233 0 2 95 41 504 0.967 44.13 2.00 Prom - 131779 131740 40 -3.36 3.07 PlyA - 131834 131829 6 -1.95 3.06 Term - 132207 132028 180 0 0 58 48 155 0.979 6.11 3.05 Intr - 133424 133335 90 2 0 62 113 22 0.830 2.29 3.04 Intr - 134951 134719 233 0 2 49 98 133 0.366 7.89 3.03 Intr - 137689 137610 80 0 2 71 91 51 0.775 2.89 3.02 Intr - 145530 145340 191 0 2 40 115 122 0.521 8.48 3.01 Init - 153243 153184 60 0 0 95 72 62 0.772 4.61 3.00 Prom - 154193 154154 40 -7.46 4.00 Prom + 155334 155373 40 -4.36 4.01 Init + 156565 156693 129 0 0 70 67 98 0.962 6.05 4.02 Intr + 166177 166255 79 0 1 31 115 130 0.956 9.12 4.03 Intr + 170101 170135 35 2 2 75 116 30 0.971 2.44 4.04 Intr + 172677 172770 94 2 1 83 93 32 0.799 2.74 4.05 Intr + 178061 178197 137 0 2 57 74 44 0.407 0.19 4.06 Intr + 183885 184010 126 2 0 70 91 117 0.992 11.08 4.07 Intr + 184101 184190 90 2 0 57 73 126 0.866 8.19 4.08 Intr + 188422 188475 54 0 0 74 76 37 0.489 0.28 4.09 Intr + 196182 196325 144 2 0 71 109 46 0.918 5.48 4.10 Intr + 199438 199557 120 0 0 63 98 39 0.892 3.09 4.11 Intr + 199643 199786 144 1 0 71 78 85 0.987 6.28 4.12 Intr + 200783 200881 99 1 0 99 105 23 0.977 5.41 4.13 Intr + 202044 202147 104 2 2 65 71 147 0.735 9.67 4.14 Intr + 202301 202400 100 2 1 41 115 21 0.599 0.21 4.15 Intr + 203396 203534 139 1 1 58 78 72 0.203 3.24 4.16 Term + 206406 206455 50 1 2 77 44 55 0.124 -2.53 4.17 PlyA + 206813 206818 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:122373500_122582088|GENSCAN_predicted_peptide_1|219_aa MSMLKPSGLKAPTKILKPGSTALKTPTAVVAPVEKTISSEKASSTPSSETQEEFVDDFRV GERVWVNGNKPGFIQFLGETQFAPGQWAGIVLDEPIGKNDGSVAGVRYFQCEPLKGIFTR PSKLTRKVQAEDEANGLQTTPASRATSPLCTSTASMVSSSPSTPSNIPQKPSQPAAKEPS ATPPISNLTKTASESISNLSEAGSIKKGERELKIGDRVL >gi568815586r:122373500_122582088|GENSCAN_predicted_CDS_1|657_bp atgagtatgctaaagccaagtgggcttaaggcccccaccaagatcctgaagcctggaagc acagctctgaagacacctacggctgttgtagctccagtagaaaaaaccatatccagtgaa aaagcatcaagcactccatcatctgagactcaggaggaatttgtggatgactttcgagtt ggggagcgagtttgggtgaatggaaataagcctggatttatccagtttcttggagaaacc cagtttgcaccaggccagtgggctggaattgttttagatgaacccataggcaagaacgat ggttcggtggcaggagttcggtatttccagtgtgaacctttaaagggcatatttacccga ccttcaaagttaacaaggaaggtgcaagcagaagatgaagctaatggcctgcagacaacg cccgcctcccgagctacttcaccgctgtgcacttctacggccagcatggtgtcttcctcc ccctccaccccttcaaacatccctcagaaaccatcacagccagcagcaaaggaaccttca gctacgcctccgatcagcaaccttacaaaaactgccagtgaatctatctccaacctttca gaggctggctcaatcaagaaaggagaaagagagctcaaaatcggagacagagtattg >gi568815586r:122373500_122582088|GENSCAN_predicted_peptide_2|602_aa MAAEVYFGDLELFEPFDHPEESIPKPVHTRFKDDDGDEEDENGVGDAELRERLRQCEETI EQLRAENILSSRWALSPGQYHQEIEEFVSNLVKRFEEQQKNDVEKTSFNLLPQPSSIVLE EDHKVEESCAIKNNKEAFSPRNAARISEKRKEYMDACGEANNQNFQQRYHAEEVEERFGR FKPGVISEELQDALGVTDKSLPPFIYRMRQLGYPPGWLKEAELENSGLALYDGKDGTDGE TEVGEIQQNKSVTYDLSKLVNYPGFNISTPRGIPDEWRIFGSIPMQACQQKDVFANYLTS NFQAPGVKSGNKRSSSHSSPGSPKKQKNESNSAGSPADMELDSDMEVPHGSQSSESFQFQ PPLPPDTPPLPRGTPPPVFTPPLPKGTPPLTPSDSPQTRTASGAVDEDALTLEELEEQQR RIWAALEQAESVNSDSDVPVDTPLTGNSVASSPCPNELDLPVPEGKTSEKQTLDEPEVPE IFTKKSEAGHASSPDSEVTSLCQKEKAELAPVNTEGALLDNGSVVPNCDISNGGSQKLFP ADTSPSTATKIHSPIPDMSKFATGITPFEFENMAESTGMYLRIRSLLKNSPRNQQKNKKA SE >gi568815586r:122373500_122582088|GENSCAN_predicted_CDS_2|1809_bp atggccgcagaggtgtattttggcgatctagagctcttcgagccgttcgaccacccagag gagtcgattccgaagcccgttcacactcgcttcaaggacgacgacggcgacgaggaggac gaaaatggggtcggcgacgcggagctacgggagcggcttcggcagtgcgaggagaccatc gagcagctccgcgccgagaatatcctttcctctcgctgggccctgtcacccgggcaatat catcaagaaatagaggaatttgtatcaaatttagtaaaaagatttgaggaacagcagaaa aatgatgtggaaaagacttcctttaatcttttgccccagccatccagtattgtgctagag gaggaccacaaagtggaagagtcctgtgccattaaaaacaacaaggaagctttcagtcct cggaatgctgctcgaataagtgaaaagagaaaagagtatatggatgcctgtggagaagca aacaatcagaatttccagcagcgataccacgcagaagaagtagaagaaagatttggaaga ttcaagccaggagttattagtgaggaacttcaagatgcactaggtgtgacagacaagagt cttccaccttttatctatcggatgcgccagctagggtacccaccagggtggctcaaagag gctgaattggagaattcggggcttgcactctatgatggaaaagatggcactgatggggaa acagaagttggagaaatacaacagaataaaagtgtcacttacgatctctcaaaattggtc aactatcctggttttaatatatctactcccagaggaattccagacgaatggaggatcttt ggttccataccaatgcaggcatgtcagcagaaggatgtgtttgccaattaccttacttct aacttccaagcgccaggtgtgaagtctggcaacaagaggtcttcatctcactctagccca ggtagtccaaagaagcagaagaatgaaagcaactcagcgggatctcccgccgacatggag ctcgattcagatatggaggtaccacatggttctcagagcagcgaaagttttcagtttcaa ccaccattacctcctgacactcctccactcccccggggaactcctccacccgtcttcacc cctccactcccaaagggcaccccgccgctgactcccagtgactcaccccagaccagaaca gcatctggagctgtggatgaggacgcactgactctagaagaacttgaagaacagcagagg cggatctgggcagctcttgagcaggccgagagcgtaaacagcgactccgacgttcctgtg gacacacctttaactggcaattccgttgcctcatcaccttgtccaaatgagctagacctc cctgtcccggagggaaaaacatctgaaaagcagacgctggatgagcctgaggtaccagag atttttacaaagaaatcagaagctggacatgcctccagtccagactctgaggtgacatca ctttgtcagaaggaaaaagcagagttggctccggtaaacactgaaggtgcccttcttgat aatggcagtgtcgtaccaaactgtgacatcagcaatgggggcagccagaagctctttcct gcagacaccagtccttcaacggccactaaaattcatagccctatacctgacatgagcaaa tttgcaactggaatcacgccatttgaatttgagaatatggcagaatctactggaatgtac ctcaggataagaagcttgttaaagaactcaccccgaaaccagcagaaaaacaaaaaggcc tctgaataa >gi568815586r:122373500_122582088|GENSCAN_predicted_peptide_3|277_aa MAPRPSSAPAIMAACCREGEGRRHESKDKSSKKHKSEEHNDKEHSSDKGRERLNSSENGE DRHKRKERKSSRGRSHSRSRSRERLERAKKLQEQREKEMVEKQKQQEIAAAAAATGGSVL NVAALLASGTQVTPQIAMAAQMAALQAKALAETGIAVPSYYNPAAVNPMKFAEQEKKRKM LWQGKKEGDKSQSAEIWEKLNFGNKDQNVKFRKLMGIKSEDEAGCSSVDEESYKTLKQQE EVFRNLDAQYEMARSQTHTQRGMGLGFTSSMRGMDAV >gi568815586r:122373500_122582088|GENSCAN_predicted_CDS_3|834_bp atggcgccgcggccttcttctgcgcccgccatcatggctgcctgctgtagggagggggaa ggaagaagacatgaatccaaagataaatcctctaagaaacataagtctgaggaacataat gacaaagaacattcttctgataaaggaagagagcgactaaattcatctgaaaatggtgag gacaggcacaaacgcaaagaaagaaagtcatcaagaggcagaagtcactcaagatctagg tctcgtgaaaggttggaaagggcaaagaaattacaagaacagcgagaaaaggaaatggtt gaaaaacaaaaacaacaagaaatagctgcagcagctgcagctactggaggttctgttctc aatgttgctgccctgttggcatcaggaacacaagtaacacctcagatagccatggcagct cagatggcagccctgcaagctaaagctttggcagagacaggaatagctgttcctagctac tataacccagccgctgttaatccaatgaaatttgctgaacaagagaaaaaaaggaaaatg ctttggcagggcaagaaagaaggggacaaatcccaatctgctgaaatatgggaaaaattg aattttggaaacaaggaccaaaatgtcaaatttaggaaattgatgggtattaagagtgaa gatgaagctggatgtagctcagttgatgaagaaagttacaagactctgaagcagcaggaa gaagtatttcgaaatttagatgctcagtatgaaatggcaagatcacaaacccacacacaa agaggaatgggtttgggtttcacatcttcaatgcgaggaatggatgcagtttga >gi568815586r:122373500_122582088|GENSCAN_predicted_peptide_4|547_aa MWNDIELLTNDDTGSGYLSVGSRKEHGTALYQVDLLVKISSEKAFVQKANDENRRTYQNL VIEKDGSNEAIENVDFSTAKKGTGNCAFSKWEPDSSKKGMTVKNLIDAEIIKVNNLNSLN NKHLTKFLFQLNRLSRLLHKHRFAEAESFAIQFGLDVELVYKVKSNHILEKLALSSVDAS EQTEWQQLVDDAKENLHKIQDDEFVVNYCLKAQWITYETTQEMLNYAKTRLLKKEDKTAL IYSDGLKEANFESRFDVKMLESLLNSMSASVSLQKLCPWFKNDVIPFVRRTVPEGQKDYQ NTEEVCQLRTLVNNLRELITLHRKYNCKLALSDFEKENTTTIVFRMFDKVLAPELIPSIL EKFIRVYMREHDLQEEELLLLYIEDLLNRCSSKSTSLFETAWEAKAMAVIACLSDTDLIF DAVLKIMYAAVVPWSAAVEQLVKQHLEMDHPKVKLLQESYKLMEMKKLLRGYGIREVNLL NKEIMRVVRYILKQDVPSSLEDALKVAQAFMLSDDEIYSLRIIDLIDREQVWQGLENVCS EDIRGHS >gi568815586r:122373500_122582088|GENSCAN_predicted_CDS_4|1644_bp atgtggaatgatattgagctgctaacaaatgatgataccggaagtgggtacctgagtgtc ggttcaagaaaagaacatggaactgctttatatcaagtagatttgctagtgaagatctct tctgaaaaggcatttgttcagaaagctaacgatgaaaatcggcggacttaccagaatctt gtcattgagaaggatggttcaaatgaagcaattgagaatgtagacttcagtacagcaaaa aagggaaccggtaattgtgcattctcaaaatgggaaccagattcttccaagaaaggaatg acagttaagaaccttattgatgcagagattattaaagtgaataacttaaattccctgaat aataagcatttaacaaaatttctcttccaacttaacagattgagtcggttacttcacaaa cacagatttgctgaagctgagagttttgccattcagtttggactagatgttgagcttgtt tacaaggtcaagtcaaatcatatattggagaaactggcattgagttctgtggatgccagt gaacagaccgaatggcaacaacttgtagacgacgctaaggaaaatctacataagatccag gatgatgaatttgtggtgaattactgcctgaaagctcagtggataacctatgaaaccact caagagatgctgaattatgccaaaaccaggcttttgaagaaagaagataaaactgctctc atttattctgatggcttgaaagaggcaaactttgaaagcagatttgatgtgaaaatgctg gagagcttgctcaactcaatgtctgcatcagtctctttgcaaaagctgtgtccatggttt aaaaatgatgtgattccatttgtaagaaggactgtgcctgaaggacagaaagattatcag aacacagaggaagtatgtcagctaaggactttggtaaataacttgcgagagttgatcacg ttgcataggaagtacaactgcaaattagccctctctgattttgagaaggaaaatacaacc accatagtgttccgaatgtttgataaagtgctggccccagagcttattccctccatctta gagaagtttataagagtttacatgagagaacatgacttgcaagaggaggaacttctcttg ctgtacatagaggatttactgaatagatgcagctcaaagtccacatcactctttgaaaca gcatgggaagcaaaggccatggcagtaatagcgtgtttatctgacacggacctcatattt gatgccgtgctcaagatcatgtatgcggcagtggttccttggagtgcagctgtggagcaa ctggtgaaacagcacctggaaatggaccatcccaaagtcaagttattacaggaaagttac aaactaatggagatgaaaaaacttttacgaggctatggaataagagaggtaaatctctta aacaaggaaataatgagagtggttagatacattctcaaacaagatgtcccatcttcttta gaagatgctttaaaggtagcccaagcgtttatgttatctgatgatgagatctacagtcta agaattattgacctgattgatagagaacaggtttggcaaggcctggagaatgtctgtagc gaagacatccgtggacattcttaa