GENSCAN 1.0 Date run: 3-Nov-116 Time: 17:30:26 Sequence gi568815597f:101138985_101340130 : 201146 bp : 40.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 3891 3755 137 0 2 73 109 95 0.190 9.77 1.00 Prom - 10994 10955 40 -2.55 2.02 PlyA - 11828 11823 6 1.05 2.01 Sngl - 21057 20632 426 0 0 69 50 231 0.781 13.44 2.00 Prom - 26884 26845 40 -5.85 3.00 Prom + 50033 50072 40 -3.75 3.01 Init + 52800 52860 61 2 1 50 100 76 0.290 6.46 3.02 Intr + 62990 63033 44 0 2 55 113 63 0.161 2.54 3.03 Intr + 97375 97505 131 0 2 86 62 95 0.684 5.27 3.04 Intr + 97889 98115 227 2 2 42 75 198 0.853 10.61 3.05 Intr + 99341 99449 109 0 1 60 25 129 0.954 2.22 3.06 Term + 99838 101149 1312 1 1 88 42 1475 0.925 132.95 3.07 PlyA + 102367 102372 6 1.05 4.03 PlyA - 104356 104351 6 1.05 4.02 Term - 132305 131891 415 1 1 64 38 395 0.654 25.85 4.01 Init - 134144 133726 419 2 2 62 53 179 0.545 7.75 4.00 Prom - 134289 134250 40 -9.05 5.02 PlyA - 134315 134310 6 1.05 5.01 Sngl - 134994 134359 636 0 0 58 48 248 0.940 13.74 5.00 Prom - 136173 136134 40 -2.15 6.00 Prom + 140788 140827 40 -6.75 6.01 Init + 142262 142402 141 1 0 65 84 73 0.738 4.78 6.02 Intr + 143824 143925 102 0 0 93 57 47 0.485 1.55 6.03 Intr + 145048 145188 141 0 0 85 55 132 0.522 9.23 6.04 Term + 151549 151647 99 0 0 102 45 66 0.291 0.85 6.05 PlyA + 154149 154154 6 1.05 7.00 Prom + 154371 154410 40 -6.85 7.01 Init + 158158 158218 61 0 1 71 62 43 0.204 1.48 7.02 Intr + 165772 165801 30 2 0 71 92 57 0.476 1.58 7.03 Term + 169311 169426 116 1 2 81 49 143 0.652 7.45 7.04 PlyA + 169611 169616 6 -0.45 8.07 PlyA - 169788 169783 6 1.05 8.06 Term - 170855 170749 107 0 2 91 48 149 0.830 8.79 8.05 Intr - 171261 170924 338 2 2 94 61 160 0.371 8.14 8.04 Intr - 172928 172891 38 2 2 91 111 16 0.858 0.34 8.03 Intr - 175601 175469 133 1 1 27 98 127 0.655 7.33 8.02 Intr - 175976 175783 194 2 2 101 40 19 0.365 -4.23 8.01 Init - 176250 176110 141 0 0 81 91 90 0.834 8.79 8.00 Prom - 182528 182489 40 -3.65 9.04 PlyA - 182684 182679 6 1.05 9.03 Term - 185560 185368 193 0 1 86 36 136 0.966 4.11 9.02 Intr - 189931 189642 290 1 2 81 55 175 0.610 8.62 9.01 Init - 195807 195763 45 0 0 71 80 38 0.235 2.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_1|46_aa MAPKDVHVQISETCEYVILHGKREFTDAIKVMDLQVICPEYPGQPS >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_1|138_bp atggcccccaaagatgtccatgtccaaatctctgaaacttgtgaatatgtcatcttacat ggcaaaagggaatttacagatgcaattaaggttatggaccttcaggttatctgtcctgaa tatcctggacaacccagn >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_2|141_aa MDKGFMTKTPKAMATKAKIDKWDLIKLKSFCVAKETIIRVNKQPIEWEKILAIYLSDNRY YPESTRNLNTFTRKKPNNRIKKRAKDMNRHFSKADIYVSKHKKKSSLSLVIREIQIKTTM KYDLMPVRMAIITKSGKYVEK >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_2|426_bp atggacaaaggcttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcgtagcaaaagaaactatcatcagagtg aacaagcaacctatagaatgggagaaaattttggcaatctatctatctgacaatcgctat tatccagaatctacaaggaacttaaacacatttacgagaaaaaaaccaaacaaccgtatc aaaaagagggcaaaggatatgaacagacatttctcaaaagcagacatttatgttagcaaa cataagaaaaaaagctcattatcactggtcattagagaaatacaaatcaaaaccacaatg aaatacgatctcatgccagttagaatggcgatcattacaaagtcaggaaagtatgtggag aaatag >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_3|627_aa MQMMFEAMAMDETPWGISGEVQLDLYMQVPPSRPKSAADTRRAAAQVFCCSRESGSGSSW PQAQHTDPPRAAGGVGPGRNHPALARRWVTSRSRQGTGRGERDWPLECSAAEGGDPDSSK FARALRSQSGAAARCEASRTDPGLSERNFALLERDWQLSLFRGETWAGIERAAKLVTASE NLMAGNPGVRGLRFPRPSPAKEKLHKKPGSLIEPPLKPVKALSPRPLAFVWSSATPASWG HRVGTMGPTSVPLVKAHRSSVSDYVNYDIIVRHYNYTGKLNISADKENSIKLTSVVFILI CCFIILENIFVLLTIWKTKKFHRPMYYFIGNLALSDLLAGVAYTANLLLSGATTYKLTPA QWFLREGSMFVALSASVFSLLAIAIERYITMLKMKLHNGSNNFRLFLLISACWVISLILG GLPIMGWNCISALSSCSTVLPLYHKHYILFCTTVFTLLLLSIVILYCRIYSLVRTRSRRL TFRKNISKASRSSEKSLALLKTVIIVLSVFIACWAPLFILLLLDVGCKVKTCDILFRAEY FLVLAVLNSGTNPIIYTLTNKEMRRAFIRIMSCCKCPSGDSAGKFKRPIIAGMEFSRSKS DNSSHPQKDEGDNPETIMSSGNVNSSS >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_3|1884_bp atgcagatgatgtttgaagccatggcaatggatgagactccctggggaatcagtggggaa gtgcagctggacctctatatgcaggttccaccttcacgaccaaagagcgccgccgacact cgccgggccgcggctcaggttttctgctgctcgcgcgaaagtggctcggggagctcctgg ccacaagctcagcacaccgatcctcctagggctgcggggggtgtgggaccagggcgaaac cacccagccttggcgcggcgctgggtgacttcgcgtagcaggcagggaactggccgcggc gagcgggactggccattggagtgctccgctgcggagggaggggaccccgactcgagtaag tttgcgagagcactacgcagtcagtcgggggcagcagcaagatgcgaagcgagccgtaca gatcccgggctctccgaacgcaacttcgccctgcttgagcgagactggcagttgagtctt ttccgaggagagacgtgggctggtattgagcgtgcagctaagctggtgactgcttcagag aatctcatggctggaaatccgggggtgagggggctgcggtttccgaggccctctccagcc aaggaaaagctacacaaaaagcctggatcactcatcgaaccacccctgaagccagtgaag gctctctcgcctcgccctctagcgttcgtctggagtagcgccaccccggcttcctgggga cacagggttggcaccatggggcccaccagcgtcccgctggtcaaggcccaccgcagctcg gtctctgactacgtcaactatgatatcatcgtccggcattacaactacacgggaaagctg aatatcagcgcggacaaggagaacagcattaaactgacctcggtggtgttcattctcatc tgctgctttatcatcctggagaacatctttgtcttgctgaccatttggaaaaccaagaaa ttccaccgacccatgtactattttattggcaatctggccctctcagacctgttggcagga gtagcctacacagctaacctgctcttgtctggggccaccacctacaagctcactcccgcc cagtggtttctgcgggaagggagtatgtttgtggccctgtcagcctccgtgttcagtctc ctcgccatcgccattgagcgctatatcacaatgctgaaaatgaaactccacaacgggagc aataacttccgcctcttcctgctaatcagcgcctgctgggtcatctccctcatcctgggt ggcctgcctatcatgggctggaactgcatcagtgcgctgtccagctgctccaccgtgctg ccgctctaccacaagcactatatcctcttctgcaccacggtcttcactctgcttctgctc tccatcgtcattctgtactgcagaatctactccttggtcaggactcggagccgccgcctg acgttccgcaagaacatttccaaggccagccgcagctctgagaagtcgctggcgctgctc aagaccgtaattatcgtcctgagcgtcttcatcgcctgctgggcaccgctcttcatcctg ctcctgctggatgtgggctgcaaggtgaagacctgtgacatcctcttcagagcggagtac ttcctggtgttagctgtgctcaactccggcaccaaccccatcatttacactctgaccaac aaggagatgcgtcgggccttcatccggatcatgtcctgctgcaagtgcccgagcggagac tctgctggcaaattcaagcgacccatcatcgccggcatggaattcagccgcagcaaatcg gacaattcctcccacccccagaaagacgaaggggacaacccagagaccattatgtcttct ggaaacgtcaactcttcttcctag >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_4|277_aa MSKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGL ISRIYKELKQIYKKKTNNPINKWVKDMNRHSSKEDIYAANRHMKKCSSSLAIREMQIKTT MRYHLTPVRMAIIKKSGNNRQISKNNENFHALSTVEKGFGYKGSCFHRIISEFMCQGGDF PCHNGTDGKFIYGEKFDDENFILKHTGPGILSIANARPNSNGSQFFICTAKTEWLDGKHV VFGKVKEGMNIVEAMDRSGSRNGKTNKKIIIADCGQL >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_4|834_bp atgagcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactactatcagagtg aacaggcaacctacagaatgggagaaaatttttgcaatctattcatctgacaaagggcta atatctagaatctacaaagaactcaaacaaatttacaagaaaaaaacaaacaaccccatc aacaagtgggtgaaagatatgaacagacactcctcaaaagaagacatttatgcagccaac agacacatgaaaaaatgctcatcatcactcgccatcagagaaatgcaaatcaaaaccaca atgagataccatctcacaccagttagaatggcgatcattaaaaagtcaggaaacaacaga caaatttccaaaaacaacgaaaactttcatgctctgagcactgtagagaaaggatttggt tataagggttcctgctttcacagaattatttcagagtttatgtgtcagggtggtgacttc ccatgccataatggcactgatggcaagttcatctacggggagaaatttgatgatgagaac ttcattctgaagcatacaggtcctggcatcttgtccatagcaaatgctagacccaactca aacggttctcagtttttcatctgcactgccaagactgagtggttggatggcaagcatgtg gtctttggcaaggtgaaagaaggcatgaatattgtggaggccatggaccgctctgggtcc aggaatggcaagaccaacaagaagattatcattgctgactgtgggcaactctaa >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_5|211_aa MIVYLENPIVSAQNLLKLTSNFSKVSGYKINVQKSQAFLYINNRQTESQIMSELSFTIAS KRIKYLGIQLTRDVKDLLKENYKPLLNEIKEDTNKWKNIPCLWVGRINFVKMAILPKVIY RFNAIPTKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDILQGYSNQN SMVLVPKQRYRPTEQNRALRNNTTHLQPSDL >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_5|636_bp atgattgtatatctagaaaaccccatcgtctcagcccaaaatctccttaagctgacaagc aacttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatac atcaataacagacaaacagagagccaaatcatgagtgaactctcattcacaattgcttca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttgaaggag aactacaaaccactgctcaatgaaataaaagaggacacaaacaaatggaagaatattcca tgcttatgggtaggaagaatcaatttcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccaccaagctaccaatgactttcttcacagaattggaaaaaacc actttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaa aagaacaaagctggaggcatcacgctacctgatatactacaaggctacagtaaccaaaac agcatggtactggtgccaaaacagagatatagaccaacagaacagaacagagccctcaga aataataccacacatctacaaccatctgatctttga >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_6|160_aa MGDPWTPQALGKFPSDCKVWGLRGQHIASESSRQIVKRSIQAGSGLLIKSKLLHRGPGFA DPRQLYFLLGSSFAESRVVAYPWNWDEVLHGGALPKATQLVDGEMRRLPAASQAAFPVRK EVPRASEEDQISLRTATLNPNCTFESSGSFQKLPQSEPHL >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_6|483_bp atgggggacccctggactcctcaggctctgggaaagtttccatctgactgcaaagtttgg ggcctgaggggacagcatattgcatcagaatccagcaggcaaattgtaaagagatccata caagctggcagtgggcttctgataaagtccaaactccttcacagggggcctggctttgct gatcctcgccagctttatttccttctaggttccagttttgctgagtcccgagtagttgca tacccgtggaattgggacgaagtgctgcacggaggggccttgcccaaggctacgcagctg gtagacggggaaatgaggcggttgccggccgcctcgcaggcagctttccctgtgaggaag gaggtgcctcgggcctctgaagaggatcaaatttcccttagaacagcgactctcaaccct aattgcacgtttgaatcatctgggagctttcagaaactcccacagtcagagccccatctg tag >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_7|68_aa MELFLPDLGKTVERAGLLQVGFEDGLLVDKGHKIQEVVGLPLKQVVDSSFERYPPYTSRK GAEDTEMP >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_7|207_bp atggaattgtttctccctgatctggggaagactgtggagagagccggtttgctgcaagtt ggctttgaagatggcttgctggtggacaaaggtcataaaatccaagaagttgttggcctt cccctgaagcaggtcgtagactcctcatttgagagataccctccctatacctcgaggaaa ggagcagaagacacagaaatgccgtga >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_8|316_aa MVQIHSEAFRMLLLGREQIAHQISRIWLSRMRQRVKKSPIFLNSRCLAEGLSLRVHSGCY FHSSHNHIPNSRKEGKRHTLPFLQRFARSTSQHFRVKLTGWNLIIWPYLAASKKVFAQST EQSTLSLSCGGQCWNHGSNGRKEVSENPIYQKRLWKVLNRLLDDSHLHCSNKCKNPPNRP VPTTPKTRLPDGGVTWCQDEYAVLEREKRTSGEVLFFLPPFPAPVFKAVLSSGPHGLFRA KPRCAGTGAVNSSTMRDSREAGGETGDTSATQEIKRWSPAKEGCRKELNAEPRRLLQPSD LTPLTRWSLSAIKPDV >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_8|951_bp atggtccaaattcattcagaagcttttcgcatgctcctgctgggacgagaacaaattgca caccagatcagtcggatttggcttagccgcatgagacagagagtcaaaaagagccctatc ttcttgaattctaggtgtttggctgagggcctcagcctcagggtccacagtggctgttat ttccacagcagtcacaaccacattccaaacagcaggaaagagggaaaaaggcacactctt ccctttctacaaaggtttgccagaagtaccagccaacacttccgtgtaaaactcactggc tggaatctaattatatggccatacctagctgcaagcaagaaggtctttgcacagtccaca gaacaaagcaccctctccctgagctgtggaggacagtgctggaatcatggatccaatggg aggaaagaagtttcagaaaatcccatctatcagaagaggctctggaaggtactgaacaga ttattggatgattcccacctacattgctcaaacaaatgcaaaaatcccccaaatcgtcct gtgcccactacccccaaaaccagacttcctgatgggggtgtgacttggtgtcaagatgaa tatgcagtgctagagagagagaagaggaccagtggggaggttttgtttttcttaccccca ttccccgccccagtattcaaggctgtactaagttctggccctcacggcctctttcgtgca aaaccccgctgtgcagggactggggccgtgaacagctccacaatgagggactccagggaa gctggaggagaaacgggggacacaagtgcaacacaagagatcaaaaggtggtcgccggct aaagagggctgcaggaaggagctcaatgcggaaccgcgaaggctgctgcagccctcagat ctaacgcctttgacgcgttggagcttgtcagcgataaaacctgacgtgtga >gi568815597f:101138985_101340130|GENSCAN_predicted_peptide_9|175_aa MWMELEVIILSEVNQDFRAENMVFLVPSPQQFRTYLAQASHQRQSQVPTLSALTPKSHTL PPKKVTSCEIDFTLYYASVRTVNVNISVNSVRAFLSTFSVLARHCDESEGLRVLLVEGVK VLGILNKELDKMHKQSKERMKQQKQRFIENKSTLHSVGAAQASGSRVLVTEFSRA >gi568815597f:101138985_101340130|GENSCAN_predicted_CDS_9|528_bp atgtggatggagctggaggtcattattctaagtgaagtaaatcaggatttcagagcagaa aacatggtcttccttgtaccaagtcctcagcagtttaggacttacttagctcaagcaagt catcaaaggcaaagccaagtcccaacccttagtgctttgactcccaagtcccacactctt cctccgaaaaaagtcacatcgtgtgagattgattttaccctctactatgcttctgttcgc acagtaaatgtcaacatttctgttaattcggttcgtgcatttttgagcactttttctgtt cttgcaagacattgtgatgaaagtgaaggactgagggtgttactggtggagggtgtcaag gttcttggcattttgaacaaagaattggacaaaatgcacaaacaaagcaaggaaagaatg aaacaacaaaagcagagatttattgaaaacaaaagcacacttcacagtgtaggagcagcc caagcaagcggatcaagagtgctggttacagaattttccagggcttaa