GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:39:14 Sequence gi568815578f:1196240_1406051 : 209812 bp : 44.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7235 7412 178 2 1 65 95 107 0.651 9.02 1.02 Intr + 10630 10733 104 0 2 70 42 72 0.296 -0.23 1.03 Term + 11294 11510 217 2 1 106 54 38 0.261 -1.18 1.04 PlyA + 12014 12019 6 1.05 2.00 Prom + 14415 14454 40 -6.16 2.01 Init + 21869 22241 373 1 1 24 58 220 0.249 9.73 2.02 Intr + 24616 24867 252 2 0 67 35 132 0.300 3.21 2.03 Term + 25552 25814 263 2 2 70 55 152 0.539 5.79 2.04 PlyA + 25833 25838 6 1.05 3.00 Prom + 26595 26634 40 -2.46 3.01 Init + 29177 29267 91 1 1 69 78 32 0.063 -0.99 3.02 Intr + 35287 35380 94 2 1 96 22 65 0.054 -0.28 3.03 Intr + 37846 37952 107 1 2 111 77 27 0.306 3.76 3.04 Intr + 49452 49520 69 1 0 85 62 54 0.254 1.65 3.05 Intr + 70057 70243 187 2 1 107 23 83 0.409 2.45 3.06 Intr + 70415 70573 159 2 0 98 75 104 0.541 9.20 3.07 Intr + 73325 73369 45 2 0 103 93 40 0.679 3.52 3.08 Term + 85365 85578 214 0 1 90 47 106 0.465 3.40 3.09 PlyA + 87917 87922 6 1.05 4.04 PlyA - 88127 88122 6 1.05 4.03 Term - 88385 88345 41 0 2 114 47 27 0.453 -1.45 4.02 Intr - 90168 90024 145 0 1 111 59 140 0.835 13.26 4.01 Init - 91526 91515 12 2 0 64 81 8 0.141 -2.05 4.00 Prom - 92754 92715 40 -2.96 5.00 Prom + 93261 93300 40 -7.46 5.01 Init + 99340 99472 133 0 1 94 62 110 0.485 8.89 5.02 Intr + 100086 100182 97 1 1 110 109 23 0.897 5.67 5.03 Intr + 100807 101013 207 1 0 39 86 260 0.667 19.09 5.04 Intr + 104323 104472 150 1 0 103 105 278 0.998 30.28 5.05 Term + 108639 109815 1177 0 1 79 55 1955 0.993 182.57 5.06 PlyA + 111807 111812 6 -3.44 6.12 PlyA - 111842 111837 6 -3.84 6.11 Term - 112340 112040 301 1 1 114 42 122 0.904 4.89 6.10 Intr - 114652 114561 92 1 2 102 78 176 0.966 16.79 6.09 Intr - 116269 116098 172 0 1 75 102 322 0.999 32.25 6.08 Intr - 116523 116348 176 0 2 114 109 224 0.998 25.84 6.07 Intr - 117259 117101 159 1 0 110 91 167 0.951 19.38 6.06 Intr - 122179 122079 101 2 2 96 31 48 0.778 -0.27 6.05 Intr - 122722 122673 50 0 2 60 81 29 0.103 -2.28 6.04 Intr - 123420 123351 70 1 1 121 109 26 0.130 6.24 6.03 Intr - 124152 124124 29 2 2 72 111 4 0.020 -1.14 6.02 Intr - 132947 132846 102 1 0 35 115 86 0.205 5.29 6.01 Init - 167483 167401 83 2 2 65 70 83 0.300 4.64 6.00 Prom - 171348 171309 40 -4.86 7.07 PlyA - 172768 172763 6 1.05 7.06 Term - 176001 175873 129 0 0 114 48 134 0.959 10.18 7.05 Intr - 179364 179252 113 1 2 97 85 32 0.133 3.90 7.04 Intr - 196642 196595 48 2 0 102 117 60 0.004 8.95 7.03 Intr - 196862 196723 140 1 2 30 113 66 0.003 3.41 7.02 Intr - 197078 196887 192 1 0 89 41 75 0.002 1.51 7.01 Intr - 197487 197316 172 1 1 53 77 84 0.003 2.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 185351 185451 101 1 2 58 114 35 0.864 2.83 S.002 Term + 189319 189427 109 1 1 107 48 74 0.899 3.28 S.003 Sngl + 196737 197099 363 2 0 78 38 266 0.968 14.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:1196240_1406051|GENSCAN_predicted_peptide_1|166_aa XDLARTCCQSERVGGMYKSKIPRAQNQVSVKVTPKNTEMKIAEEPSPSLGQTLEWLRKEL SEMQIQDQSLLLTLRHLHSVLEELRADSAHWEDARKPSHSPSSDNILGPVTTLAIGMGIL PGLANDIIHGCPKKMIGPMDGHVNQQSQSPTFPEIATWKLFLLHSG >gi568815578f:1196240_1406051|GENSCAN_predicted_CDS_1|501_bp nntgatttggccagaacctgctgccagtctgaaagagttgggggaatgtacaagtcaaag atccctcgggcccagaaccaggtcagtgtcaaggtcactcctaagaacactgagatgaaa atagcagaagagcccagcccgagtcttgggcagaccttggagtggctgagaaaggagctg tctgagatgcagattcaagatcagagtctcctgctcacactgaggcatcttcacagtgtc ctggaggagctgcgtgcggacagcgcccactgggaggacgccaggaagccttcccattcc ccttcatctgataacatattaggccctgtaaccaccctggccataggcatgggtatatta ccaggcctggccaatgatataattcatggttgccctaaaaagatgattggcccaatggat ggtcatgtgaaccaacagagccagtcaccgaccttccctgagattgctacctggaagctc tttctattacactcaggctga >gi568815578f:1196240_1406051|GENSCAN_predicted_peptide_2|295_aa MEERVSVIEDQMNEMKREEKFREKRVKRNEQSLQEMWDYVKRPNLHLIGVPESDGENGTK LENTLQDIIQENFPNLARQANIQIQEIQRTPQRDSSRRATPRHIIVRFTKVEMKEKMLRA AREKVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISAQNLLKLISNFS KVSEYKINVRKSQTFLYTNNRQTESQIMKLEKTTLKFTWNQKRAHIAKTILSQKNKAGGV TLPNFKLYYKATVTKTAWYWYRNRDIDQWNRTEPSEIIPHIYNHLIFDKPDKNKK >gi568815578f:1196240_1406051|GENSCAN_predicted_CDS_2|888_bp atggaagaaagggtatcagtgattgaagatcaaatgaatgaaatgaagcgagaagagaag tttagagaaaaaagagtaaaaagaaatgaacaaagcctccaagaaatgtgggactacgtg aaaagaccaaatctacatctgattggtgtacctgaaagtgacggagagaatggaaccaag ttggaaaacactcttcaggatatcatccaggagaacttccccaacctagcaaggcaggcc aacattcaaattcaggaaatacagagaacgccacaaagagactcctcgagaagagcaact ccaagacacataattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggca gccagagagaaagtgttggaagttctcgccagggcaatcaggcaagagaaagaaataaag ggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgta tatttagaaaaccccatcatctcagcccaaaacctccttaagctgataagcaacttcagc aaagtctcagaatacaaaatcaacgtgcgaaaatcacaaacattcctatacaccaataac agacaaacagagagccaaatcatgaaattggaaaaaactactttaaagttcacatggaac caaaaaagagcccacattgccaagacaatcctaagccaaaagaacaaagctggaggcgtc acgctacctaacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccgaaacagagatatagaccaatggaacagaacagagccctcagaaataataccacac atctacaaccatctcatctttgacaaacctgacaaaaacaagaaatga >gi568815578f:1196240_1406051|GENSCAN_predicted_peptide_3|321_aa MGPRPHPCHAPAQVSQVRAPGQSRRLRTRTGLVDLPKENFEASYNAITLPEEFHDFDTQN MNAIDVSEHFTQNQSRPEEITLRENFDNDLIFQAESFDRFFFPFARDSSSSGGRNPVMVL APAPASTQLAASCRGSGEQFGGPCRAAEAMEASAGDDKDPGDLRAHRPAAPAAVQSAECQ GVGSLIRARRLQQRLQRRCAKPGRSGASRRGCGGPVDSGEEAAAERGAGPALHRGRWEQG FGEEEAVNNLRMWNAPPQAAPGYLENNYAPVLQSSAQGHQLLSSAEKPLSLLPNPSVTRA SIMGLLAEEEVAHNRHLVNID >gi568815578f:1196240_1406051|GENSCAN_predicted_CDS_3|966_bp atggggcccaggccccatccctgtcatgcccctgctcaggtgtctcaggtccgagcgcca ggacaaagccggagactgcgcactaggaccggactggttgaccttccaaaagagaatttt gaagcatcttacaatgctatcacattgccagaagaatttcatgattttgacacccaaaat atgaatgctattgatgtttcagaacactttactcagaaccaaagcagaccagaagaaatc actcttagagaaaattttgacaatgatctaattttccaagctgagagctttgacaggttc ttcttcccttttgcccgtgacagctccagcagtggtggcaggaatcctgtcatggtcctc gccccagccccggcgagcacccagctagccgcctcctgcaggggctcgggagagcaattc ggcggcccctgcagggcagctgaagccatggaagcctccgcaggtgatgacaaggacccc ggggatctccgggcccaccgcccagctgcacccgccgcagtccagagtgccgagtgccag ggggtggggtcgctgatcagggccaggcggctgcagcagcgactgcagaggcgctgcgcc aagccgggccggagtggtgcgagccggcggggctgcggagggccagtggactcaggtgag gaggccgcggcggagcggggagctggccctgcgctgcaccgcggcaggtgggaacagggg tttggagaagaagaagctgtgaacaatttacgaatgtggaatgctcctccacaggcagct cctgggtacctggagaacaactatgcacctgtcctccagagctctgctcaagggcaccag cttctgagttctgcagagaagcccctctctcttctgcccaacccctcagtgacacgtgct tcgattatgggactattggcagaggaggaagtggcacacaatagacacttggtaaatatt gactga >gi568815578f:1196240_1406051|GENSCAN_predicted_peptide_4|65_aa MRKMLQLPCCPQQLIPVHLIPRGTSHAMLNSFTFNKIQGLNNAQCGSHLLEEGLCSKVTL SVNLL >gi568815578f:1196240_1406051|GENSCAN_predicted_CDS_4|198_bp atgaggaaaatgctgcagctgccctgctgtcctcagcagctgatccctgtccacctgatt ccccgtggtacttcccatgccatgctgaattctttcaccttcaataagatccagggcctc aacaatgcccagtgtggctcacacttgctggaagagggtctttgctccaaggtcaccctc tcagtgaatcttctctga >gi568815578f:1196240_1406051|GENSCAN_predicted_peptide_5|587_aa MPVCRMGAGGPGCMSCVMGWLPEHVHGDACGMSWNGRDMVPSRVAPGIPPIPPLTRTHSL MAMSLPGSRRTSAGSRSGGTLGRSGLAVFAQCPQLPASQNEHLPLLPASRRTSPPVSVRD AYGTSSLSSSSNSGSYKGSDSSPTPRRSMKYTLCSDNHGIKPPTPEQYLTPLQQKEVCIR HLKARLKDTQDRLQDRDTEIDDLKTQLSRMQEDWIEEECHRVEAQLALKEARKEIKQLKQ VIDTVKNNLIDKDKGLQKYFVDINIQNKKLETLLHSMEVAQNGMAKEDGTGESAGGSPAR SLTRSSTYTKLSDPAVCGDRQPGDPSSGSAEDGADSGFAAADDTLSRTDALEASSLLSSG VDCGTEETSLHSSFGLGPRFPASNTYEKLLCGMEAGVQASCMQERAIQTDFVQYQPDLDT ILEKVTQAQVCGTDPESGDRCPELDAHPSGPRDPNSAVVVTVGDELEAPEPITRGPTPQR PGANPNPGQSVSVVCPMEEEEEAAVAEKEPKSYWSRHYIVDLLAVVVPAVPTVAWLCRSQ RRQGQPIYNISSLLRGCCTVALHSIRRISCRSLSQPSPSPAGGGSQL >gi568815578f:1196240_1406051|GENSCAN_predicted_CDS_5|1764_bp atgcctgtgtgccgcatgggtgcgggaggacctggctgcatgtcctgcgtcatgggctgg ctgcctgagcacgttcatggagacgcgtgtggcatgtcttggaatggcagggacatggta ccttctcgtgttgcccccgggataccgcccatcccaccccttactcggacccacagcctc atggccatgtccctgccaggaagtagacggacctctgctggatcacgcagcgggggcact ttgggccgcagcggcctggcagtgttcgcccagtgtccgcagctgcccgccagccagaac gagcacctgcctcttcttcctgcctccaggcgcacctctccacctgtgagcgtgcgggat gcctacggcacctcttcgctcagcagcagcagcaattctggctcctacaagggcagtgac agcagtcccacgccaaggcgctccatgaaatacacgctgtgcagtgacaaccatggcatc aagcccccgaccccggagcagtacctgacccccctgcagcagaaggaggtgtgcatccgg cacctgaaagcccggctgaaggacacacaggaccggctccaggaccgggacacagagatt gatgacctgaagacgcagctgtcacgcatgcaggaggactggattgaggaggagtgccac cgcgtggaggcccagctggccctgaaggaggcccgaaaggagatcaagcagctcaagcag gtcatcgacactgtcaagaacaacctgattgacaaggacaaggggctgcagaagtacttc gtggacatcaacatccagaacaagaagctggagacgctgctgcacagcatggaggtggcc cagaatggcatggccaaggaggatggcactggggagtcagccggtgggtcccctgcccgc tccctcacccgcagctccacctacaccaagctgagtgacccggctgtctgtggtgaccgc cagccgggtgatccctccagcggctctgctgaggatggggcagacagtggctttgcagca gccgatgacacactgagccggacggacgcgctggaagccagcagcctgctgtcgtcgggg gtggactgtggcaccgaggagacctcgctgcacagctccttcggcctgggcccccgcttc cctgccagcaacacctatgagaagctgctgtgtggcatggaggctggtgtgcaggccagc tgcatgcaggagcgtgccatccagacagacttcgtgcagtaccagcctgaccttgacacc atcctggagaaagtgacccaggcccaggtctgtgggacagaccctgagtcaggggacagg tgcccagagctggatgcccacccttcagggcccagagaccccaactcagcagtggtggtg acagtgggtgatgagctagaggccccagagcccatcacccgtggacccaccccacagcgg cctggtgccaaccccaaccctggccagtcggtgagcgtggtgtgccccatggaagaggag gaggaggctgccgtggctgagaaggagcccaagagctactggagccgccactacatcgtg gatctgctggctgtggtggtgccggccgtgcccacggtggcctggctttgccgctcccag cggcgccagggccagcccatctacaacatcagctccctgctgcggggctgctgcactgtg gccttgcactccatccgcaggatcagctgccgctcgctgagccagccgagtcccagccca gcgggcggcggctcccagctctga >gi568815578f:1196240_1406051|GENSCAN_predicted_peptide_6|444_aa MGPSEAEFAAAAVLYSMQQNGCQLLLGRSGSQLRDRATFQGGKQIQQRQQLEKVVEQHLA CRGPKSGPSHSGPGQSLTQDASPASPGNSHFPTTSSGGWKPKIKVLANSVSVLYPNLAEL ENYMGLSLSSQEVQESLLQIPEGDSTAVSGPGPGQMVAPVTGYSLGVRRAEIKPGVREIH LCKDERGKTGLRLRKVDQGLFVQLVQANTPASLVGLRFGDQLLQIDGRDCAGWSSHKAHQ VVKKASGDKIVVVVRDRPFQRTVTMHKDSMGHVGFVIKKGKIVSLVKGSSAARNGLLTNH YVCEVDGQNVIGLKDKKIMEILATAGNVVTLTIIPSVIYEHMVKKAWHLSSSCRTSTAWS LTGSLHLCIDAGSEHTHMHSRAGPGPPATTVPGPVPGLSGELLPPLTRRKAHRRPQESPR KRGPSTLWARALVTHTGGLRPEQE >gi568815578f:1196240_1406051|GENSCAN_predicted_CDS_6|1335_bp atgggcccatcagaggctgagtttgcagcagcagctgtcctgtacagcatgcagcaaaac ggctgtcagctcttattaggaagatcaggaagccagctgagagacagggctacgtttcag ggagggaaacagattcagcagcggcagcagctggagaaggtcgtggagcagcaccttgcc tgcagaggacctaaaagtggaccaagccattcaggcccaggtcagagcctcacccaagat gccagccctgccagtccaggcaacagccatttccccaccaccagctctggaggctggaag cccaagatcaaggttctggccaattcagtttctgttttgtacccaaacttggcagaactg gaaaattatatgggtctttccctctccagccaagaagtccaggagagcctgcttcagatt ccagagggtgacagtacagcggtctcgggccccgggcccggccagatggtggcaccggta accgggtacagcctgggcgtgcggcgagctgagatcaagcccggggtgcgcgagatccac ctgtgcaaggacgagcgcggcaagaccgggctgaggctgcggaaggtcgaccaggggctc tttgtgcagttggtccaggccaacacccctgcatcccttgtggggctgcgctttggggac cagctcctgcagattgacgggcgtgactgtgctgggtggagctcgcacaaagcccatcag gtggtgaagaaggcatcaggcgataagattgtcgtggtggttcgggacaggccgttccag cggactgtcaccatgcacaaggacagcatgggccacgtcggcttcgtgatcaagaagggg aagattgtctctctggtcaaagggagttctgcggcccgcaacgggctcctcaccaaccac tacgtgtgtgaggtggacgggcagaatgttatcgggctgaaggacaaaaagatcatggag attctggccacggctgggaacgttgtcaccctgaccatcatccccagtgtgatctacgag cacatggtcaaaaaggcatggcacctgagcagcagctgccgcacctccaccgcctggagc ctcacaggcagcctgcacctttgcatagatgcgggatctgagcacacacacatgcactcc agagctgggccagggccaccggccaccacagtcccagggcctgtcccagggctgagtggg gagctgctccctccactgacaagaagaaaagcccatcgaaggccccaggagagcccccga aagagaggaccctcaacactgtgggccagggccctcgtgacccacaccggtggtctcagg ccagaacaggagtag >gi568815578f:1196240_1406051|GENSCAN_predicted_peptide_7|264_aa XGQQAISAAVSGNIGSWSRRCVPPVLFPWPLDSDSVGHKKSDPIRIINFHMLNKNVCMEM LNRLEAQRKGRRTKQVTRQTQLYPEASGLPEVSGLLGFRSLFRRPGFPGRRAPCGARTQG WARRAEVLGRAVEPPPGRCWSTPPVAPPARSASAAAMGVQVETISPGDGRTFPKRGQTCV VHYTGMLEDGKKFDSSRDRNKPFKFMLGKQEVIRGWEEGVAQMSVGQRAKLTISPDYAYG ATGHPGIIPPHATLVFDVELLKLE >gi568815578f:1196240_1406051|GENSCAN_predicted_CDS_7|795_bp nggggacagcaggccatttccgcagcagttagtggaaatattggttcttggagcaggcgc tgcgtgcctccagtgcttttcccctggcccctggattctgattccgttgggcataagaag agtgacccaattcgtattatcaactttcacatgctcaataagaatgtatgcatggagatg cttaaccggctggaagcacagaggaagggcaggagaacgaagcaagtcacccggcaaaca cagctgtatccggaggcctccgggcttccggaggtctcggggcttctgggcttccggtcc ctcttccggaggcctgggtttccgggacgtcgcgcgccgtgtggggcgcgcacgcagggc tgggcgcgacgcgccgaggtactaggcagagccgtggaaccgccgccaggtcgctgttgg tccacgccgcccgtcgcgccgcccgcccgctcagcgtccgccgccgccatgggagtgcag gtggaaaccatctccccaggagacgggcgcaccttccccaagcgcggccagacctgcgtg gtgcactacaccgggatgcttgaagatggaaagaaatttgattcctcccgggacagaaac aagccctttaagtttatgctaggcaagcaggaggtgatccgaggctgggaagaaggggtt gcccagatgagtgtgggtcagagagccaaactgactatatctccagattatgcctatggt gccactgggcacccaggcatcatcccaccacatgccactctcgtcttcgatgtggagctt ctaaaactggaatga