GENSCAN 1.0 Date run: 5-Nov-116 Time: 15:23:17 Sequence gi568815596r:55767076_56022440 : 255365 bp : 38.47% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 940 935 6 1.05 1.02 Term - 2126 1996 131 0 2 54 42 85 0.026 -2.14 1.01 Init - 18638 18485 154 2 1 87 9 140 0.243 6.19 1.00 Prom - 21527 21488 40 -3.65 2.02 PlyA - 22287 22282 6 1.05 2.01 Sngl - 24656 22395 2262 2 0 49 42 886 0.839 73.18 2.00 Prom - 24749 24710 40 -6.15 3.10 PlyA - 24918 24913 6 1.05 3.09 Term - 26624 26268 357 2 0 51 55 209 0.543 7.43 3.08 Intr - 41718 41593 126 0 0 9 107 69 0.517 0.86 3.07 Intr - 43030 42749 282 1 0 50 84 235 0.879 15.99 3.06 Intr - 45429 45334 96 0 0 72 108 64 0.570 6.09 3.05 Intr - 47871 47706 166 2 1 63 -12 167 0.006 3.24 3.04 Intr - 58544 58381 164 2 2 86 45 123 0.066 5.65 3.03 Intr - 58821 58705 117 0 0 79 60 58 0.459 1.84 3.02 Intr - 59123 58861 263 0 2 63 38 205 0.537 9.28 3.01 Init - 68316 67398 919 0 1 63 -33 289 0.003 8.51 3.00 Prom - 96598 96559 40 -3.15 4.11 PlyA - 96921 96916 6 1.05 4.10 Term - 100159 99998 162 1 0 84 34 183 0.811 9.45 4.09 Intr - 103840 103645 196 0 1 100 91 200 0.999 20.00 4.08 Intr - 104048 103925 124 0 1 82 63 159 0.999 11.52 4.07 Intr - 107990 107871 120 0 0 77 115 7 0.764 1.85 4.06 Intr - 109667 109548 120 0 0 54 93 80 0.939 4.65 4.05 Intr - 114659 114537 123 0 0 75 97 113 0.702 10.54 4.04 Intr - 137090 136908 183 0 0 32 58 110 0.026 1.24 4.03 Intr - 150976 150590 387 2 0 53 97 490 0.968 40.24 4.02 Intr - 151192 151144 49 1 1 90 105 70 0.995 6.23 4.01 Init - 155365 155285 81 1 0 52 103 120 0.558 10.94 4.00 Prom - 155657 155618 40 -4.15 5.06 PlyA - 155906 155901 6 1.05 5.05 Term - 156158 155984 175 1 1 65 53 116 0.780 2.05 5.04 Intr - 156798 156636 163 1 1 57 91 79 0.794 3.21 5.03 Intr - 159400 159289 112 1 1 85 78 -2 0.105 -2.47 5.02 Intr - 170375 170204 172 1 1 88 84 102 0.411 8.82 5.01 Init - 178006 177858 149 1 2 91 20 75 0.371 0.61 5.00 Prom - 178323 178284 40 -3.75 6.06 PlyA - 178672 178667 6 1.05 6.05 Term - 185708 185564 145 1 1 104 47 101 0.552 4.00 6.04 Intr - 191714 191579 136 0 1 53 93 60 0.116 1.71 6.03 Intr - 206980 206875 106 1 1 79 87 68 0.051 4.67 6.02 Intr - 208500 208434 67 2 1 124 75 50 0.546 5.29 6.01 Init - 223097 223042 56 2 2 60 84 64 0.454 4.01 6.00 Prom - 241572 241533 40 -2.95 7.04 PlyA - 242285 242280 6 1.05 7.03 Term - 248901 248814 88 2 1 137 48 42 0.445 1.25 7.02 Intr - 251377 251128 250 2 1 24 44 178 0.640 2.57 7.01 Init - 253599 253593 7 0 1 114 77 0 0.774 2.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 58544 58377 168 2 0 86 55 133 0.812 6.70 S.002 Sngl - 68316 67384 933 0 0 63 43 270 0.984 16.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:55767076_56022440|GENSCAN_predicted_peptide_1|94_aa MERRDNGGLDLGNGSGVREKEMGLRAIKEAQLSGFHDLLDVRDEGEGNTAAAMRVHFWHL CHMGKCVRGKAGGAQRDSNALSNTLRADVMAKRV >gi568815596r:55767076_56022440|GENSCAN_predicted_CDS_1|285_bp atggagaggagagacaatggtggcctggacttgggaaatggcagtggtgttagagaaaaa gagatgggtctgagagccattaaggaggcacaattgtcaggattccatgacttactggat gtgagagatgagggagaaggaaacacagctgcagccatgagagttcacttctggcatctt tgccatatgggaaaatgtgtaagaggaaaggcaggtggagcccaacgggactccaatgcc ctatcaaacaccctgagggcagatgttatggccaagcgtgtgtga >gi568815596r:55767076_56022440|GENSCAN_predicted_peptide_2|753_aa MGDFNTPLSTLDRSMRQKVNKDTQELNSALHEADLIDIYRTLHPKTTEYTFFSAPHHTYS KIDHIVGSKVLLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRATAWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLRKINESRSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKPENPEEMDKFLDTYTLPRLN QEEVGSLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEKLGPCLLKLFQSIEKE GILHNSFYEASILLIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHPD QVGFIPGMQGWFNICKSINVIQHLNRTNNKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGMYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLNLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIASKRIKYLGIQLARDVKDLFKENYKPLLNEIKEDTNKWKNI PCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEKTALKFIWNQNRAHIAKSILS QRNKAGGIMLPDFKLYYKATVTKQHGTGTKTET >gi568815596r:55767076_56022440|GENSCAN_predicted_CDS_2|2262_bp atgggagactttaataccccactgtcaacattagacagatcaatgagacagaaagttaac aaggatacccaggaattgaactcagctctgcatgaagcagacctaatagacatctacaga actctccaccccaaaacaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagtactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc acccaaaaccgcgcaactgcatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaactcttcgaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccactagcaagactaataaag aagaaaagagagaaaaatcaaatagatgcaataaaaaatgacaaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactataaacacctctacgcaaataaa ccagaaaatccagaagaaatggataaattcctcgatacatacaccctcccaagactaaac caggaagaagttggatctctgaatagaccaataacaggctctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggagaaattgggaccatgccttctgaaactattccaatcaatagaaaaagag ggaatcctccataactcattttatgaggccagcatcctcctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccctgat caagtaggcttcatccctgggatgcaaggctggttcaacatatgcaaatcaataaatgta atccagcatttaaacagaaccaacaacaaaaaccatatgattatctcaatagatgcagaa aaggcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggatgtatctcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaaaggtattcagttaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaacctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttgcaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacatt ccgtgctcatgggtaggaagaatcaatattgtgaaaatggccatactgcccaaggtaatt tatagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actgctttaaagttcatatggaaccaaaatagggcacacattgccaagtcaatcctaagc caaaggaacaaagctggaggcatcatgctacctgacttcaaactatactacaaggctaca gtaaccaaacagcatggtactggtaccaaaacagagacatag >gi568815596r:55767076_56022440|GENSCAN_predicted_peptide_3|829_aa MEEDLTSKWNIKKAEVAILVSDKTDFKPTTIKRDKEGHCIMVKGSIQQEELTILNIYAVN TGAPRFIKQVLRDLQRDLESHTIIMGDFNTSLSILDRSMRQKVNKDIQDLNSALHQVDLI DIYRTLHPKSTEYTFFSAPHSTYSKIDHIIGSKALLCKCKRTEITTHCLSDHSAIQLELR IKKLTQNHTTTWKLNNQLLNDYWVNNEMKAEIKMLFETNENKHTTYQNLWDKLKAMCRGK FTALNAHKRKQERSKIDTLTSQLKELEKQQQKNSKASRRQEITKIRAELKEIETQKTFKK SMNPGAVQGQQEQYHCSGSTTAVAVAEGLSVASGSSTSEKCRAAANGNVQSGGGVSVQLA QGRGQLVKSRQLGVHRERRLGSPPYGDFDMLVPQGSKGRTTAVAMGLSGLSVASGSSTPG KHRATTSGNAQLGPEGNKGNTTAASVTEGLWIMSGISSPEKRRAATNGSVQAGAGWLCWG PKSRDPAHQDPQLQVCWSLLRSTPDPVCLSISSGGCRTADIGEPQMLLPDRSSGSFVSEE YPALHWPVLSIPPIPELQHLATASQELLTPTCNLQMCIEIVPNCTKAKNHAIWRLCKQDY KAPRYPVPVLLISSAYLKALHIRPGQLNAALGPDANAAGKRLHLQRSNMALSSNQGFICK SSKQSLITGVVTRWCGLRISGKRLMEDTKNGLIQYPFQSPSKMTSCELQRLGTDKQKDSS YLCRLKCPCLTALKRAVVLPARSWRSENGQTASSSGSLTPEQPNWEAPPSRGRLTPHMAG YSSETKVPEEQSGSSICSSPISAVLQPPLLILRQTGSGVDLSKLQQTCS >gi568815596r:55767076_56022440|GENSCAN_predicted_CDS_3|2490_bp atggaggaagatctaacaagcaaatggaatataaaaaaagcagaggttgcaatcctagtc tctgataaaacggactttaaaccaacaacgatcaaaagagacaaagaaggccattgcata atggtaaagggatcaattcaacaagaagagctaactatcctaaatatatatgcagtcaat acaggagcacccagattcataaagcaagtccttagagacctacaaagagacttagaatcc cacacaataataatgggagactttaacacctcactgtcaatattagacagatcaatgaga cagaaggttaacaaggatatccaggacttgaactcagctctgcaccaagtggacctaata gacatctacagaactctccacccaaaatcaacagaatatacattcttctcagcaccacat agcacttattctaaaattgaccacataattggaagtaaagcactcctttgcaaatgtaaa agaacagaaatcacaacacactgtctctcagaccacagtgcaatccaattagaactcagg attaagaaactcactcaaaaccacacaactacatggaaactgaacaaccagctcctgaat gactactgggtaaataacgaaatgaaggcagaaataaagatgttgtttgaaaccaatgag aacaaacacacaacataccagaatctctgggacaaattaaaagcaatgtgtagagggaaa tttacagcactaaatgcccataagagaaagcaggaaagatctaaaatcgacaccctaaca tcacaattaaaagaactagagaagcaacagcagaaaaattcaaaagctagcagaaggcaa gaaataactaagatcagagcagaactgaaggagatagagacacaaaaaaccttcaaaaaa tcaatgaatccaggagctgtccaagggcagcaggagcagtaccactgcagtggcagtacc actgcagtggcagtggcagaggggctttcagttgcctctgggagctccacttcagagaaa tgcagagctgctgccaatgggaatgttcagtcagggggtggggtgtctgtgcagctggcc caaggtagggggcaactggtgaagagcaggcagttgggggttcacagggagcgcagactg ggatcccctccatatggtgactttgacatgctggtgccacaaggcagcaagggcagaacc actgcagtggcaatggggctgtcggggctgtcggttgcctcagggagctccaccccaggg aaacacagagccactaccagtggaaatgctcagctggggcctgagggcaataagggcaat accactgcagcttcagtgaccgagggactgtggatcatgtctgggatttcctccccagag aaacgcagagctgccaccaacggaagtgttcaagcaggggcagggtggctgtgctggggt cccaagtcaagagatcctgcgcatcaggaccctcagctgcaggtctgttggagtttgctg aggtccactccagaccctgtttgcctgagtatcagcagtggtggctgcagaacagctgat attggtgaaccgcaaatgctgctgcctgatcgttcctctggaagttttgtctcagaggag tacccggccctgcactggcctgttctctccatccctcctatcccggagcttcagcacttg gccactgcctctcaggaactgctcaccccaacctgcaacctccaaatgtgcattgaaatt gtgcccaactgcaccaaagccaagaaccatgcaatctggaggttgtgcaagcaagattac aaggctccaaggtatccagtgcctgtgctgctcatctcttctgcttacctcaaagctttg catattcggcccgggcaactgaatgccgcgttgggaccagatgcaaatgctgcaggaaag cgactacatttgcagagaagcaatatggctttatcttccaatcagggcttcatttgcaag tctagtaaacagtcactgataactggggtggttacccgctggtgtgggttgaggatctct ggaaagagacttatggaagataccaaaaatgggctcattcaatatccattccaatctcct tctaaaatgacttcctgtgaattgcagagactgggcacagacaaacaaaaagacagcagt tacctctgcagacttaaatgtccctgtctgacagctttgaagagagcagtggttctccca gcacgcagctggagatctgaaaatgggcagactgcctcctcaagtgggtccctgaccccc gagcagcctaactgggaggcaccccccagtaggggcagactgacacctcacatggccggg tactcctctgagacaaaagttccagaggaacaatcaggcagcagcatttgcagttcacca atatcagctgttctgcagccaccactgctgatactcaggcaaacagggtctggagtggac ctcagcaaactccaacagacctgcagctga >gi568815596r:55767076_56022440|GENSCAN_predicted_peptide_4|514_aa MLKALFLTMLTLALVKSQDTEETITYTQCTDGYEWDPVRQQCKDIDECDIVPDACKGGMK CVNHYGGYLCLPKTAQIIVNNEQPQQETQPAEGTSGATTGVVAASSMATSGVLPGGGFVA SAAAVAGPEMQTGRNNFVIRRNPADPQRIPSNPSHRIQCAAGYEQSEHNVCQALKYENQS SKEKAKIQGVPARVGARTIFKHTFVEKKQKRSKQLSHTFIFQALLLTLKDRTPYIDECTA GTHNCRADQVCINLRGSFACQCPPGYQKRGEQCVDINECDASNQCAQQCYNILGSFICQC NQGYELSSDRLNCEDIDECRTSSYLCQYQCVNEPGKFSCMCPQGYQVVRSRTCQDINECE TTNECREDEMCWNYHGGFRCYPRNPCQDPYILTPENRCVCPVSNAMCRELPQSIVYKYMS IRSDRSVPSDIFQIQATTIYANTINTFRIKSGNENGEFYLRQTSPVSAMLVLVKSLSGPR EHIVDLEMLTVSSIGTFRTSSVLRLTIIVGPFSF >gi568815596r:55767076_56022440|GENSCAN_predicted_CDS_4|1545_bp atgttgaaagcccttttcctaactatgctgactctggcgctggtcaagtcacaggacacc gaagaaaccatcacgtacacgcaatgcactgacggatatgagtgggatcctgtgagacag caatgcaaagatattgatgaatgtgacattgtcccagacgcttgtaaaggtggaatgaag tgtgtcaaccactatggaggatacctctgccttccgaaaacagcccagattattgtcaat aatgaacagcctcagcaggaaacacaaccagcagaaggaacctcaggggcaaccaccggg gttgtagctgccagcagcatggcaaccagtggagtgttgcccgggggtggttttgtggcc agtgctgctgcagtcgcaggccctgaaatgcagactggccgaaataactttgtcatccgg cggaacccagctgaccctcagcgcattccctccaacccttcccaccgtatccagtgtgca gcaggctacgagcaaagtgaacacaacgtgtgccaagcactcaagtatgaaaatcagtct tctaaggaaaaagcaaaaattcagggagtacctgctagggtaggggccaggactatattc aaacatacatttgtggagaaaaagcaaaaacggagcaaacaactgagccatacctttatc ttccaggcgctgcttcttaccctgaaagacaggaccccctacatagacgagtgcactgca gggacgcacaactgtagagcagaccaagtgtgcatcaatttacggggatcctttgcatgt cagtgccctcctggatatcagaagcgaggggagcagtgcgtagatataaatgaatgtgat gccagcaatcaatgtgctcagcagtgctacaacattcttggttcattcatctgtcagtgc aatcaaggatatgagctaagcagtgacaggctcaactgtgaagacattgatgaatgcaga acctcaagctacctgtgtcaatatcaatgtgtcaatgaacctgggaaattctcatgtatg tgcccccagggataccaagtggtgagaagtagaacatgtcaagatataaatgagtgtgag accacaaatgaatgccgggaggatgaaatgtgttggaattatcatggcggcttccgttgt tatccacgaaatccttgtcaagatccctacattctaacaccagagaaccgatgtgtttgc ccagtctcaaatgccatgtgccgagaactgccccagtcaatagtctacaaatacatgagc atccgatctgataggtctgtgccatcagacatcttccagatacaggccacaactatttat gccaacaccatcaatacttttcggattaaatctggaaatgaaaatggagagttctaccta cgacaaacaagtcctgtaagtgcaatgcttgtgctcgtgaagtcattatcaggaccaaga gaacatatcgtggacctggagatgctgacagtcagcagtatagggaccttccgcacaagc tctgtgttaagattgacaataatagtggggccattttcattttag >gi568815596r:55767076_56022440|GENSCAN_predicted_peptide_5|256_aa MKARRLWNNTFNMPNNNNSQFGILYPTEVSFKISIMKTFSDKQKWSLPPTFWTLGPTPVA CQGLLGLQPQTEGCTVSFHTFEVLRLGLASLLFSLQKAYCGTSPCDHSLPPSHLESAGKE RSFSSPPELCLLLKDRVKVLTPSVSLSSAPLRPPPFPPFLLLLLPPRPLPDFARSDPRGC PPLRTPLAARARSAARPHRQPETEGSVSGTQTDKTKENQSLWAAPPHNLGLLNKAVRRER RGNATLRVCAGGCEKL >gi568815596r:55767076_56022440|GENSCAN_predicted_CDS_5|771_bp atgaaagccagaagattatggaataacaccttcaacatgccaaataacaataactctcaa tttggaatcttatacccaacagaagtatctttcaagataagcatcatgaagacattttca gacaagcaaaaatggagtttaccacccacgttttggactcttggacctacaccagtggct tgccaggggctcttgggccttcagccacagactgaaggctgcactgtcagcttccatact tttgaggttttgagacttggactggcttccttgctcttcagcttgcagaaggcctattgt gggacttcaccttgtgatcattcgctaccaccctcccaccttgagtcagcaggaaaggaa aggagcttttcctctcctcctgagctctgcctcctgctcaaggacagagtcaaagtttta accccttcagtttctctctcctccgccccccttcgccctccccctttccctccctttctc ctcctcctcctgccgccgcggccgctgccggacttcgccagatcagacccacggggctgc cctcccctgcgcactcccctcgctgcccgggcccggagcgcagcgcggccgcacaggcag cctgaaaccgaaggtagcgtgtcggggacccagactgataagacaaaagagaatcagtcg ctttgggctgcccctccacacaacctgggacttttaaacaaagctgtgcgcagagaaagg cgtggaaatgccactttgagagtttgtgctgggggatgtgagaagctctga >gi568815596r:55767076_56022440|GENSCAN_predicted_peptide_6|169_aa MNKHSANTWKFAVNEGESIPIIKLKSQQASKDEVQSVAHDKKTDGSWRMTVDNHKLEQVM TLTAAVVLDMVLLLEQVLGTGEEKVRHNLVPPEVHRRTKETVIEMIHMQGSIWKNRGLSA VRIGETGMSLEARGGKRQKIGDVETAQNHPVPCMPPTIRGCPCTIYGCS >gi568815596r:55767076_56022440|GENSCAN_predicted_CDS_6|510_bp atgaataagcacagtgccaacacttggaagtttgcagtcaatgaaggagaaagcatacct ataattaaactcaagtcccagcaggcatctaaagatgaggtacaaagtgtggcccatgac aagaagacagatggatcctggagaatgacagtggataatcataaacttgagcaagtgatg actctaactgcagctgttgtactagatatggttttattgcttgagcaagttcttggcaca ggagaagagaaagtgagacacaaccttgttcctccagaagttcacagaagaacaaaggag acagtgatagaaatgatacatatgcaaggcagcatttggaaaaatagaggactatccgca gtacgaataggggagacaggcatgagtttggaagcaagaggagggaagagacagaagatt ggagatgtggaaacagcccagaaccatcctgtcccctgcatgcccccgaccatcagaggc tgcccatgtaccatatatggctgttcctga >gi568815596r:55767076_56022440|GENSCAN_predicted_peptide_7|114_aa MPGLSRPSPGTLLTAIANNRSQQEYVRTGAGASWVRLCSEKGPPQSCAKIAHFQVALLPF AIAGFLLLFMQQMFGKICHKALWLLRITVSKINLDPRVPSPWGICLSLNLLSMQ >gi568815596r:55767076_56022440|GENSCAN_predicted_CDS_7|345_bp atgcccggcttgtcccgaccgtctcctgggacgcttttgacggcaattgcaaacaacaga agccagcaggaatatgtaagaacaggtgctggggcctcctgggtacgcctgtgctcagag aaagggcctccacagagctgtgccaagatagcccacttccaagttgccctcttgcccttc gccattgcaggatttttgctccttttcatgcagcagatgtttggaaagatttgccacaaa gcattatggcttctaagaataactgtcagcaagattaatttagatccacgtgttccctct ccctggggcatttgcctttctttgaatcttctctcaatgcagtga