GENSCAN 1.0 Date run: 7-Nov-116 Time: 17:51:22 Sequence gi568815586r:76924018_77164635 : 240618 bp : 38.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 5608 5519 90 2 0 95 82 43 0.007 3.45 1.06 Intr - 16252 16222 31 1 1 89 78 41 0.013 -0.01 1.05 Intr - 26896 26785 112 0 1 94 58 46 0.115 1.66 1.04 Intr - 38876 38797 80 2 2 0 75 138 0.164 1.13 1.03 Intr - 39518 39162 357 2 0 39 42 237 0.020 8.73 1.02 Intr - 45062 44909 154 1 1 -5 31 155 0.003 -0.35 1.01 Init - 49659 49295 365 0 2 37 13 258 0.013 9.77 1.00 Prom - 58210 58171 40 -3.35 2.04 PlyA - 58471 58466 6 1.05 2.03 Term - 60273 60062 212 1 2 16 35 298 0.938 13.77 2.02 Intr - 60484 60275 210 2 0 36 -24 277 0.798 9.26 2.01 Init - 60652 60631 22 1 1 100 52 55 0.724 3.10 2.00 Prom - 67594 67555 40 -6.05 3.00 Prom + 69363 69402 40 -5.15 3.01 Init + 74888 74978 91 1 1 83 87 126 0.501 12.70 3.02 Intr + 79882 79988 107 2 2 102 42 83 0.024 4.11 3.03 Term + 89734 89799 66 0 0 110 39 126 0.823 6.86 3.04 PlyA + 90350 90355 6 1.05 4.17 PlyA - 92031 92026 6 1.05 4.16 Term - 100168 99998 171 1 0 108 37 162 0.995 9.94 4.15 Intr - 101965 101541 425 2 2 122 94 233 0.976 19.96 4.14 Intr - 104121 103866 256 0 1 60 110 205 0.949 15.99 4.13 Intr - 106315 105814 502 0 1 103 116 371 0.974 33.46 4.12 Intr - 109105 109033 73 2 1 106 98 48 0.987 5.15 4.11 Intr - 110025 109840 186 1 0 63 65 192 0.821 13.14 4.10 Intr - 119217 119048 170 2 2 52 79 179 0.739 12.07 4.09 Intr - 119427 119352 76 1 1 79 88 40 0.774 0.75 4.08 Intr - 120778 120620 159 2 0 69 69 138 0.975 9.04 4.07 Intr - 122311 122021 291 2 0 82 70 293 0.982 23.08 4.06 Intr - 126727 126559 169 1 1 86 76 177 0.999 14.90 4.05 Intr - 132113 131838 276 2 0 65 97 232 0.549 18.59 4.04 Intr - 137247 137071 177 0 0 70 41 115 0.449 4.19 4.03 Intr - 139131 139029 103 2 1 -1 49 165 0.222 2.96 4.02 Intr - 141577 141328 250 2 1 7 86 186 0.469 5.87 4.01 Init - 161183 161108 76 2 1 83 99 48 0.851 6.70 4.00 Prom - 161360 161321 40 -4.25 5.05 PlyA - 162061 162056 6 1.05 5.04 Term - 171492 171220 273 0 0 88 37 256 0.364 14.99 5.03 Intr - 185481 185249 233 1 2 49 86 122 0.295 4.67 5.02 Intr - 190330 190249 82 1 1 0 119 111 0.063 3.69 5.01 Init - 198303 198157 147 0 0 71 95 114 0.707 10.54 5.00 Prom - 202751 202712 40 -7.15 6.00 Prom + 203700 203739 40 -4.35 6.01 Init + 209820 210000 181 2 1 52 69 108 0.876 4.69 6.02 Intr + 212040 212204 165 1 0 84 94 85 0.535 7.71 6.03 Term + 220670 221835 1166 0 2 67 32 367 0.810 20.01 6.04 PlyA + 221972 221977 6 1.05 7.04 PlyA - 222251 222246 6 1.05 7.03 Term - 227000 226845 156 2 0 105 43 47 0.468 -1.15 7.02 Intr - 228899 228821 79 1 1 80 95 57 0.612 4.23 7.01 Init - 232909 232794 116 1 2 51 94 78 0.699 4.43 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 96368 96793 426 1 0 47 50 191 0.911 7.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:76924018_77164635|GENSCAN_predicted_peptide_1|397_aa MRGNQKTNSDNMTKQGSLPAPQNHTSSPAMDPKQKEIPDLPENEFRRLVIKLIREAPEKG KAQCKEIQKMIPDVKGEIDSIKKKQSKLQETMDTYIEMQNALENLSNTIEQVEEKFQSSK TSNPINGNPLGGKEVIIRKRNLHTQVYRSTIRNCRIMEPTQMPINQQVDKETMVGIILPD EQQMTCWLQTVRGESVFPFPGVFFHVEKSTTSVILLALQPNLITGVNTVTSLSDKIGKNK RGILMSSSPFFLLVANLIARMSGRSCQKPHGTFHQPQRESLPFIQHCQWDKEDRLWLLTV HTRVPDADAKQHSLERMGREVIHSNFLLAMNICSDSVHQFNLSPNKVLSKPWAIEQTTTD LLSAIIGPQPSRSLQHHLEKELLVGNAKEQKIFENGX >gi568815586r:76924018_77164635|GENSCAN_predicted_CDS_1|1191_bp atgagagggaaccagaaaaccaattctgataatatgacaaaacaaggttccttaccagcc ccccaaaatcatactagttcaccagcaatggatccaaaacaaaaagaaatccctgattta cctgaaaacgaattcaggaggttagttattaaactaatcagggaggcaccagaaaaaggc aaagcccaatgcaaggaaatccaaaaaatgataccagacgtgaagggggaaatagatagc ataaagaaaaaacaatcaaaacttcaagaaacaatggacacatacatagaaatgcaaaat gctctggaaaatctcagcaatacaattgaacaagtagaagaaaaatttcagagctcaaag acaagcaatcccattaatgggaatccactcggaggaaaagaagtcattatacgaaaaaga aacttgcacacgcaagtttatagaagcacaattcgcaattgcagaatcatggaaccaacc caaatgcccatcaatcaacaagtggataaagaaaccatggtgggtatcattctaccagat gagcaacaaatgacgtgctggctgcagactgttagaggggagagtgtctttccctttcct ggcgttttctttcatgttgagaaaagcactaccagtgtaatactgctggctctccagccc aatctcatcacaggagtgaacacagtcacttctctatctgataaaattgggaagaacaaa cgaggtattctcatgtcatcttctccctttttcctcttggtagcaaacctgattgccagg atgagtggtaggagctgccaaaaacctcacgggaccttccaccagccccagagagagagc ctgcccttcatccagcactgccagtgggacaaagaggaccgtctgtggctgctaactgtt cacactcgggtccctgatgctgatgccaagcaacactcactcgagaggatgggcagggaa gtaattcattcaaacttccttctggccatgaacatttgctctgattctgttcatcagttt aacctgtcacccaacaaggtcctgagcaaaccctgggccatagaacaaacaactactgac ctgctttctgccatcatagggcctcagccctccagaagtctgcagcaccaccttgaaaag gagctgctggtagggaatgcaaaagaacagaaaatttttgaaaatggagnn >gi568815586r:76924018_77164635|GENSCAN_predicted_peptide_2|147_aa MAAAKDTQNDLPEWKERGTGDVKLLKHKEKGAIRLLMQRDKTLNICANHYIMRMMELKPK QVVTVPGSGTPMLTSPTIPKPELLAICFLNAENAQKFKTKFEECRKEIEEREKKAGSGKN DHAKKVAKKLEALSVKEETKEDAEEKQ >gi568815586r:76924018_77164635|GENSCAN_predicted_CDS_2|444_bp atggcggccgccaaggacactcagaacgatctcccagaatggaaggagcgaggcactggt gacgtcaagctcctgaagcacaaggagaaaggggccatccgcctcctcatgcagagggac aagaccctgaatatctgtgccaaccactacatcatgcggatgatggagctgaagcccaag caggtagtgaccgtgcctgggtctggaacacccatgctgacttcgccgaccatacccaag ccggagctgctggccatctgcttcctgaatgctgagaatgcacagaaattcaaaacaaag tttgaagaatgcagaaaagagatcgaagagagagaaaagaaagcaggatcaggcaaaaac gatcatgccaaaaaagtggcaaaaaagctagaagctctctcggtgaaggaggagaccaag gaggatgctgaggagaagcaataa >gi568815586r:76924018_77164635|GENSCAN_predicted_peptide_3|87_aa MVHKHKLEAFRDCSLDFGVKQKEGSMPLKEDCYKTMVPQESSKKWNLGERKSSRSRDILL MAAAASPTQCEDDEDEDLYDDPVSFNE >gi568815586r:76924018_77164635|GENSCAN_predicted_CDS_3|264_bp atggtgcataaacataagttggaagcgttcagggactgtagcttggattttggtgtaaag caaaaggaaggctccatgcccctgaaggaagactgctataaaactatggtaccccaagag tctagcaaaaagtggaatttaggagaaaggaagtcttcaagatccagggacattctcctc atggcagcagcagcctcacctactcaatgtgaagatgatgaggatgaagatctttatgat gatccagtttcatttaatgaatag >gi568815586r:76924018_77164635|GENSCAN_predicted_peptide_4|1119_aa MDEAGNHRSQQTNTRTENQTLHVLTRARGGVLAGTTIQAGVALGRSERWRCPDAAGSPPA QGTRRGDLRASLSLPDAAARLLIAAPPSLIGLGPWRSLQRQEASLGKQGQPENTVAARAG SIYRANTLDKGVSHIRSRTEQDDVNKYVQKAAGSLILEFMKGRNQVRTGIVSIQVDFKAL YVGVANNERAAEGCVCTLAVWFENIFVDRSRMAPKTPIKNEPIDLSKQKKFTPERNPITP VKFVDRQQAEPWTPTANLKMLISAASPDIRDREKKKGLFRPIENKDDAFTDSLQLDVVGD SAVDEFEKQRPSRKQKSLGLLCQKFLARYPSYPLSTEKTTISLDEVAVSLGVERRRIYDI VNVLESLHLVSRVAKNQYGWHGRHSLPKTLRNLQRLGEEQKYEEQMAYLQQKELDLIDYK FGERKKDGDPDSQEQQLLDFSEPDCPSSSANSRKDKSLRIMSQKFVMLFLVSKTKIVTLD VAAKILIEESQDAPDHSKFKSASCKRCEVVEMEGHTDLPLAFRYCRQHDRNRVLYLQAKV RRLYDIANVLTSLALIKKVHVTEERGRKPAFKWIGPVDFSSSDEELVDVSASVLPELKRE TYGQIQVCAKQKLARHGSFNTVQASERIQRKVNSEPSSPYREEQGSGGYSLEIGSLAAVY RQKIEDNSQGKAFASKRVVPPSSSLDPVAPFPVLSVDPEYCVNPLAHPVFSVAQTDLQAF SMQNGLNGQVDVSLASAASAVESLKPALLAGQPLVYVPSASLFMLYGSLQEGPASGSGSE RDDRSSEAPATVELSSAPSAQKRLCEERKPQEEDEPATKRQSREYEDGPLSLVMPKKPSD STDLASPKTMGNRASIPLKDIHVNGQLPAAEEISGKATANSLVSSEWGNPSRNTDVEKPS KENESTKEPSLLQYLCVQSPAGLNGFNVLLSGSQTPPTVGPSSGQLPSFSVPCMVLPSPP LGPFPVLYSPAMPGPVSSTLGALPNTGPVNFSLPGLGSIAQLLVGPTAVVNPKSSTLPSA DPQLQSQPSLNLSPVMSRSHSVVQQPESPVYVGHPVSVVKLHQSPVPVTPKSIQRTHRET FFKTPGSLGDPVLKRRERNQSRNTSSAQRRLEIPSGGAD >gi568815586r:76924018_77164635|GENSCAN_predicted_CDS_4|3360_bp atggatgaagctggaaaccatcgttctcagcaaactaacacaagaacagaaaaccaaaca ctgcatgttctcactcgcgccagaggaggtgttttagcggggactacgatccaggctgga gttgcgctcggccggtctgagcgctggcgctgcccggacgccgcggggtccccgccagcc cagggcactcggcgcggggatctgcgcgcctcgctctcccttcccgatgccgccgcccgg ctgctgatcgccgcaccaccttccctcatcggcttgggtccgtggaggtccctgcagagg caggaagcctccttaggaaagcagggccagcctgaaaacacagtggctgctcgagcaggt agtatctaccgtgcaaatacgctggacaaaggggtgagtcacatccgaagcaggacggag caggatgacgtaaataaatatgtgcagaaagcagcaggtagcctgattttggaattcatg aaaggtcgtaatcaggtccgaacaggaattgtcagcatacaggttgattttaaagcttta tacgtgggagtggcaaataatgagagagcggcagagggttgtgtgtgcactcttgcggtt tggtttgaaaatatatttgttgatcgatcaaggatggccccgaagactccaataaaaaat gaaccaattgatttatcgaagcaaaaaaaatttactccagaaagaaatcccattactcca gttaagtttgttgacagacagcaagcggaaccatggacacccacagctaacctgaagatg ctcattagtgctgccagcccagatataagggaccgggagaagaaaaagggactattccga cccattgaaaacaaggacgatgcatttacagattctctacagcttgatgttgttggggac agtgctgtggacgaatttgaaaagcaaaggccaagcagaaaacagaaaagtttaggactc ctgtgccagaagtttctagctcgctatccaagttatcccttgtcaactgagaaaactacc atctccctagatgaagttgctgtcagtcttggtgtggaaaggagacgcatctatgacatt gtaaatgtgctggagtcgctgcatctggtcagccgggtggctaagaatcagtatggctgg catggacggcacagcctgccaaaaaccctgaggaacctccagagactaggagaggagcag aaatatgaagagcaaatggcctacctccaacagaaagagctggacctgatagattataaa tttggagaacgtaaaaaagatggtgatccagattcccaggaacaacagttactggatttc tctgaacccgactgtccctcttcatctgcaaacagtagaaaagacaagtctctgagaatt atgagccagaagtttgtcatgctgttcctcgtctccaaaaccaagattgtcactctggat gtggctgccaaaatactgatagaagaaagccaagatgccccagaccatagtaaatttaaa agtgcctcttgtaagaggtgtgaagttgtggaaatggaaggacacactgaccttccctta gccttcagatactgcaggcagcatgaccgtaatcgtgttttatatcttcaagcaaaggta cgacgcctctatgacatagccaatgttctgaccagcttggctctgataaagaaagtgcat gtaacagaagagcgaggtcgtaaaccagccttcaagtggatcgggcctgtggacttcagc tcaagtgatgaagaactggtggatgtttctgcatctgtcttaccagaattgaaaagagaa acatatggccagattcaagtctgtgcaaaacagaagctggctcgccatggttcttttaac acagttcaggcttctgagaggatccagaggaaagtgaactcagaaccgagcagcccgtac agagaagaacaaggatcaggtggctactctttagaaattggaagcctggcagctgtctat agacagaaaatagaagacaattcacagggaaaagcctttgccagtaagagagtggtgcct ccatcaagcagcttggaccctgttgctcctttccctgtcctctctgttgacccagaatat tgtgttaatcctttagcccacccagtattttctgttgctcagacggacctgcaggcattc tccatgcagaacggtctgaatggacaagtggatgtctcacttgcttctgcagcctctgct gtggagagcctgaagccagcactccttgctggccagcctctagtgtatgtgccctctgcc tcactgttcatgctgtatggaagtctgcaggagggaccagcgtcagggtcagggtcagag agggatgacagaagctcagaagccccagccacagtagagctgtcatctgcaccctcagct cagaagcgcctctgtgaggagaggaaacctcaggaggaggatgagccagccactaaaagg caaagtagggaatatgaagacggcccgctgtcgcttgtcatgcccaagaaaccctcagat tccacagaccttgcctctcccaagactatgggtaacagggcatctatacccctcaaagac attcatgtgaatggccaactccctgctgcagaagagatttcaggaaaggcaacagcaaac tctcttgtttcttctgagtggggaaatccttcaagaaatacagatgttgaaaagccttca aaagaaaatgaaagcaccaaagagccttctttgctacaatatctttgtgtgcagtctcct gcaggattaaatggtttcaatgtacttttatctggcagtcaaaccccccctactgtgggc ccgtcctcaggtcagctgccgtctttcagtgtcccttgcatggtcttaccatctccacct ctgggcccttttcctgttctctattctcctgcaatgccgggcccggtttcttctactctt ggtgctctcccaaacacaggacctgtgaatttcagcttgcctggccttggatcaatagcc cagcttctcgtcggccccacagctgtggttaatccaaagtcgtccacactcccttctgca gaccctcagcttcagagtcagccctcactaaacctaagtccagtgatgtcaaggtcacac agtgtcgtccaacaacctgagtcccccgtttacgtgggacatccagtctcagtagtaaaa ttacatcagtcaccagttccagtgacccccaagagcatccaacgcacacatcgtgagacg tttttcaagacacccggcagccttggagaccctgtcctgaagagaagagaaaggaaccag tcacgaaacaccagctcggcccagaggagactagaaatccccagcggcggcgctgactaa >gi568815586r:76924018_77164635|GENSCAN_predicted_peptide_5|244_aa MAPKRKSSDTGNSDMPKRSCKVLPLSEKVKALDLISKEKNHTLRLLRSMELVARMYKEMD NPETYFSSRSVRLHGPEVAHKNMTSCSNAGPLISSYGTCKIDIGLPLKSSAAELRTLSME YLIDTELPLKSSAAELRTLSMEYFNTLKLKNKAKDRSSLPAMGQSWTENDFDELTEVGFR RSVVTHFSELKEHVLTHCKEAKNLEKRLDEWLTRIARVEKNLNDMVELKTMAQELHDTCT SFNV >gi568815586r:76924018_77164635|GENSCAN_predicted_CDS_5|735_bp atggccccaaagcgcaaaagtagtgatactggcaactcggatatgccaaagagaagctgc aaagtgcttcctttaagtgaaaaggtgaaagcccttgacttaataagtaaagaaaaaaat catacactgaggttgctgagatctatggaattggttgcaaggatgtacaaggaaatggac aacccagaaacttatttcagtagtcgctcagttcgccttcatggacctgaggtagcacat aaaaatatgactagctgcagtaacgcgggtcccctcataagtagctacggtacttgtaag attgatataggacttcctctgaagagctcagctgctgaactgagaacattaagcatggaa tatttgattgatacagaacttcctctgaagagctcagctgctgaactgagaacactaagc atggagtattttaacactctgaaattaaaaaacaaggcaaaggatcgcagctccttgcca gcaatggggcaaagctggacggagaatgactttgatgagctgacagaagtaggcttcaga aggtcagtagtaacacacttctctgagctaaaggagcatgttctaacccattgcaaggaa gctaaaaaccttgaaaagaggttagacgaatggctaactcgaatagcccgtgtagagaag aacttaaatgacatggtggagctgaaaaccatggcacaagaacttcatgacacatgcaca agcttcaatgtctga >gi568815586r:76924018_77164635|GENSCAN_predicted_peptide_6|503_aa MGAEEVWRKNEGAPSRSGEDGFLPAHLRETSKGHTEKFEDREGELLENGGSKKWWSCIWR ETLDKIAGALEKSPLPDQSLCQEAFDIKVRVGVQSKMINNSKCIIGEKNESSEIPVLEVL ARAIRQEKEIKGIQLGKEEVKLSLFADDMIVFLENPIVSAQNLLKLIGNFSKVSGYKINV QKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKED TNKWKNIPCSWVGRINIVKMAILPKVIYRFNAITIKLPMTFFTELEKTTLKFIWNQKRAR IAKSILSQKNKAGGIMLLDFKLYYKATVTKTAWYWYQNRDIDQWNRREPSEITLHIYNYL IFDKPVKNRKWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIK TLAENLGNTIQDIGMGKDFMPKIPKAMATKAKIEKWDLIKLKGFCTSKETTIRVNRQPTE WEKIFAIYSSDQGLISRIYNELK >gi568815586r:76924018_77164635|GENSCAN_predicted_CDS_6|1512_bp atgggggctgaagaagtgtggagaaagaatgagggggcaccaagcagaagtggagaagat ggttttttgcctgctcatttaagagaaacctcaaaagggcacacagaaaaatttgaagat agagaaggagagcttttagagaatggaggctctaaaaaatggtggagctgcatctggaga gaaacattagacaaaattgctggagctttggagaagtcccctcttccagatcagtctctg tgtcaggaagcatttgatataaaggtgagggtgggagttcaatccaagatgataaacaat tctaaatgcatcattggggaaaaaaatgaatcatctgaaattccagtgttggaagttctg gccagggcaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatttctagaaaaccccatcgtctcagcc caaaatctccttaagctgataggcaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gatgtgaaagacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggat acaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatattgtgaaaatg gccatactgcccaaggtcatttatagattcaatgccatcaccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccagaaaagagcccgc atcgccaagtcaatcctaagccaaaagaacaaagctggtggcatcatgctacttgacttc aaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagat atagatcaatggaacagaagagagccctcagaaataacgctacatatctacaactatctg atctttgacaaacctgtcaaaaacaggaaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tacacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccataaaa accctagcagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatg cctaaaataccaaaagcaatggcaacaaaagccaaaattgagaaatgggatctaattaaa ctaaagggcttctgcacatcaaaagaaactaccatcagagtgaacagacaacctacagaa tgggagaaaatttttgcaatctactcatctgaccaagggctaatatccagaatctacaat gaactcaaataa >gi568815586r:76924018_77164635|GENSCAN_predicted_peptide_7|116_aa MTETTEVEETLKRLSLQDHQWQIRQSLYAELKNLDFIWRENFRVPPPTTPNNRYSLGKTK AAWEDVSKEICVFCVNRILGDMSQNPKLSEFKELKSSLLTIIFLQPPQPPLSNGHS >gi568815586r:76924018_77164635|GENSCAN_predicted_CDS_7|351_bp atgactgaaaccacagaagttgaggaaactctcaagagattaagtctgcaggatcatcag tggcagatcagacagagcctttatgctgagttgaagaatttggactttatatggagggaa aatttcagggtgccccctcctaccacccccaacaacagatacagcttgggaaaaactaaa gctgcctgggaagatgtctccaaagagatatgtgtgttctgcgtaaataggattcttgga gatatgtcccaaaatcccaaattatctgagttcaaggagctaaagagctcccttctcact atcatcttcctgcagcctccccagcccccactcagcaatggccattcctag