GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:04:55 Sequence gi568815587f:76695617_76896675 : 201059 bp : 46.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 Intr - 1299 1125 175 0 1 48 80 124 0.522 7.11 1.07 Intr - 7911 7719 193 2 1 115 100 197 0.998 23.09 1.06 Intr - 8708 8545 164 2 2 93 86 198 0.992 18.87 1.05 Intr - 16799 16651 149 0 2 40 100 136 0.472 9.95 1.04 Intr - 17570 17492 79 2 1 77 19 67 0.370 -2.18 1.03 Intr - 21026 20930 97 1 1 106 94 126 0.930 15.01 1.02 Intr - 23615 23473 143 2 2 50 22 57 0.169 -5.55 1.01 Init - 24383 23952 432 2 0 75 96 331 0.475 26.81 1.00 Prom - 32122 32083 40 -3.56 2.00 Prom + 38618 38657 40 -2.86 2.01 Init + 42539 42598 60 1 0 84 58 78 0.591 3.97 2.02 Intr + 43167 43340 174 2 0 108 54 25 0.434 1.24 2.03 Term + 50256 50426 171 2 0 109 42 84 0.650 3.73 2.04 PlyA + 53424 53429 6 1.05 3.03 PlyA - 54870 54865 6 1.05 3.02 Term - 57454 57246 209 2 2 92 46 78 0.857 1.50 3.01 Init - 60809 60707 103 2 1 104 53 169 0.989 13.44 3.00 Prom - 69447 69408 40 -2.06 4.00 Prom + 73331 73370 40 -4.46 4.01 Init + 73429 73431 3 0 0 60 115 0 0.370 -0.10 4.02 Intr + 73833 73911 79 2 1 64 66 59 0.222 0.42 4.03 Intr + 74644 74769 126 2 0 60 92 45 0.248 2.75 4.04 Intr + 84049 84108 60 2 0 35 111 66 0.315 2.31 4.05 Term + 87952 88649 698 2 2 41 42 295 0.427 13.92 4.06 PlyA + 90368 90373 6 1.05 5.00 Prom + 92079 92118 40 -3.76 5.01 Sngl + 100001 101062 1062 1 0 108 48 1238 0.996 116.96 5.02 PlyA + 102509 102514 6 1.05 6.05 PlyA - 105263 105258 6 1.05 6.04 Term - 112203 112114 90 0 0 22 49 115 0.021 -1.28 6.03 Intr - 123830 123680 151 1 1 47 77 101 0.571 5.06 6.02 Intr - 126403 126235 169 2 1 81 67 98 0.415 6.00 6.01 Init - 132505 131626 880 1 1 55 11 298 0.065 13.44 6.00 Prom - 135175 135136 40 -3.96 7.00 Prom + 135974 136013 40 -4.06 7.01 Init + 165361 165463 103 0 1 91 65 249 0.891 23.30 7.02 Intr + 168149 168231 83 0 2 70 98 -1 0.086 -1.54 7.03 Intr + 182486 182548 63 1 0 106 73 41 0.018 3.31 7.04 Term + 182625 182747 123 2 0 48 42 78 0.018 -2.42 7.05 PlyA + 183064 183069 6 1.05 8.03 PlyA - 183674 183669 6 1.05 8.02 Term - 183902 183744 159 2 0 122 44 27 0.262 -0.36 8.01 Intr - 187616 187501 116 0 2 65 33 113 0.331 3.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_1|478_aa MGVGWQQSGCPGVPSSSPGLENCGLQGRPQWNWACKGDPATSGPSCGAVDALGGAGGLGD DHPAEPCAWAEGLVDGRLVFLPYDMLLFSLSYCNRSYLSLGTGGPLQEAYDAVLTISQES GPIDKAFAAARPDGEAAAHLEPEQSLGVVNRFLLLVCVCVYVYVYVYVYVYVYVYVYVYV YVYVYVYVYVYVLGIHQLQLVWGPHRILLTAQELTFIHPPLNRQMRELRCENVATCLGIF VISGEPASERELLWTAPELLPGPGRPGRRTLTGDIFSTGIILQEVLTRAHSTAPRDFQWK FKSINQGKRTSVADSMLWLLEKYSQNLEDLIQEQTEELELKREKTERLLCQMIPPSVAEA RKMGATVEPGYFDQVTIYFSDIVGFTIISALSEPIEAVGLLNDLYMLFDAVLGSHDLYKV ETIGDAYMVVWGLPQCNGSQHAAEINNMALDILGSVGDFRKRHAPNVPICIRAGLHSX >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_1|1434_bp atgggggtggggtggcagcagagcgggtgcccgggagttccgagcagctctccagggcta gagaactgtgggcttcagggcaggccacagtggaactgggcctgcaaaggtgaccctgcc acctctggccccagttgtggtgctgtggatgcactcggcggtgctggggggcttggagat gaccaccctgctgagccgtgtgcctgggccgagggcctggtggatgggagattagtcttc ctgccctatgacatgctgctcttctccctgtcctactgcaaccgctcctacctgtctctc ggcactggtggacccctgcaggaggcctatgatgcagtgctcaccatcagccaggagtct ggccccatagacaaggcctttgctgctgcccggcctgatggagaggcggctgcccacctt gagccagagcagtccctgggtgtggtcaatcgctttctccttcttgtatgtgtatgtgtg tatgtatatgtatatgtatatgtatatgtatatgtatatgtgtacgtgtacgtgtacgtg tacgtgtacgtgtacgtgtacgtgtacgtgtatgttttgggcatccaccagctgcagctg gtgtggggcccccaccggatcctgctgacagcccaggagctcaccttcatccatccaccc ctgaacagacagatgcgggagctgcggtgtgagaacgtcgccacctgcctgggcattttc gtgatctctggggaacctgcttcggaaagagagctgctatggacagctcctgagctgctg ccggggcctgggcgccctgggcggcgcaccctcacaggggacatcttcagcactggcatc atcctgcaggaggtgctgactcgggcccactctactgctcctcgggacttccagtggaag ttcaaaagcatcaaccaaggcaagaggaccagtgttgctgactccatgctgtggttgctg gagaaatattcccagaacctggaggacctgattcaggagcagactgaggaactggagctg aagagagagaagacagaaaggctgctctgtcagatgattcccccgtctgtggctgaagct cggaaaatgggggcaactgtggaaccagggtattttgaccaggttaccatatacttcagt gacattgtgggtttcaccatcatctcagccctgagtgaacccattgaggcggtgggcttg ctcaacgatctctacatgctgtttgatgctgttctgggcagccatgacctgtataaggtg gagaccattggggacgcctacatggtggtgtgggggctgcctcagtgcaatggcagtcag catgcggccgagatcaataacatggctctggatatccttggctctgtgggtgacttccgg aagaggcatgcacccaacgtgcccatttgcatcagggctggcctgcattcagnn >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_2|134_aa MLPGAILTAAIPTPGRSLALVRKLRPSKSWQDQAQNTGDKESERRKGPGGKMRAGQRVPR GEKQEERRMGAERAQRAMEDHPHTAPDKLHNPSPYIRVCFWGNPNEDTMEEAEAMVQRRD NELNLERDCEGGEN >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_2|405_bp atgcttcctggagccatcctcacagctgctattcccactccagggaggagcctggctctg gtgagaaaactgaggcccagcaaatcatggcaggatcaggctcagaatacaggtgacaaa gagagcgaaagaagaaagggccctggaggaaagatgagagctggacagagagttccaagg ggtgagaagcaggaggagagacgaatgggagcggaaagggcgcagcgggctatggaggat cacccccacactgcccctgataaactacataacccaagtccttatatcagggtctgcttt tgggggaaccctaatgaagacaccatggaggaggctgaggcaatggtgcagaggagagat aacgaactgaacctggagagggactgtgagggcggagagaactga >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_3|103_aa MAWQRALLQLAGPSFHAVLLVAGRSDGRKGSLDTDVAHVWLPPNALILEGPDSFQMGAVT ASFPESSRQAARPSGGAAQGLSLPLGEVPLTSFRVLLCVVRAL >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_3|312_bp atggcctggcagagggccctgctgcagctggcaggaccaagtttccatgcggtgctgctg gtggctgggaggagtgatggccgcaagggttccttggacacagacgtggctcatgtctgg cttcctcccaacgccctcatccttgagggacctgacagcttccagatgggtgcggtcact gcctccttcccggagtccagcaggcaggcagcccggcccagcggaggagctgcccaaggc ctctccctgcccctgggggaagtccctctcacgtcctttcgcgtccttctctgcgtcgtc cgggccctctga >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_4|321_aa MPPKALEGKQVVQPWLQPGHQGPKPSTGHSPSMCACVQISSPYQDNGHTGLEPILLTSSS LNYLLPEPLWLFEKRDLPTDHPQLQYGCSGLGGSGERRPRGEVLRETKATPRGRVGLCAD TCISRLWVFVDLRSGTRRCRCLCVSVALRGCFAPVSTFAGLRLAVGVGGRIACLRVRGRC LPTCLGWVSAEGVRTESPTPPHPPAPPPSPRAHDLAPNFSPGSSGNSPSPPHRYPAGGAD SPPPPPPEVPGQQGAATGLVVGGSAAPGDWLGRGWAERADSPSDPGWEEEASGGSQAPAL GWGEEGTGRLALPSPRFLERV >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_4|966_bp atgcctcccaaagccttggagggcaagcaagtggtccagccatggctgcagcctggacac caaggacctaagcccagcacagggcattctccctctatgtgtgcctgcgtccagatttct tctccttatcaggacaatggtcatactggattagagcccatcctgctgacctcatcttca ctaaattaccttctccctgagcctctatggctcttcgagaaaagagatcttccaacagat cacccgcagcttcagtatggctgttcagggctgggcgggagtggggaacgccgacccagg ggagaggtcttgcgggagacaaaggctacgcccaggggccgtgtgggtctttgtgcggac acgtgcatctcacgtttgtgggtctttgtggatttgcggtcggggacgcgccgctgtcgg tgcctttgtgtgtccgtggctttgcgcggctgctttgcgcccgtgtccacgtttgcgggc ctgcgcttggctgtgggcgtcggtgggcgcatcgcgtgtctccgcgtacgtggtcggtgc ctgccgacgtgtctgggctgggtgtccgcagagggtgtgcgcaccgagagcccaacgccg ccccacccccccgctcccccgccgtcccctcgggcccacgacttggctccaaacttttct cccggttcctcgggcaacagccccagtcctccccaccgctacccggccgggggcgcagac agccctccccctccccctcctgaggtcccaggacagcaaggggctgcgacgggattggtg gtcgggggatcggcagcgcctggggactggctgggcaggggctgggcggagcgtgcggac tccccctcagaccccggctgggaggaagaggcgagcggtgggtcccaggctccggccctg ggctggggagaggaagggacagggcggctggcgttgccctccccgcgcttcttggaaaga gtgtga >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_5|353_aa MPWPLLLLLAVSGAQTTRPCFPGCQCEVETFGLFDSFSLTRVDCSGLGPHIMPVPIPLDT AHLDLSSNRLEMVNESVLAGPGYTTLAGLDLSHNLLTSISPTAFSRLRYLESLDLSHNGL TALPAESFTSSPLSDVNLSHNQLREVSVSAFTTHSQGRALHVDLSHNLIHRLVPHPTRAG LPAPTIQSLNLAWNRLHAVPNLRDLPLRYLSLDGNPLAVIGPGAFAGLGGLTHLSLASLQ RLPELAPSGFRELPGLQVLDLSGNPKLNWAGAEVFSGLSSLQELDLSGTNLVPLPEALLL HLPALQSVSVGQDVRCRRLVREGTYPRRPGSSPKVALHCVDTRDSAARGPTIL >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_5|1062_bp atgccgtggcccctgctgctgctgctggccgtgagtggggcccagacaacccggccatgc ttccccgggtgccaatgcgaggtggagaccttcggccttttcgacagcttcagcctgact cgggtggattgtagcggcctgggcccccacatcatgccggtgcccatccctctggacaca gcccacttggacctgtcctccaaccggctggagatggtgaatgagtcggtgttggcgggg ccgggctacacgacgttggctggcctggatctcagccacaacctgctcaccagcatctca cccactgccttctcccgccttcgctacctggagtcgcttgacctcagccacaatggcctg acagccctgccagccgagagcttcaccagctcacccctgagcgacgtgaaccttagccac aaccagctccgggaggtctcagtgtctgccttcacgacgcacagtcagggccgggcacta cacgtggacctctcccacaacctcattcaccgcctcgtgccccaccccacgagggccggc ctgcctgcgcccaccattcagagcctgaacctggcctggaaccggctccatgccgtgccc aacctccgagacttgcccctgcgctacctgagcctggatgggaaccctctagctgtcatt ggtccgggtgccttcgcggggctgggaggccttacacacctgtctctggccagcctgcag aggctccctgagctggcgcccagtggcttccgtgagctaccgggcctgcaggtcctggac ctgtcgggcaaccccaagcttaactgggcaggagctgaggtgttttcaggcctgagctcc ctgcaggagctggacctttcgggcaccaacctggtgcccctgcctgaggcgctgctcctc cacctcccggcactgcagagcgtcagcgtgggccaggatgtgcggtgccggcgcctggtg cgggagggcacctacccccggaggcctggctccagccccaaggtggccctgcactgcgta gacacccgggattctgctgccaggggccccaccatcttgtga >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_6|429_aa MKDLFKENYKPLLNEIKEDTNKWKNIPSSWRGRINIMKMAILPKVIYRFNAIPIKLPTTF FTELEKITLKFIWNQKRARITKSILSQKNKAGGIMLPDFKLYYKATVTKTAWYWYQNRDI DQWNRTEPSEIMPHIYSYLIFDKPEKNKQWGTDSLFNKWCWENWLAICTKLKLDPFLTPY TKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMATKAKIDKWDLIKL KSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYNELKQIYKKKTTPSKSRRIFLS TEQNEKSPMSTSFYTDTATIRFLNLFPTFPPFPLHRTAIVIMARSQRAVGSWNRKHGKAW PYQIIRMLEVEAFILGQKESSQWAIASGGKTCNWHQCPPKVESSFIGGEDVKVESLVAAE NRCSSDFCV >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_6|1290_bp atgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggacaca aacaaatggaagaacattccaagctcatggagaggaagaatcaatatcatgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaacgactttc ttcacagaattggaaaaaattactttaaagttcatatggaaccaaaaaagagcccgcatc actaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcctggtactggtaccaaaacagagatata gatcaatggaacagaacagagccctcagaaataatgccacatatctacagctatctgatc tttgataaacccgagaaaaacaagcaatggggaacggattccctatttaataaatggtgc tgggaaaactggctagccatatgtacaaagctgaaactggatcccttccttacaccttac acaaaaattaattcaagatggattaaagacttaaacgttagacctaaaaccataaaaacc ctagaagaaaacctaggcattaccattcaggacataggcatgggcaaggacttcatgtct aaaacaccaaaagcaatggcaacaaaagccaaaatcgacaaatgggatctaattaaatta aagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacagaatgg gagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctacaatgaa ctcaaacaaatttacaagaaaaaaacaaccccatcaaaaagcagaagaatttttcttagt acagaacaaaatgaaaagtctcccatgtctacttctttctacacagacacggcaaccatc cgatttctcaatcttttccccacctttcccccctttccactccacagaactgccattgtc atcatggcccgttctcaacgagctgttggctcatggaacaggaaacatgggaaagcttgg ccttatcaaattataaggatgctagaagtcgaggccttcatcctgggacaaaaggaaagc tcacagtgggccatcgcctctggtgggaaaacttgcaactggcaccagtgcccacctaag gttgaaagttcttttatcggaggggaagatgtgaaagtggagtccttggtggctgctgaa aaccgctgctcatcagacttctgcgtctga >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_7|123_aa MAPAADREGYWGPTTSTLDWCEENYSVTWYIAEFLISKRRYYYPHFKDKEIEVLRNNLPK ITVFAVTDKAAVNILVHLLVHRNVGAKVIAVFAITFNGKTLNYFCNDLIEILEFFGLMTP LHI >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_7|372_bp atggctccggccgcggaccgagagggctactggggccccacgacctccacgctggactgg tgcgaggagaactactccgtgacctggtacatcgccgagttcttaatttcaaaaagaagg tattattatccccatttcaaagataaggagattgaagttctaagaaataatttacctaag attacagtttttgctgttacagataaggctgctgtgaacattcttgtgcatcttctggta cacagaaatgttggtgcaaaagtaattgcggtttttgccattactttcaatggcaaaacc ctcaattacttttgcaatgacctaatagaaattctcgaattttttggtctcatgacccct ttacacatttaa >gi568815587f:76695617_76896675|GENSCAN_predicted_peptide_8|91_aa XKYGKIQDNTKRDYLEAPECKKQTNSNAGYTFQEGELYKVKKRQGCSLMMSFLLNIALEV LASAIAQAKEIKCIKIGKKEVKLLFEDYMIK >gi568815587f:76695617_76896675|GENSCAN_predicted_CDS_8|276_bp naaaaatacggtaaaatccaggataatacaaaaagagattatctggaagcaccggagtgt aagaagcagacaaattctaatgcagggtacacatttcaagaaggggagctgtacaaagtc aagaaaaggcaaggatgttcactcatgatgtcatttctactcaacattgccctggaggtc ctagccagtgcaatagcacaagcaaaagaaataaaatgcataaagattggaaagaaagaa gtgaaacttctatttgaagattacatgattaaatga