GENSCAN 1.0 Date run: 5-Nov-116 Time: 23:30:54 Sequence gi568815575f:119368516_119571055 : 202540 bp : 45.04% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11368 11544 177 0 0 32 115 165 0.493 13.72 1.02 Term + 13238 13270 33 1 0 125 35 22 0.899 -1.81 1.03 PlyA + 16269 16274 6 1.05 2.00 Prom + 23908 23947 40 -1.96 2.01 Init + 30889 31163 275 0 2 71 71 303 0.795 21.34 2.02 Intr + 37945 38186 242 1 2 96 82 46 0.306 1.99 2.03 Intr + 41675 41847 173 0 2 108 94 43 0.669 6.56 2.04 Intr + 45738 45762 25 2 1 88 91 -31 0.055 -5.20 2.05 Intr + 50744 50829 86 0 2 92 121 36 0.421 6.84 2.06 Term + 57667 57747 81 0 0 105 49 44 0.540 -0.11 2.07 PlyA + 58143 58148 6 1.05 3.02 PlyA - 60044 60039 6 1.05 3.01 Sngl - 70569 68578 1992 0 0 44 49 722 0.805 57.97 3.00 Prom - 70662 70623 40 -8.36 4.02 PlyA - 70831 70826 6 1.05 4.01 Sngl - 72075 71059 1017 0 0 88 43 564 0.992 48.94 4.00 Prom - 79113 79074 40 -5.06 5.00 Prom + 82661 82700 40 -6.26 5.01 Init + 82728 82779 52 2 1 87 100 19 0.568 4.34 5.02 Intr + 83477 83628 152 0 2 44 100 80 0.316 4.68 5.03 Term + 84350 84550 201 1 0 98 38 97 0.991 3.09 5.04 PlyA + 86813 86818 6 1.05 6.00 Prom + 99896 99935 40 -2.46 6.01 Init + 100001 100111 111 1 0 75 98 267 0.587 26.61 6.02 Intr + 101146 101632 487 0 1 85 110 443 0.974 38.79 6.03 Intr + 101858 101998 141 0 0 80 100 144 0.999 15.12 6.04 Term + 102386 102543 158 0 2 130 38 124 0.990 9.70 6.05 PlyA + 102784 102789 6 1.05 7.03 PlyA - 103594 103589 6 1.05 7.02 Term - 121795 121686 110 2 2 85 49 93 0.008 3.77 7.01 Init - 142369 142288 82 1 1 83 64 77 0.463 5.93 7.00 Prom - 143334 143295 40 -3.56 8.08 PlyA - 143409 143404 6 1.05 8.07 Term - 171274 171212 63 1 0 106 41 66 0.767 1.59 8.06 Intr - 172905 172813 93 0 0 95 105 148 0.998 17.36 8.05 Intr - 174079 173990 90 1 0 109 87 180 0.996 20.19 8.04 Intr - 175976 175838 139 1 1 68 86 99 0.998 8.17 8.03 Intr - 176989 176948 42 0 0 90 106 46 0.939 3.96 8.02 Intr - 191870 191753 118 0 1 100 94 129 0.874 14.22 8.01 Init - 196840 196717 124 1 1 93 92 109 0.995 12.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_1|69_aa CTIGDIVLLKALPVPRTKHVKHELAEIVFKVGKLVDPVTGKPCAGTTYLESPLSSETTQV LATINILHS >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_1|210_bp tgcacaattggggatatcgtgcttctcaaagctttacctgttccacgaacaaagcatgtg aaacatgaactggctgagattgttttcaaagttggaaaactcgtagatcctgtgacagga aaaccttgtgcaggaactacctatctggagagtccattgagttcggaaaccacccaggta ttagccacaatcaacatcctgcattcctaa >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_2|293_aa MATWRRDGRLTGGQRLLCAGLAGTLSLSLTAPLELATVLAQVGVVRGHARGPWATGHRVW RAEGLRALWKGNAVACLRLFPCSAVQLAAYRKFVVLFTDDLGHISQWSSIMAGSLAGMVS TIVTYPTDLIKTRLIMQNILEPSYRGLLHAFSTIYQQEGFLALYRGVSLTVVGALPFSAG SLLVYMNLEKIWNGPRDQFSLPQNFANVCLAAAVTQTLSFPFETVKRKMQNIQLFLFIGV SISSNIKPSVSLLLYTPVDFDPITAHKSCCLFAYDGCVPLEAVMGRLAQWVTV >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_2|882_bp atggctacgtggaggcgggacggccgactgacaggcggccaaaggctgctgtgcgctggg ctggcggggacgctcagcctcagcctcaccgcgcccctggagctcgccaccgtgctggcc caggttggcgtcgtgcgaggccacgcccggggaccgtgggccacagggcaccgggtgtgg cgggcagaggggctccgggccctgtggaaggggaacgcggtggcgtgcctgcgcctcttc ccctgcagcgccgtgcagctcgccgcctaccgcaaatttgttgtgctgttcacagatgac ctgggccacatttcccagtggagctccatcatggctgggagtctcgcaggcatggtttcc accattgtaacatatcctacagacctcatcaaaacccggttgatcatgcagaacatactg gaaccatcgtacagggggctcctccatgctttttctactatttaccaacaggaagggttc cttgccctttatcgaggggtttccctcactgttgtaggtgctctcccgttctctgctggc tcccttcttgtttacatgaacctggagaaaatctggaacggaccccgagatcagttctct ctcccacagaactttgctaatgtctgtctggctgctgcagtgacccagaccctctccttt ccctttgagaccgtgaagagaaagatgcagaatattcaactttttctattcattggtgtc tccatttcatcaaatataaaaccaagtgtgagcctgttgctttataccccagtggacttt gaccccatcacagctcacaagagctgttgcttatttgcctacgacggctgtgttcctctt gaggctgtaatgggcagactagctcagtgggtaacagtgtga >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_3|663_aa MGDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQSRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAYKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNQIDTIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDTFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYMEELVPFLLKLFQSIEKE GILPNSFYEASIMLIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMGELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEETNGRTFH AHG >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_3|1992_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaagccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacacc acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcctacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggatacattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggctctgaaattgtggcaataatc aatagtttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacatggaggaactggtaccattccttctgaaactattccaatcaatagaaaaagag ggaatcctccctaactcattttatgaggccagcatcatgctgataccaaagccgggcaga gacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaaccggcacaagacagggatgccct ctctcaccgctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgggtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggagacaaatggaagaacattccat gctcatgggtag >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_4|338_aa MGKKQNRKTGNSKTQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNLEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSP MEDEMNEMKREGKFREKRIKRNEQSLQEMWDYVKRPNLRLIGVPESDVENGTKLENTLQD IIQENFPNLARQANVQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRV TLKGKPIRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFIDK QMLRDFVTTRPALKELLKEALNMERNNWYQPLQNHAKM >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_4|1017_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcgcctctcctcct ccaaaggaacgcagttcctcaccagcaacagaacaaagctggatggagaatgattttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcacca atggaagatgaaatgaatgaaatgaagcgagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatgtgggactatgtgaaaagaccaaatctacgtctg attggtgtacctgaaagtgatgtggagaatggaaccaagttggaaaacactctgcaggat attatccaggagaacttccccaatctagcaaggcaggccaacgttcagattcaggaaata cagagaacgccacaaagatactcctcgagaagagcaactccaagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggtcgggtt accctcaaaggaaagcccatcagactaacagcggatctctcggcagaaaccctacaagcc agaagagagtgggggccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttatagacaag caaatgttgagagattttgtcaccaccaggcctgccctaaaagagctcctgaaggaagcg ctaaacatggaaaggaacaactggtaccagccgctgcaaaatcatgccaaaatgtaa >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_5|134_aa MALKVVMMSTGWPKWYKASVSCQAQSPYLPHSGGVDVHFSGAVDCFRQIVKAQGVLGLWN GLTANLLKIVPYFGIMFSTFEFCKRICLYQNGYILSPLSYKLTPGVDQSLQPQELRELKK FFKTRKPKPKKPTL >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_5|405_bp atggctctgaaggtggtgatgatgtccacggggtggccaaaatggtataaagcttctgtt tcctgtcaggctcagagcccctacctcccacacagtggaggagtagatgtccatttctca ggagcagtggactgcttccggcagatagtgaaggcccagggggtcctggggctctggaat ggattgacagccaatttactgaagatagttccatattttggaattatgtttagcaccttt gagttctgcaagagaatctgtctttatcaaaatggttacattctgtctccactgagctat aaattgaccccaggagtcgatcagagtttgcagccccaggaattacgagaattaaagaag ttcttcaaaacgagaaaaccgaagcctaaaaaaccaactctataa >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_6|298_aa MTDAAVSFAKDFLAGGVAAAISKTAVAPIERVKLLLQVQHASKQITADKQYKGIIDCVVR IPKEQGVLSFWRGNLANVIRYFPTQALNFAFKDKYKQIFLGGVDKRTQFWLYFAGNLASG GAAGATSLCFVYPLDFARTRLAADVGKAGAEREFRGLGDCLVKIYKSDGIKGLYQGFNVS VQGIIIYRAAYFGIYDTAKGMLPDPKNTHIVISWMIAQTVTAVAGLTSYPFDTVRRRMMM QSGRKGTDIMYTGTLDCWRKIARDEGGKAFFKGAWSNVLRGMGGAFVLVLYDEIKKYT >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_6|897_bp atgacagatgccgctgtgtccttcgccaaggacttcctggcaggtggagtggccgcagcc atctccaagacggcggtagcgcccatcgagcgggtcaagctgctgctgcaggtgcagcat gccagcaagcagatcactgcagataagcaatacaaaggcattatagactgcgtggtccgt attcccaaggagcagggagttctgtccttctggcgcggtaacctggccaatgtcatcaga tacttccccacccaggctcttaacttcgccttcaaagataaatacaagcagatcttcctg ggtggtgtggacaagagaacccagttttggctctactttgcagggaatctggcatcgggt ggtgccgcaggggccacatccctgtgttttgtgtaccctcttgattttgcccgtacccgt ctagcagctgatgtgggtaaagctggagctgaaagggaattccgaggcctcggtgactgc ctggttaagatctacaaatctgatgggattaagggcctgtaccaaggctttaacgtgtct gtgcagggtattatcatctaccgagccgcctacttcggtatctatgacactgcaaaggga atgcttccggatcccaagaacactcacatcgtcatcagctggatgatcgcacagactgtc actgctgttgccgggttgacttcctatccatttgacactgttcgccgccgcatgatgatg cagtcagggcgcaaaggaactgacatcatgtacacaggcacgcttgactgctggcggaag attgctcgtgatgaaggaggcaaagcttttttcaagggtgcatggtccaatgttctcaga ggcatgggtggtgcttttgtgcttgtcttgtatgatgaaatcaagaagtacacataa >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_7|63_aa MDSSDSDKVIMEHEDPNVIFESFGKLSVGELSSCLYLAGLGLKPLSPITAASILADADPW DQP >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_7|192_bp atggattcctctgactcagacaaggtcatcatggaacatgaagacccaaatgtcatcttt gagtcctttggtaaactatcagtcggggagctgagcagctgcctgtacctggctgggctg ggcctgaaaccactcagccccatcacagcagcctccatccttgccgatgcagacccatgg gaccagccctga >gi568815575f:119368516_119571055|GENSCAN_predicted_peptide_8|222_aa MPKVVSRSVVCSDTRDREEYDDGEKPLHVYYCLCGQMVLVLDCQLEKLPMRPRDRSRVID AAKHAHKFCNTEDEETMYLRRPEGIERQYRKKCAKCGLPLFYQSQPKNAPVTFIVDGAVV KFGQGFGKTNIYTQKQEPPKKVMMTKRTKDMGKFSSVTVSTIDEEEEEIEAREVADSYAQ NAKVIEKQLERKGMSKRRLQELAELEAKKAKMKGTLIDNQFK >gi568815575f:119368516_119571055|GENSCAN_predicted_CDS_8|669_bp atgccgaaagtagtgtctcggtcagtagtctgctctgacactcgggaccgggaggaatat gacgacggcgagaagcccctccatgtttactactgtttgtgcggccagatggtcctagtg ctggactgccagttagagaaattgcccatgaggccccgggaccggtcccgtgtgattgat gctgccaaacatgcccataagttttgtaacacagaagatgaggagactatgtatctgcgg agacctgaaggcattgaacgacagtacaggaagaaatgtgcaaagtgtggactgccgctc ttctaccaatcccagccaaagaatgctcctgttaccttcattgtggatggagcagtagtc aagtttggccagggctttgggaaaacgaacatatatactcagaaacaagagcctcctaag aaggtgatgatgaccaaacggaccaaagacatgggcaagttcagttctgtcaccgtgtct accattgatgaagaggaagaggagattgaggctagggaagttgctgactcatatgcacag aatgccaaagtgattgaaaaacagctggagcgcaaaggcatgagcaagaggcgactgcaa gagctggctgaattggaagccaagaaagcgaaaatgaaggggaccttgattgacaaccag ttcaaataa