GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:33:26 Sequence gi568815590f:81180596_81384564 : 203969 bp : 38.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 1048 1043 6 1.05 1.02 Term - 22402 21615 788 2 2 11 41 304 0.095 10.50 1.01 Init - 23864 23528 337 2 1 74 -1 357 0.191 22.89 1.00 Prom - 25712 25673 40 -3.05 2.02 PlyA - 26610 26605 6 1.05 2.01 Sngl - 40961 40587 375 2 0 72 47 134 0.626 3.59 2.00 Prom - 46235 46196 40 -3.35 3.03 PlyA - 46486 46481 6 1.05 3.02 Term - 47515 47340 176 2 2 70 48 80 0.479 -0.86 3.01 Init - 51367 51238 130 1 1 59 110 104 0.963 10.06 3.00 Prom - 54283 54244 40 -2.55 4.00 Prom + 54529 54568 40 -5.65 4.01 Init + 70241 70417 177 1 0 57 42 165 0.435 8.11 4.02 Term + 74419 74637 219 0 0 56 39 350 0.345 22.96 4.03 PlyA + 74751 74756 6 1.05 5.00 Prom + 82998 83037 40 -3.55 5.01 Init + 85730 85837 108 1 0 79 111 -21 0.126 -0.43 5.02 Intr + 95428 95563 136 0 1 49 82 104 0.302 5.12 5.03 Intr + 99888 100079 192 1 0 32 66 234 0.706 14.14 5.04 Intr + 102771 102943 173 1 2 74 82 153 0.910 12.04 5.05 Intr + 103278 103379 102 2 0 68 103 74 0.983 6.35 5.06 Term + 120770 120898 129 1 0 67 48 130 0.128 4.10 5.07 PlyA + 121373 121378 6 1.05 6.00 Prom + 124488 124527 40 -7.75 6.01 Init + 125593 125797 205 0 1 61 74 101 0.329 5.06 6.02 Intr + 125823 125909 87 1 0 94 27 106 0.219 4.12 6.03 Intr + 129679 129830 152 2 2 42 99 90 0.159 4.46 6.04 Term + 136328 136414 87 1 0 95 48 112 0.865 4.58 6.05 PlyA + 136616 136621 6 1.05 7.04 PlyA - 136792 136787 6 1.05 7.03 Term - 138379 138170 210 1 0 53 42 104 0.032 -1.39 7.02 Intr - 147763 146843 921 1 0 -34 86 389 0.412 17.00 7.01 Init - 148875 147838 1038 0 0 46 41 433 0.544 28.43 7.00 Prom - 148968 148929 40 -6.15 8.02 PlyA - 149137 149132 6 1.05 8.01 Sngl - 150109 149708 402 1 0 49 44 300 0.977 17.72 8.00 Prom - 153217 153178 40 -5.35 9.00 Prom + 154010 154049 40 -6.95 9.01 Init + 154092 154194 103 2 1 44 88 33 0.482 -0.65 9.02 Intr + 156621 156745 125 1 2 57 44 142 0.512 6.08 9.03 Intr + 157478 157584 107 1 2 89 84 116 0.948 9.39 9.04 Term + 170489 170588 100 2 1 90 49 89 0.407 1.82 9.05 PlyA + 173174 173179 6 1.05 10.04 PlyA - 174011 174006 6 1.05 10.03 Term - 180682 180536 147 1 0 56 32 134 0.669 1.52 10.02 Intr - 182831 182678 154 1 1 56 77 79 0.464 2.75 10.01 Init - 185238 185072 167 0 2 86 42 104 0.721 4.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 103919 103972 54 1 0 115 42 63 0.968 0.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_1|374_aa MEQSWTENDFEELREEGFRRLVITNFSKLKEDVRTHRKEAKNLGKRLDEWLTRINSVEKS LNDLLELKTMAQELCDACTSFSSRFNQVEERVSVIEDQMNEMKQEEKFRGKKNRSMRQKV NKDIQELKSALHQADLIDIYRTLHPKSTEYIFFSAPHHTDSTTDHIVGSKALLSKCKRTE ILTNCLSDHSAIKLELRIKKLIQNHSTTWKLNNLLLNDYGVHNEIKTEIKMFFETNENKD TTYQNLWDTFKAVCRGKFIALNARKRKQERSKIDTLTSQLKDLEKQEQTLSKASRRQEIT KIRAELKEIETQKTLQKINESRSWSFENINKIDRPLIRLIKKEKEKNQIDTIKNGKGDIT TNPTEMQTTIREYY >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_1|1125_bp atggaacaaagctggacagagaatgactttgaggagttgagagaagaaggcttcagacga ttggtaataacaaatttctccaagctaaaggaagatgttcgaacccatcgcaaagaagct aaaaaccttggaaaaagactagacgaatggctaactagaataaacagtgtagagaaatcc ttaaatgacctgttggagctgaaaaccatggcacaagaactatgtgacgcatgcacaagc ttcagtagccgatttaatcaagtggaagaaagggtatcagtgatcgaagatcagatgaat gaaatgaagcaagaagagaagtttagaggaaaaaagaacagatcaatgagacagaaagtt aacaaggatatccaggaattgaagtcagctctgcaccaagcagacctaatagacatctac agaactctccaccccaaatcaacagaatatatattcttctcagcaccacatcacactgat tccacaactgaccacatagttggaagtaaagcactcctcagcaaatgtaaaagaacagaa attttaacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaa ctcattcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactacggg gtacataacgaaataaagacagagataaagatgttctttgaaaccaatgagaacaaagac acaacataccagaatctctgggacacatttaaagcagtgtgtagagggaaatttatagca ctaaatgcccgcaaaagaaagcaggaaagatcgaaaattgacaccctaacatcacaatta aaagatctagagaagcaagagcaaacactttcaaaagctagcagaaggcaagaaataact aagatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaa tccaggagctggtcttttgaaaatatcaacaaaattgatagaccactaataagactaata aagaaggaaaaagagaagaatcaaatagacacaataaaaaatggtaaaggggatatcacc accaatcccacagaaatgcaaactaccatcagagaatactattaa >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_2|124_aa MVLEVLARAIRPEKETKVIQISKKEVKLSLIASDMIIYLENPKDSSKILLDLINEFSKVS GYKINVHKSVALLSTNRDQAENQTKNSIPFTTAAKNKILRNIPNQGGERLLQEKLQNTAE INHR >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_2|375_bp atggtactggaagtcctagccagagcaatcaggccagagaaggaaacaaaggtcatccaa ataagtaaaaaagaagtcaaactgtcactgatcgccagcgatatgatcatatacctagaa aaccctaaagactcatccaaaatcctcctagatctgatcaatgaattcagtaaagtttca ggatacaaaatcaatgtacacaaatcagtagcactgctatccaccaacagggaccaagct gagaatcaaaccaagaactcaattccctttacaacagctgcaaaaaataaaatacttagg aatatacctaaccaaggaggtgaaagacttctacaagaaaaactacaaaacactgctgaa ataaatcatagatga >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_3|101_aa MHPDAKEAGKQWGQRFYVELTLKNRKEAAETELGAGACSRGGAGLSRTWVLSLLSFGLAQ DKEQDLDSSTKFSWAKVYDPPNKTRYPIFKDENLVLLPSGQ >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_3|306_bp atgcatcctgacgccaaggaagcagggaagcagtggggtcaaaggttctatgttgaacta actctgaaaaacagaaaggaagctgctgagacagagcttggggctggagcgtgctcacga ggtggagcaggtctgagcaggacatgggttcttagcttgttatcctttgggctggctcag gacaaggaacaggatctggatagtagcaccaaattctcttgggcaaaggtttatgatcca cctaataagacaaggtatccaatattcaaagatgaaaatttggtcctgctccccagtgga cagtag >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_4|131_aa MSASEPVPDDKEEDIEEAVPENKWTLDNPVEGFQLFKIAFDFSYSMDTSMIWSLKLKETL LAYPNPIDPLNGEDAAMYLQRPEEYKQKIKEYIQKYTTEQALKEQEEGTRDSSSESSISD FSKDEAQDMEL >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_4|396_bp atgagtgcttctgagccagtgccagatgataaggaagaagacatagaagaagcagtgcca gaaaacaaatggacattagacaatccagtagaaggtttccagttatttaagattgctttt gacttctcttatagcatggacacttctatgatatggtcactgaaactaaaggaaacattg ttggcctatcctaaccccatagatcctctcaatggtgaagatgcagccatgtacctccaa agaccagaagaatacaagcagaaaattaaagagtacatccagaaatacaccacggagcag gcgctgaaagagcaggaagagggtaccagggacagctcatcggagagctctatatctgac ttttccaaagatgaggcccaggatatggagttgtag >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_5|279_aa MNIFVHDSWCKYSTLIDNTKEISKVVYQFTIHQPGLRSAQNQLHAGFTALCMTLRSYRVA LPTAKVGGPEIGSRKHLQSDLAPPSAGRRYKAAAGAGCLTARCHADADPSLHASPPAPTM ATVQQLEGRWRLVDSKGFDEYMKELGVGIALRKMGAMAKPDCIITCDGKNLTIKTESTLK TTQFSCTLGEKFEETTADGRKTQTVCNFTDGALVQHQEWDGKESTITRKLKDGKLVVRIG GMLITAAEVLEHTSDTNGLSNNGIFSTGGKYMVELVDLQ >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_5|840_bp atgaacatttttgtacatgactcttggtgtaaatattcaactttaatagataataccaaa gagatttccaaagtggtttatcaatttactatccaccagccaggcttgagaagtgcacaa aatcagctgcatgctggttttacagctctgtgcatgactctgaggagttatagggtagct cttcccactgccaaagtaggaggaccagagataggatcaagaaagcatcttcagtcggat ttagcgccgcccagcgcgggccgccgttataaagcagccgccggcgccgggtgcctcaca gcacgctgccacgccgacgcagacccctctctgcacgccagcccgcccgcacccaccatg gccacagttcagcagctggaaggaagatggcgcctggtggacagcaaaggctttgatgaa tacatgaaggagctaggagtgggaatagctttgcgaaaaatgggcgcaatggccaagcca gattgtatcatcacttgtgatggtaaaaacctcaccataaaaactgagagcactttgaaa acaacacagttttcttgtaccctgggagagaagtttgaagaaaccacagctgatggcaga aaaactcagactgtctgcaactttacagatggtgcattggttcagcatcaggagtgggat gggaaggaaagcacaataacaagaaaattgaaagatgggaaattagtggtgcgtatagga ggaatgctgataacagctgcagaagttctggagcatacttcagatacgaatggtctaagc aataatggaatatttagtactggagggaagtacatggtagagctggtagacctgcaatga >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_6|176_aa MDRLRWQTGGRTSLQLLLRWTEQRVETHIVNFCSKNYCRNIPGKLRESTDPLKVLDHYCR LPEMLLKNWRLVVWGKFSALVTGCLEIDLVLLQGHSGILEVLARAIRQEKAMKGIQIGKE EVKLSLFADDMIVNLENPKASFKKLLKQPNPCVEIRTRTLQNVIISGDRAFKEVIK >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_6|531_bp atggacagattaagatggcagacaggaggcaggactagcttgcagctgttgctcagatgg acagagcagcgtgtggagactcacattgtgaacttttgctccaagaactactgcaggaac ataccaggaaagctgagagaatccacagaccctttgaaggtactggatcactactgcagg ctccctgagatgctgctgaaaaactggaggctcgtggtctggggcaagttctcagccctg gtcactggctgcttggaaatagacctggtgctgttgcagggccacagtgggatactggaa gtcctagccagagcaatcagacaagaaaaagcaatgaagggcattcaaattggtaaagag gaagtcaaactttcactgtttgctgatgatatgatcgtaaacttagaaaaccctaaagct tcattcaaaaagctcctaaaacagccaaacccatgtgttgaaatccgaacccgcacactt cagaatgtgattatctctggagatagggccttcaaagaggtgattaagtga >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_7|722_aa MEDFNTPLSTLDRSTRQKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNCSTTWKLNNLLLNDYWV HNEMKEEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKVSRRQEITKIRAELKEIETQKTLQKINESKSWFFERINKIDRPLARLIK KKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYVNKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGAEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELHHPDTKAGQRHTKK ENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHIN RTKDKNHMIISIDAEKAFDKVQQPFMLKTLNKLGIDGMYLKIIRAIYDKPTANIILNGQK LEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVY LENPIISAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIK YLGIQLTRDVKDLFKENYKPPLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKFYVSDLK PSIRKDYFKKLLGAKSSKMSQTSGSVQQHFSNINVLTNHLEELAKMQIEFSRSGAESEFL PF >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_7|2169_bp atggaagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaactgctcaactacatggaaactgaacaacttgctcctgaatgactactgggta cataatgaaatgaaggaagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagttagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aagagctggttttttgaaaggatcaacaaaattgatagaccgctagcaagactaataaag aaaaaaagagagaagaatcaaatagacgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctatgtaaataaa ctagaaaatctagaagaaatggataaattccttgacacatacaccctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggagctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagctgaattctaccag aggtacaaggaggaactgcatcatcctgataccaaagccgggcagagacacacaaaaaaa gagaattttagaccaatatccttgatgaacattgatgcaaaaatcctcaataaaatactg gcaaatcgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatc cctgggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcatataaac agaaccaaagacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaa gtacaacaacccttcatgctaaaaactctcaataaattaggtattgatgggatgtatctc aaaataataagagctatctatgacaaacccacagccaatatcatactgaatgggcaaaaa ctggaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactccta ttcaacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaataaagggt attcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgtatat ctagaaaaccccatcatctcagcccaaaatctccttaagctgataagcaacttcagcaaa gtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaacaacaga caaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagagaataaaa tacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactacaaacca ccgctcaaggaaataaaggaggatacaaacaaatggaagaacattccatgctcatgggta ggaagaatcaatatcgtgaaaatggccatactgcccaagttctatgtttcagatttgaaa ccctctataaggaaagactatttcaagaaacttttaggtgccaaatcaagtaaaatgtca cagacttcaggttcagtacagcaacacttctcaaacatcaatgtgctcacaaatcacctg gaggaacttgctaaaatgcagatcgaattcagtaggtctggggcggaatctgagtttctg cctttttaa >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_8|133_aa MELKTKAQELCEECRSLRSRHDQLEERVSAMEDEMNEMKREGKFREKRIKRNEQSLQEIW DYVKRPNLRLIGVPESDGENGTKLENTAGYYPGELPQSSKAGQRSDSGNTENATKILLEK SNSKTHNCQIHQS >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_8|402_bp atggagctgaaaaccaaggctcaagaactatgtgaagaatgcagaagcctcaggagccga cacgatcaactggaagaaagggtatcagcgatggaagatgaaatgaatgaaatgaagcga gaagggaagtttagagaaaaaagaataaaaagaaacgagcaaagcctccaagaaatatgg gactatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgatggggagaat ggaaccaagttggaaaacactgcaggatattatccaggagaacttccccaatctagcaag gcaggccaacgttcagattcaggaaatacagagaacgccacaaagatactcctcgagaag agcaactccaagacacataattgtcagattcaccaaagttga >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_9|144_aa MAYHHYLQYHHYPIAFASCLTVTKSFFTGRTSLRVLEPSVGGRVEEEEAEEGRRKGRTKS PVNVPSTHRTPPRNRQESAEVFTAACLPAVSDSRSGVMMTKLDGEWKILDTRMTFCLKSS SRESSLCTPDQRAKQRAAVTQPGD >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_9|435_bp atggcctatcatcactatcttcagtatcatcattatcccattgcatttgcatcatgctta acagtaacaaagagctttttcactggtcgcacaagtctcagagttctggagccgtccgtg ggaggaagagtggaggaagaggaggcggaagaaggaaggaggaagggccgtaccaagtca ccggtgaacgttcccagcacgcacaggactcctcccagaaacagacaggaatctgctgag gtgttcacagctgcctgcctgcctgccgtgtcagatagtagaagtggtgtgatgatgacc aagttggatggtgaatggaagattctcgacaccagaatgaccttctgcctgaaaagttca tccagagagtcatcactttgtacccctgatcagcgtgctaagcaaagagcagctgtgact cagcctggtgactga >gi568815590f:81180596_81384564|GENSCAN_predicted_peptide_10|155_aa MDAAGGHYPKRINARTENQILNVLTYKWELNIGYTWTQRRKQQTLVLARGWRVEGDTESI PLSLTQLLSKLELLTPLADNSLSPGLLIDNIAPLVRVPRCMQQKLTVSVYHLMTERPFQR RGFTRKSFAVLKKPLDFPTYSCPNEDKRFQKYDLI >gi568815590f:81180596_81384564|GENSCAN_predicted_CDS_10|468_bp atggatgcagctggaggccattatcctaaacgaattaatgcaagaacagaaaaccaaata ctgaatgtcctcacttataagtgggagctaaacattgggtacacatggacacaaagaagg aaacagcagacactggtgcttgcaagagggtggagggtggaaggagatactgagagcatt cctctatcacttactcagttattgtcaaagcttgagttactgacacccctggctgacaac agtttaagtcctggcttactcattgataacattgctcctctagtcagagttcctagatgc atgcaacagaaactgacagtgagtgtctatcacttaatgacagaaaggccctttcagagg agaggatttacccgaaagagctttgctgttctgaagaagcccctggatttccccacatat tcctgtccaaacgaggacaagagattccaaaagtatgatttaatataa