GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:06:18 Sequence gi568815590f:124439187_124649889 : 210703 bp : 43.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 9105 9144 40 -2.86 1.01 Sngl + 11769 13088 1320 2 0 103 44 551 0.865 48.29 1.02 PlyA + 13650 13655 6 1.05 2.00 Prom + 28469 28508 40 -0.96 2.01 Init + 35924 36104 181 1 1 64 87 347 0.473 31.44 2.02 Intr + 46645 46830 186 2 0 143 27 121 0.989 11.16 2.03 Term + 47731 48458 728 2 2 49 45 440 0.723 29.44 2.04 PlyA + 48686 48691 6 1.05 3.05 PlyA - 49341 49336 6 1.05 3.04 Term - 61913 61904 10 1 1 107 41 7 0.368 -4.43 3.03 Intr - 65161 65085 77 1 2 123 84 64 0.921 7.81 3.02 Intr - 69502 69417 86 2 2 52 92 24 0.649 -1.26 3.01 Init - 69831 69798 34 0 1 56 97 56 0.471 2.23 3.00 Prom - 79736 79697 40 -4.16 4.00 Prom + 89951 89990 40 0.74 4.01 Init + 100001 100101 101 1 2 98 94 100 0.778 11.34 4.02 Intr + 103901 104093 193 2 1 28 100 224 0.986 17.09 4.03 Intr + 107814 107927 114 2 0 112 92 64 0.998 9.84 4.04 Term + 110575 110706 132 0 0 137 43 100 0.976 8.49 4.05 PlyA + 110771 110776 6 1.05 5.16 PlyA - 111121 111116 6 1.05 5.15 Term - 114500 113806 695 0 2 37 39 527 0.680 36.43 5.14 Intr - 116718 116556 163 0 1 69 102 206 0.992 19.65 5.13 Intr - 117219 117046 174 0 0 83 26 231 0.983 16.54 5.12 Intr - 118663 118495 169 0 1 45 76 46 0.406 -0.95 5.11 Intr - 119646 119390 257 0 2 95 39 179 0.660 9.94 5.10 Intr - 123806 123596 211 1 1 116 110 266 0.970 30.52 5.09 Intr - 126573 126476 98 0 2 113 109 11 0.994 4.51 5.08 Intr - 127992 127885 108 0 0 93 96 49 0.989 6.68 5.07 Intr - 129350 129193 158 0 2 94 87 165 0.996 16.73 5.06 Intr - 145975 145901 75 2 0 68 88 80 0.928 5.49 5.05 Intr - 150525 150434 92 2 2 35 87 75 0.363 1.74 5.04 Intr - 152049 151965 85 1 1 136 78 25 0.541 5.18 5.03 Intr - 156777 156659 119 2 2 59 56 31 0.185 -2.89 5.02 Intr - 160615 160488 128 1 2 70 97 71 0.398 5.68 5.01 Init - 166124 165948 177 2 0 51 61 131 0.905 5.96 5.00 Prom - 168175 168136 40 -5.96 6.00 Prom + 170304 170343 40 -2.36 6.01 Init + 176750 177103 354 1 0 77 63 168 0.381 9.67 6.02 Intr + 187634 187748 115 1 1 33 105 80 0.644 4.22 6.03 Term + 192703 192803 101 2 2 101 49 83 0.923 3.99 6.04 PlyA + 193310 193315 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:124439187_124649889|GENSCAN_predicted_peptide_1|439_aa MERESGKPVAVVAVVTEPWFTQRYREYLQRQKLFDTQHRVEKMPDGSVALPVLGETLPEQ HLQELRNRVAPGSPCMLTQLPDPVPSKRAQGCSPAQKLCLEVSRWVEGRGVKWSAELEAD LPRSWQRHGNLLLLSEDCFQAKQWKNLGPELWETVALALGVQRLAKRGRVSPDGTRTPAV TLLLGDHGWVEHVDNGIRYKFDVTQCMFSFGNITEKLRVASLSCAGEVLVDLYAGIGYFT LPFLVHAGAAFVHACEWNPHAVVALRNNLEINGVADRCQIHFGDNRKLKLSNIADRVILG LIPSSEEGWPIACQVLRQDAGGILHIHQNVESFPGKNLQALGVSKVEKEHWLYPQQITTN QWKNGATRDSRGKMLSPATKPEWQRWAESAETRIATLLQQVHGKPWKTQILHIQPVKSYA PHVDHIVLDLECCPCPSVG >gi568815590f:124439187_124649889|GENSCAN_predicted_CDS_1|1320_bp atggagagagaaagtgggaagcccgtggctgttgtcgcagttgtgactgagccttggttt acccagcgatacagagaatatctccagaggcagaaactctttgatacacagcaccgtgtg gaaaagatgccggatggctcggtggcgctaccggtgctgggagagacgcttccagagcag cacctgcaggagctgaggaatcgtgttgccccaggcagtccctgtatgctcacgcagctc ccggatcctgttccttcgaagagggcccagggttgttcacctgcccaaaaattgtgtctt gaggtgagtcgctgggtggagggtcggggagtcaagtggtcagccgagttggaggctgat ttgccccgatcatggcaacggcatggtaatctcttgttgctgagtgaagactgtttccaa gccaagcagtggaaaaatctgggaccggaactctgggagaccgttgccttggcacttggc gtccagcgtttggcaaaacgagggcgggtatcaccggatggtactcgaactccagcagtg acactgctgctgggtgaccatggctgggtagagcatgtggataatggtatccgttataag tttgacgtgacccagtgtatgttctcctttggaaacatcactgagaagcttcgagtggca tcgttgtcctgtgctggagaagtgctggtggatctctatgcagggattggttattttaca ttgcctttcctagttcatgctggtgctgccttcgtccatgcttgtgagtggaatccccat gctgtagttgctctgagaaataaccttgagatcaatggagtagcagatcggtgccaaata cactttggagataacagaaaactgaagctctcaaatattgcagatagggtgatcctgggg ctgattcccagctctgaagaaggctggcccattgcctgccaagtgttaaggcaggatgct ggaggcattttgcatatccaccaaaatgtggaatctttcccagggaagaatcttcaggct cttggagtcagcaaagtagagaaagagcattggctgtatcctcagcaaattaccaccaac caatggaaaaatggagctaccagggattctaggggaaaaatgctgtcaccagccaccaag ccagagtggcaaaggtgggcagaatctgcagaaactcgaatcgccactcttcttcagcag gtgcatgggaaaccatggaagacacaaattctgcacatccaaccagtgaaatcctatgct ccccatgtggatcacatagtcctggatctggaatgctgcccctgtccttcagttggctag >gi568815590f:124439187_124649889|GENSCAN_predicted_peptide_2|364_aa MAAVGPPQQQVRMAHQQVWAALEVALRVPCLYIIDAIFNSYPDSSQSRFCIVLQIFLRLF GVFASSIVLILSQRSLFKFYTYSSAFLLAATSVLVNYYASLHIDFYGAYNTSAFGIELLP RKVTAFCVELCLKVIVSLTVYTLFMIDGYYNVLWEKLDDYVYYVRSTGSIIEFIFGVVMF GNGAYTMMFESGSKIRAFMMCLHAYFNIYLQAKNGWKTFMNRRTAVKKINSLPEIKGSRL QEINDVCAICYHEFTTSARITPCNHYFHALCLRKWLYIQDTCPMCHQKVYIEDDIKDNSN VSNNNGFIPPNETPEEAVREAAAESDRELNEDDSTDCDDDVQRERNGVIQHTGAAAEEFN DDTD >gi568815590f:124439187_124649889|GENSCAN_predicted_CDS_2|1095_bp atggcggccgtggggcccccgcagcagcaggtgcggatggcccatcagcaggtctgggcg gcgctcgaagtggcgctccgggtgccctgcctttacatcatcgacgccatcttcaactcc tacccggattccagccaaagccggttctgcatcgtgctccagatcttcctccggctcttt ggtgtatttgcatccagtattgttctgatcttgtcacaacgatcacttttcaagttttac acgtacagctcagcctttctgttagctgcaacttcagtgttggtgaattattatgcttct ttgcacattgacttctatggtgcctacaacacgtcagcttttggaattgagctgcttcct cgaaaagttacagcattttgtgtggaactgtgcttaaaagtaattgtttctctcactgtt tatacgttattcatgattgatggctactataatgtcctctgggaaaagcttgacgattat gtctactacgttcgttcaacaggcagtattattgaatttatatttggagttgtaatgttt ggaaatggggcttacactatgatgtttgagtcgggaagtaaaattcgggcttttatgatg tgcctacatgcatattttaacatctacttacaagccaaaaatggctggaagacatttatg aatcgtaggactgctgtgaagaaaattaattcacttcctgaaataaaagggagccgctta caagaaataaatgatgtatgtgcaatctgctatcatgagtttacaacatctgctcgtatt acaccgtgtaatcattatttccatgcactttgccttcggaaatggctgtacattcaagat acttgtccaatgtgccatcagaaagtatacatcgaagatgatatcaaggataattcaaat gtatctaacaacaatggatttattccacccaatgaaactccagaggaagctgtaagagaa gctgctgctgaatctgacagggaattgaacgaagatgacagtacagattgtgatgatgat gttcaaagagaaagaaatggagtgattcagcacacaggcgcagcagctgaagaatttaat gatgatactgactga >gi568815590f:124439187_124649889|GENSCAN_predicted_peptide_3|68_aa MQYEALLGFSLDILKNSLNCQNKQNYQCFFIVETHMLNFWVHSFDGTKEAAAALIDLDLY IGFNGCCF >gi568815590f:124439187_124649889|GENSCAN_predicted_CDS_3|207_bp atgcagtatgaagctctcctgggcttctcgcttgatattttgaaaaacagtttgaactgt cagaacaaacaaaattaccaatgtttcttcattgtcgaaactcacatgctgaatttttgg gtgcattcatttgatggtaccaaggaagcagcagctgctttgattgacttggatctttat ataggatttaatggttgctgtttctga >gi568815590f:124439187_124649889|GENSCAN_predicted_peptide_4|179_aa MAFLASGPYLTHQQKVLRLYKRALRHLESWCVQRDKYRYFACLMRARFEEHKNEKDMAKA TQLLKEAEEEFWYRQHPQPYIFPDSPGGTSYERYDCYKVPEWCLDDWHPSEKAMYPDYFA KREQWKKLRRESWEREVKQLQEETPPGGPLTEALPPARKEGDLPPLWWYIVTRPRERPM >gi568815590f:124439187_124649889|GENSCAN_predicted_CDS_4|540_bp atggcgttcttggcgtcgggaccctacctgacccatcagcaaaaggtgttgcggctttat aagcgggcgctacgccacctcgagtcgtggtgcgtccagagagacaaataccgatacttt gcttgtttgatgagagcccggtttgaagaacataagaatgaaaaggatatggcgaaggcc acccagctgctgaaggaggccgaggaagaattctggtaccgtcagcatccacagccatac atcttccctgactctcctgggggcacctcctatgagagatacgattgctacaaggtccca gaatggtgcttagatgactggcatccttctgagaaggcaatgtatcctgattactttgcc aagagagaacagtggaagaaactgcggagggaaagctgggaacgagaggttaagcagctg caggaggaaacgccacctggtggtcctttaactgaagctttgccccctgcccgaaaggaa ggtgatttgcccccactgtggtggtatattgtgaccagaccccgggagcggcccatgtag >gi568815590f:124439187_124649889|GENSCAN_predicted_peptide_5|902_aa MTCGFCIRPQLLRWQELASSSRSDLASAGASFCAHAAEADHGNGGFMVADVGETAGISQA AWSVPAIQVCGKEETDGNTAAKEPQDRFGGHLGEEGVTYLNRSGKHSPEASPAASSPCLC PIPEAATAKGDGVSIMGLDRSCGTREIGSALTRMCMRHRSIEAKLRQFSSALIDCLINPL QEQMEEWKKVANQLDKDHAKEYKKARQEIKKKSSDTLKLQKKAKKGRGDIQPQLDSALQD VNDKYLLLEETEKQAVRKALIEERGRFCTFISMLRPVIEEEISMLGEITHLQTISEDLKS LTMDPHKLPSSSEQVILDLKGSDYSWSYQTPPSSPSTTMSRKSSVCSSLNSVNSSDSRSS GSHSHSPSSHYRYRSSNLAQQAPVRLSSVSSHDSGFISQDAFQSKSPSPMPPEAPNQNSS SSASSEASETCQSVSECSSPTSVSSGSTMGAWVSTEKVTARVAATAAPLSLVPLPAAQTA LTAAATWKLRIDTKSGAPLGDRGLSSESHVGPTGAGLFPHCLPASRLLPRVTSVHLPDYA HYYTIGPGMFPSSQIPSWKDWAKPGPYDQPLVNTLQRRKEKREPDPNGGGPTTASGPPAA AEEAQRPRSMTVSAATRPGEEMEACEELALALSRGLQLDTQRSSRDSLQCSSGYSTQTTT PCCSEDTIPSQDYDYFSVSGDQEADQQEFDKSSTIPRNSDISQSYRRMFQAKRPASTAGL PTTLGPAMVTPGVATIRRTPSTKPSVRRGTIGAGPIPIKTPVIPVKTPTVPDLPGVLPAP PDGPEERGEHSPESPSVGEGPQGVTSMPSSMWSGQASVNPPLPGPKPSIPEEHRQAIPES EAEDQEREPPSATVSPGQIPESDPADLSPRDTPQGEDMLNAIRRGVKLKKTTTNDRSAPR FS >gi568815590f:124439187_124649889|GENSCAN_predicted_CDS_5|2709_bp atgacatgtggtttctgcatcaggccacagctgcttcgctggcaggagctggcgtcatcg agtcgctcagacctggcttcagcaggagcatctttctgtgcccatgcagcagaggcagac catggcaatggggggttcatggtggcagatgtcggggagacagctggcatttcccaggct gcgtggagtgtgcctgccatccaagtgtgtgggaaagaggagacagatgggaatacagct gccaaggagccccaagaccgttttggaggtcatttgggggaggagggggtcacttacctc aatagatcaggaaaacattccccagaggcctccccagcagcctcctctccatgtctttgt cccatccctgaagcagccactgccaaaggggatggagttagcatcatgggcctagaccga tcatgtgggaccagggagattggatctgctctcaccaggatgtgcatgaggcacagaagc attgaagccaagctgaggcagttttcgagcgctttaattgattgtctgataaacccactt caagaacagatggaagaatggaagaaagtggccaaccagctggataaagaccacgcaaaa gaatataagaaagcccgccaagagataaaaaagaagtcctcggatacgctgaaactgcag aagaaagcaaaaaaagggagaggtgatatccagcctcagttggacagtgctctccaagat gtcaatgataagtatctcttattggaagaaacagaaaagcaggctgtccggaaggctttg attgaagaacgtggccgattctgtaccttcatctctatgctgcggccagtgattgaagaa gaaatctcaatgctaggggaaataacccaccttcagaccatctcggaagatctaaaaagc ctgaccatggaccctcacaaactgccctcctcaagtgaacaggtgattctggacttgaaa ggttctgattacagctggtcgtatcagacgccaccctcttcccccagcaccaccatgtcc agaaagtccagtgtctgcagcagcctgaacagtgtcaacagcagtgactcccggtccagc ggctcccactcgcattcccccagctcacattaccgctaccgcagctccaacctggcccag caggctcctgtgaggctgtccagcgtgtcctcccatgactcaggattcatatcccaggat gccttccagtccaagtcaccatcccccatgccgccagaggcccccaaccagaactcgtcc agctcggcctcctccgaagcctcggaaacctgccagtcagtgagcgagtgcagctccccc acctctgtcagctcgggctccaccatgggtgcctgggtgtccacagagaaggtgaccgcc cgggtggcagccacagcggccccactctccctggtccccctgcctgctgctcagacagcc ctcacagccgcggcaacctggaagctcagaatcgacaccaagagtggggctcctcttggg gacagaggtttatcaagtgagtcccacgtggggcccacgggtgcaggccttttccctcat tgcctgcctgcctcccgcctgctccctcgggtcacctctgtccaccttccagactacgct cattattacaccattgggcccggcatgttcccgtcatctcagatccctagctggaaggac tgggctaagcctgggccctatgaccagcctctggtgaacaccctgcagcgccgcaaagag aagcgagaaccggaccccaacgggggaggacccactaccgccagcggcccacctgcagca gctgaggaggctcagagaccacggagcatgactgtatcggctgccaccaggcctggtgag gagatggaggcttgtgaggagctggccctggccctgtctcggggcctgcagctggacacc cagaggagcagccgggactcgcttcagtgctccagcggctacagcacccagacaaccacc ccctgctgctctgaggacaccatcccttcccaagattatgattatttctctgtaagtggt gaccaggaggcagatcagcaggagttcgacaagtcctccaccattccaagaaacagcgac atcagccagtcctaccgacggatgttccaagccaagcgtccagcctcaactgctggcctc cccaccaccctgggacctgctatggtcactccaggggttgcaactatccgacggacccct tccaccaagccttctgtccgccggggaaccattggagctggtcccatccccatcaagaca cccgtgatccctgtcaagaccccaaccgtcccagacctcccaggggtgttgccagcccct ccagatgggccagaagagcggggggagcacagccctgagtcgccatctgtgggtgagggc ccccaaggtgtcaccagcatgccctcctcaatgtggagcggccaagcttccgttaaccct ccacttccaggcccgaagcccagtatccctgaggagcacagacaggcaattccagaaagt gaagctgaagaccaggaacgggaacccccaagtgccactgtctccccaggccagattcca gagagtgaccctgcagacctgagcccaagggatactccacaaggagaagacatgctgaac gccatccgaaggggcgtgaaactgaagaagaccacgacaaacgatcgctcagcccctcgc ttttcttag >gi568815590f:124439187_124649889|GENSCAN_predicted_peptide_6|189_aa MAIMAAGPHGLRPSGFSCFLLSITLPFATISGDTSTDPHGSGPPPPPTEPRVQVHDAKFR AQINQTNTFNMNMQTLCCDVRRGLTSYASACNRQDSTATNCKGEPMFICTGFAGKLLETK GKGNKNHISMVTVTEPDIRCPNTRKGLPRQVLFTAAATGITDLSFGPAADDGGCVIWFLT ESKEKEEKS >gi568815590f:124439187_124649889|GENSCAN_predicted_CDS_6|570_bp atggctattatggcagccgggccccatggccttcgaccttcaggattctcctgcttcctc ctcagcatcactcttccctttgccaccatcagtggtgacaccagcacagatccacacggc tctgggccacctccgcctccaactgaaccaagagtgcaagttcatgatgccaaattcagg gcacagataaaccaaaccaacacattcaacatgaacatgcaaacgctctgctgtgatgtc agaagaggcctaacatcctacgcttcagcatgcaacagacaagattccactgcaacaaac tgcaaaggagagcctatgtttatttgcactggatttgcagggaagctgctggaaacaaaa ggaaaaggcaacaagaatcacatctccatggtgacggtcacagagcctgatatacgctgc cccaacactaggaaaggtctcccaaggcaggtcctgttcactgcagctgctactggaata acagatctgagctttgggcctgctgctgatgatggtggttgtgtcatttggtttttaaca gaaagcaaagagaaagaggagaaaagttaa