GENSCAN 1.0 Date run: 5-Nov-116 Time: 01:37:03 Sequence gi568815586r:10506278_10713532 : 207255 bp : 37.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 58 1044 987 0 0 58 49 793 0.852 68.99 1.02 PlyA + 1690 1695 6 1.05 2.02 PlyA - 3224 3219 6 1.05 2.01 Sngl - 11949 11206 744 0 0 59 42 213 0.589 9.74 2.00 Prom - 12039 12000 40 -6.15 3.02 PlyA - 12200 12195 6 1.05 3.01 Sngl - 31549 31100 450 1 0 27 53 303 0.939 16.66 3.00 Prom - 37939 37900 40 -6.45 4.03 PlyA - 38467 38462 6 1.05 4.02 Term - 45174 45069 106 2 1 53 45 168 0.532 5.90 4.01 Init - 56001 55949 53 0 2 68 98 2 0.092 -0.12 4.00 Prom - 58697 58658 40 -3.75 5.00 Prom + 63976 64015 40 -4.95 5.01 Sngl + 64823 64993 171 1 0 48 49 211 0.937 7.78 5.02 PlyA + 66227 66232 6 1.05 6.09 PlyA - 67069 67064 6 1.05 6.08 Term - 74848 74419 430 0 1 80 44 201 0.033 8.69 6.07 Intr - 75153 75081 73 1 1 56 80 35 0.010 -2.95 6.06 Intr - 89650 89414 237 2 0 21 87 104 0.281 0.26 6.05 Intr - 91932 91766 167 2 2 96 91 76 0.537 7.38 6.04 Intr - 101659 101577 83 1 2 77 106 56 0.948 3.82 6.03 Intr - 103664 103554 111 2 0 60 72 134 0.988 8.66 6.02 Intr - 104403 104345 59 1 2 117 107 29 0.998 5.38 6.01 Init - 107255 107162 94 2 1 82 93 54 0.977 5.89 6.00 Prom - 108053 108014 40 -7.25 7.10 PlyA - 110386 110381 6 1.05 7.09 Term - 114071 113867 205 1 1 106 49 274 0.970 21.16 7.08 Intr - 118582 118374 209 1 2 85 9 80 0.187 -3.25 7.07 Intr - 121447 121364 84 1 0 107 99 82 0.977 10.30 7.06 Intr - 123397 123216 182 2 2 128 88 131 0.991 15.77 7.05 Intr - 125031 124768 264 1 0 90 93 214 0.999 18.66 7.04 Intr - 127847 127713 135 0 0 58 98 97 0.983 7.32 7.03 Intr - 128409 128290 120 1 0 40 103 77 0.159 3.95 7.02 Intr - 135780 135708 73 0 1 94 61 5 0.028 -3.54 7.01 Init - 136049 135972 78 2 0 84 110 14 0.412 4.31 7.00 Prom - 144983 144944 40 -4.05 8.03 PlyA - 145005 145000 6 1.05 8.02 Term - 155274 155071 204 0 0 49 32 159 0.212 2.79 8.01 Init - 167970 167677 294 0 0 42 39 228 0.569 10.53 8.00 Prom - 169537 169498 40 -5.05 9.03 PlyA - 170886 170881 6 1.05 9.02 Term - 173822 173742 81 2 0 90 34 116 0.375 3.11 9.01 Init - 189967 189782 186 1 0 65 60 127 0.129 6.71 9.00 Prom - 193845 193806 40 -3.95 10.07 PlyA - 194606 194601 6 1.05 10.06 Term - 195076 195011 66 1 0 104 41 37 0.650 -2.44 10.05 Intr - 195857 195683 175 1 1 94 84 186 0.927 17.82 10.04 Intr - 197871 197774 98 0 2 109 77 166 0.578 15.49 10.03 Intr - 199971 199900 72 0 0 56 102 41 0.174 0.98 10.02 Intr - 203837 203631 207 2 0 59 98 300 0.924 26.45 10.01 Intr - 207056 206934 123 2 0 103 63 295 0.999 28.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 74949 74419 531 0 0 70 44 277 0.872 15.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_1|328_aa MLNGAAVMDAALLLIAGNESCPQPQTSEHLAAIEIMKLKHILILQNKIDLVKERQAKEQY EQILAFVQGTVAEGAPIIPISAQLKYNIEVVCEYIVKKIPVPPRDFTSEPRLIVIRSFDV NKPGCEVDDLKGGVAGGSILKGVLKVGQETEVRPGIVSKDSEGKLMCKSIFSKIVSLFAE HNDLQYAAPGGLIGVGTKIDPTLCRADRMVGQILGAVGALPEIFTELEISYFLLRRLLGV RTEGDKKAAKVQKLSKNEVLMVNIGSLSTGGRVSAVKADLGKIVLTNPVCTEVGEKIALS RRVEKHWRLIGWGQIRRGVTIKPTVDDD >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_1|987_bp atgctgaacggtgcagcagtgatggatgcagctcttctgttgatagctggtaatgaatct tgccctcagcctcagacatctgaacacctggctgctatagagatcatgaaactgaagcat attttgattctacaaaataaaattgatttggtaaaagaaaggcaggctaaagaacaatac gagcagatccttgcgtttgtccaaggtacagtagcagagggagctcccattattccaatt tcggctcagctgaaatacaatattgaagttgtttgtgagtacatagtaaagaaaattcca gtacccccaagagactttacttcagagccccggcttattgttattagatcttttgatgtc aacaaacctggctgtgaagttgatgaccttaagggaggtgtagctggtggtagtatccta aaaggagtattaaaggtgggccaggagacagaagtaagacctggtattgtttccaaagat agtgaaggaaaactcatgtgtaaatcaatcttttccaaaattgtttcactttttgcggag cataatgatctgcaatatgctgctccaggcggtcttattggagttggaacaaaaattgac cccactttgtgccgggctgacagaatggtggggcaaatacttggtgcagtcggagcttta cctgagatattcacagaattggaaatttcctatttcctgcttagacggcttctaggtgta cgcactgaaggagacaagaaagcagcaaaggttcaaaagctgtctaagaatgaagtgctc atggtgaacataggatccctgtcgacaggagggagagttagtgctgtcaaggccgatttg ggcaaaattgttttgaccaatccagtgtgcacagaggtaggagaaaaaattgcccttagc cgaagagttgaaaaacactggcgtttaattggttggggtcagataagaagaggagtgaca atcaagccaacagtagatgatgactga >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_2|247_aa MGDLNTPLSTLDRSMRQKVNKYTQELNSALHQVDLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNHSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNKNKDTTYQNLWNTFKAVCTGKFIALNAHKRKQERSKIDNLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIEAQKTLQKINGSRSWFFERVNKIDRLLARLIK KKREKIK >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_2|744_bp atgggagaccttaacaccccactgtcaacattagacagatcaatgagacagaaagttaac aagtatacccaggaattgaactcagctctgcaccaagtggacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaactc actcaaaaccactcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccagaatctctggaacacattcaaagcagtgtgtacagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaacctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagaggcacaaaaaacccttcaaaaaattaatggatcc aggagctggttttttgaaagggtcaacaaaattgatagattgctagcaagactaataaag aagaaaagagagaaaatcaaatag >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_3|149_aa MVQAPSLGSFPHAVESVGVQKSRIKVWESPPGFQRLYGDTWMSKQKFAAGEGPSWRTSAR AVWKGNVGLKPPHRVSTGALPSGAVRKGPPSSKPQNGRSTNGLHGVPGKAADTQCQPVKA GGRLYPAKPQGWSYPRPWKPTSCISVTWM >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_3|450_bp atggtgcaagctccaagccttggcagctttccacatgctgttgagtctgtgggtgtacag aagtcaagaataaaggtttgggaatctccacctggatttcagaggctctatggagacacc tggatgtccaagcagaagtttgctgcaggggagggaccttcctggagaacctctgctagg gcagtgtggaagggaaatgtgggattgaagcccccacacagagtctccactggggcacta cctagtggagctgtgagaaaagggccaccgtcctccaaacctcagaatggtagatccacc aatggcttgcatggtgtgcctggaaaagctgcagacactcagtgccagcctgtgaaagca ggagggaggctataccctgcaaagccacaggggtggagctacccaagaccatggaaaccc acctcttgcatcagtgtgacctggatgtga >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_4|52_aa MVHTQDFTLALLELSNTSGVLEPLHLSPNGIRCAARTRFTGSMGAFVKIVRR >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_4|159_bp atggtacacactcaagattttaccttagcattattggaattgtcaaataccagtggtgtc ctggagcccctgcacctgtctccaaatggtatccgctgtgccgccaggacgcgcttcact ggcagcatgggtgcctttgtgaagattgtgaggcgctag >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_5|56_aa MCNKKSRGQETEGCRQLNSIINAIRCLNPSNLLRLDMWTTLWPHIAAIAPDQYSWQ >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_5|171_bp atgtgtaataaaaagtctagaggtcaagagactgagggctgtcggcagctcaacagcatc atcaatgccattcgctgtctaaatccttctaatcttctgcgtttggacatgtggacaacc ttgtggccacacatagctgccatagccccagatcagtattcatggcaatga >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_6|417_aa MAVASDFYLRYYVGHKGKFGHEFLEFEFRPDGKLRYANNSNYKNDVMIRKEAYVHKSVME ELKRIIDDSEITKEDDALWPPPDRVGRQELEIVIGDEHISFTTSKIGSLIDVNQSKNHPI SGFECKTYLLTLKMNDQGEIYSTLRFLQSPSESQNRLRPDDTQRPGKTDDKVFQCIQEKH QRQEILRNCSEKYIMQNDNYLKEQILTNKTLKFDVLKNSFQQKKELDSRLIQKNRCHREN EIVFKVLQNTGLKPQNFAVSVIALKGGPSSVCCSRALIAPFYRVLIGPFLQSADWCVYKP LARHRVLIGAFLQSADWCIYKPLARHRELIGAFLQSADWCIYKPLARHRELIGAFLHSTD WCVSKPLARHRVLIGAFLQSTDWCIYKPLARHRELIGAFLQSADWCVYKPLARQSTD >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_6|1254_bp atggctgtggctagcgatttctacctgcgctactacgtagggcacaagggcaagtttggg cacgagtttctggagttcgaatttcggccggacggaaagcttagatatgccaacaacagc aattacaaaaatgatgtcatgatcagaaaagaggcttatgtgcacaagagtgtaatggaa gaactgaagagaattattgatgacagtgaaattacaaaagaagatgatgctttgtggcct ccccctgatagggttggccgacaggagcttgaaattgtaattggagatgagcacatatct tttaccacatcaaaaataggttctcttattgatgtaaatcagtcaaaaaaccatcctatt tcaggatttgaatgcaaaacttaccttcttactctaaagatgaatgatcagggagagatt tattcaaccctgagatttttgcagtctccttcagagtcacagaatagattaaggcctgat gatactcaaaggcctgggaaaactgatgacaaagtctttcagtgtattcaagaaaaacat caacggcaggaaattctaagaaactgtagtgaaaagtacatcatgcaaaatgacaactac ttaaaagagcagattttgacaaataagactttaaaatttgacgttctcaaaaatagcttt cagcagaaaaaggaactggattcacgccttatacaaaagaacagatgtcatagagaaaat gagatcgtttttaaagttttgcaaaatacaggattgaaaccacagaacttcgcagttagt gttatagctcttaaaggtggcccatccagtgtttgttgctcccgagcactgattgctcca ttttacagagtgctgattggtccgtttttacagagtgctgattggtgtgtttacaaacct ttagctagacacagagtgctgattggtgcatttttacagagtgctgattggtgcatttac aaacctttagctagacacagagagctgattggtgcatttttacagagtgctgattggtgc atttacaaacctttagctagacacagagagctgattggtgcatttttacacagtactgat tggtgcgtttccaaacctttagctagacacagagtgctgattggtgcatttttacagagt actgattggtgcatttacaaacctttagctagacacagagagctgattggtgcattttta cagagtgccgattggtgcgtttacaaacctttagccagacagagcactgattga >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_7|449_aa MNDRNEIQMEAKLQSLTIIAQEILCRFFITLRRHARFLLTKLGRQGMARSGITHSCAVCI LCGPSREGDSPVAMGMTRMLLECSLSDKLCVIQEKQYEVIIVPTLLVTIFLILLGVILWL FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ VPREQLSEVLEQICSGSCGPIFRANMNTGDPSKPKSVILKALKEPAGLHEVQDFLGRIQF HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLLSFLWTCRRDVMTMDGLLYDLTEKQ VYHIGKQVLLALEFLQEKHLFHGDVAARNILMQSDLTAKLCGLGLAYEVYTRGAISSTQT IPLKWLAPERLLLRPASIRADVYSIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVL QVPELVVPELYAAVAGIRVESLFYNYSML >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_7|1350_bp atgaatgataggaatgagattcaaatggaagccaaactccaaagtcttaccattatagca caggaaattctatgcaggttctttattacccttaggagacatgcacgtttcctgctcact aaactaggaaggcaaggaatggcaaggtcaggaattactcacagctgtgctgtgtgcatt ctctgtgggcctagcagggaaggggacagccctgtggcaatgggcatgacacggatgctc ctggaatgcagtctcagtgacaagttgtgtgtcatccaggagaagcagtatgaagtgatt atcgtcccaactttgttggttactatcttcctcatccttcttggggtcatcctgtggctt tttatcagagaacaaagaactcaacagcagcgttctggacctcaaggcattgcccctgtt cctccacctagggacctaagctgggaagcaggacatggaggaaatgtggctttgccactt aaggagacatccgtggaaaactttctgggagctaccacacctgccctggctaagctgcag gtgccgcgggagcaactctctgaagttctggagcagatttgcagtggtagctgtgggccc atctttcgagccaatatgaacactggggacccttctaagcccaagagtgttattctcaag gctttaaaagaaccagctgggctccatgaggtacaagatttcttagggcgaatccaattc catcaatacctggggaaacacaaaaacctggtgcagctggaaggctgctgcactgaaaag ctgccactctatatggtgttggaggatgtggcccagggggacctgctcagctttctctgg acctgtcggcgggatgtgatgactatggatggtcttctctatgatctcacagaaaaacaa gtatatcacatcggaaagcaggtccttttggcgctggaattcctgcaggagaagcatttg ttccatggggatgtggcagccaggaatattctgatgcaaagtgatctcactgctaagctc tgtggattaggcctggcttatgaagtttacacccgaggggccatctcctctactcaaacc atacctctcaagtggcttgccccagaacggcttctcctgagacctgctagcatcagagca gatgtgtacagtatcatgaagtcctgctggcgctggcgtgaggctgaccgcccctcacct agagagctgcgcttgcgcctagaagctgccattaaaactgcagatgacgaggctgtgtta caagtaccagagttggtggtacctgaactgtatgcagctgtggccggcatcagagtggag agcctcttctacaactatagcatgctttga >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_8|165_aa MSIDQQQVQPLQLQQRFPVARVWRVPETSEPADFPSREPARTPSFAGSHAPQWEGGTEKH FLEAGTPCPTQPCHYFPSDWAGAIGNRSDQRRDTVVNGVIVGIKTSIKEVCVKDKYQLRN SRAGWIPKQIASLGLLAGLCSMQMCEESGFCARGGRDDTDQEYNI >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_8|498_bp atgagtatagatcagcaacaagtccagcctttacagttgcagcagaggtttcctgtggct cgagtgtggcgagtcccggaaacctcggagcccgcagacttcccttcgcgggagcccgcc cgaactccatcctttgccggcagccacgccccgcagtgggaaggagggactgaaaagcat ttccttgaggctggcacaccttgccctacccaaccctgtcattatttcccctccgactgg gccggtgccatcggaaaccggagtgaccagaggagggacacggtagtgaacggggtaata gtgggaatcaaaaccagtatcaaggaagtttgtgttaaagacaaatatcagttaaggaat agcagagcagggtggattccaaagcagattgcttcacttggtttattggctggactgtgc tccatgcagatgtgtgaagagtctgggttttgtgcaagaggaggaagagatgacacggat caagaatataacatctaa >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_9|88_aa MTGVLVRSNTRDMRAQGGGHLKKLQEEGRPSASHGEASVNQTCQHLDLGLPTSRPEKRNF CRGEYSLLEETDKETTNEKRSGTEKKEG >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_9|267_bp atgactggtgttcttgtgaggagcaacaccagagacatgcgtgcacaggggggtggccat ttgaagaagttgcaagaggaaggaaggccatctgcaagccacggagaggcctcagtgaac caaacctgccagcaccttgatcttggacttccaacctccagacctgagaaaaggaatttc tgtcgtggagaatatagcctattagaagaaacagacaaggaaactacaaatgagaaacgc tctggtaccgagaagaaagaaggatag >gi568815586r:10506278_10713532|GENSCAN_predicted_peptide_10|246_aa GAEAANVTGPDGVPVEGSRYAADRRRYRRGYYGRRRGPPRNYAGEEEEEGSGSSEGFDPP ATDRQFSGARNQLRRPQYRPQYRQRRFPPYHVGQTFDRRSRVLPHPNRIQSYPWSLPYPL PHQQLLKPLNGQIKAGEIGEMKDGVPEGAQLQGPVHRNPTYRPRYRSRGPPRPRPAPAVG EAEDKENQQATSGPNQPSVRRGYRRPYNYRRRPRPPNAPSQDGKEAKAGEAPTENPAPPT QQSSAE >gi568815586r:10506278_10713532|GENSCAN_predicted_CDS_10|741_bp ggtgcagaagctgccaatgtgactggcccggatggagttcctgtggaagggagtcgttac gctgcagatcggcgccgttacagacgtggctactatggaaggcgccgtggccctccccgg aattacgctggggaggaggaggaggaagggagcggcagcagtgaaggatttgacccccct gccactgataggcagttctctggggcccggaatcagctgcgccgcccccagtatcgccct cagtaccggcagcggcggttcccgccttaccacgtgggacagacctttgaccgtcgctca cgggtcttaccccatcccaacagaatacagagttacccctggtctctcccttacccgtta cctcaccaacaacttctaaagccattaaatgggcagatcaaggctggtgagattggagag atgaaggatggagtcccagagggagcacaacttcagggaccggttcatcgaaatccaact taccgcccaaggtaccgtagcaggggacctcctcgcccacgacctgccccagcagttgga gaggctgaagataaagaaaatcagcaagccaccagtggtccaaaccagccgtctgttcgc cgtggataccggcgtccctacaattaccggcgtcgcccgcgtcctcctaacgctccttca caagatggcaaagaggccaaggcaggtgaagcaccaactgagaaccctgctccacccacc cagcagagcagtgctgagtaa