GENSCAN 1.0 Date run: 4-Nov-116 Time: 08:50:20 Sequence gi568815596r:11346480_11565879 : 219400 bp : 44.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5148 5641 494 2 2 92 119 635 0.988 61.63 1.02 Intr + 7124 7130 7 2 1 112 81 0 0.038 -4.56 1.03 Intr + 18667 18810 144 0 0 64 89 126 0.410 10.78 1.04 Intr + 25346 25397 52 1 1 50 103 56 0.076 1.78 1.05 Term + 25443 25594 152 1 2 112 42 77 0.104 3.57 1.06 PlyA + 25932 25937 6 1.05 2.03 PlyA - 27833 27828 6 -0.45 2.02 Term - 28266 28127 140 1 2 76 55 104 0.547 4.03 2.01 Init - 42225 42171 55 0 1 55 93 54 0.603 4.11 2.00 Prom - 43291 43252 40 -4.06 3.04 PlyA - 44542 44537 6 1.05 3.03 Term - 47661 47539 123 0 0 99 42 79 0.749 2.78 3.02 Intr - 49355 49203 153 2 0 89 58 90 0.746 6.37 3.01 Init - 56234 56151 84 2 0 73 -17 215 0.135 8.33 3.00 Prom - 63468 63429 40 -1.96 4.03 PlyA - 67310 67305 6 1.05 4.02 Term - 76377 76232 146 1 2 121 43 72 0.729 4.07 4.01 Init - 78444 78375 70 0 1 93 115 40 0.491 8.41 4.00 Prom - 79133 79094 40 -5.56 5.08 PlyA - 80652 80647 6 1.05 5.07 Term - 86212 86196 17 2 2 114 55 2 0.344 -2.10 5.06 Intr - 92322 92155 168 1 0 74 68 101 0.345 6.62 5.05 Intr - 101295 101148 148 0 1 105 94 81 0.677 10.21 5.04 Intr - 105327 105108 220 2 1 64 -11 121 0.309 -1.90 5.03 Intr - 107319 107103 217 1 1 66 76 138 0.683 7.96 5.02 Intr - 110754 110700 55 0 1 101 100 5 0.784 1.55 5.01 Init - 119400 119293 108 0 0 85 107 141 0.454 15.96 5.00 Prom - 122674 122635 40 -3.86 6.02 PlyA - 123474 123469 6 1.05 6.01 Sngl - 136903 136280 624 1 0 74 47 291 0.782 17.81 6.00 Prom - 144684 144645 40 -2.06 7.04 PlyA - 144853 144848 6 1.05 7.03 Term - 146504 146487 18 2 0 87 52 4 0.191 -4.98 7.02 Intr - 146967 146778 190 2 1 68 53 86 0.437 2.69 7.01 Init - 148524 148472 53 0 2 95 101 17 0.811 4.33 7.00 Prom - 150638 150599 40 0.34 8.04 PlyA - 150838 150833 6 1.05 8.03 Term - 159873 159734 140 1 2 55 50 83 0.444 -0.67 8.02 Intr - 162085 161861 225 2 0 22 77 154 0.058 5.66 8.01 Init - 176065 175957 109 1 1 86 71 55 0.144 4.06 8.00 Prom - 185533 185494 40 -3.96 9.04 PlyA - 185653 185648 6 1.05 9.03 Term - 195989 195823 167 0 2 104 42 160 0.327 11.08 9.02 Intr - 198093 198066 28 0 1 118 67 14 0.344 -0.01 9.01 Init - 198537 198535 3 0 0 113 22 0 0.289 -4.10 9.00 Prom - 199974 199935 40 -2.46 10.00 Prom + 202925 202964 40 -0.46 10.01 Init + 210136 210292 157 0 1 57 81 225 0.752 18.77 10.02 Intr + 215984 216103 120 0 0 107 101 84 0.928 12.07 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_1|282_aa MVNPTVFFDISVSGKPLGHVSFRIFADKLSKTAQNFRALSTGEKGLDYKGSCFHRIIPGF MCQGGDFTYNNGTGGKSLYGEKFDDENFILKRIRPGILSMAKAGPNTNGSQFFICTAKTE WLIGKHVVFGKVKEGMNIVEAMEPFGSRNGKTSKKITIADCGQLWILADPSIDDLLLLAV SRVASLKPPEQRPGIFSEQQPTRQLEETPGLFCSQLQVPGQEAEPGATEPATGHFSPPPS LLTLSAITPKEGRFLECPACHPRDSPQGTHCWIQQVCPSASP >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_1|849_bp atggtcaaccccactgtgttcttcgacatcagtgtcagtggcaagcccttgggccatgtc tccttcaggatatttgcagacaagctttcaaagacagcacaaaactttcgtgctctgagc actggagagaaaggacttgattataagggttcctgctttcacagaattattccagggttt atgtgtcagggtggtgacttcacgtacaataatggcactggtggcaagtccctctatggg gagaaatttgatgatgagaacttcatcctgaagcgtatacgtcctggcatcttgtccatg gcaaaagctggacccaacacgaatggctcccagtttttcatctgcactgccaagactgag tggttgattggcaagcatgtggtctttggcaaggtgaaagagggcatgaatattgtggag gccatggagccctttgggtccaggaatggcaagaccagcaagaagatcaccattgctgat tgtggacaactctggattctggctgacccctccatcgatgacctgcttctgctggctgtg tccagggtggcctccctgaagccaccggagcagaggcctggtatcttctcagagcagcag ccaaccaggcagctggaggaaacccctggtctcttttgcagccaactccaagttccagga caagaggcagaaccaggtgccaccgagcctgccacagggcacttctcaccccctccatcc ctgctgacgctgagtgctatcacccccaaggaaggacgcttcctggaatgcccagcctgc cacccaagagactcccctcagggcacccactgctggatccaacaggtctgccccagtgcc agcccctag >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_2|64_aa MTDSLTTESPEARMALTAGRKNSRTGEEGHSRNATETSRERCTVSASHERNELQEETQGS TAAG >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_2|195_bp atgacagactcgctgaccacagaaagcccagaagctagaatggccctcacagctgggaga aagaacagcagaactggggaagaaggacattccagaaatgccacagagacctcaagagag cgctgcactgtgtctgcctcacacgagagaaacgagctgcaggaagagacccagggctcc acggccgctggatga >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_3|119_aa MVVVMVVVVAVMAAAVVMAMAAAAVVVVEPSCIPNSIPGSAYQKTWIDTPGPETLANISH ATRARQDRCPRPPKGRLTLILESSGFLALSLFVDPPAASTTQIPAGKARGRLNRKKKPL >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_3|360_bp atggtggtggtgatggtggtggtggtggcggtgatggcagcggcagtggtgatggcgatg gcggcagcggcggtggtggtggtggaaccctcttgcatccccaattccattcctggatct gcttatcagaagacctggattgatactccaggcccagagacccttgccaacatcagtcat gctaccagggcccgtcaggaccgctgcccacgtcctcccaaaggccggctcacactgatt ctggaatcttcaggcttcttggcgctcagcctgttcgttgatccccctgctgcttctacc acccagatcccagcagggaaagccagagggagactaaataggaagaagaagcctctctga >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_4|71_aa MATVHSYAGSPTYPAIGHIHGKRGNHSTNGDNIAREDVLPGQEKHAPSYVPCPPFLFIFV YLFVTECGNRK >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_4|216_bp atggccactgttcactcatatgcgggctcccccacatacccggcaattggacacattcat ggtaagcgcggcaaccattctaccaacggagataatattgccagagaagacgttttacca gggcaggagaaacatgcaccatcttacgtcccttgccctccatttctcttcatatttgtg tacctgtttgtaactgaatgtggtaatagaaaatag >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_5|310_aa MSQQRPARKLPSLLLDPTEETVRRRCRDPINVEGLLPSKIRINLEDNVQYVSMRKALKVK RPRFDVSLVYLTRKFMDLVRSAPGGILDLNKVATKLGVRKRRVYDITNVLDGIDLVEKKS KNHIRWIGSDLSNFGAVPQQKKLQEELSDLSAMEDALDELIKDCAQQLFELTDDKENERY PLSLYLFKISASKTDYLSSRDSITVHIRSTNGPIDVYLCEVEQGQTSNKRSEGVGTSSSE STHPEGPEEEDTGQCLETVLIIMTGVLLASRSGGRDAAKPSVQDKPSDGHLVQRANGAEA KRPKLGLKVA >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_5|933_bp atgagtcagcagcggccggcgaggaagttacccagtctcctcctggacccgacggaggag acggttcgccgtcggtgccgagaccccatcaacgtggagggcctgctgccatcaaaaata aggattaatttagaagataatgtacaatatgtgtccatgagaaaagctctaaaagtgaag agacctcgttttgatgtatcgctggtttatttaactcgaaaatttatggatcttgtcaga tctgctcccgggggtattcttgacttaaacaaggttgcaacgaaactgggagtccgaaag cggagagtgtatgacatcaccaatgtcttagatggaatcgacctcgttgaaaagaaatcc aagaaccatattagatggataggatctgatcttagcaattttggagcagttccccaacaa aagaagctacaggaggaactttctgacttatcagcaatggaagatgctttggatgagtta attaaggattgtgctcagcagctgtttgagttaacagatgacaaagaaaatgaaagatat cctttaagtctatacctttttaaaatttctgcttccaaaacagattacttaagtagtaga gactctatcacagtgcacataaggagcaccaacggacctatcgatgtctatttgtgtgaa gtggagcagggtcagaccagtaacaaaaggtctgaaggtgtcgggacctcttcatctgag agcactcatccagaaggccctgaggaagaggacactgggcaatgtctggagacagttttg attatcatgactggagtgctgttggcatctaggagtggaggacgggatgctgctaaacct agtgtacaggataaacccagcgacggtcacctggtccagcgtgccaatggtgctgaggcc aagagacctaagttaggactgaaggtagcttga >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_6|207_aa MCQAPGASGALSLSHTHTRAHTDTRVHMHARTRALTHAHTCTHAHPYTHTHAHPTHTRAH TYTRAHTHAQAHGSGSTQVPTRCQGRSRTPSTSFPGPGPGSARVRRGAPSPPGAECPPRA APHSPAPPLRPLPPGGTAPTSGPLSSGRRGPAPERPPPTEASPAYRLSSRPQRQAPHPDL APAPATARGARQSPDLRNRGQRGGLDP >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_6|624_bp atgtgccaagcgccaggagcctcgggtgctctctctctctcacacacacacacgcgcgcg cacaccgatacacgcgtacacatgcacgcacgcacacgcgcactcacacacgcacacacg tgcacacacgcacacccatacacgcacacgcacgcacaccccacacacacgcgcgcccac acctatacccgcgcgcacacgcacgcacaagcgcacggcagcggcagcacccaggtccct acacgctgccagggccgatcccggacaccttccacctccttcccgggaccagggccagga agtgcccgcgtccggcggggcgcaccttccccgccgggcgccgagtgcccgccccgagca gccccgcacagccccgcgcccccgctccgcccgctgccgcccggagggaccgctcctacc tcgggccctttgtcctccggccgccgcggccccgctcctgagcgcccgccgccgacggag gctagcccagcctacaggctgagctcgcggccgcagcgccaggccccgcacccggacctg gccccggccccggccacggcgcggggcgctcggcaaagcccggatctccggaaccgcggc cagcgcggaggactcgacccctga >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_7|86_aa MGTEQEMWRAMKTEHHQRSVAASDSHRSSNPIVNCAYKGSRLHAPYENLIPDDLRWNSFT LPHTLSVEKLSSMKLDPGAKKDISSC >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_7|261_bp atgggaacagagcaggagatgtggagagccatgaaaactgaacaccaccaacgatcagtg gcagcatcagattctcataggagctcaaaccctattgtgaactgcgcctacaagggatcc aggttgcacgctccttatgagaatctaatacctgatgatctgaggtggaatagtttcacc ctcccccacaccctgtctgtggaaaaattgtcttccatgaaactggaccctggtgccaaa aaggatatctccagctgctga >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_8|157_aa MICRFLLRVLQQCAGSSTKTSIQATVVTDPAKFNFLTSHVKGEWPGKDVRKDAKQTFATL GESQKGAQPEAAKACAKPVLLKTHAGRCSLLENEGSGKYSNASLMCPDAEDGYGHKPSPQ INHLTLAQLHGNYLLAHYQTTLLKDRKFQHLYSTKNT >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_8|474_bp atgatttgtaggttcctgctgcgagtcctgcagcagtgtgcagggtccagcacaaaaacc agtattcaggcgacagttgtgacagacccagccaagttcaacttcctaacaagccacgta aaaggggagtggccaggcaaagatgtcaggaaggatgcaaaacaaacctttgccacactg ggggaaagccagaaaggagcccagcctgaggctgccaaagcgtgtgcaaagcccgttctg ctgaagacccacgcagggcgctgttctttgttggaaaatgaaggatctgggaaatattcc aacgccagcctcatgtgtcccgatgcagaggatggctatggccacaagcccagtccccag ataaaccatttgaccctggctcagctgcatggtaactacctgttggcacattaccaaacc acgcttctcaaggacaggaaattccagcatctatactcaaccaagaacacatga >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_9|65_aa MNRTAKSLPTEPRTVSYGRVDLQGNGIDSADDPTLQILLDHHPGHPKLQEQMQPGAFGVS GLKYR >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_9|198_bp atgaatagaacagccaaaagtttgcccacagagcccagaacagtttcttacgggagagtg gatctgcaagggaatggcattgactctgccgatgatcccacgctgcagatcctactcgac caccacccaggccaccccaagctccaagagcagatgcagccgggggccttcggggtctca gggctcaagtacagatag >gi568815596r:11346480_11565879|GENSCAN_predicted_peptide_10|93_aa MGNSYAGQLKTTRFEEVLHNSIEASLRSNNLVPRPIFSQLYLEAEQQLAALEGGSRVDNE EEEEEGEGGLETNGPPNPFQLHPLPEGCCTTDX >gi568815596r:11346480_11565879|GENSCAN_predicted_CDS_10|279_bp atgggaaattcttacgctggacagctgaagacgacacgctttgaagaggtcttgcacaat tccatcgaggcatccctgcggtccaacaacctggtgcccaggcccatcttttcccagctg tacctggaagctgagcagcagcttgccgctctagaaggtggtagccgagtggacaatgag gaagaggaagaagagggagaaggagggctggaaacaaatggccccccaaaccctttccag ctgcaccctctgcctgaaggatgctgtaccacagacgnn