GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:06:11 Sequence gi568815582f:77091486_77298305 : 206820 bp : 41.08% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2147 2271 125 2 2 88 116 92 0.495 11.31 1.02 Term + 21197 21402 206 0 2 42 42 164 0.246 3.75 1.03 PlyA + 21815 21820 6 1.05 2.07 PlyA - 22208 22203 6 1.05 2.06 Term - 41743 41678 66 1 0 71 39 106 0.261 0.96 2.05 Intr - 50720 50641 80 0 2 71 55 106 0.018 3.95 2.04 Intr - 56820 56769 52 0 1 94 91 57 0.083 4.16 2.03 Intr - 58772 58629 144 2 0 49 32 124 0.262 2.46 2.02 Intr - 61461 61361 101 1 2 54 108 53 0.784 2.81 2.01 Init - 63560 63509 52 2 1 103 108 22 0.795 7.12 2.00 Prom - 64486 64447 40 -6.35 3.00 Prom + 64842 64881 40 -3.35 3.01 Init + 83928 83967 40 2 1 62 81 57 0.244 2.80 3.02 Intr + 96190 96325 136 2 1 79 53 66 0.165 0.91 3.03 Intr + 99991 100148 158 1 2 97 80 210 0.418 19.83 3.04 Intr + 101966 102292 327 0 0 56 93 370 0.742 29.05 3.05 Intr + 102850 103669 820 2 1 110 96 847 0.999 77.25 3.06 Intr + 104050 104197 148 1 1 61 106 202 0.999 18.62 3.07 Intr + 104430 104621 192 2 0 96 20 83 0.543 1.07 3.08 Term + 106623 106823 201 2 0 103 44 217 0.999 15.21 3.09 PlyA + 108133 108138 6 1.05 4.00 Prom + 114455 114494 40 -2.85 4.01 Init + 116566 116650 85 0 1 57 -60 124 0.150 -4.47 4.02 Intr + 116725 116784 60 2 0 129 92 52 0.828 7.49 4.03 Intr + 116884 117054 171 2 0 41 95 78 0.713 2.79 4.04 Intr + 117512 117659 148 0 1 32 63 70 0.203 -2.73 4.05 Intr + 117932 117986 55 2 1 86 94 63 0.289 4.76 4.06 Intr + 119728 119791 64 0 1 77 68 68 0.615 1.07 4.07 Intr + 120072 120418 347 1 2 85 -17 211 0.419 4.49 4.08 Intr + 120645 120732 88 2 1 88 80 130 0.910 10.82 4.09 Intr + 120797 120884 88 0 1 89 80 103 0.999 7.71 4.10 Intr + 121089 121161 73 0 1 125 75 94 0.992 10.29 4.11 Term + 121990 122190 201 0 0 61 49 179 0.965 7.71 4.12 PlyA + 122690 122695 6 1.05 5.00 Prom + 125776 125815 40 -6.95 5.01 Init + 127957 127978 22 0 1 82 95 21 0.023 2.13 5.02 Intr + 128913 129074 162 1 0 78 20 97 0.009 0.93 5.03 Intr + 131107 131151 45 2 0 88 92 60 0.179 3.86 5.04 Intr + 133365 133497 133 1 1 34 97 110 0.132 5.28 5.05 Intr + 134481 134584 104 0 2 -18 70 94 0.101 -4.10 5.06 Intr + 136250 136365 116 0 2 0 98 146 0.785 6.05 5.07 Term + 136620 136799 180 2 0 98 44 132 0.980 6.43 5.08 PlyA + 137339 137344 6 1.05 6.06 PlyA - 137383 137378 6 1.05 6.05 Term - 140108 140040 69 2 0 64 39 74 0.600 -2.94 6.04 Intr - 140236 140150 87 1 0 37 94 82 0.059 2.95 6.03 Intr - 144913 144685 229 0 1 57 85 117 0.060 5.25 6.02 Intr - 146335 146251 85 2 1 72 40 73 0.041 -1.14 6.01 Init - 153622 153484 139 1 1 57 105 96 0.920 8.55 6.00 Prom - 163334 163295 40 -4.45 7.02 PlyA - 164091 164086 6 1.05 7.01 Sngl - 165272 164643 630 2 0 43 42 187 0.513 5.53 7.00 Prom - 167283 167244 40 -5.35 8.06 PlyA - 167500 167495 6 1.05 8.05 Term - 174294 174065 230 1 2 42 52 108 0.012 -1.59 8.04 Intr - 197926 197779 148 1 1 122 83 82 0.909 9.99 8.03 Intr - 199993 199781 213 1 0 85 94 78 0.952 6.09 8.02 Intr - 201773 201591 183 2 0 38 64 173 0.916 8.96 8.01 Init - 203638 203438 201 1 0 28 92 105 0.517 3.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 48464 48513 50 1 2 72 75 60 0.855 3.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_1|110_aa XYKKVESGACVSRKMTRAFILTSIAIGPQIPHGSCAQINAAEEYSGNAAVRFPIQRIKKL AAILSLGLILANQPQCYRNICGKSMQKGNKVQALIPMNSKPTARIILPVM >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_1|333_bp nnctacaagaaggtagaaagtggggcatgtgtcagcaggaaaatgactcgggcttttata ttgacaagcattgccattgggccgcaaattccccatggcagctgtgctcagatcaatgct gctgaagaatatagcgggaatgccgctgtccggtttccaatccaaagaattaagaaactg gcagctatcctatctcttgggctcattcttgcaaaccaaccacaatgctacagaaacata tgtgggaagtccatgcaaaaaggaaacaaggtccaggcactcatccccatgaattctaag ccaacagcgaggatcatattgccagtcatgtga >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_2|164_aa MGSGPSIPQHTVVSAVPSPTLEAGDSEGPNYNMRGVLHSLNRRTDIKTIFQGSSIFSKFI VFVKATQAASGAAISIWSPCSLAMPRSLLWDFGALFPLEAPWLLRAEPAEKNEHSTAATA AANLYPNRAARKNPDLLDGVVKASTKCEDDEDKDLYDDPFPLND >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_2|495_bp atgggctcgggcccctctatcccccaacacacagtcgtatcagcagtcccaagtcctacg ctagaagctggggattcagaggggccaaactataatatgcgaggagtgctacactctctg aacagaagaacagatattaaaacaatattccaaggaagtagcattttcagcaaattcatc gtctttgtcaaggccactcaggcagcgagtggtgcagccatcagcatctggagcccatgc tctctggccatgcccagaagcctcctatgggactttggtgcccttttccccctggaggcc ccatggctgcttcgtgctgagccagcagaaaaaaatgagcacagcacagctgctactgca gcagccaatctgtatcctaacagagcagcaagaaagaacccagatctccttgatggtgtt gtcaaggcatctactaaatgtgaagatgatgaggataaagacctttatgatgatccattt ccgcttaatgactag >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_3|673_aa MQSDLMDFGDSEEVNDSSFEQVWDQALVKNEHLTMGCQVIAQPEISIMGWVLDTHLAIRD VQMEVGGDTAAPAPGGAEDLEDTQFPSEEAREGGGVHAVPPDPEDEGLEETGSKDKDQPP SPSPPPQSEALSSTSRLWSPAAPENSPTCSPESSSGGQGGDPSDEEWRSQRKHVFVLSEA GKPIYSRYGSVEALSATMGVMTALVSFVQSAGDAIRAIYAEDHKLVFLQQGPLLLVAMSR TSQSAAQLRGELLAVHAQIVSTLTRASVARIFAHKQNYDLRRLLAGSERTLDRLLDSMEQ DPGALLLGAVRCVPLARPLRDALGALLRRCTAPGLALSVLAVGGRLITAAQERNVLAECR LDPADLQLLLDWVGAPAFAAGEAWAPVCLPRFNPDGFFYAYVARLDAMPVCLLLLGTQRE AFHAMAACRRLVEDGMHALGAMRALGEAASFSNASSASAPAYSVQAVGAPGLRHFLYKPL DIPDHHRQLPQFTSPELEAPYSREEERQRLSDLYHRLHARLHSTSRPLRLIYHVAEKETL LAWISNQDSVFDFPEAPLTTTAVPADSPVAQSMCLEAPHSSPAHCDSMSPQHSSDYRRGP LLCPPPGVTSKFELYTCLSPLVTKAGAILVVTKLLRWVKKEEDRLFIRYPPKYSTPPATS TDQAAHNGLFTGL >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_3|2022_bp atgcagagtgatttaatggactttggagactcggaagaagttaatgattcttcttttgaa caggtttgggaccaggccttggtaaagaatgagcacctaactatgggatgtcaggtgatt gcacagcctgagatttcgatcatgggctgggtgttggatactcacctagctataagggat gtgcagatggaggtcggaggagacactgctgccccggcccccgggggcgcggaggacttg gaggacacgcagttccccagtgaggaagctagagaaggtggaggggttcacgcggtcccg ccggatcccgaagacgagggcctggaggaaacaggatccaaggacaaggaccagccaccc agcccatcaccaccgccccagtcagaggccctgtcaagcacctctcggctctggagtcct gcagcccctgagaatagtcccacatgtagccctgagagtagctctggaggccagggcggg gaccccagtgatgaggagtggcgcagccagcggaagcatgtgtttgtgctgagtgaggct ggcaagcccatctactcgcggtatggtagtgtggaggcgctgtcggctaccatgggtgta atgaccgccctggtgtcctttgtgcagagtgcgggagatgccatccgtgccatctacgct gaggaccacaagctggtgttcctacaacagggcccactgttgctcgtggccatgtcacgg acttctcagtcagcagcccagctgcggggggagctgctagctgtgcacgcacagatcgtg agcacacttacacgtgcaagtgtcgcccgcatcttcgcacacaagcagaactatgacctc cgccgcctgctggctggttcagagcgcacactggaccgacttctggacagtatggagcag gacccaggagccctgctcctgggtgccgtgcgctgtgtgccccttgcccgcccgctgcga gacgcactaggtgcgctcctccgacgttgcacagcgcctggcctggcgctgtcagtgctg gcagtaggcggtcgacttataacagcagcccaggagcgaaatgtgctggccgagtgccgg ctggacccagctgacctgcagttgctgctcgactgggtgggtgcaccagcctttgcggcg ggtgaggcttgggcacctgtgtgcctgccccgcttcaaccctgatggttttttctacgcc tacgtggcccgcctggatgctatgcctgtctgcctgctgctgcttggcacccaacgtgaa gccttccatgccatggccgcctgccggcgcctggttgaagatgggatgcatgcccttggt gccatgcgtgcccttggggaggctgccagcttctctaatgcctcatcagccagtgctcct gcctacagcgtgcaggctgtcggggcgccgggcctccggcacttcctgtataagccgctg gacatccctgaccaccaccgccaactgccccagtttaccagccctgagctagaggccccc tacagcagagaggaggagcggcagcggctgtcggacctgtaccaccgcctgcatgctcgt ctccacagcacctcccgacccctgcgcctcatttaccacgtggctgagaaggagacacta ctggcctggattagcaaccaggacagtgtctttgactttccagaggccccattaactacc actgctgtgcctgccgattccccagtagcccaatccatgtgtttggaggcaccccattca tcaccagctcattgtgactccatgagtccccagcactcctcagactacaggcgtggtccc ttgctgtgccctccacctggggtgacctccaaattcgagctctatacctgcctcagccct ctggtgaccaaggcaggtgcaatcttggtagtgaccaaactcctgcgctgggtgaagaaa gaggaggaccggctcttcattcgttacccacccaagtactccacaccaccagccacctct acggaccaagctgcccataatggcttgttcactggactctga >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_4|459_aa MQALQGALAELNTQQALNICRLIDSSRHKGSLEPQIEDLISRINDLQQGLFEDDCAMSYL RENSKARDYTHTVSFSLLGPAKKKSSEELRETHSLWEALHRELDSWLYYTFCVAFPLYEV ALGMAMGCLEAESVIVFSSEWRESAPRGGLGQKARGTEDPPDALPREGKRGSEVGCQRTA GGSDGPAQGPLGIPLANGSPGKAMVRSFLHLANVSGCTGQHLRQGGRVADQRVHVFAVPS WLGTVEAPRKLVARSWNFSGKASKQVTSQRRGEGSEGRSRAGIGAEGRPWGWTVPETVEE PQIGEWRMRRMLEQRLAREIRALERSKEQLLSESEPPAPGRLVRAKLREVERRLHSPPEV EGAMAVNDGLKAELEIFGEQVRSAPEVGAGEGEATSDLLNVSATLEAGIWDPNPKAPSSR LRPLTEHVLTKQAFADCLLGTRHCSKHWILNSAVSDGLT >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_4|1380_bp atgcaagcccttcaaggggctctggcagagctcaacacacagcaggcgctcaatatatgc cggttaattgatagcagcagacacaagggaagcctggagccacagatagaggacctgatt agccggattaatgatcttcagcaagggttgtttgaagatgactgtgcaatgtcatatttg cgagagaacagcaaggctagagactacactcatacagtgtctttttcccttcttggccca gcaaagaagaaatccagtgaggaactgagagagacccacagtctctgggaggccctgcat agggaattagactcctggctgtactacaccttctgtgtggctttccccttatatgaagtt gcactggggatggcaatggggtgcctggaggcagaaagtgtcattgtgttttccagtgaa tggagagaaagtgcacctagaggaggtcttgggcaaaaagcaagaggcactgaggatcct ccagatgcactgccaagagaaggaaagcgaggctcagaggttggatgtcagaggacagct ggaggatctgatgggccagcacaaggacctctgggaattccactggcaaatggcagccca gggaaagccatggtcaggtcattccttcatttagctaatgtcagtggttgcaccggtcag catttgaggcagggagggagggtagcagatcaaagagtccacgtctttgctgtcccaagc tggttggggacggtggaggcaccaagaaagttagtggctagatcgtggaacttcagtggg aaggcctctaagcaagtgacatctcaacgcagaggtgaaggaagtgagggaagaagccgt gcaggtatcggtgcagaaggaaggccctggggctggacggtgcctgaaaccgttgaggag ccgcagattggggaatggaggatgaggagaatgctggagcagcgactggcccgggagatc cgtgccctggagagaagcaaggagcagctgctctcggagagtgagcctcccgcgccaggg aggctggtgcgcgccaagctgcgggaggtggagcggcggctgcactcgccgcctgaggtc gagggcgccatggcggtgaatgacgggctgaaggcggagctggagatattcggggagcag gtccggagcgcccccgaggtcggggccggcgagggagaggcaacctcagatctgttaaat gtgtccgccactctggaggccggcatttgggacccaaatccaaaggcacccagcagcagg ctgcggccattgactgagcatgtactgactaagcaggcatttgccgattgcctgctgggt accaggcactgttctaagcactggattttaaacagcgcggtcagcgacggcctcacctag >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_5|253_aa MQAHPNADTIQYKKCPDDQYQNSQRIYCLPKDVNFLAFEGPLGMSLACTALPLCPYCSGP WDGPSIDEVMLRNTGVVKTSSPHVAVCNIAHLFLIPLNPNPTYPTAADCDKTQWPTPESC SFGTPVYLLSPTDSQTALHATINSGATKCPESLSEVESAATLDGASSITRGNPTGKAAGN ERARASMDDGWDVKLCIVVASSWVVTQIEATKVYFMNGYFLRSLAKVTSSQCGRESLKSL LPNPMPSQAPLDT >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_5|762_bp atgcaggcacatcctaatgcagatactattcagtataagaagtgtccagatgatcaatat cagaatagtcagagaatttactgtcttccaaaggatgtgaacttcttggcttttgaaggc cctttggggatgtctctggcctgcacagccctgcctctctgtccttactgcagtggtcct tgggatgggccctcaatagatgaggtgatgctccgcaacaccggtgtggttaaaacttcc tctccccatgtggctgtttgcaatatagcccacttgttcctcatcccactgaacccaaac ccaacatatcccacagctgctgactgtgataaaacccagtggccaacaccagagtcgtgt agctttggaacccctgtgtatctgctgtcacccactgattctcaaactgcactacatgct acaatcaatagcggtgctacaaagtgcccggaatccctttcagaagtagaatcagcagcc acgttggatggggcaagctccataaccagaggcaatccaacagggaaagcagcaggtaac gagcgtgccagagccagtatggatgatggttgggatgttaagctctgcattgttgttgct tcctcatgggttgtaacccagatagaagccacaaaagtatatttcatgaatgggtacttt ttacggtctctggctaaagtcaccagcagccagtgtgggagagagagcttgaagtccctg ctaccaaacccaatgccaagtcaagcccccctggatacctga >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_6|202_aa MSRKTTWDENTWFECGRRSVSSSIKIIGCHQKQEHPLPILKDTGEQAHVDLPDSFVLLII KALQPAVICVKKTPGAVTPTWTVTGRSWLVGAHVGRGAETGGSAKKASEPARLSAQPRSR SALWCALNILHGVTRRLGPLGASLSRRCHLLTYNDTMGKDHTRMTDAYAITLSRGPATKH MPASELSKITFDDMTVGNDSTM >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_6|609_bp atgtcaagaaaaaccacctgggatgaaaacacttggtttgaatgtggaagaaggagtgta agttcatcaatcaaaataataggttgtcaccagaaacaggagcaccctttgccaatcctc aaggacactggagaacaagctcatgttgatttaccagatagctttgttttgctgattata aaggccttgcagccagcagtcatctgtgtgaaaaagactcctggagcagtcacacctacg tggacagtaacagggcgctcttggttagtaggtgcacatgtgggtcggggagcagagaca ggaggctccgcaaagaaagcctcggagccagcccggctcagtgcccagccgcgaagccgt agcgccctctggtgtgccctgaacatattgcatggcgtcacaaggcggttgggaccgctt ggagcatctctgagccgacgctgccatctcctgacctataatgacacaatgggaaaagac catacccgaatgactgatgcttatgcaataacattaagcagagggcctgccactaagcac atgcctgcaagtgaattgagcaaaattacatttgacgacatgactgttgggaatgatagc accatgtag >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_7|209_aa MNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVINHINRTNNKNHM IISIDAEKAFDKIQQPFMLKTLNKLGIVGTYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLGRAIRQEKEIKDIQLGKEEVKLSLFADDMIVYIENPIISA QNLLKVISNFGSLRIQNQCVKITSIPIHQ >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_7|630_bp atgaacattgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaaacttatccaccatgatcaagttggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaatgtaatcaatcacataaacagaaccaacaacaaaaaccacatg attatctcaatagatgcagaaaaggcctttgacaaaattcaacagcctttcatgctaaaa actctcaataagctaggtattgttggaacctatctcaaaataataagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaacc ggcacaagacagggatgtcctctctcaccactcctattcaacatagtgttggaagttctg ggcagggcaatcaggcaagagaaagaaataaaggatattcaattaggaaaagaggaagtc aaattgtccctgtttgcagatgacatgattgtatatatagaaaaccccatcatctcagcc caaaatctccttaaggtgataagcaacttcggcagtctgaggatacaaaatcaatgtgta aaaatcacaagcattcctatacaccaataa >gi568815582f:77091486_77298305|GENSCAN_predicted_peptide_8|324_aa MPGEWSTCSKACAGGQQSRKIQCVQKKPFQKEEAVLHSLCPVSTPTQVQACNSHACPPQW SLGPWSQCSKTCGRGVRKRELLCKGSAAETLPESQCTSLPRPELQEGCVLGRCPKNSRLQ WVASSWSECSATCGLGVRKREMKCSEKGFQGKLITFPERRCRNIKKPNLDLEETCNRRAC PAHPVYNMVAGWYSLPWQQCTVTCGGGVQTRSVHCVQQGRPSSSCLLHQKPPVLRACNTN FCPAPEKRVLEILAMRQEKEIRGIKIGIEKVKLSLFADDMILYVENLIVSAQKLKLISNF IKSKDTKSVSKNHKHSYIPTTDKQ >gi568815582f:77091486_77298305|GENSCAN_predicted_CDS_8|975_bp atgccaggtgaatggagtacatgcagcaaggcctgtgctggaggccagcagagccgaaag atccagtgtgtgcaaaagaagcccttccaaaaggaggaagcagtgttgcattctctctgt ccagtgagcacacccactcaggtccaagcctgcaacagccatgcctgccctccacaatgg agccttggaccctggtctcagtgttccaagacctgtggacgaggggtgaggaagcgtgaa ctcctctgcaagggctctgccgcagaaaccctccccgagagccagtgtaccagtctcccc agacctgagctgcaggagggctgtgtgcttggacgatgccccaagaacagccggctacag tgggtcgcttcttcgtggagcgagtgttctgcaacctgtggtttgggtgtgaggaagagg gagatgaagtgcagcgagaagggcttccagggaaagctgataactttcccagagcgaaga tgccgtaatattaagaaaccaaatctggacttggaagagacctgcaaccgacgggcttgc ccagcccatccagtgtacaacatggtagctggatggtattcattgccgtggcagcagtgc acagtcacctgtgggggaggggtccagacccggtcagtccactgtgttcagcaaggccgg ccttcctcaagttgtctgctccatcagaaacctccggtgctacgagcctgtaatacaaac ttctgtccagctcctgaaaagagagtattagaaattctggccatgagacaagagaaagaa ataaggggtattaaaataggaatagagaaagtcaaattgtctttgtttgcagatgacatg atcctttatgtagaaaaccttatcgtctcagcccaaaagcttaagctgataagcaacttc atcaaatctaaggatacaaaatcagtgtccaaaaatcacaagcattcctatataccaaca acagacaagcagtga