GENSCAN 1.0 Date run: 6-Nov-116 Time: 05:50:26 Sequence gi568815591f:43483244_43724839 : 241596 bp : 40.87% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 949 1067 119 0 2 54 72 104 0.216 3.74 1.02 Intr + 7078 7162 85 1 1 69 84 36 0.134 0.20 1.03 Intr + 8832 8937 106 2 1 53 71 27 0.118 -3.63 1.04 Intr + 9841 9937 97 2 1 114 115 88 0.188 12.25 1.05 Intr + 17456 17539 84 2 0 68 75 87 0.855 3.42 1.06 Intr + 17970 18079 110 0 2 81 92 95 0.914 8.21 1.07 Intr + 21528 21656 129 1 0 95 22 65 0.417 0.35 1.08 Intr + 23894 24014 121 0 1 117 78 80 0.835 8.53 1.09 Intr + 24775 24888 114 1 0 124 60 170 0.998 16.44 1.10 Intr + 25726 25878 153 1 0 99 91 154 0.888 15.07 1.11 Intr + 29991 30184 194 0 2 37 26 119 0.084 -1.19 1.12 Term + 30775 30887 113 2 2 86 40 93 0.177 2.04 1.13 PlyA + 33173 33178 6 1.05 2.00 Prom + 34949 34988 40 -2.45 2.01 Init + 37988 38026 39 1 0 67 70 49 0.088 1.24 2.02 Intr + 40117 40325 209 0 2 61 34 106 0.068 -0.55 2.03 Intr + 48599 48754 156 2 0 113 69 99 0.111 8.70 2.04 Intr + 57920 58018 99 2 0 109 93 108 0.930 11.71 2.05 Intr + 58626 58755 130 0 1 49 72 150 0.915 9.28 2.06 Intr + 67202 67348 147 1 0 45 78 175 0.738 11.71 2.07 Intr + 68979 69093 115 2 1 58 105 218 0.999 19.60 2.08 Intr + 71349 71547 199 1 1 108 92 156 0.857 15.49 2.09 Term + 74803 74989 187 1 1 42 42 118 0.043 -1.42 2.10 PlyA + 75011 75016 6 1.05 3.04 PlyA - 75647 75642 6 -3.64 3.03 Term - 76529 76360 170 0 2 43 33 205 0.034 7.56 3.02 Intr - 86416 86325 92 0 2 125 55 54 0.037 4.52 3.01 Init - 94708 94269 440 1 2 54 33 259 0.016 12.63 3.00 Prom - 96550 96511 40 -7.15 4.00 Prom + 97008 97047 40 -6.25 4.01 Init + 100001 100206 206 1 2 79 94 133 0.012 9.46 4.02 Intr + 100413 100514 102 0 0 18 86 145 0.012 5.67 4.03 Intr + 112658 112870 213 2 0 95 92 139 0.372 11.91 4.04 Intr + 125013 125157 145 0 1 68 116 149 0.991 15.06 4.05 Intr + 136354 136480 127 0 1 53 106 146 0.851 12.23 4.06 Intr + 140519 140645 127 0 1 52 80 81 0.509 2.52 4.07 Term + 141275 141599 325 2 1 65 53 381 0.996 25.55 4.08 PlyA + 143524 143529 6 1.05 5.03 PlyA - 144387 144382 6 1.05 5.02 Term - 152354 152105 250 1 1 45 39 198 0.774 4.79 5.01 Init - 153854 153733 122 2 2 93 103 180 0.750 19.71 5.00 Prom - 157008 156969 40 -7.65 6.00 Prom + 157495 157534 40 -6.35 6.01 Init + 159874 160241 368 0 2 87 44 199 0.636 12.24 6.02 Term + 160289 160385 97 2 1 110 42 80 0.602 2.06 6.03 PlyA + 161207 161212 6 1.05 7.06 PlyA - 161292 161287 6 -0.45 7.05 Term - 162156 161966 191 1 2 50 49 208 0.812 9.73 7.04 Intr - 164391 164292 100 0 1 58 106 72 0.158 4.76 7.03 Intr - 165409 165357 53 2 2 92 90 25 0.035 0.81 7.02 Intr - 202454 202340 115 2 1 83 92 27 0.153 1.70 7.01 Init - 212199 212080 120 0 0 53 83 118 0.705 8.04 7.00 Prom - 218638 218599 40 -3.45 8.04 PlyA - 219178 219173 6 1.05 8.03 Term - 222038 221916 123 2 0 79 37 131 0.000 4.50 8.02 Intr - 232884 232532 353 1 2 37 72 272 0.386 14.62 8.01 Init - 234720 234585 136 0 1 83 25 136 0.591 7.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_1|474_aa LCDSQSLWGLIPRLQDRITKRPINYIELRGAGKAGRKDGRMELSPPFLVLVLSEDHPILT PFFDQLNSASEVSRNRGASLLARPGHSLVAAIRSQHQHESLPLAYNDKIVAFLRQPNIFE MLQERQPSLARNHTLREKIHYIRTEGNHGLEKLSCDADLVILLSLFEEEIMSYVPLQAAF HPGYSFSPRCSPCSSPQNSPANDVLFYLTEKLKPSSRNFFKGLPTRLANLYTSSLSSLEV TKGGLQRASARAPSPYRRDFEAKLRNFYRKLEAKGFGQGPGKIKLIIRRDHLLEGTFNQV MAYSRKELQRNKLYVTFVGEEGLDYSGPSREFFFLLSQELFNPYYGLFEYSANDTYTVQI SPMSAFVENHLECPCKVSDKCTCPLRSLHSGTLLSTTRECCLLLKEGLLLLLTLHVVLQR PLPGKALPKDINGETPDIPMAKPNHEPEDRGSPDTATHSGQSLSHRAGQKKMGN >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_1|1425_bp ctgtgtgattctcagagtctgtggggcttgattccacgtcttcaggaccgcataactaag agacccattaactacatagagctaagaggagcaggaaaggcaggacggaaggatggcaga atggagctctctcctccctttctggttctggttctgagtgaggaccaccccattctcaca ccattctttgatcaattaaactctgcctcagaagtttctagaaacagaggagcctcttta ctggccaggccaggacacagcttagtagctgctattcgaagccaacatcaacatgagtca ttgccactggcatataatgacaagattgtggcatttcttcgccagccaaacatttttgaa atgctgcaagagcgtcagccaagcttagcaagaaaccacacactcagggagaaaatccat tacattcggactgagggtaatcacgggcttgagaagttgtcctgtgatgcggatctggtc attttgctgagtctctttgaagaagagattatgtcctacgtccccctgcaggctgccttc caccctgggtatagcttctctccccgatgttcaccctgttcttcacctcagaactcccca gcaaatgacgtcctgttttacctcactgagaagttgaagccatccagcagaaacttcttc aaaggcctgcccacaaggctagcaaacctgtacacatcttctctgtcatctctggaggtc acgaaaggaggtttacagagagccagtgcaagagccccttccccctaccgaagagacttt gaggccaagctccgcaatttctacagaaaactggaagccaaaggatttggtcagggtccg gggaaaattaagctcattattcgccgggatcatttgttggagggaaccttcaatcaggtg atggcctattcgcggaaagagctccagcgaaacaagctctacgtcacctttgttggagag gagggcctggactacagtggcccctcgcgggagttcttcttccttctgtctcaggagctc ttcaacccttactatggactctttgagtactcggcaaatgatacttacacggtgcagatc agccccatgtccgcatttgtagaaaaccatcttgagtgcccttgcaaagtcagtgataaa tgcacatgtcccttaagatctttacactctgggactctgctctccacgacgcgtgaatgc tgcttgttactgaaggaaggcctcctccttcttctcacactgcacgtggtcctgcagcgg ccacttccaggcaaggcactgcccaaagacatcaatggagaaacccctgacatcccaatg gccaaacccaaccatgaaccagaggacagaggctctccagatactgcaacccacagtggc cagtccctaagtcacagagcaggacagaaaaagatgggaaattag >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_2|426_aa MNGSKALIKGLERELGFVGPLEGCKAFEETMSEEREILNYKREQKVDGRRAQRRQGKTID SPSEDISFERESAPAGELLEPGRSPNAASINIQPWVRLSICTNFLHDLVQMPGFKCHLHP DGPTVTSVVRPCARRFRFSGRILGLALIHQYLLDAFFTRPFYKALLRLPCDLSDLEYLDE EFHQSLQWMKDNNITDILDLTFTVNEEVFGQVTERELKSGGANTQVTEKNKKEYIERMVK WRVERGVVQQTEALVRGFYEVVDSRLVSVFDARELELVIAGTAEIDLNDWRNNTEYRGGY HDGHLVIRWFWAAVERFNNEQRLRLLQFVTGTSSVPYEGFAALRGSNGLRRFCIEKWGKI TSLPSVWGSEQFPPMRTGDGTLGPGGSIWKELKEVAKFLYFEQELKGGQRKGSEGKRRAE ARSESK >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_2|1281_bp atgaatgggagtaaggcccttataaaaggccttgagagagagcttggctttgtggggccg ctagaaggatgcaaggcctttgaggagacaatgtcagaagagagagaaatcttgaattat aagagagagcaaaaagtggatggaagaagagcccagaggaggcagggaaagacaatagat agcccaagtgaagacattagctttgaaagagagtcagccccagcaggagaattgcttgaa cccgggagatccccgaatgctgcatcaattaacattcagccctgggtccgcctctccatc tgcacaaacttcctccatgatcttgtccagatgcctggcttcaaatgccatctgcatcct gacggccccacagtgacatctgtagtgcggccctgtgcccggaggttcaggtttagcggt cgcatcctgggtctggctctgatccatcagtaccttcttgacgctttcttcacgaggccc ttctacaaggcactcctgagactgccctgtgatttgagtgacctggaatatttggatgag gaattccaccagagtttgcagtggatgaaggacaacaacatcacagacatcttagacctc actttcactgttaatgaagaggtttttggacaggtcacggaaagggagttgaagtctgga ggagccaacacacaggtgacggagaaaaacaagaaggagtacatcgagcgcatggtgaag tggcgggtggagcgcggcgtggtacagcagaccgaggcgctggtgcgcggcttctacgag gttgtagactcgaggctggtgtccgtgtttgatgccagggagctggagctggtgatagct ggcaccgcggaaatcgacctaaatgactggcggaataacactgagtaccggggaggttac cacgatgggcatcttgtgatccgctggttctgggctgcggtggagcgcttcaataatgag cagaggctgagattactgcagtttgtcacgggaacatccagcgtgccctacgaaggcttc gcagccctccgtgggagcaatgggcttcggcgcttctgcatagagaaatgggggaaaatt acttctctccccagcgtgtggggcagtgagcaatttccacccatgagaactggagatggg acactaggacccggaggcagcatctggaaagagcttaaggaggtggcaaagtttctttac tttgaacaagagctcaaaggaggacagaggaaaggctcagaaggaaagagaagggcagaa gctaggtcagagtcgaagtag >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_3|233_aa MLVIDEQRKWFPEMESTPVEDAVNTVEMTTNDLEYYINLVDKMVAILIPILKEVLLIEAR GRQRPRQIGKGPRRISNSPHKCLHQMFCADKGTCTGDLPEHARSELEAHKHWGNGVEPPG IRVLCRGGAWLRSLDKEQKNPASLLCGPALMVIAQQQEREVKLSRASTLHASACVMLGVV LTAPSACLPHSRAPNQALPGGVGQWTARNQGLEDFYADPGLVTYQLYDTGKVN >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_3|702_bp atgcttgttatagatgagcaaagaaagtggtttcctgagatggaatctactcctgttgaa gatgccgtaaacactgttgaaatgacaacaaacgatttagaatattacataaacttagtt gataaaatggtggcaattttgattccaattttgaaggaagtcctactgatagaggcaaga ggcagacaaaggcctaggcagatagggaagggtccccggagaatctccaactcgccccac aagtgtttacatcagatgttttgtgcagataagggaacctgcacaggggacttgcctgag catgcccgcagtgaactggaggcccacaagcactgggggaatggggtggagccaccagga attcgtgtcttatgcaggggaggagcctggctaaggagcctagataaggagcaaaaaaat cctgcatcactactgtgtgggcctgctctcatggtgatagcacagcaacaagagagagag gtcaagctcagccgtgcaagtactttacatgcctctgcttgtgttatgctgggtgtggtt ctgacagcaccgagtgcctgcttacctcattcccgtgcaccaaaccaagcattgcccggt ggtgtggggcagtggacagcaagaaaccagggcttggaagacttctatgctgaccccggt ctagtcacttaccagctttatgacactggaaaagtcaattaa >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_4|414_aa MIPLEKPGSGGSSPGATSGSGRAGRGLSGPCRPPPPPQARGLLTEIRAVVRTEPFQDGYS LCPGRELGSKRVVVTRWRSRRRGHFPQPFRSLWRFEWLFNERRGKFAVVRKCIKKDSGKE FAAKFMRKRRKGQDCRMEIIHEIAVLELAQDNPWVINLHEVYETASEMILVLEYAAGGEI FDQCVADREEAFKEKDVQRLMRQILEGVHFLHTRDVVHLDLKPQNILLTSESPLGDIKIV DFGLSRILKNSEELREIMGTPEYVGNDKQETFLNISQMNLSYSEEEFDVLSESAVDFIRT LLVKKPEDRATAEECLKHPWLTQSSIQEPSFRMEKALEEANALQEGHSVPEINSDTDKSE TKESIVTEELIVVTSYTLGQCRQSEKEKMEQKAISKRFKFEEPLLQEIPGEFIY >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_4|1245_bp atgatccctttggagaagccaggcagcggcggctcctccccaggcgccacctcaggctcg ggccgggcaggccggggtctgagcgggccgtgccggccgccgccgccgccccaggcccgc gggctgctgacagagatacgcgccgtggtgcgcaccgagcccttccaggacggctacagc ctgtgcccgggccgggagctgggcagcaagcgagtggtggtaactcgatggcggtcccgg aggcgcggacactttcctcagccctttcgttctctgtggcgtttcgaatggctatttaac gagagaagggggaaatttgcagtggtgagaaaatgtataaagaaagattctgggaaagaa tttgctgcaaagttcatgagaaaaagaagaaaaggccaagattgtcggatggaaataatt catgagattgctgtacttgaactagcacaagacaatccttgggtcattaatttacatgaa gtttatgagactgcatcagaaatgatcttagttctggaatatgctgctgggggtgaaatc tttgaccagtgtgttgcagacagagaagaagcctttaaagaaaaagatgttcaaagactt atgcgacagattttagaaggtgttcactttttacacactcgtgatgtagttcatcttgat ttgaagcctcagaatattctgttgacaagtgaatctccattgggtgacattaagattgtt gattttggcctttcaagaatattgaagaacagtgaagagctccgagaaattatgggtacc cctgaatatgtgggcaatgataaacaagaaacattcttaaacatctcacagatgaattta agttattctgaggaagaatttgatgttttgtctgagtcggctgttgatttcatcaggaca cttttagttaagaaacctgaagatcgagccactgctgaagaatgtctaaagcacccctgg ttgacacagagcagtattcaagagccttctttcaggatggaaaaggcactagaagaagca aatgccctccaagaaggtcattctgtgcctgaaattaattcggataccgacaaatcagaa accaaggaatccattgtaaccgaagagttaattgtagttacttcatatactctaggacaa tgcagacagtctgaaaaagagaaaatggagcaaaaggccatttccaaacgatttaaattt gaggaacctttgctacaagaaattccaggagaatttatctactga >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_5|123_aa MMESTTCKWKAQHEERPQSGQMGAVLQKSEAQFGQRVDERRCDCVACEWKWHAGLLSKSF KCQPLVHAVHFAFVPKIGNVPDGGYFKGLGARTADPQWDTEHKGDINPYKPLRWGAICYP SLT >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_5|372_bp atgatggagagcacaacatgcaagtggaaagcacaacatgaagaaaggccccagagcggg cagatgggagctgtgctccagaaatcagaagcccagtttggacagcgtgtagatgagcgc aggtgtgactgtgtggcttgtgagtggaagtggcatgcaggacttctgagtaaaagtttt aagtgccagcccttggttcatgctgtccactttgccttcgtccctaagattggtaatgtt ccagatggtggctacttcaagggcttgggggctcggacagctgacccacagtgggacacg gagcataagggagacataaacccttacaagcccctaagatggggggctatttgttacccc agcttaacctag >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_6|154_aa MAKAPGMWGSPGRRPLSTTGAKVKAGQDKAPTKTTMALPYSHWGSWSTRSWMLHQPAPAK APLAHNSSVLQLGSVLRETQQPDLPSGFSKFSGLLGSSQVTKGKEATDRLRHTMGAIRWP GTEMSSLTQCRCRPSTHVHAGHWTITSDPPAPDT >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_6|465_bp atggccaaggccccagggatgtggggaagcccaggaaggcgccctctcagcaccactggg gccaaagtgaaagctggacaggacaaagctcccaccaaaaccaccatggccttaccctac tctcactggggttcctggagcaccagatcatggatgctgcatcaaccagccccagcaaag gcgcctttggcacacaattcctcagtgctacagctgggctctgtgctgagagaaacgcaa cagccagacctacccagtggcttttccaagttctcaggattgctggggtctagccaagtg accaaaggcaaggaagcaacagacaggctccgccacaccatgggagcaatccggtggcct ggcacagagatgtcctctctcacgcagtgtcgctgccgtccttccactcacgtccatgct ggacactggaccatcacctctgacccacccgctcctgatacatga >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_7|192_aa MWITGGRRFLAERIPNKTLMSDMLIELQGDQCDWIRGSKEDSDSDRFNTGEGLHSLILQL SQFYSSLSKSFEISRRISRQLAFDDFQESCAMMWQKYAGSRRSMPLGARILFHGVFYAGG FAIVYYLIQKFHSRALYYKLAVEQLQSHPEAQEALGPPLNIHYLKLIDRENFVDIVDAKV MCFPCEPQAGAC >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_7|579_bp atgtggataactggaggaagaagattcctggcagagagaataccaaataagaccctcatg tctgacatgctcatagaactgcaaggagatcagtgtgactggattagagggagtaaggag gactctgattctgacagatttaacactggagagggtcttcatagcctgattctccagctc tctcaattctactctagtctttccaaatcatttgaaatcagcagaagaattagcagacaa cttgcctttgatgattttcaagagagttgtgctatgatgtggcaaaagtatgcaggaagc aggcggtcaatgcctctgggagcaaggatccttttccacggtgtgttctatgccgggggc tttgccattgtgtattacctcattcaaaagtttcattccagggctttatattacaagttg gcagtggagcagctgcagagccatcccgaggcacaggaagctctgggccctcctctcaac atccattatctcaagctcatcgacagggaaaacttcgtggacattgttgatgccaaggta atgtgctttccctgcgaaccgcaggctggtgcctgctga >gi568815591f:43483244_43724839|GENSCAN_predicted_peptide_8|203_aa MVEGKEEQVLSYIDGSRQRENEKDAKWKPLIKPSDLVRLIHYHKKIRFHAVDKDIPETGQ FTKERGLIGLTVPCGWGSLTIMAEGKEEQVLSYMDGSRQRENEEDAKAETPDKTIRSLEA IHYRKTAPMIRLSPTGSLPQHVGIMGVRFRMRFEWGHRAKSYQACMCVGAYCCPTRVLLL AAAIGVLLPVDREHLSSFRAVGT >gi568815591f:43483244_43724839|GENSCAN_predicted_CDS_8|612_bp atggtggaaggcaaagaggagcaagtcctgtcttacatagatggcagcaggcaaagagag aatgagaaagatgcaaagtggaaacccttgataaaaccatcagatcttgtgagacttatt cactaccacaagaaaatccgttttcatgctgttgataaagacatacctgagactgggcaa tttacaaaagaaagaggtttaattggacttacagttccatgtggctggggaagcctcaca atcatggcagaaggcaaggaggagcaagtcctatcttacatggatggcagcaggcaaaga gagaatgaggaggatgcaaaagcggaaacccctgataaaaccatcagatctcttgaggct attcactatcggaaaactgcccccatgattcgattatctcccactgggtccctcccacaa cacgtgggaattatgggagtacgattcaggatgagatttgagtggggacacagagccaag tcatatcaggcatgcatgtgcgtgggagcctattgctgccccaccagagttcttttgctg gcagctgccatcggagtgttgttgccagtggaccgggaacatctcagctccttcagagca gtaggtacttaa