GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:05:51 Sequence gi568815590r:96131058_96333228 : 202171 bp : 42.67% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 844 839 6 1.05 1.10 Term - 10576 10406 171 1 0 101 48 125 0.920 6.64 1.09 Intr - 14467 13533 935 2 2 112 -4 838 0.473 66.98 1.08 Intr - 16177 16068 110 0 2 92 40 147 0.961 9.31 1.07 Intr - 21072 20972 101 0 2 59 55 131 0.993 4.89 1.06 Intr - 21438 21163 276 0 0 95 78 157 0.978 12.19 1.05 Intr - 22144 22101 44 2 2 62 84 32 0.590 -2.76 1.04 Intr - 22533 22444 90 1 0 91 36 115 0.590 5.65 1.03 Intr - 27273 26968 306 1 0 56 50 216 0.958 10.10 1.02 Intr - 28462 28169 294 2 0 46 4 217 0.891 5.16 1.01 Init - 29635 29230 406 1 1 102 97 254 0.981 22.50 1.00 Prom - 42056 42017 40 -5.15 2.00 Prom + 43199 43238 40 -7.55 2.01 Init + 46459 46592 134 0 2 85 100 54 0.536 5.96 2.02 Term + 48343 48511 169 1 1 44 38 168 0.932 3.77 2.03 PlyA + 49000 49005 6 1.05 3.00 Prom + 51036 51075 40 -7.35 3.01 Init + 53168 53386 219 1 0 93 28 174 0.419 10.58 3.02 Intr + 53988 54145 158 2 2 19 81 130 0.246 3.29 3.03 Term + 57086 57254 169 2 1 51 35 135 0.249 0.87 3.04 PlyA + 57661 57666 6 1.05 4.02 PlyA - 57722 57717 6 1.05 4.01 Sngl - 60539 60198 342 2 0 60 43 193 0.805 7.78 4.00 Prom - 60821 60782 40 -5.95 5.04 PlyA - 62170 62165 6 1.05 5.03 Term - 100883 100678 206 0 2 62 49 235 0.581 13.55 5.02 Intr - 102164 102099 66 0 0 8 94 99 0.453 0.46 5.01 Init - 104473 104455 19 1 1 91 131 4 0.774 5.49 5.00 Prom - 108372 108333 40 -8.55 6.05 PlyA - 108385 108380 6 1.05 6.04 Term - 108628 108434 195 1 0 133 33 100 0.777 5.43 6.03 Intr - 113023 112862 162 1 0 72 95 42 0.596 2.55 6.02 Intr - 114874 114803 72 1 0 48 97 81 0.672 3.68 6.01 Init - 115348 115250 99 1 0 62 98 59 0.718 4.61 6.00 Prom - 116238 116199 40 -5.35 7.00 Prom + 119202 119241 40 -6.35 7.01 Init + 119515 119669 155 0 2 82 15 265 0.865 18.11 7.02 Term + 122367 122493 127 0 1 143 48 42 0.834 2.47 7.03 PlyA + 122787 122792 6 -0.45 8.08 PlyA - 123628 123623 6 1.05 8.07 Term - 124182 124076 107 1 2 62 39 61 0.378 -3.81 8.06 Intr - 126057 125905 153 1 0 60 97 186 0.953 15.82 8.05 Intr - 127468 127300 169 1 1 74 86 13 0.753 -1.70 8.04 Intr - 130754 130444 311 0 2 58 103 126 0.026 6.21 8.03 Intr - 131200 131109 92 0 2 133 68 65 0.030 7.72 8.02 Intr - 135280 135199 82 2 1 80 97 88 0.307 6.68 8.01 Init - 144815 144758 58 2 1 67 70 110 0.462 8.62 8.00 Prom - 146121 146082 40 -8.15 9.00 Prom + 150188 150227 40 -6.65 9.01 Init + 154129 154348 220 0 1 65 65 91 0.427 3.34 9.02 Intr + 155965 156089 125 2 2 110 80 107 0.516 11.48 9.03 Intr + 162408 162533 126 2 0 94 17 66 0.114 0.06 9.04 Intr + 164041 164199 159 0 0 71 94 60 0.259 4.16 9.05 Intr + 178500 178565 66 2 0 92 92 37 0.470 2.68 9.06 Term + 185185 185289 105 0 0 108 42 116 0.875 6.43 9.07 PlyA + 185613 185618 6 1.05 10.04 PlyA - 189008 189003 6 1.05 10.03 Term - 194618 194442 177 2 0 -6 37 195 0.229 1.70 10.02 Intr - 197060 196896 165 2 0 3 48 164 0.323 3.14 10.01 Init - 199187 198996 192 2 0 74 17 134 0.439 3.91 10.00 Prom - 199819 199780 40 -1.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 130608 130444 165 0 0 56 103 181 0.802 14.08 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_1|910_aa MDTPRVLLSAVFLISFLWDLPGFQQASISSSSSSAELGSTKGMRSRKEGKMQRAPRDSDA GREGQEPQPRPQDEPRAQQPRAQEPPGRGPRVVPHEYMLSIYRTYSIAEKLGINASFFQS SKSANTITSFVDRGLARRRSSLYAIPEGWWGSGKSLVPSVGTPACVGVCGNPCGPFLPPA LICSVASALKKTSHHLWSQKSGKDPQASETKLLKERDNNVEGDRKGREKKERKTSPPPSP PLRCMPPILEIEEGLHVHPVWGGLCSRAKASRLAGPRSTSTATTPECGGGLGMRAVHSGR RRCMTTRTASSKALSPPRLRVLRKCEERQFVDWEGAFVEDLVAPLGSRVTYRSTVQGRPP LANPPETKGVSLKRLGAPDRTGPVVRSCPHKRTPKWGKKRILRPLRAALPARESFGLAQG LSLALTPAEEETCKKGWILDVAGVEVICSPGTVGGTAVVTGPHRTEKNRDDEHLSSTHFW NLVERSGPQGLKSRELDFKGSQEKDRLGHVRKSEPASATVSPEYKLGVAEAPEPVPPGYI WTDDLSHTPLRRQKYLFDVSMLSDKEELVGAELRLFRQAPSAPWGPPAGPLHVQLFPCLS PLLLDARTLDPQGAPPAGWEVFDVWQGLRHQPWKQLCLELRAAWGELDAGEAEARARGPQ QPPPPDLRSLGFGRRVRPPQERALLVVFTRSQRKNLFAEMREQLGSAEAAGPGAGAEGSW PPPSGAPDARPWLPSPGRRRRRTAFASRHGKRHGKKSRLRCSKKPLHVNFKELGWDDWII APLEYEAYHCEGVCDFPLRSHLEPTNHAIIQTLMNSMDPGSTPPSCCVPTKLTPISILYI DAGNNVVYKQYEDMIIYLLSDWSLNIDVGNVQLATILRVGQLAEEEMVQRPAALPEVLEC VNCNCWHKGS >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_1|2733_bp atggatactcccagggtcctgctctcggccgtcttcctcatcagttttctgtgggatttg cccggtttccagcaggcttccatctcatcctcctcgtcgtccgccgagctgggttccacc aagggcatgcgaagccgcaaggaaggcaagatgcagcgggcgccgcgcgacagtgacgcg ggccgggagggccaggaaccacagccgcggcctcaggacgaaccccgggctcagcagccc cgggcgcaggagccgccaggcaggggtccgcgcgtggtgccccacgagtacatgctgtca atctacaggacttactccatcgctgagaagctgggcatcaatgccagctttttccagtct tccaagtcggctaatacgatcaccagctttgtagacaggggactagccaggcgacgttcc tctctctacgctattcctgagggctggtgggggtctgggaaatccctggtaccctctgtg ggaacgcctgcatgtgtgggtgtctgcggaaacccttgcggccccttcttgccaccggcg ttgatctgcagcgtggccagtgccctcaagaaaacctcgcaccacctctggtcccagaaa tcagggaaggatccacaggcatcggaaactaagctcctaaaggagagagacaataatgtg gaaggagacaggaaaggaagggagaaaaaggaaaggaagacctccccacccccatccccg ccactgcgctgtatgcccccgatcctggagatagaggagggattgcatgtgcacccagtc tggggaggcctctgttcccgagccaaggcctccagactagcaggaccgagatccacaagt accgctacaaccccggaatgcggtggagggctggggatgcgagcggtccacagcgggagg agacgctgcatgaccacccggacagcatcttcaaaggcgctgtctcctccgcgtctccgg gttctcagaaaatgcgaagaaaggcagttcgttgactgggaaggcgccttcgtggaggac ctggttgctcccctgggctcccgggtgacttaccggtcgacagtgcagggaaggccgccc ctagccaaccccccagagacaaaaggagtttccttaaagcgccttggagctcctgatagg actggccctgtggtcagatcgtgcccccacaaaaggacaccaaagtggggaaagaaacga atccttaggcctctcagggctgcactgccagccagggagtcatttgggttagcacaaggg ctgtcactggccttgacccctgcagaagaggaaacctgcaagaagggctggatccttgat gtggctggggtggaggtcatctgttcacctggcactgtgggagggacagcagtggtgaca gggcctcacaggactgagaagaatcgggatgatgagcacctttcctccactcacttctgg aatcttgtggagagaagtggaccacaaggactgaagtccagagaactggatttcaaagga agtcaagagaaggacaggttggggcatgtgcgtaagagtgaaccagccagtgctacggtc agccccgagtacaaactgggtgtggcagaggctccagagccagtgcccccgggctacatc tggactgacgatctctcgcacactcctctccggagacagaagtatttgtttgatgtgtcc atgctctcagacaaagaagagctggtgggcgcggagctgcggctctttcgccaggcgccc tcagcgccctgggggccaccagccgggccgctccacgtgcagctcttcccttgcctttcg cccctactgctggacgcgcggaccctggacccgcagggggcgccgccggccggctgggaa gtcttcgacgtgtggcagggcctgcgccaccagccctggaagcagctgtgcttggagctg cgggccgcatggggcgagctggacgccggggaggccgaggcgcgcgcgcggggaccccag caaccgccgcccccggacctgcggagtctgggcttcggccggagggtgcggcctccccag gagcgggccctgctggtggtattcaccagatcccagcgcaagaacctgttcgcagagatg cgcgagcagctgggctcggccgaggctgcgggcccgggcgcgggcgccgaggggtcgtgg ccgccgccgtcgggcgccccggatgccaggccttggctgccctcgcccggccgccggcgg cggcgcacggccttcgccagtcgccatggcaagcggcacggcaagaagtccaggctacgc tgcagcaagaagcccctgcacgtgaacttcaaggagctgggctgggacgactggattatc gcgcccctggagtacgaggcctatcactgcgagggtgtatgcgacttcccgctgcgctcg cacctggagcccaccaaccacgccatcatccagacgctgatgaactccatggaccccggc tccaccccgcccagctgctgcgtgcccaccaaattgactcccatcagcattctatacatc gacgcgggcaataatgtggtctacaagcagtacgaggacatgatcatctacctgctctct gattggtccctaaatattgacgtggggaatgtgcagttagctacaatccttagggttggt cagctggcggaggaggaaatggtacagaggccagcagcattgcctgaagttttggagtgt gtgaattgtaactgctggcataagggcagctga >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_2|100_aa MVLFLSSDTVDKKSAKSCQQPWLKPLRESWCSVRENGGHLLERLRLKKTMHFQWKDPDGC MSSLWWATFKQKLWEQCRTGDGPRAAAVLQRMSICCLALR >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_2|303_bp atggtcctttttctctctagtgacacagttgacaagaagtcagcaaagagctgccagcaa ccttggctgaagcccctcagagaaagctggtgctccgtcagagagaatgggggccacctc ttagaaaggctaagacttaaaaagacaatgcatttccagtggaaggaccccgatggctgc atgtcgtccctgtggtgggccactttcaaacagaagctctgggaacaatgccggaccggg gatggtcccagagctgctgcggtgctgcagagaatgagcatctgctgcttggctctgaga tga >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_3|181_aa MEKLNNAPNIARLVSGRAWIHADSVTPGLLAPADEAVVIEQLSSSTIFTSETISPTAPYK VNAFLPLLFCVPFTAGRNKGSGARTQNSSPKSRAPDRARAEVSPQKKVHLEDERQGLAWE AATKARPESHGNSLSDIPGKLRGHPRKGLKLIRAIYSRRNVLHDRSGGDCHHTQMRHASP Q >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_3|546_bp atggagaagttaaataatgcccccaatattgcacggctagtaagcggcagggcctggatc cacgctgactctgtgactccggggctcctcgctcctgccgatgaggcggtagtcattgag cagctgtcttctagcacaatattcacttctgaaacgatctccccgacagccccctacaaa gttaatgcatttcttccactccttttctgtgtaccctttactgcaggaagaaacaaaggg tccggagcaaggacacagaacagcagccccaagtcgagagccccggacagggcaagggca gaagtgtctcctcagaagaaggtgcacctggaagacgaaaggcagggtttggcttgggaa gcagcaactaaagcaaggccagaatctcatgggaattcactttcagacattcctgggaag cttcgaggacatccaaggaaaggtctgaaattgattagagcaatatattccaggagaaat gttttgcacgaccgttctggaggagactgtcatcacactcagatgagacatgcaagccca cagtaa >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_4|113_aa MGKDFMTKTPKAMATKAKIDEWDLIKLKSFCTAKETIIRVNRQPTEWEKNFEIYPSDKRL ISRIYKELKEIYRKNNNNNNKPHQKVGEGYEQTLLKKRHLCGQQTYEKKAHHH >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_4|342_bp atgggaaaagacttcatgactaaaacaccaaaagcaatggcaacaaaagccaaaattgac gaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactatcatcagagtg aacaggcaacctacagaatgggagaaaaattttgaaatctatccatctgacaaaaggcta atatccagaatctacaaagaacttaaagaaatttacaggaaaaacaacaacaacaacaac aaaccccatcaaaaagtgggcgaaggatatgaacagacgcttctcaaaaaaagacattta tgcggccaacaaacatacgaaaaaaaagctcatcatcactga >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_5|96_aa MAGKQAASGKWLDGIRKWYYNAAGFNKLGLMRDDTIYEDEDVKEAIRRLPENLYNDRMFR IKRALDLNLKHQILPKEQWTKYEEVAQLYLPDSVAH >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_5|291_bp atggctggtaagcaggccgcatcaggcaagtggctggatggtattcgaaaatggtattac aatgctgcaggattcaataaactggggttaatgcgagatgatacaatatacgaggatgaa gatgtaaaagaagccataagaagacttcctgagaacctttataatgacaggatgtttcgc attaagagggcactggacctgaacttgaagcatcagatcttgcctaaagagcagtggacc aaatatgaagaggtagcacagctttatctacctgacagtgttgctcattaa >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_6|175_aa MVRKAPFLLNFSVERLDNRLGFFQKELELSVKKTRDLVVRLPRLLTGSLEPVKENMKVYR LELGFKHNEIQHMITRIPKMLTANKMKLTETFDFVHNVMSIPHHIIVKFPQVFNTRLFKV KERHLFLTYLGRAQYDPAKPNYISLDKLVSIPDEIFCEEIAKASVQDFEKFLKTL >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_6|528_bp atggtcagaaaagcaccatttttgctgaacttttcagtggaaagactggataacagattg ggattttttcagaaagaacttgaacttagtgtgaagaagactagagatctggtagttcgt ctcccaaggctgctaactggaagtctggaacccgtgaaagaaaatatgaaggtttatcgt cttgaacttggttttaaacataacgaaattcaacatatgatcaccagaatcccaaagatg ttaactgcaaataaaatgaaacttaccgagacgtttgattttgtgcacaatgtgatgagc attccccaccacatcattgtcaagttcccacaggtatttaatacaaggctgtttaaggtc aaagaaagacacttgtttcttacctatttaggaagagcacagtatgatccagcaaaacct aactacatctctttggacaaactagtatctattcctgatgaaatattttgtgaagagatt gccaaagcatcagtacaggactttgaaaaattcttaaaaacgctttag >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_7|93_aa MVATCSHPATWEAEAEEEEEEEEEEEEEEEEEEEEEEGGGGGGEGEGEKKNGCLPLAEAA KKPAGIGALVHRGWPPGIQIGQKTGLRSKQRKN >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_7|282_bp atggtggccacctgtagtcacccagctacttgggaggctgaggcagaagaagaagaagaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaaggaggagga ggagggggggagggggagggggagaagaagaatgggtgcctcccactggctgaagctgcc aagaagccagctggcataggagccctggtgcataggggttggcctcctgggatacagatc gggcagaaaacaggtctcagaagcaaacagagaaaaaactag >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_8|323_aa MVMALSDDKEEGSRSRDTEVLNRGEMKHRQIDDHLSTQIRRKATSSSPWTRPPLGWAAPP GKGEVHEADDGEAEQGDAEASLPLEEFSDSKSTGLQRNSEPEKGSRGPWLALFRARSPLS PRVTRHMVSPTPGAAHVGPGARLRCVGRAEEPACGAAGRDVRPSELVGPGVTRYGKQASP QWGFKTYRTSSLWNSSQSTSSSSQENNSAQSSLLPSMNEQSQKTQNISSFDSELFLEELD ELPPLSPMQPISEEEAIQIIADPPLPPASFTLRDYVDHSETLQKLVLLVTTLFRSSFVTE WLQEPPILIVFSSAVLALISFQC >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_8|972_bp atggtcatggcattgagtgatgacaaagaagaaggcagcagatcacgggatacagaagtt ctaaatcgaggagaaatgaagcatagacaaattgatgatcacttaagtacacaaatccgc agaaaggcaacgtcttctagcccttggacgcgccccccgctcggctgggccgccccacct ggtaaaggcgaagtacatgaggctgacgatggtgaagctgagcagggtgatgctgaggca tctctaccactcgaggaattttcagactccaagtccacaggacttcagaggaatagcgag ccggagaagggtagccggggcccctggctggctcttttcagagcacgtagcccgctgtct ccccgcgtaacccggcacatggtctcgcccactccgggcgcggcccacgtgggcccggga gcgcggctgcgctgtgtagggcgggctgaggagccggcgtgcggcgcggcgggacgggac gtgcggcccagcgagttggtcggtcccggggtcacccgctacgggaagcaggcctcgcca cagtggggatttaagacttacaggacttcctccttatggaatagttcccagtctactagc tcaagtagtcaggagaataattctgcccaaagcagtctgcttccttccatgaatgaacag tcacagaagacacaaaatatatccagctttgattctgagctgtttctagaagaactggat gaattgcctccattgtctccaatgcagccaatttcagaggaagaggctattcagattatt gcagaccctccattgccaccagcttcattcacacttcgagactatgtggatcattctgag actctgcagaagttggttcttctagtcactaccttgtttaggtcctcatttgtcactgag tggctgcaggagcctcctatcttgatagtcttctcctcagcagtattggctctgatatct ttccagtgttag >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_9|266_aa MPSLKLSYKEGSKPAKILGEGSIPGKRKSCQNGPQLETTSRPVWPQYSAPWEKSRREVRE SSGTDVIMGTVGQGLSVLYFLFLVFLLFLNFEQVKSLMYWLDPNLRYATREADVMASALH CLQLKPVPPGSQGSACMQLSGCFKLQCPVALPGQGPQEYAVNCHVITWERIISHFDIFAF GHFWGWAMKALLIRSYGLCWTISITWELTETVLRLPHRHTVQARRNTMLGVWSSVQPAEG SRMFITMLFTIAKTWNPPSAHQWWTG >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_9|801_bp atgccaagtctgaaactgtcatacaaagaaggatccaagcctgcaaagattctgggggag gggagcatcccaggcaagaggaaaagctgccaaaatggcccacagctagaaaccacttcc aggccagtgtggccccagtatagtgcaccctgggagaagagtaggagagaagtcagagag tcatcgggcacagatgtaatcatggggactgtgggccagggactcagtgtgctctacttc ctgttcctggtattcctactcttcctgaatttcgagcaggttaaatctctaatgtattgg ctagatccaaatcttcgatacgccacaagggaagcagatgtcatggcttcagctctacac tgtcttcagttaaagcctgttcctcctggttcccagggttctgcctgcatgcagctctct ggctgctttaaactgcaatgccctgtggcccttccaggacaaggacctcaggagtatgct gtgaactgccatgtgatcacctgggagaggattatcagccactttgatatttttgcattt ggacatttctggggctgggccatgaaggccttgctgatccgtagttacggtctctgctgg acaatcagtattacctgggagctgactgagacagtactacgcttacctcaccgacacaca gtgcaagcgcgtaggaacacaatgctgggtgtttggagttctgttcagccagcagaaggt tctcgaatgttcatcaccatgctgttcacaatagcaaagacatggaatccacctagtgcc catcagtggtggactggataa >gi568815590r:96131058_96333228|GENSCAN_predicted_peptide_10|177_aa MPYRQRKVVKAWKRKTNDAAQKSSISQNFRVSVGINAGPFNAFLPSGRRLQNGWTWQRSD LKGRAVAAAAPSSGKQQASGERQVLIITIVTQRVCCDCQIYFTCVQRGPFYTCPSKRKPQ HGEWIWRAEWQENGCPDSRRGSGNGEEMELSCIVKLTDLSDWLIRRVMEDRKQPPGF >gi568815590r:96131058_96333228|GENSCAN_predicted_CDS_10|534_bp atgccgtacagacagaggaaagtggtgaaagcctggaagagaaaaacaaatgatgctgca caaaaaagttcaatttcccagaacttcagggtttcagtgggaatcaatgcaggaccgttc aatgcatttctgccgtctggacgccggttacaaaacggctggacgtggcagagatcagac ctcaaggggagggcagtggcagctgccgctccctccagcggcaaacagcaggcaagtgga gaacgacaggtgcttatcatcacaatcgtcacacaaagggtctgttgtgattgccaaatc tacttcacctgcgtacaaagaggcccattttacacttgtcccagcaaacggaagccgcag catggagaatggatttggagggcagaatggcaagaaaatgggtgtcctgacagccgcaga ggcagtgggaatggagaggagatggagttgagctgcattgtcaaattaacagatcttagt gactggctgattcgaagagtgatggaggacaggaagcagcctccaggtttttag