GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:17:35 Sequence gi568815592r:129057526_129259116 : 201591 bp : 35.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7616 7725 110 1 2 47 63 81 0.016 1.24 1.02 Intr + 23569 23687 119 1 2 73 59 66 0.094 1.39 1.03 Intr + 30418 30458 41 2 2 79 82 72 0.787 2.72 1.04 Intr + 40648 40890 243 0 0 93 94 272 0.970 25.07 1.05 Intr + 45378 45570 193 2 1 35 28 125 0.252 -0.66 1.06 Intr + 66989 67047 59 0 2 110 68 63 0.563 4.18 1.07 Intr + 67365 67418 54 2 0 70 110 20 0.358 0.66 1.08 Intr + 86376 86555 180 2 0 56 89 240 0.997 20.04 1.09 Intr + 89407 89523 117 0 0 74 48 100 0.564 4.34 1.10 Intr + 91454 91571 118 1 1 75 95 76 0.997 6.12 1.11 Intr + 93150 93284 135 1 0 98 5 84 0.524 0.72 1.12 Term + 94094 94233 140 0 2 47 55 107 0.795 0.44 1.13 PlyA + 95418 95423 6 1.05 2.04 PlyA - 96380 96375 6 1.05 2.03 Term - 101166 99998 1169 1 2 90 49 1223 0.232 109.60 2.02 Intr - 107228 107181 48 0 0 83 100 65 0.253 4.93 2.01 Init - 111424 110251 1174 1 1 60 40 335 0.442 19.58 2.00 Prom - 111480 111441 40 -12.13 3.03 PlyA - 111628 111623 6 1.05 3.02 Term - 113230 111802 1429 0 1 42 31 626 0.742 41.68 3.01 Init - 114770 114385 386 2 2 88 44 481 0.978 40.06 3.00 Prom - 115369 115330 40 -8.55 4.00 Prom + 115925 115964 40 -5.25 4.01 Init + 118976 118979 4 1 1 49 80 0 0.184 -4.49 4.02 Intr + 120181 120341 161 2 2 82 83 143 0.969 11.99 4.03 Intr + 132680 132820 141 1 0 94 80 121 0.741 11.53 4.04 Intr + 135155 135328 174 1 0 76 106 232 0.994 23.01 4.05 Intr + 170023 170179 157 0 1 75 30 120 0.000 3.56 4.06 Term + 175435 175694 260 2 2 -2 50 211 0.618 3.03 4.07 PlyA + 175952 175957 6 1.05 5.00 Prom + 188357 188396 40 -2.25 5.01 Init + 190717 190761 45 0 0 54 81 48 0.533 1.43 5.02 Intr + 192587 192688 102 1 0 99 65 109 0.996 9.15 5.03 Intr + 194559 194770 212 2 2 127 98 121 0.656 13.79 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 168424 167756 669 1 0 49 34 239 0.855 10.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:129057526_129259116|GENSCAN_predicted_peptide_1|502_aa MKDKNHMIISIDAEKAFDKIQHCLMIKAVNKLAIEGMDMDEAGNHHSQQTIARTGNQTPH VLTHRWETNNENTWTQEGDLAGSRDNSGGKVFQIAYVIVKAANSPRPGNWILERSLDDVE YKPWQYHAVTDTECLTLYNIYPRTGPPSYAKDDEVICTSFYSKIHPLENGEYTLIPMLLI FGVQASVCGNYESPTHAVLESNFCPPNTAGLSKELLSLSAMCLDRKMLTGQKLSQRSRAQ ATGTKYAKAYAASWALCWIDVRTQEGRANAREQIHISLINGRPSADDPSPELLEFTSARY IRLRFQRIRTLNADLMMFAHKDPREIDPIVTRRDFSLDCFLQYYYSVKDISVGGMCICYG HARACPLDPATNKSRCECEHNTCGDSCDQCCPGFHQKPWRAGTFLTKTECEGKRGGRISK LKNKHFKNAEIQDGDLSHKAIHKNHTFSNEYIPESLRGLRKLMIKGEEGAGTSYMVQERE EKVNGEEPLIKPSDLVRTHSLS >gi568815592r:129057526_129259116|GENSCAN_predicted_CDS_1|1509_bp atgaaggacaaaaatcatatgatcatctcaatagatgcagaaaaagcatttgacaaaatt caacattgtttaatgataaaagctgtcaacaaattagctatagaaggaatggatatggat gaagctggaaaccatcattctcagcaaactatagcaaggacaggaaaccaaacaccgcat gttctcactcataggtgggaaacgaacaatgagaacacttggacacaggagggggatttg gcagggtcacgggacaatagtggagggaaggtgttccagatcgcgtatgtgattgtgaag gcagctaactccccccggcctggaaactggattttggaacgctctcttgatgatgttgaa tacaagccctggcagtatcatgctgtgacagacacggagtgcctaacgctttacaatatt tatccccgcactgggccaccgtcatatgccaaagatgatgaggtcatctgcacttcattt tactccaagatacaccccttagaaaatggagagtatacgctcattcccatgctgcttatc tttggagtccaagccagcgtgtgtgggaattacgaaagtcccacccatgcagtcctggaa tccaacttttgtcctcctaatacagcagggctgtcaaaagagctgctcagtctctcagcc atgtgtctggacaggaaaatgctcacaggacaaaagctgtcacaacgaagtagagcccag gcaactggcaccaaatatgccaaagcttatgctgcttcctgggctctgtgttggattgat gtcagaacacaggagggaagggcaaatgcaagggagcagattcacatctctttaatcaat gggagaccaagtgccgatgatccttctccagaactgctagaatttacctccgctcgctat attcgcctgagatttcagaggatccgcacactgaatgctgacttgatgatgtttgctcac aaagacccaagagaaattgaccccattgtcaccagaagagatttctctctggattgcttt ttgcagtattactactcggtcaaggatatttcagttggagggatgtgcatctgctatggt catgccagggcttgtccacttgatccagcgacaaataaatctcgctgtgagtgtgagcat aacacatgtggcgatagctgtgatcagtgctgtccaggattccatcagaaaccctggaga gctggaacttttctaactaaaactgaatgtgaagggaagaggggaggtcgtatttccaaa ctgaaaaacaaacattttaaaaacgcagaaattcaagatggggatcttagccataaagcc atacataaaaatcacactttcagcaatgagtacattcctgaaagcttaagaggcctcagg aaacttatgatcaaaggggaagagggagcaggcacatcttacatggtacaggagagagag gagaaagtgaatggggaagagccccttataaaaccatcagatcttgtacgaactcactca ctatcatga >gi568815592r:129057526_129259116|GENSCAN_predicted_peptide_2|796_aa MSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIV KMAILPKVIYRFNAIPIKLPMPFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLP DFKLYYKSTVTKTAWYWYQNRDIDQWNRTEPSEITPHMYNYLIFDKPEKNKQWGKDSLFN KWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGSTIQDIGMGKD FMSKTPKAMATKDKIDKWDLIKLKSFCTAKETTIRVNRQPTKWEKIFATYSSDKGLISRI YNELKQIYKKKTNNPINKWAKDMNRHFSKEDIYAAKKHMKKCSSSLAIREMQIKTTMRYH LTPVRMAIIKKSGNNRCWRGCGEIGTLLHTVGDSRKFSGKQPLETEQGPFFDGSIRWLVL LISMAVCIIAMIIFSSCFCYKHYCKSISSRRRYNRDLEQDEAFIPVGKSLKDLIDQSQSS GSGSGLPLLVQRTIAKQIQMVRQVGKGRYGEVRMGKWHGERVAVKVFFTTEEASWFRETE IYQTVLMRHENILGFIAADIKGTGSWTQLYLITDYHENGSLYDFLKCATLDTRALLKLAY SAACGLCHLHTETYGTQGKPAIAHRDLKSKNILIKKNGSCCIADLGLAVKFNSDTNEVDV PLNTRVGTKRYLAPEVLDESLNKNHFQPYIMADIYSFGLIIWEMARRCITGGIVEDYQLP YYNMVPSDPSYEDMREVVCVKRLRPIVSNRWNSDECLRAVLKLMSECWAHNPAPRLTALR IKKTLAKMVESQDVKI >gi568815592r:129057526_129259116|GENSCAN_predicted_CDS_2|2391_bp atgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaa gaggatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcgtg aaaatggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctacca atgcctttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaaga gcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacct gacttcaaactatactacaagtctacagtaaccaaaacagcatggtactggtaccaaaac agagatatagatcaatggaacagaacagagccctcagaaataacgccgcatatgtacaac tacctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctgtttaat aaatggtgctgggagaactggctagccatatgtagaaagctgaaactggatcccttcctt acaccttatacaaaaatcaattcaagatggattaaagacttaaacgttagacctaaaacc ataaaaaccctagaagaaaacctaggcagtaccattcaggacataggcatgggcaaggac ttcatgtctaaaacaccaaaagcaatggcaacaaaagacaaaattgacaaatgggatcta attaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacct acaaaatgggagaaaatttttgcaacctactcatctgacaaagggctaatatccagaatc tacaatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaacaagtgggcg aaggacatgaacagacacttctcaaaagaagacatttatgcagccaaaaaacacatgaaa aaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccat ctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacaggtgctggagagga tgtggagaaataggaacacttttacacactgttggagattctaggaaattcagtggaaag caaccgctggaaactgaacaaggtccgttttttgatggcagcattcgatggctggttttg ctcatttctatggctgtctgcataattgctatgatcatcttctccagctgcttttgttac aaacattattgcaagagcatctcaagcagacgtcgttacaatcgtgatttggaacaggat gaagcatttattccagttggaaaatcactaaaagaccttattgaccagtcacaaagttct ggtagtgggtctggactacctttattggttcagcgaactattgccaaacagattcagatg gtccggcaagttggtaaaggccgatatggagaagtacggatgggcaaatggcatggcgaa agagtggcggtgaaagtattctttaccactgaagaagccagctggtttcgagaaacagaa atctaccaaactgtgctaatgcgccatgaaaacatacttggtttcatagcggcagatatt aaaggtacaggttcctggactcagctctatttgattactgattaccatgaaaatggatct ctctatgacttcctgaaatgtgctacgctggacaccagagccctgcttaaattggcttat tcagctgcctgtggtctgtgccacctgcacacagaaacgtatggcacccaaggaaagccc gcaattgctcatcgagacctaaagagcaaaaacatcctcatcaagaaaaatgggagttgc tgcattgctgacctgggccttgctgttaaattcaacagtgacacaaatgaagttgatgtg cccttgaataccagggtgggcaccaaacgctacctggctccagaagtgctggacgaaagc ctgaacaaaaaccacttccagccctacatcatggctgatatctacagcttcggcctaatc atttgggagatggctcgtcgttgtatcacaggagggatcgtggaagactaccaattgcca tattataacatggtaccgagtgatccgtcatacgaagatatgcgtgaggttgtgtgtgtc aaacgtttgcggccaattgtgtctaatcggtggaacagtgatgaatgtctacgagcagtt ttgaagctaatgtcagaatgctgggcccacaatccagcccccagactcacagcgttgaga attaagaagacgcttgccaagatggttgaatcccaagatgtaaaaatctga >gi568815592r:129057526_129259116|GENSCAN_predicted_peptide_3|604_aa MGKKQNRKTGNSKKQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNYSELREDIQ TKGKEVENFEKNVEECITRITNTEKCLKELMELKTKARELREECRSLRSRCDQLEERVSA MEDEMNEMKQINETESQQEVNKDTQELNTALHQADLINIYRTLHPKSTEYTFFSAPHHTY SKIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELRIKNLTQNCSTTWKLNNLLLNDYW VHNEMKAEIKMFFETHENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQL KELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERVNKIDRPLARLI KKKREKNLIDAIKNDKGDITTDPTEIQTKIREYYKHLYANKLENLEEMDKFLDTYTLPRL NQEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEK EGILPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILSKILAKRIQQHIKKLIHH DQVGFIPGMQGWFNIHKSINVIQHINRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLG IDGT >gi568815592r:129057526_129259116|GENSCAN_predicted_CDS_3|1815_bp atggggaaaaaacagaacagaaaaactggaaactctaaaaagcagagtgcctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaacgactttgac gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaagttgaaaactttgaaaaaaatgtagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaagatgaaatgaatgaaatgaaacagatcaacgagacagaaagtcaacaagaagtc aacaaggatacccaggaattgaacacagctctgcaccaagcggacctaataaacatctac agaactctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctat tccaaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagaa attataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaat ctcactcaaaactgctcaactacatggaaactgaacaacctgctcctgaatgactactgg gtacataacgaaatgaaggcagaaataaagatgttctttgaaacccatgagaacaaagac acaacataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagca ctaaatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaatta aaagaactagaaaagcaggagcaaacacattcaaaagctagcagaaggcaagaaataact aaaatcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaa tccaggagctggttttttgaaagggtcaacaaaattgatagaccgctagcaagactaata aagaaaaaaagagagaagaatctaatagatgcaataaaaaatgataaaggggatatcacc acggatcccacagaaatacaaactaaaatcagagaatactacaaacacctctatgcaaat aaactagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagacta aaccaggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaata atcaatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctac cagaggtacaaggaggaactggtaccattccttctgaaactattccaatcaatagaaaaa gagggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccaggc agagacacaaccaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaa atcctcagtaaaatactggcaaaacgaatccagcagcacatcaaaaagcttatccaccat gatcaagtaggcttcatccctgggatgcaaggctggttcaatatacacaaatcaataaat gtaatccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgca gaaaaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggt attgatgggacgtaa >gi568815592r:129057526_129259116|GENSCAN_predicted_peptide_4|298_aa MSLAPGSCHCKTGFGGVSCDRCARGYTGYPDCKACNCSGLGSKNEDPCFGPCICKENVEG GDCSRCKSGFFNLQEDNWKGCDECFCSGVSNRCQSSYWTYGKIQDMSGWYLTDLPGRIRV APQQDDLDSPQQISISNAEARQALPHSYYWSAPAPYLGNKSGPSAAGLLEFAGGPLQTLF AWVSAAEAAEQQILLNSKCCRLIVPLEASSQREPGKEHWLTNSKEMGTSVPQPQEMNSAN NLNEQGDRFSSRATRKECIHADTIILVILAKGDPYWNIRLQNCIPTRMHMLEYSPTEP >gi568815592r:129057526_129259116|GENSCAN_predicted_CDS_4|897_bp atgagtttggcacctggatcctgtcattgcaaaactggttttggaggtgtgagctgtgat cggtgtgccaggggctacactggctacccggactgcaaagcctgtaactgcagtgggtta gggagcaaaaatgaggatccttgttttggcccctgtatctgcaaggaaaatgttgaagga ggagactgtagtcgttgcaaatccggcttcttcaatttgcaagaggataattggaaaggc tgcgatgagtgtttctgttcaggggtttcaaacagatgtcagagttcctactggacctat ggcaaaatacaagatatgagtggctggtatctgactgaccttcctggccgcattcgagtg gctccccagcaggacgacttggactcacctcagcagatcagcatcagtaacgcggaggcc cggcaagccctgccgcacagctactactggagcgcgccggctccctatctgggaaacaaa tcaggaccctcagctgcaggtctgttggagtttgctggaggtccactccagaccctgttt gcctgggtatcagcagcggaggctgcagaacagcagatattgctgaacagcaaatgttgc cgcctgatcgttcctctggaagcttcgtctcagagggaacctggaaaagagcattggctg acaaacagcaaggaaatgggaacctcagttccacaaccacaagaaatgaattctgccaac aacctgaatgaacaaggagatagattctcctctagagccaccagaaaggaatgtatccat gccgacaccatcattttagtcattttagctaaaggagacccatattggaatattcgccta cagaactgtattccaacaaggatgcatatgttggaatattcacctacagaaccatga >gi568815592r:129057526_129259116|GENSCAN_predicted_peptide_5|120_aa MEKLSSTKPMPDAKKLPAVGGQLTFTISYDLEEEEEDTERVLQLMIILEGNDLSISTAQD EVYLHPSEEHTNVLLLKEESFTIHGTHFPVRRKEFMTVLANLKRVLLQITYSFGMDAIFS >gi568815592r:129057526_129259116|GENSCAN_predicted_CDS_5|360_bp atggaaaaattgtcttccacgaaacccatgcctgatgccaaaaagctcccagcagtagga ggacagttgacatttaccatatcatatgaccttgaagaagaggaagaagatacagaacgt gttctccagcttatgattatcttagagggtaatgacttgagcatcagcacagcccaagat gaggtgtacctgcacccatctgaagaacatactaatgtattgttacttaaagaagaatca tttaccatacatggcacacattttccagtccgtagaaaggaatttatgacagtgcttgcg aatttgaagagagtcctcctacaaatcacatacagctttgggatggatgccatcttcagn