GENSCAN 1.0 Date run: 4-Nov-116 Time: 17:02:20 Sequence gi568815588f:123054918_123262829 : 207912 bp : 44.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 8108 7925 184 2 1 32 36 157 0.028 4.26 1.05 Intr - 18558 18325 234 0 0 -30 72 235 0.058 7.99 1.04 Intr - 30747 30616 132 0 0 -3 89 178 0.958 9.64 1.03 Intr - 34137 33985 153 0 0 53 119 52 0.803 5.07 1.02 Intr - 44059 43979 81 1 0 69 75 81 0.402 4.73 1.01 Init - 45942 45817 126 0 0 91 42 33 0.296 -0.81 1.00 Prom - 52149 52110 40 -1.56 2.03 PlyA - 52324 52319 6 1.05 2.02 Term - 57141 56993 149 1 2 54 38 117 0.539 1.36 2.01 Init - 63346 63166 181 1 1 83 0 238 0.349 11.85 2.00 Prom - 68537 68498 40 -4.26 3.00 Prom + 72656 72695 40 -5.86 3.01 Init + 79516 80098 583 0 1 54 64 310 0.839 20.55 3.02 Intr + 81033 81533 501 1 0 54 81 484 0.020 37.25 3.03 Term + 82141 82814 674 2 2 83 54 1200 0.986 110.02 3.04 PlyA + 85737 85742 6 1.05 4.00 Prom + 85815 85854 40 -5.46 4.01 Init + 93032 93144 113 1 2 44 58 141 0.121 4.43 4.02 Intr + 93461 93729 269 2 2 47 82 242 0.139 16.68 4.03 Term + 94653 95206 554 1 2 116 47 879 0.999 81.08 4.04 PlyA + 95736 95741 6 1.05 5.00 Prom + 96990 97029 40 -7.16 5.01 Init + 99830 100199 370 1 1 75 17 498 0.782 36.56 5.02 Intr + 102812 102963 152 0 2 106 102 29 0.970 5.98 5.03 Intr + 105490 105648 159 0 0 29 72 160 0.990 8.68 5.04 Intr + 107319 107496 178 2 1 101 103 103 0.999 12.59 5.05 Term + 107695 107915 221 2 2 70 48 174 0.540 8.80 5.06 PlyA + 108954 108959 6 1.05 6.03 PlyA - 108990 108985 6 1.05 6.02 Term - 116958 116615 344 1 2 32 49 308 0.603 15.97 6.01 Init - 148661 148544 118 2 1 54 64 103 0.723 4.89 6.00 Prom - 153936 153897 40 -1.26 7.00 Prom + 157741 157780 40 -5.86 7.01 Init + 164377 164483 107 0 2 81 37 131 0.154 5.49 7.02 Term + 171019 171151 133 1 1 56 53 110 0.188 1.86 7.03 PlyA + 172348 172353 6 1.05 8.03 PlyA - 176176 176171 6 1.05 8.02 Term - 185687 185451 237 2 0 67 48 279 0.887 17.97 8.01 Init - 190598 190578 21 2 0 58 111 8 0.212 -1.45 8.00 Prom - 196313 196274 40 -4.26 9.00 Prom + 196331 196370 40 -4.06 9.01 Sngl + 199313 201040 1728 1 0 30 41 1256 0.717 110.00 9.02 PlyA + 203140 203145 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 5385 5449 65 2 2 37 116 60 0.818 4.32 S.002 Intr - 72341 72213 129 2 0 64 103 83 0.861 8.29 S.003 Term + 80380 80537 158 2 2 112 44 45 0.891 0.60 S.004 Init + 81134 81533 400 1 1 104 81 557 0.967 51.43 S.005 Init + 93462 93729 268 2 1 83 82 233 0.855 19.50 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_1|304_aa MTCRRMSGEGSFSLEGLSQNYSQKALETYSQDMPQRSSLSRQSGNDLSILHRHSLAEPRE THEDVKDSKHQGFSRWALVIFQQQHFPRVAPQSVQQDDGLCPLIVPEGPRPVSWHLKGAL TGLGAVKKSEDDIIEGDLAVSEIKEEGGCEPDEDPKRMCCQEAQVRQANSLSPPQTVHGT SRGQGSFLNPLLFVESTCSLGLRHRCGLHGPTEDVPESLDVAVTGECIYSGVFGALPFMD SKITKHDNEDGGMSRADSVDDQEGGRADNMDTLDKGMIHVLIGTEQNCARFHHATQNDVQ FKTX >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_1|912_bp atgacctgcaggagaatgagtggagagggctctttctctctcgaggggctttcacagaac tactcccagaaggccctggagacctactctcaggacatgcctcagcggtcaagtctttcc agacagagtggcaatgatcttagcatccttcaccggcacagccttgccgagcccagggag acgcatgaagacgtgaaggactccaagcatcaaggcttctctcggtgggccctggtcata ttccaacagcagcatttccccagagtggctccccagagtgttcaacaggacgatggcctg tgtcccctcattgttccagaaggccccaggcccgtctcatggcatctgaagggggctctg actgggttaggagcagtgaagaagagtgaggatgatatcatagagggtgacttggcagtt agtgagatcaaggaggagggtggctgcgagccagatgaggaccccaagaggatgtgctgc caggaagcacaggtaagacaagcaaacagcctgagtcccccacaaacagtgcatggaacc agcagaggccagggctccttcctcaaccccttgctcttcgtggagtccacctgctccctg ggcctcagacacagatgtggcctgcatgggcccacggaggacgttccagagtccctcgat gtggccgtcacaggggagtgcatctactcaggagtgtttggggcattacccttcatggat tcaaagatcaccaagcatgacaatgaagatggcgggatgtcaagagcagacagtgttgat gaccaagaagggggtagagcagacaacatggatacgctggacaaagggatgattcacgtt cttattgggacggagcagaattgtgcaagatttcatcatgctactcagaacgacgtgcaa tttaaaacttnn >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_2|109_aa MVSAVQLGACLPARAGGRLAAVLLPVQALGGGSGGEAVTDFECPDPSNTPFGACFRCSRI ARIVKLFDFSQPGTSGGVESASGKSAIGFLFREAKIHATQHNLSIFISI >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_2|330_bp atggtttcggccgtccagcttggggcctgcctgccagctcgggcgggtggccgacttgct gcagtgctgctcccagtccaggcactgggtggcggctctggtggtgaagcagtcactgac ttcgaatgtcccgatccatccaacactccgtttggggcgtgcttccggtgttcccgcata gctaggattgtcaagctctttgattttagccaaccaggtacaagtggaggtgttgaaagc gcctctggaaagtctgctataggatttctgttccgtgaagccaaaattcatgctacccag cacaaccttagtatcttcatctctatttaa >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_3|585_aa MSLKCLPHGLLVRTGELFLKPVEWNVLSLSRSAKVQCGSLRTSLPSRRAADAAACRGGGS YAAAPSVHLAPIKALVRQLQTPAGRRDFLGESAVVGVTDSELIREPWAPAARAEGCPSES SRRCREAVAPAAPPSDARVLAGSARIRTQAHPLLVPGEGSLFGFPGLKAKATRISESEDA VRGLTPGFPYYRLDVEFRLWSDFRPLACSPRRPPVPLPPSRGPGGEGTMPEPGPDAAGTA SAQPQPPPPPPPAPKESPFSIKNLLNGDHHRPPPKPQPPPRTLFAPASAAAAAAAAAAAA AKGALEGAAGFALSQVGDLAFPRFEIPAQRFALPAHYLERSPAWWYPYTLTPAGGHLPRP EASEKALLRDSSPASGTDRDSPEPLLKADPDHKELDSKSPDEIILEESDSEESKKEGEAA PGAAGASVGAAAATPGAEDWKKGAESPEKKPACRKKKTRTVFSRSQVFQLESTFDMKRYL SSSERAGLAASLHLTETQVKIWFQNRRNKWKRQLAAELEAANLSHAAAQRIVRVPILYHE NSAAEGAAAAAAGAPVPVSQPLLTFPHPVYYSHPVVSSVPLLRPV >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_3|1758_bp atgagtcttaaatgtctgcctcatgggcttctcgtgcgaaccggggagctgtttttaaag ccagtggagtggaatgtcctttcactaagccgttctgcaaaggtccagtgtggttctcta aggacttcgctgccgtcgagacgcgcagcggatgccgctgcttgccgcggcggaggaagc tacgccgctgcgccttcggttcaccttgcccccattaaagcgttagttagacagctccaa acgccagctggccggagagacttcctgggagaaagtgctgtcgtcggggtcacagactcg gagttgatccgcgagccttgggcaccggcggccagagcagaggggtgcccttccgagagc agccgtcggtgccgagaagctgttgccccagcagccccaccgagtgacgcgcgggttttg gccgggtcagcgcgcatacgcacacaggcacacccactcctggtcccgggggaaggctca ctcttcggttttccagggctaaaggccaaggccacgcggatttctgagtccgaagacgca gtgcgaggacttacacctggattcccttactatcggctggatgttgagttccgtttatgg tctgatttccggcctctcgcctgctcgccccgccgcccgcctgtcccgctccctccctcc cggggacccggaggagaggggaccatgccggaacccgggccggacgctgccggcaccgcc agcgcacagccccaaccgccgccgccccccccacccgctcccaaggagtccccgttctcc atcaagaacctgctcaacggagaccaccaccggccgccccctaagcctcagccgccccca cggacgctcttcgcgccagcctcggctgccgccgccgccgccgctgccgctgccgcggcg gccaagggggccctggagggcgccgcgggcttcgcgctctcgcaggtgggcgacctggct ttccctcgctttgagatcccggcgcagaggtttgccctgcccgcgcactacctggagcgc tccccagcctggtggtacccctacaccctgacccccgccggcggccacctcccgcgacct gaagcctcggagaaggccttgctgagagactcctcccccgcctccggcacagaccgcgac tctccggagccactgctcaaggccgaccccgatcacaaggagctggactccaagagcccg gacgagatcattctggaggagagcgactccgaggaaagcaaaaaggaaggcgaagcggcg ccaggcgcggccggggcgagcgtaggggcggcggcggccactccgggcgcagaagactgg aagaagggcgctgaaagtccagagaagaagccggcgtgccgcaagaagaagacgcgcaca gtcttctcgcgcagccaggtcttccagctcgagtccaccttcgacatgaagcgctatctg agcagctcggagcgagccggcctggccgcgtccctgcacctcaccgagacgcaggtcaag atctggttccagaaccgccgcaacaagtggaagcggcagctggcggcggagctggaggcg gccaacctgagccatgccgcggcgcagcgcatcgtgcgggtgcccatcctctaccacgag aactcggcggccgagggcgcggcggctgcagccgcgggggccccggtgccagtcagccag ccgctgctcaccttcccgcaccccgtctactactcgcacccggtggtctcttccgtgccg ctgctacggccggtctga >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_4|311_aa MPARPSARSGGAPVLCAWPERFEIFRGRSAALRLAAFGMGSKEDAGKGCPAAGGVSSFTI QSILGGGPSEAPREPVGWPARKRSLSVSSEEEEPDDGWKAPACFCPDQHGPKEQGPKHHP PIPFPCLGTPKGSGGSGPGGLERTPFLSPSHSDFKEEKERLLPAGSPSPGSERPRDGGAE RQAGAAKKKTRTVFSRSQVYQLESTFDMKRYLSSSERACLASSLQLTETQVKTWFQNRRN KWKRQLSAELEAANMAHASAQTLVSMPLVFRDSSLLRVPVPRSLAFPAPLYYPGSNLSAL PLYNLYNKLDY >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_4|936_bp atgcccgccaggccctcggcgcgctctgggggcgcaccggtgctgtgtgcctggccggag cgttttgaaattttccggggccgctcggcggcgctgcgattggccgcgttcgggatgggc agcaaagaagatgcgggcaaggggtgtccggcggccggtggcgtctccagcttcaccatc cagtccatcctgggcgggggcccctcggaggcaccgcgggagcccgtcggctggccagcc aggaagcgcagcctgtccgtgtcctcggaggaggaggagccggacgacggctggaaggcg cccgcctgcttctgcccagaccagcacggccctaaggagcagggccccaagcaccatccc cccatcccttttccttgcctgggtacccccaagggcagcggaggctcgggcccgggcggc ttggagcgcacgcctttcctctctccttcgcactcggactttaaagaagagaaagagagg ctcctgcccgcgggctcgccctcgccggggtccgagcggccgcgggacggcggcgctgag cggcaggccggcgcggccaagaagaagacgcgcaccgtcttttcgcgcagccaggtgtac cagctcgagtccaccttcgacatgaagcgctacctgagcagctcggagcgcgcctgcctc gcctccagcctgcagctcacggagacccaggtaaagacttggttccagaaccgccgcaac aagtggaagcggcagctctcggctgagctggaggcggccaacatggcgcacgcgtcggcg cagactctggtgagcatgccgctggtgttccgggacagttcgctgctgcgcgtgccggtg ccgcgctcgctcgcctttcccgcgccgctctactacccgggaagcaacctctcggcctta cctctctacaacctatacaacaagctcgactactga >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_5|359_aa MGRVRTLAGECSAQAQAQSLLAVVLSAPPSGGTPSARLSVRSPSPRDPWGLWAPVLQMTG SNEFKLNQPPEDGISSVKFSPNTSQFLLVSSWDTSVRLYDVPANSMRLKYQHTGAVLDCA FYVENLVGTHDAPIRCVEYCPEVNVMVTGSWDQTVKLWDPRTPCNAGTFSQPEKVYTLSV SGDRLIVGTAGRRVLVWDLRNMGYVQQRRESSLKYQTRCIRAFPNKQGYVLSSIEGRVAV EYLDPSPEVQKKKYAFKCHRLKENNIEQIYPVNAISFHNIHNTFATGGSDGFVNIWDPFN KKRLCQFHRYPTSIASLAFSNDGTTLAIASSYMYEMDDTEHPEDGIFIRQVTDAETKPK >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_5|1080_bp atggggcgagtccggaccttggcgggcgagtgctcggcgcaggcgcaagcgcagagtctc ctcgcggtcgtcctctcggcccctccctctggggggacccccagtgccaggctgtcagtg cgcagccccagcccgcgggacccctggggactctgggcgcctgttctgcagatgaccggt tctaacgagttcaagctgaaccagccacccgaggatggcatctcctccgtgaagttcagc cccaacacctcccagttcctgcttgtctcctcctgggacacgtccgtgcgtctctacgat gtgccggccaactccatgcggctcaagtaccagcacaccggcgccgtcctggactgcgcc ttctacgtagaaaatcttgttgggacccatgatgcccctatcagatgtgttgaatactgt ccagaagtgaatgtgatggtcactggaagttgggatcagacagttaaactgtgggatccc agaactccttgtaatgctgggaccttctctcagcctgaaaaggtatataccctctcagtg tctggagaccggctgattgtgggaacagcaggccgcagagtgttggtgtgggacttacgg aacatgggttacgtgcagcagcgcagggagtccagcctgaaataccagactcgctgcata cgagcgtttccaaacaagcagggttatgtattaagctctattgaaggccgagtggcagtt gagtatttggacccaagccctgaggtacagaagaagaagtatgccttcaaatgtcacaga ctaaaagaaaataatattgagcagatttacccagtcaatgccatttcttttcacaatatc cacaatacatttgccacaggtggttctgatggctttgtaaatatttgggatccatttaac aaaaagcgactgtgccaattccatcggtaccccacgagcatcgcatcacttgccttcagt aatgatgggactacgcttgcaatagcgtcatcatatatgtatgaaatggatgacacagaa catcctgaagatggtatcttcattcgccaagtgacagatgcagaaacaaaacccaagtga >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_6|153_aa MIPDSKTDSGPWMLGYLLPRALLICHSLGLQFVLSFRNGNDKRRNNGRAKKGHGHVQPIC CTNCAQCVPKDKAIKKFVIRNIVEAAAVRDISEAGIFDAYVLPKLYVKLHYCVSCAIHNK VVRNQSCEARKDRTPPPRFRPAGAAPPPLPKPI >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_6|462_bp atgatacctgattccaagaccgactctggtccctggatgctgggctacctccttcctagg gccctgctgatctgccactcactgggcttgcagtttgtgctgtccttccgtaatgggaat gacaaaagaaggaacaacggtcgtgccaaaaagggccatggccacgtgcagcctatttgc tgcactaactgtgcccaatgcgtgcccaaggacaaggccattaagaaattcgtcattcga aacatagtggaggccgcagcagtcagggacatttctgaagcgggcatcttcgatgcctat gtgcttcccaagctgtatgtgaagctacattactgtgtgagttgtgcaattcacaacaaa gtagtcaggaatcaatcttgtgaagcccgcaaggaccgaacacccccaccccgatttaga cctgcgggtgctgccccacctcccctaccaaagcccatataa >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_7|79_aa MMFFLFAMEVGWSCRFCSLILDHGSTLVLVLTSLQGFPKSSPSSSKFYKSPGQGQNATSR FAKTARVTFTPVHKFLISI >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_7|240_bp atgatgttctttctcttcgcaatggaagtaggatggagctgccgattctgctccctgatc ctggaccacggctccacccttgtcctggtgttgacatcccttcagggattccctaaatca tctccctcaagttcaaagttctacaaatctccagggcaggggcaaaatgccaccagtcgc tttgctaaaacagcaagagtcacctttactccagttcacaagttcctcatctccatctga >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_8|85_aa MSSFLMQVIHTARQIPGGPSKGPRAKVIPVEISGTWIHKIPLEILSLAFDDDDDDDDDDD DAETWGLPRLYLLGILGKGAHQNIF >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_8|258_bp atgagctcttttctgatgcaggtgatacacacagccagacaaatcccaggtggcccatcc aaaggaccaagagcaaaagttattccagttgagatcagtggaacttggattcacaaaatc cccctggagatcctctctctggcctttgatgatgatgatgatgatgatgatgatgatgat gacgcagaaacatgggggcttccaaggctgtacctcctaggcatcctggggaaaggagcc catcagaacatcttctag >gi568815588f:123054918_123262829|GENSCAN_predicted_peptide_9|575_aa MLMLGGFAFSVELDLLLKTDPEPNLLSPLTQPLSPSHHYYHHHHHHHHTTTTTIITIIIT PPPPPSSLSSSHHHHHHHHTTTTTIITIIIITTPPPSPPPPSHHHHHCRHHYHHHHTTTT IITPPPPPPSSSLSSSSSHRHHHHHHHHTTTTTTTVTITIIIITPPPPPPSSLSSSSSPH HHHHHHHHHTTTTTVTTTTTTITPPPPPPSSSLSSSSSHHHHHHHHHTTTTTTVITIIII TPPPSPSSHHHHHHHHHHHTTTTITPPPPSLSSPSSHHHHHHHHHHHHTTTTITITIITP PPPPLPSPSSHHHHHHHHTTTTITIITPPPSLPSPSSHHHHHHYHHHTTTIITPPPPSYT ITINTPPPPSYTITIITPPPPSLPSPSSHHHHQHYHHHHHTTTTTITITIITLPPPLPSP SSHYHHHYHHHHHTTTTNITITIITPPPPPLPSPPSHHHHHHHHHHTTTTTITIITPPPP PLPSPSLHHPHHYHHHHHTTTTTITITIITPPPSLPSPSLHHHHQYTTTTITITIITSPP PSHHHYNYHHHYHITTIIITIINIIPPPSSSHPLP >gi568815588f:123054918_123262829|GENSCAN_predicted_CDS_9|1728_bp atgctgatgttaggtggatttgcattctctgtcgaactggatctcttgctcaaaactgac cctgagcctaaccttctttccccacttacacagccattgtcaccatcgcaccactactat catcatcaccaccaccatcatcacaccaccaccaccaccatcatcactatcatcatcaca ccaccaccaccaccatcatcactatcatcatcacaccatcaccaccaccatcatcacacc accaccaccaccatcatcactatcatcatcatcaccacaccaccaccatcaccaccacca ccatcacaccaccaccaccactgtcgtcatcactatcatcatcatcacaccaccaccacc atcatcacaccaccaccaccaccaccgtcgtcatcactatcatcatcatcatcacaccgc caccatcaccaccaccaccatcacaccaccaccaccaccaccaccgtcaccatcactatc atcatcatcacaccaccaccaccaccaccatcatcactatcatcatcatcatcaccacac caccaccatcaccaccaccaccatcacaccaccaccaccactgtcaccaccaccaccacc accatcacaccaccaccaccgccaccgtcgtcatcactatcatcatcatcatcacaccac caccatcaccaccaccatcacaccaccaccaccactactgtcatcactatcatcatcatc acaccaccaccatcaccatcatcacaccaccaccaccatcaccatcaccatcatcacacc accaccaccatcacaccaccaccaccatcattatcatcaccatcatcacaccaccaccac caccatcaccatcaccatcatcacaccaccaccaccattaccatcaccatcatcacacca ccaccaccaccattaccatcaccatcatcacaccaccaccatcaccatcatcacaccacc accaccatcaccatcatcacaccaccaccatcattaccatcaccatcatcacaccaccac caccaccattaccatcatcacaccaccaccatcatcacaccaccaccaccatcatatacc atcaccatcaacacaccaccaccaccatcatataccatcaccatcatcacaccgccacca ccatcattaccatcaccatcatcacaccaccaccaccaacattaccatcaccatcatcac accaccaccaccaccattaccatcaccatcatcacactaccaccaccattaccatcacca tcatcacactaccaccatcattaccatcaccatcatcacaccaccaccaccaacattacc atcaccatcatcacgccaccaccaccaccattaccatcaccaccatcacaccaccaccac caccatcaccatcatcacaccaccaccaccaccatcaccatcatcacaccaccaccacca ccattaccatcaccatcattacaccacccccaccattaccatcaccatcatcacaccacc accaccaccattaccatcaccatcatcacacccccaccatcattaccatcaccatcatta caccaccaccatcagtacaccaccaccaccattaccatcaccatcatcacatcaccacca ccatcacaccatcactataattaccaccaccattatcacatcaccaccattatcatcacc atcatcaacatcataccaccaccatcatcatcacatcctcttccttga