GENSCAN 1.0 Date run: 3-Nov-116 Time: 08:01:52 Sequence gi568815596r:226695013_226898738 : 203726 bp : 39.92% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 27472 27511 40 -3.45 1.01 Init + 46939 47173 235 0 1 67 19 205 0.369 9.85 1.02 Term + 52424 52533 110 0 2 82 36 136 0.870 5.49 1.03 PlyA + 52624 52629 6 1.05 2.00 Prom + 56922 56961 40 -3.05 2.01 Sngl + 74246 74578 333 1 0 43 43 346 0.311 21.37 2.02 PlyA + 75016 75021 6 1.05 3.11 PlyA - 75738 75733 6 1.05 3.10 Term - 99869 99745 125 0 2 100 45 95 0.812 3.97 3.09 Intr - 103748 100006 3743 1 2 60 29 3472 0.263 324.90 3.08 Intr - 104482 104275 208 2 1 73 27 209 0.212 10.51 3.07 Intr - 113633 113570 64 2 1 77 95 64 0.004 3.37 3.06 Intr - 140630 140436 195 2 0 26 22 191 0.020 4.99 3.05 Intr - 141206 141034 173 0 2 91 56 166 0.045 12.44 3.04 Intr - 144549 144421 129 1 0 81 110 131 0.949 14.35 3.03 Intr - 153736 153624 113 0 2 88 107 -18 0.017 -0.80 3.02 Intr - 156392 156286 107 2 2 67 63 34 0.123 -3.11 3.01 Init - 156955 156818 138 1 0 56 103 145 0.980 12.99 3.00 Prom - 163280 163241 40 -6.95 4.00 Prom + 167014 167053 40 -5.55 4.01 Init + 169682 170114 433 1 1 65 110 127 0.877 9.02 4.02 Term + 173262 173350 89 1 2 108 41 106 0.779 4.74 4.03 PlyA + 174331 174336 6 1.05 5.03 PlyA - 174515 174510 6 1.05 5.02 Term - 183563 183464 100 1 1 103 43 105 0.982 4.12 5.01 Init - 183811 183645 167 1 2 36 78 105 0.824 3.45 5.00 Prom - 186641 186602 40 -3.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 140748 140685 64 2 1 24 50 170 0.813 3.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:226695013_226898738|GENSCAN_predicted_peptide_1|114_aa MTTDLWIDCETLIKGSSQLKALVLAFELVAQKEEAAFSWSSAINYGHIADTKRRAKSREP LKSGTWDNLPVLYKGQLGRGYNKKTAIVNQEEGTPNQGLWCLVLGLTRLQNCEK >gi568815596r:226695013_226898738|GENSCAN_predicted_CDS_1|345_bp atgaccactgacctatggatagactgtgaaacgcttattaaaggaagttcccagcttaaa gccttggttctagcatttgagttagtagcacagaaggaggaagctgcattctcctggtcc agcgctattaactatggtcatattgctgatacaaaaagaagagctaaatccagagagcct ctaaagtctgggacatgggataatctccccgttctttacaagggacagctgggaagaggc tacaataagaagacggccatcgtaaaccaggaagagggcacaccaaaccagggtctgtgg tgccttgttcttggacttacccgcctccagaactgtgagaaataa >gi568815596r:226695013_226898738|GENSCAN_predicted_peptide_2|110_aa MMKLKTTKAISLALPKMVTGSAVHMDTELRTVQVQIISDRQGNRSCCKTARSVFRKQLDL WKTRPRQCRPAAPQGTTEPFRGCPSSQPDMDGRSRHADVVLGFRRVSWEA >gi568815596r:226695013_226898738|GENSCAN_predicted_CDS_2|333_bp atgatgaaactgaagacaactaaggccatcagccttgccctgccaaagatggtcactggc agtgcagtacacatggacactgaactcaggacagtccaggttcaaataatcagtgaccgg caaggaaacagaagctgctgcaagactgcccggagtgtcttcagaaagcaattagatcta tggaagactagaccaaggcaatgcaggcctgcagcaccccaaggaaccacagagccattt cgaggctgtccttccagccaaccagacatggatggaagatctagacatgcagatgttgtt ttaggtttccgacgcgtttcatgggaggcttag >gi568815596r:226695013_226898738|GENSCAN_predicted_peptide_3|1664_aa MAEELKGKTSQQKATNTTATNAKLYYTDDWLLRVKDPKELHEHWFQAQLDHQVALLHTEL RNSNSSLKTENLSPSVVSALGGKEFLILKLVYTQSPTKFTNSPKLPFKSPYWLLSLVASE FHVQLPGIRGSLGTPPNIAAKDQEPVPFNLTVLKASRLLSFTAPREAPTSAQQPQPGRQL PQPGETARDAAHNQRQQPDQTDAETHALYDPPASPRLRAHVLCREASSFRAPAQPEAGEL LPGRVRYRLLGGGWERHSPASGAIADNRRAAYRRVPEPASKARSRTLTFEIQPPRCEKSK SHGETGSAVAARYRFAWKSHFLHPPRWARMGAAEDAPAGGGSSSSSSSSNSNSRSAAVSA TELVFGRLVAAGTVGGVGGGGGSMASPPESDGFSDVRKVGYLRKPKSMHKRFFVLRAASE AGGPARLEYYENEKKWRHKSSAPKRSIPLESCFNINKRADSKNKHLVALYTRDEHFAIAA DSEAEQDSWYQALLQLHNRAKGHHDGAAALGAGGGGGSCSGSSGLGEAGEDLSYGDVPPG PAFKEVWQVILKPKGLGQTKNLIGIYRLCLTSKTISFVKLNSEAAAVVLQLMNIRRCGHS ENFFFIEVGRSAVTGPGEFWMQVDDSVVAQNMHETILEAMRAMSDEFRPRSKSQSSSNCS NPISVPLRRHHLNNPPPSQVGLTRRSRTESITATSPASMVGGKPGSFRVRASSDGEGTMS RPASVDGSPVSPSTNRTHAHRHRGSARLHPPLNHSRSIPMPASRCSPSATSPVSLSSSST SGHGSTSDCLFPRRSSASVSGSPSDGGFISSDEYGSSPCDFRSSFRSVTPDSLGHTPPAR GEEELSNYICMGGKGPSTLTAPNGHYILSRGGNGHRCTPGTGLGTSPALAGDEAASAADL DNRFRKRTHSAGTSPTITHQKTPSQSSVASIEEYTEMMPAYPPGGGSGGRLPGHRHSAFV PTRSYPEEGLEMHPLERRGGHHRPDSSTLHTDDGYMPMSPGVAPVPSGRKGSGDYMPMSP KSVSAPQQIINPIRRHPQRVDPNGYMMMSPSGGCSPDIGGGPSSSSSSSNAVPSGTSYGK LWTNGVGGHHSHVLPHPKPPVESSGGKLLPCTGDYMNMSPVGDSNTSSPSDCYYGPEDPQ HKPVLSYYSLPRSFKHTQRPGEPEEGARHQHLRLSTSSGRLLYAATADDSSSSTSSDSLG GGYCGARLEPSLPHPHHQVLQPHLPRKVDTAAQTNSRLARPTRLSLGDPKASTLPRAREQ QQQQQPLLHPPEPKSPGEYVNIEFGSDQSGYLSGPVAFHSSPSVRCPSQLQPAPREEETG TEEYMKMDLGPGRRAAWQESTGVEMGRLGPAPPGAASICRPTRAVPSSRGDYMTMQMSCP RQSYVDTSPAAPVSYADMRTGIAAEEVSLPRATMAAASSSSAASASPTGPQGAAELAAHS SLLGGPQGPGGMSAFTRVNLSPNRNQSAKVIRADPQGCRRRHSSETFSSTPSATRVGNTV PFGAGAAVGGGGGSSSSSEDVKRHSSASFENVWLRPGELGGAPKEPAKLCGAAGGLENGL NYIDLDLVKDFKQCPQECTPEPQPPPPPPPHQPLGSGESSSTRRSSEDLSAYASISFQKQ PEDRRMHLHSQGTQDAHPTLTSGRESNKHVGAATGGLFRLRIPK >gi568815596r:226695013_226898738|GENSCAN_predicted_CDS_3|4995_bp atggctgaagaactaaaaggtaaaacaagccaacaaaaagccaccaacaccaccgcaaca aacgccaagctttactacactgacgactggctgttgagggtaaaagatccaaaggaacta catgagcattggtttcaggctcagttggatcatcaagttgctctgctgcacacggagctc aggaactctaattctagtcttaagactgaaaatctttctccatctgttgtctctgcattg gggggaaaagagttcttaattctcaagctagtctacactcagagtcccaccaaatttacc aactcaccaaaattaccatttaagagtccctactggttactgtctttagtggcttctgag tttcacgtccagctccccggaatccgaggttccctaggcacgcctcctaatatcgctgcc aaggaccaagagcctgtgcccttcaacctcactgttcttaaagcttctcggctcctgagt tttacagccccccgggaggcgccgacctccgcccagcagccccagcccggccgccagctc ccgcagcccggggaaacggcgagagatgccgctcacaaccaacgacagcagccggaccaa acagacgcggaaactcacgctctctacgacccgcctgcgtccccacgtcttcgcgcgcac gttctctgccgggaagcttcgtccttccgggccccagcgcagccggaggctggggagctg ctgcctggaagggtcagataccgcctgttaggcggagggtgggaaagacactctcctgct tcaggagctatcgcagacaaccggagggcggcttatcgcagagtcccagagccagccagc aaggcaaggtcgcggacactcacctttgaaatccagcctccacgctgtgaaaagtccaag tcacatggagagaccggatctgcagtggctgcccggtatcgtttcgcatggaaaagccac tttctccacccgccgagatgggcccggatgggggctgcagaggacgcgcccgcgggcggc ggcagcagcagcagcagcagcagcagcaacagcaacagccgcagcgccgcggtctctgcg actgagctggtatttgggcggctggtggcggctgggacggttgggggcgttggtggtggc ggtggcagcatggcgagccctccggagagcgatggcttctcggacgtgcgcaaggtgggc tacctgcgcaaacccaagagcatgcacaaacgcttcttcgtactgcgcgcggccagcgag gctgggggcccggcgcgcctcgagtactacgagaacgagaagaagtggcggcacaagtcg agcgcccccaaacgctcgatcccccttgagagctgcttcaacatcaacaagcgggctgac tccaagaacaagcacctggtggctctctacacccgggacgagcactttgccatcgcggcg gacagcgaggccgagcaagacagctggtaccaggctctcctacagctgcacaaccgtgct aagggccaccacgacggagctgcggccctcggggcgggaggtggtgggggcagctgcagc ggcagctccggccttggtgaggctggggaggacttgagctacggtgacgtgcccccagga cccgcattcaaagaggtctggcaagtgatcctgaagcccaagggcctgggtcagacaaag aacctgattggtatctaccgcctttgcctgaccagcaagaccatcagcttcgtgaagctg aactcggaggcagcggccgtggtgctgcagctgatgaacatcaggcgctgtggccactcg gaaaacttcttcttcatcgaggtgggccgttctgccgtgacggggcccggggagttctgg atgcaggtggatgactctgtggtggcccagaacatgcacgagaccatcctggaggccatg cgggccatgagtgatgagttccgccctcgcagcaagagccagtcctcgtccaactgctct aaccccatcagcgtccccctgcgccggcaccatctcaacaatcccccgcccagccaggtg gggctgacccgccgatcacgcactgagagcatcaccgccacctccccggccagcatggtg ggcgggaagccaggctccttccgtgtccgcgcctccagtgacggcgaaggcaccatgtcc cgcccagcctcggtggacggcagccctgtgagtcccagcaccaacagaacccacgcccac cggcatcggggcagcgcccggctgcaccccccgctcaaccacagccgctccatccccatg ccggcttcccgctgctcgccttcggccaccagcccggtcagtctgtcgtccagtagcacc agtggccatggctccacctcggattgtctcttcccacggcgatctagtgcttcggtgtct ggttcccccagcgatggcggtttcatctcctcggatgagtatggctccagtccctgcgat ttccggagttccttccgcagtgtcactccggattccctgggccacaccccaccagcccgc ggtgaggaggagctaagcaactatatctgcatgggtggcaaggggccctccaccctgacc gcccccaacggtcactacattttgtctcggggtggcaatggccaccgctgcaccccagga acaggcttgggcacgagtccagccttggctggggatgaagcagccagtgctgcagatctg gataatcggttccgaaagagaactcactcggcaggcacatcccctaccattacccaccag aagaccccgtcccagtcctcagtggcttccattgaggagtacacagagatgatgcctgcc tacccaccaggaggtggcagtggaggccgactgccgggacacaggcactccgccttcgtg cccacccgctcctacccagaggagggtctggaaatgcaccccttggagcgtcgggggggg caccaccgcccagacagctccaccctccacacggatgatggctacatgcccatgtcccca ggggtggccccagtgcccagtggccgaaagggcagtggagactatatgcccatgagcccc aagagcgtatctgccccacagcagatcatcaatcccatcagacgccatccccagagagtg gaccccaatggctacatgatgatgtcccccagcggtggctgctctcctgacattggaggt ggccccagcagcagcagcagcagcagcaacgccgtcccttccgggaccagctatggaaag ctgtggacaaacggggtagggggccaccactctcatgtcttgcctcaccccaaaccccca gtggagagcagcggtggtaagctcttaccttgcacaggtgactacatgaacatgtcacca gtgggggactccaacaccagcagcccctccgactgctactacggccctgaggacccccag cacaagccagtcctctcctactactcattgccaagatcctttaagcacacccagcgcccc ggggagccggaggagggtgcccggcatcagcacctccgcctttccactagctctggtcgc cttctctatgctgcaacagcagatgattcttcctcttccaccagcagcgacagcctgggt gggggatactgcggggctaggctggagcccagccttccacatccccaccatcaggttctg cagccccatctgcctcgaaaggtggacacagctgctcagaccaatagccgcctggcccgg cccacgaggctgtccctgggggatcccaaggccagcaccttacctcgggcccgagagcag cagcagcagcagcagcccttgctgcaccctccagagcccaagagcccgggggaatatgtc aatattgaatttgggagtgatcagtctggctacttgtctggcccggtggctttccacagc tcaccttctgtcaggtgtccatcccagctccagccagctcccagagaggaagagactggc actgaggagtacatgaagatggacctggggccgggccggagggcagcctggcaggagagc actggggtcgagatgggcagactgggccctgcacctcccggggctgctagcatttgcagg cctacccgggcagtgcccagcagccggggtgactacatgaccatgcagatgagttgtccc cgtcagagctacgtggacacctcgccagctgcccctgtaagctatgctgacatgcgaaca ggcattgctgcagaggaggtgagcctgcccagggccaccatggctgctgcctcctcatcc tcagcagcctctgcttccccgactgggcctcaaggggcagcagagctggctgcccactcg tccctgctggggggcccacaaggacctgggggcatgagcgccttcacccgggtgaacctc agtcctaaccgcaaccagagtgccaaagtgatccgtgcagacccacaagggtgccggcgg aggcatagctccgagactttctcctcaacacccagtgccacccgggtgggcaacacagtg ccctttggagcgggggcagcagtagggggcggtggcggtagcagcagcagcagcgaggat gtgaaacgccacagctctgcttcctttgagaatgtgtggctgaggcctggggagcttggg ggagcccccaaggagccagccaaactgtgtggggctgctgggggtttggagaatggtctt aactacatagacctggatttggtcaaggacttcaaacagtgccctcaggagtgcacccct gaaccgcagcctcccccacccccaccccctcatcaacccctgggcagcggtgagagcagc tccacccgccgctcaagtgaggatttaagcgcctatgccagcatcagtttccagaagcag ccagaggaccggagaatgcacttacattctcagggcacacaagatgctcaccccacactg acatctggcagagagtcaaacaaacatgtaggagcagccacaggagggctttttcgtttg agaattcccaagtga >gi568815596r:226695013_226898738|GENSCAN_predicted_peptide_4|173_aa MQRRSRGINTGLILLLSQIFHVGINNIPPVTLATLALNIWFFLNPQKPLYSSCLSVEKCY QQKDWQRLLLSPLHHADDWHLYFNMASMLWKGINLERRLGSRWFAYVITAFSVLTGVVYL LLQFAVAEFMDEPDFKRSCAVGFSEKDLEIRKVKRLINKQFVVDTSLVRIQAP >gi568815596r:226695013_226898738|GENSCAN_predicted_CDS_4|522_bp atgcaacggagatcaagagggataaatactggacttattctactcctttctcaaatcttc catgttgggatcaacaatattccacctgtcaccctagcaactttggccctcaacatctgg ttcttcttgaaccctcagaagccactgtatagctcctgccttagtgtggagaagtgttac cagcaaaaagactggcagcgtttactgctctctccccttcaccatgctgatgattggcat ttgtatttcaatatggcatccatgctctggaaaggaataaatctagaaagaagactggga agtagatggtttgcctatgttatcaccgcattttctgtacttactggagtggtatacctg ctcttgcaatttgctgttgccgaatttatggatgaacctgacttcaaaaggagctgtgct gtaggtttctcagagaaggacttggaaattcggaaggttaagagacttatcaacaaacag tttgtggtagacaccagccttgtgaggattcaggccccctga >gi568815596r:226695013_226898738|GENSCAN_predicted_peptide_5|88_aa MVAITTSTEQVSPNLVGPKKRFLGVTDSVGQEFRRSTRGGRSLLHNVSSLSRENWSLVTS GKLYAFDMGSRDKHFNSQGKKLLVSEAM >gi568815596r:226695013_226898738|GENSCAN_predicted_CDS_5|267_bp atggttgccattaccacttctacagaacaagtctccccaaatctagtgggaccaaagaaa cgttttcttggagtcacagattctgtgggtcaggaattcagaaggagcacacggggtggt cggtctctgctccataatgtcagcagcctcagcagagaaaactggagcctggtgacatcg ggaaagctgtatgcttttgacatgggctccagagacaagcatttcaactcacaaggcaag aaactgcttgtttcagaagcaatgtga