GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:52:52 Sequence gi568815580f:59800178_60002750 : 202573 bp : 42.22% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 31 26 6 1.05 1.03 Term - 1918 1824 95 2 2 72 39 61 0.067 -3.39 1.02 Intr - 2172 1963 210 1 0 65 101 89 0.091 5.86 1.01 Init - 7795 7723 73 1 1 48 97 66 0.349 4.78 1.00 Prom - 10608 10569 40 -6.15 2.00 Prom + 14111 14150 40 -5.15 2.01 Sngl + 14826 15077 252 2 0 50 54 225 0.463 10.14 2.02 PlyA + 15200 15205 6 1.05 3.00 Prom + 20889 20928 40 -3.65 3.01 Init + 23585 23648 64 1 1 74 91 33 0.357 2.38 3.02 Intr + 23896 24164 269 2 2 32 45 294 0.642 15.83 3.03 Intr + 24560 24841 282 1 0 -19 -43 360 0.704 8.89 3.04 Term + 24926 25087 162 1 0 15 37 222 0.699 6.75 3.05 PlyA + 25135 25140 6 1.05 4.00 Prom + 25188 25227 40 -4.55 4.01 Init + 44031 44144 114 2 0 67 81 111 0.807 8.56 4.02 Term + 53095 53208 114 0 0 57 53 91 0.179 0.09 4.03 PlyA + 53484 53489 6 1.05 5.16 PlyA - 53519 53514 6 1.05 5.15 Term - 54532 54235 298 0 1 35 38 161 0.216 -0.45 5.14 Intr - 57617 57368 250 0 1 36 62 237 0.104 11.47 5.13 Intr - 64251 64134 118 0 1 40 61 106 0.043 2.22 5.12 Intr - 85543 85365 179 2 2 89 17 80 0.008 -0.28 5.11 Intr - 97539 97486 54 1 0 64 90 50 0.142 0.83 5.10 Intr - 99707 99558 150 0 0 109 59 131 0.917 11.51 5.09 Intr - 100069 99839 231 2 0 62 113 252 0.916 21.92 5.08 Intr - 100415 100264 152 1 2 53 -26 218 0.214 5.79 5.07 Intr - 118476 118233 244 1 1 66 27 155 0.100 2.83 5.06 Intr - 121534 121483 52 1 1 46 108 20 0.650 -2.64 5.05 Intr - 121806 121674 133 2 1 47 94 72 0.778 3.43 5.04 Intr - 123221 123115 107 2 2 66 61 76 0.777 0.79 5.03 Intr - 124778 124581 198 2 0 50 33 166 0.826 5.93 5.02 Intr - 129211 129136 76 0 1 85 81 60 0.635 3.60 5.01 Init - 132999 132701 299 0 2 53 48 195 0.783 8.74 5.00 Prom - 134866 134827 40 -6.45 6.00 Prom + 146730 146769 40 -7.65 6.01 Init + 152059 152096 38 0 2 62 116 43 0.185 4.03 6.02 Intr + 164621 164666 46 2 1 107 64 34 0.031 0.39 6.03 Term + 169373 169966 594 1 0 -3 48 611 0.723 41.54 6.04 PlyA + 169968 169973 6 -6.47 7.00 Prom + 170114 170153 40 -13.11 7.01 Sngl + 170325 171197 873 2 0 3 49 493 0.992 32.59 7.02 PlyA + 171711 171716 6 1.05 8.06 PlyA - 172678 172673 6 1.05 8.05 Term - 174776 174657 120 2 0 36 53 121 0.522 0.89 8.04 Intr - 179566 179460 107 2 2 56 75 65 0.351 1.01 8.03 Intr - 181237 181138 100 1 1 60 50 97 0.497 1.86 8.02 Intr - 186362 186178 185 0 2 68 92 204 0.472 17.39 8.01 Init - 187122 187014 109 0 1 50 88 72 0.611 3.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_1|125_aa MGVDYAIGVVVPQETREKQGWAALGKRHERSARPAGIRVQKPVSSCIYPDFRDPATGPPR RLADSTVKILLDLTQSETFCHDEGQEQEIYQACSVNILFTCGKNQSTSVVTYHLIDVYTP IVMGK >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_1|378_bp atgggggtggactatgcaattggtgttgtggtgccacaggaaacaagagaaaaacaaggg tgggcagctcttggaaagagacatgagcgctctgcaaggccagccgggatcagggtccag aagcctgtgtcatcctgtatctacccagacttcagggaccctgccactggcccacccaga aggctggcagattctactgtcaaaatattgttggatctgactcaaagtgaaactttctgt catgacgaagggcaagaacaggaaatttaccaagcttgcagcgtgaatattctttttaca tgtggcaaaaatcagtccacttccgttgtaacataccacctgatagatgtctatacacct attgtgatgggcaaataa >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_2|83_aa MNMNSFKMCEETKKEDDMGSRKWDDDALGMATKDIPRMMGNSRVIVQQALELLVQIGVVG QTAPKGMSSKKFQKGKRKKEKRN >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_2|252_bp atgaatatgaactcctttaagatgtgtgaggaaaccaagaaagaagatgacatgggatcc aggaaatgggatgatgatgcactcggaatggcaactaaggacattccaagaatgatggga aattccagggtgattgtacagcaggccttggagctactcgtccagatcggagtagtagga caaaccgctccaaaagggatgtcttcaaaaaaatttcaaaaaggaaaaagaaagaaagaa aaaagaaattga >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_3|258_aa MAQGLLSSLSIELTAPSLVLPDHPAFGWLSEMFHGGSRQSGLQADAADPQVPPREGHGAD GQETMMLKAIQGHLENSRALEKLLPRIQGNVGFVFTKEDLTEIRDMLVASKASSLKPLGT TEILSDVQLIKTGVKVGASEATLLNMRNISPFSFGLVIQQVFDNGSIYNTEVLDITEEIL NPHFLEGVRNIANVCLQIGYPAVASFKVFLVDPYAFAAATPVATATTAAPSAAAASAKIE AKEELEELDEDTGFGLFD >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_3|777_bp atggctcagggcctgctgtcttctctttctattgagctcacagcgccatctcttgtcctc ccagatcatccagcttttggatggttatccgaaatgtttcatggtgggagcagacaatct gggctccaagcagatgcagcagatcctcaagtccctccgagggaaggccatggtgctgat gggcaagaaaccatgatgctcaaggccatccaagggcacctggaaaacagccgagctctg gagaaactgttgcctcgtatccagggaaatgtgggctttgtgttcaccaaggaggacctc actgagatcagggacatgctggtagccagtaaggcatcatcactaaaacctttgggcacc actgaaatcctgagtgatgtgcagctgatcaagactggagtcaaagtgggagccagcgaa gccacactgctgaacatgcggaacatctctcccttctcctttgggctggtcatccagcag gtgtttgacaatggcagcatctacaatactgaagtgcttgacatcacagaggaaattctg aatcctcacttcctggagggtgtccgcaacattgccaatgtctgtctgcagattggttat ccagctgttgcctccttcaaggtgttcttggttgatccatatgcctttgcggctgctacc cctgtggccactgccaccacagctgctccttctgctgctgcagcctcagctaagattgaa gccaaggaagagttagaggaattggatgaggatacaggatttggtctctttgactaa >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_4|75_aa MVCPELIHEESDSECGTLCTRAVLDSSKESLLSKTESKFSGYPRVGLPDGDGYVTPPAAI SCTVLTFCSSDHNKN >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_4|228_bp atggtttgtcctgaacttattcatgaggaatcagactcagaatgtggaacactctgcaca agagctgtcctggactcttcaaaagagtcattgctcagcaaaacagaaagcaagttctca ggatatccccgggttggccttcctgatggtgatggttatgtgactccacctgctgctatc tcttgtactgtgctaacattttgctctagtgatcataacaagaattga >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_5|846_aa MLASMGVVAKEMKYLSSPYPQHAFHTRIKSLMTVNFQAQLWLPNEAGPTGDGDWGGDTHG CPSFSPHLPRFIQKQPWGTGRRLLQEHFSCKSLVPVIPSRDIEAFHLHADLILNTLPSLV IIIGQIESHSEELRARVSTYESGEDTIQCITVTWKPKYGATRWKDPGSPDGPLEERRLED LSTRNNSIRLFRGCGYRVKQNSSQRPNKVCGLPDTGVLLQTKCGLVRNSQRSAQTNTKQH EELFSTAGIPENLSRRQEVLPGHGFTGWVGWTWKLTSGEGVTFHHLSATHKLSHASCRQG NFKPKGAMSVTRSTVHPHLAFLSAEINGKCGRPCVFLGIWAINCGMAILYERLSALVCVS VSERHLDYTIDSRLHLHLFLVAPGKGKWRGGKRGGKELFAPGETQLHPNRKSGLVPRRTR PGRYLLEPARGSVERSCAPSSQASPPVPPPTQLQSPGTSASNWSTSDSCRTRAEHEQSCR QGCWDRVSGRAEMPTTHDGVMGADAGRGRGSTPLSATCPDTLPTRGGADPRRGAQSARPR NYFLTKRGNCKFSLQGGKERKEVGRGSQELCPIGALELWAKGGLGPAQIPWAALAPKRHS SSTYQALSVLTLLLSDINSDSLMQEKNTSHTTEGEYTTEQRQSQKLPRALVHCYVDELLL KMAAGDFCHPLRMPQSPVMRGKGKLQSGDSFSPCLDDTEESTEGSTLYSWFHPTHVHLWL PGEKSPVHRDRNGFCHRVGKRGEVADERDVIHCAWVGGLHCLKAPPDDANVQPRLRRPAP DQGSRTRRASESQDGLLAGLHSQVSSAVVQVWGLRICISNMLPGGPGSSWSQRAHYEEHF DPVYAS >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_5|2541_bp atgttggcatctatgggagttgtggccaaggaaatgaaatatctgagctcaccttatccc cagcatgcatttcataccaggatcaaatcactaatgaccgtgaatttccaggctcaacta tggctgcctaatgaggctggccccactggggatggggactggggaggggacacacacggg tgcccaagcttcagtccacatctccctcggttcatccagaagcagccatggggcacagga agacgtctgcttcaagagcatttcagctgcaaaagtctggttcctgtaattccctcccgg gatattgaggcattccacttgcatgcggaccttatcctcaacactctgccaagtctggtc atcataataggtcagatagagtcacattctgaggaactgcgggctagagtttcaacatat gaatctggggaagatacaattcagtgcataacagtgacctggaagccaaaatatggagcc acaagatggaaggatcctggatccccagatggccccttggaagagagaagactagaagac ctctcaactaggaacaacagcattagactttttagaggctgcggctatcgggtgaaacag aactcctcacagagaccaaacaaggtttgtgggttaccagacactggtgttctcttacag acaaagtgtggtcttgtcagaaatagccaaagatcagcacaaacaaacacaaagcagcat gaagaactgttttctactgctggaataccagaaaatctgagcaggagacaagaagtgttg ccaggacatggctttactggttgggttggttggacatggaaattgacatctggagaagga gtgactttccaccatctctcagcaacacacaagctgagtcatgccagctgccggcagggc aattttaagcccaaaggggccatgtccgtcacccgatccacagtacacccacacctagct ttcctgtctgcagagataaacggcaaatgcgggaggccatgtgtctttctgggtatctgg gccatcaactgtggaatggccattttatatgagcgtttatcagccttggtttgtgtatca gtgtcggagagacacctggattatacaatagacagccgcctacacttgcacttgttcctc gtggccccggggaagggcaagtggcgaggaggaaagagaggaggaaaggagctttttgcc cctggtgaaacgcagctgcatcccaatcgcaaatccggcttggtccctcgccgtacccgc ccgggtcggtacctgctggagcccgcgcggggctcggttgagcgttcttgcgcgccttct tcccaggcatctccgccggtgccgccgcccactcagctacagagcccgggaacctcagcc tccaactggagcacctcggacagctgcaggacgcgagctgaacacgaacagtcctgcagg cagggatgctgggatcgggtgtccggacgcgcggagatgccaactacacacgacggcgtt atgggagcggacgcgggacgcgggcggggaagtaccccgctgagcgcgacctgcccggac acgctcccgacccgcggcggcgccgacccgcggcggggggcgcagagcgctcggccccga aattacttccttacaaaacgagggaattgcaaattcagcttgcaaggaggaaaagagcgg aaagaagtcgggcgagggtcccaggaactctgccctattggagctctggagctttgggct aaaggaggcttgggtccagcacagattccctgggcggctttggctcccaagagacacagc agctctacgtaccaggcactgtcagttttgacattgttgctgtcagatattaactctgat tccttaatgcaggaaaaaaacacttcccatacaactgaaggtgaatacacaacagaacaa cgacagtcccagaagcttccccgggccttggtgcattgttatgtagatgagcttttgcta aagatggctgcaggggatttctgtcaccccctgcggatgccacagtcccctgtgatgagg ggaaaggggaaactgcaaagtggggacagcttcagcccctgccttgatgacactgaggag tctacagaggggtctacgctatattcctggtttcatccaactcatgttcatctgtggcta cctggggaaaagagcccagttcacagggacagaaatggtttctgccatcgcgttggcaag agaggggaagtggctgatgaaagagatgtgattcattgtgcctgggttggaggcctgcat tgtttgaaagctcccccagacgatgctaacgtacagcccaggttgaggaggccagctcca gaccaggggtcccgcactcggcgtgcatcagaatcacaggacggcttgcttgctgggctc catagccaagtttctagtgcagtagttcaggtatggggcctaagaatctgcatttctaac atgctcccaggtggtccagggagctcctggtcccagagagcacactatgaggagcacttt gatcctgtctatgccagttag >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_6|225_aa MAFEEDPWERGKSNAALSPYSFHENSANGRELDAAAQPEGQLLREVRVLGVPFIPRARVD AWLVHTVAVGSADEAHGLLGAAAASSTGGAGASVDGGSQAVQGGGGDPRAARSGPLDAGE EEKAPAEPTAQVADAGGCASEENEVLREKHEAVDHSSQREENEERVSALKENSLQQNNDD ENKIAEKPDWEAEKTSESRNERHLNGADTSFFLSGRLIPVAFITA >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_6|678_bp atggcctttgaggaagatccatgggaaagagggaaaagcaatgcagctctctcgccttat tctttccatgaaaattctgctaatggccgggagctggacgctgccgcgcagcccgagggc cagctgctccgggaggtgcgcgtgctcggggtccccttcatccctcgcgcccgggtggat gcgtggctggtgcacaccgtggctgtcgggagcgcggacgaggcccacgggctgctcggc gccgccgccgcctcgtccaccggaggagctggcgccagcgtggacggcggcagccaggct gtgcaggggggcggcggggatccccgagcggctcggagtggtcccttggacgccggggaa gaggagaaggcacccgcggaaccgacggctcaggtggcggacgctggcggatgtgcgagc gaggagaacgaggtactaagagaaaagcacgaagctgtggatcatagttcccagcgtgag gaaaatgaagaaagggtgtcagccctgaaggagaactcacttcagcagaataatgatgat gaaaacaaaatagcagagaaacctgactgggaggcagaaaagacctctgaatctagaaat gagagacatctgaatggggcagatacttctttctttctctctggaagacttattccagtt gctttcatcacagcctga >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_7|290_aa MSLATEDNFDPIDVSQLFDEPDSDSGLSLDSSHNSTSVIKFNSSHSVCDEGAIGYCTDRD SSSHHDLEGAVGGYYPEPSKLCHLDQSDSGFHGDLTFQHIFHNHTYHLQPSAPESTSEPF SWPGKSQKIRSRYLEDTDRNLSRDEWSAKALRIPFSVDEIVGMPVDSFNSMLSRYYLTDL QVSLIRDIRRRGKNKVSAQNCRKRKLDIILNLEDDVCNLQAKKETLREQAQCNKAINIMK QKVHDLYHDIFSRLRDDQGRPVNPNHYALQCTHDGSILIVPKELVASGHK >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_7|873_bp atgtcattggccacagaagacaacttcgatccgatcgatgtttctcagctttttgatgaa ccagattctgattctggcctttctttagattcaagtcacaatagtacctctgtcatcaag tttaattcctctcactctgtgtgtgatgaaggtgctataggttattgtactgaccgtgac tctagttcccatcatgacttagaaggtgctgtaggcggctactacccagaacccagtaag ctttgtcacttggatcagagtgattctggtttccatggggatcttacatttcaacacata tttcataaccacacttaccacttacagccaagtgcaccagaatctacttctgaacctttt tcgtggcctgggaagtcacagaagataaggagtaggtaccttgaagacacagatagaaac ttgagccgtgatgaatggagtgctaaagctttgcgtatccctttttctgtagatgaaatt gtcggcatgcctgttgattctttcaatagcatgttaagtaggtattatctgacagaccta caagtctcacttatccgtgatatcagacgaagagggaaaaataaagtttctgcgcagaac tgtcgtaaacgcaaattggacataattttgaatctagaagatgatgtatgtaacttgcaa gcaaagaaggaaactcttagagagcaagcacaatgtaacaaagctattaacattatgaaa cagaaagtgcatgacctttatcatgacatttttagtagattaagagatgaccaaggtagg ccagtcaatccaaaccactatgctcttcagtgtacccatgatggaagtatcttgatagta cccaaagaactggtggcctcaggccacaaatag >gi568815580f:59800178_60002750|GENSCAN_predicted_peptide_8|206_aa MAPPPNTVTLVLVSTYEFGGNINIQAIVPRNLECQEGPGGNGSVIKEEKMGVSRKKQGET MAYKDKVEHTWPGRIPPQSLSAPNFSEERDLLVLVLTKQHLAVTPLSMAPSDPKVTTPSL HKTTLNLHAIKIKCLNRAPFSGDFPKSKVILDDWYQTFMFQQMHSGKRYEQRENGRKRAS IKIKERCETNKAVSKQSWLPAILDGK >gi568815580f:59800178_60002750|GENSCAN_predicted_CDS_8|621_bp atggccccacctccaaatactgtcactctggttttggtttcaacctatgaatttggaggg aatataaacattcaggctatagtaccaaggaaccttgagtgtcaagaaggccctggaggg aatgggtctgtcattaaagaagaaaagatgggcgtgagtaggaaaaagcaaggagaaacc atggcctacaaggacaaggtggaacatacgtggccaggccgaatcccacctcagtctctc agtgcccccaacttcagcgaggaacgtgacctgctggtactcgtcctcacaaagcaacac ctggcagttacccccctttccatggcacccagtgatccaaaagttactaccccttccctg cataaaacgacccttaatctgcatgcaattaaaattaagtgcctaaacagagctccattc agtggggatttcccaaaaagtaaagtgatattggatgactggtaccagacttttatgttc cagcagatgcatagcggtaagagatatgagcaacgagaaaatgggcgaaaaagagccagc atcaagattaaagagagatgtgagactaacaaggcagtgagcaagcaatcctggctgcct gctatccttgatggcaaatga