GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:26:50 Sequence gi568815590r:126456460_126657389 : 200930 bp : 42.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5355 5575 221 2 2 75 64 147 0.806 9.24 1.02 Term + 7137 7248 112 0 1 70 44 115 0.990 2.35 1.03 PlyA + 8548 8553 6 1.05 2.03 PlyA - 11013 11008 6 1.05 2.02 Term - 16279 16154 126 1 0 90 39 152 0.959 7.80 2.01 Init - 23601 23599 3 0 0 108 81 0 0.318 1.35 2.00 Prom - 26671 26632 40 -2.95 3.07 PlyA - 27302 27297 6 1.05 3.06 Term - 30827 30421 407 0 2 1 42 252 0.156 6.26 3.05 Intr - 35053 34929 125 0 2 41 51 98 0.005 0.71 3.04 Intr - 48261 48157 105 2 0 104 93 2 0.101 0.71 3.03 Intr - 48907 48682 226 2 1 73 98 86 0.276 4.22 3.02 Intr - 59373 59176 198 1 0 95 66 122 0.862 9.10 3.01 Init - 62377 62323 55 1 1 77 95 54 0.932 6.50 3.00 Prom - 66646 66607 40 -7.35 4.00 Prom + 67528 67567 40 -5.55 4.01 Init + 76832 76988 157 1 1 78 94 69 0.842 6.62 4.02 Intr + 80568 80659 92 1 2 55 103 33 0.650 0.29 4.03 Intr + 83882 83911 30 1 0 115 87 48 0.485 4.81 4.04 Term + 91058 91156 99 1 0 113 48 29 0.163 -1.45 4.05 PlyA + 94926 94931 6 1.05 5.05 PlyA - 96006 96001 6 1.05 5.04 Term - 100934 99998 937 1 1 43 45 1326 0.008 114.39 5.03 Intr - 101698 101575 124 2 1 50 100 28 0.007 -1.08 5.02 Intr - 102449 102311 139 2 1 47 78 111 0.001 5.12 5.01 Init - 115841 115770 72 2 0 64 113 44 0.344 5.62 5.00 Prom - 120724 120685 40 -7.95 6.02 PlyA - 122118 122113 6 1.05 6.01 Sngl - 125009 124788 222 2 0 75 49 202 0.772 9.90 6.00 Prom - 125801 125762 40 -6.85 7.03 PlyA - 125809 125804 6 -0.45 7.02 Term - 126573 126362 212 1 2 46 42 184 0.340 6.07 7.01 Init - 139840 139765 76 1 1 80 84 60 0.334 6.10 7.00 Prom - 158587 158548 40 -3.15 8.00 Prom + 162657 162696 40 -4.65 8.01 Init + 162866 162977 112 1 1 25 65 80 0.050 -0.16 8.02 Intr + 174610 174895 286 2 1 102 28 171 0.222 7.98 8.03 Intr + 178530 178829 300 0 0 37 53 173 0.281 3.62 8.04 Term + 179388 179556 169 0 1 15 41 140 0.230 -1.63 8.05 PlyA + 180428 180433 6 1.05 9.04 PlyA - 182760 182755 6 1.05 9.03 Term - 191160 190928 233 1 2 67 36 131 0.460 1.45 9.02 Intr - 196355 196217 139 2 1 14 61 123 0.399 1.32 9.01 Intr - 200296 200193 104 2 2 110 97 36 0.674 5.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 100930 99998 933 1 0 76 45 1321 0.950 122.80 S.002 Term + 102306 102516 211 0 1 89 42 137 0.895 4.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_1|110_aa MAALPEVIYRFNAIPIKLPLAFFTELEKTILNFIWNQEKARIAKTILGKKNKAGGIMLPD FKLYYKATVTKTTCILKEALVTGKLHVGSNEVMHIVVSIRLSPHGEVFRC >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_1|333_bp atggccgcactgcctgaagtaatttataggttcaatgccatccccatcaagctaccactg gctttcttcacagaattggaaaaaactattttaaacttcatatggaaccaagaaaaagcc cgcatagccaagacaatcctgggcaagaagaataaagctggaggcatcatgctacctgac ttcaaactgtactacaaggctacagtaaccaaaacaacatgcatcctcaaagaagctctg gtaacagggaagctgcatgtaggatctaatgaagttatgcacattgtggtgtccataagg cttagccctcatggagaagtcttcagatgttga >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_2|42_aa MSEWSPISTKRTGLSNFKSSGSKVSESTGDDNNNDDDDDVKQ >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_2|129_bp atgagtgaatggtctcctatatccactaaacggactggattatctaatttcaaaagcagt ggttccaaagtttcagagtcaaccggggatgataataataatgatgatgatgatgatgtg aagcaatag >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_3|371_aa MEADAGCDALLTVESLPNACFHRLTSTPFDSCTFNNQSDGVIYTVNLQPVLEKGSKKEAP LVPTMPQKKQVKDNRLWSGAPRIGVSRDTQTWEYFEVQEPSRTPRLFPSFERGGNQTLEE WSNVIKATEPLRCGAKTSSHSLVLFPPLPCCHSTYAGGKSFTPCRLLLPHGGPQLFCVLK GTFLSSQLTFHVITSFWCMLAGFGITWMLESHPTDSEPSWICLVSGNVFCLVAESTDHTS SPAMDSNQDEISELPEKEFQSIVKLIKKEPEKSNVQLKEIKNVIQDMKRKFFSEIGSIHK KQSQLLEIKDTLRETQNALESLSNRIKQAEERTLELKDKAFDLTQSTKDKRILKNEQNLQ EVWGYAECPNL >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_3|1116_bp atggaggcagatgctggctgtgatgctctcttgactgttgaaagccttccaaatgcatgt tttcacagactaacttctacaccttttgattcttgcacttttaataatcagtcggatgga gtaatttatacagttaacttacagccagtgttggaaaaagggtcaaagaaagaggcccct ttggtgccaaccatgccacagaagaagcaagtaaaagataacaggttgtggagcggggca cctagaataggtgtctccagagatacacagacttgggaatactttgaggtgcaggagcct tctagaacacccaggttatttccatcatttgagagaggagggaaccaaaccctggaggag tggagcaatgtgatcaaggccacagagccactcagatgcggagctaagaccagcagccac agccttgtcctcttcccaccccttccctgttgccattccacatatgctggaggaaaaagt ttcactccttgcaggcttctgcttccacatggtggaccccagttgttttgtgttcttaag ggcacctttttgtcatcacagctgacgtttcatgtgatcaccagcttttggtgcatgctg gcaggctttggcattacttggatgctggaatcacaccccacagactcagagccctcctgg atctgccttgtatctgggaatgtcttctgtttggttgctgaaagcactgatcacactagc tcaccagcaatggattcaaaccaagacgaaatctctgaattgccagaaaaagaatttcag tcgattgttaagctaatcaagaaggaaccagagaaaagtaatgttcaacttaaagaaatc aaaaacgtgatacaggatatgaaaagaaaattcttcagtgaaataggtagcatacataaa aaacaatcacaacttctggaaatcaaggacacacttagagaaacacaaaatgcactggaa agtctcagcaatagaattaaacaagcagaagaaagaacattagagctcaaagacaaggct ttcgatttaacccaatccaccaaagacaaaagaattttaaaaaatgaacaaaacctccaa gaagtttggggctatgctgaatgtccaaacctatga >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_4|125_aa MEVLEKQSIQSISVAKLSHMIPCNHHKHLIKKMLLQAHFSDEETASSKVRMAIPTSFLVL EHTTFIFLVLFLPELHLPTPSRHPCEEVPSAMIGCFNKSGSINAIQIMRHATCWSRRGFR VLSMG >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_4|378_bp atggaagttttggagaaacaaagcattcagtccatctcagtggctaaactctctcacatg attccatgtaatcatcacaaacatttaattaagaagatgttacttcaggcccacttttca gatgaggaaactgcatcttcaaaagttagaatggctatacccacttcatttctggtcctc gaacacaccaccttcatcttcttggtgctctttctcccagaacttcaccttcctacccct tctcgtcatccctgtgaagaggtgccttctgccatgattggttgcttcaataaaagtgga agcataaatgctattcagataatgcgtcatgccacttgttggagcagaagagggttccga gtactctccatggggtga >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_5|423_aa MAMQDARSPSKNVIIFILLQKLRKFPTTNSENTSRMTHSSSCPGEESFLVTFRARNMFQE RKETTVPPSPASLPPAPGEEGLPSLHLQTRPRSLGARQGALPQAPRPSPLRGVMGNQVEK LTHLSYKEVPTADPTGVDRDDGPRIGVSYIFSNDDEDVEPQPPPQGPDGGGLPDGGDGPP PPQPQPYDPRLHEVECSVFYRDECIYQKSFAPGSAALSTYTPENLLNKCKPGDLVEFVSQ AQYPHWAVYVGNFQVVHLHRLEVINSFLTDASQGRRGRVVNDLYRYKPLSSSAVVRNALA HVGAKERELSWRNSESFAAWCRYGKREFKIGGELRIGKQPYRLQIQLSAQRSHTLEFQSL EDLIMEKRRNDQIGRAAVLQELATHLHPAEPEEGDSNVARTTPPPGRPPAPSSEEEDGEA VAH >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_5|1272_bp atggcaatgcaagatgcccgatcaccaagcaaaaatgtaattattttcattctgctccag aaacttagaaagtttcccactactaactctgaaaacacctctaggatgacgcactcttcg agctgtcctggtgaagaatcctttcttgtgacctttagggcaagaaatatgttccaggaa agaaaagaaacgactgttcctccttctccagcctcgctgccgcccgcgccgggagaggag ggactgccgagcctacaccttcaaactcgccctagaagtctcggggcgcggcagggagcc cttccccaggccccacggccttcacctctccgcggcgtgatgggcaaccaggtggagaaa ttgacccacctaagttacaaggaagttcccacggccgacccgactggcgtggaccgggac gacgggccccgcattggggtctcctacattttctccaatgacgatgaggacgtggagccg cagccgccgcctcaggggccagatggcggcggcttgcccgacggtggggacgggccgccg ccgccccagccgcagccctacgatccgcggctgcacgaggtggaatgctccgtgttctac cgggacgaatgcatctaccagaagagcttcgcgccgggctcggcggcgctgagtacctac acgcccgagaacctgctcaacaagtgcaagccgggcgatctggtggagttcgtgtcgcag gctcagtacccgcactgggccgtatatgtgggtaacttccaggtggtgcacctgcaccgg ctggaggtgattaacagcttcctgactgacgccagccagggccgtcgcggccgcgtggtc aacgatctgtaccgctacaagccgctaagctccagcgccgtggtgcgcaacgcgctggcg cacgtgggtgccaaggagcgcgagctgagctggcgcaactcggagagtttcgccgcctgg tgccgctacggcaagcgcgagttcaagatcggcggcgagctgcgcatcggcaagcagccc taccggctgcagattcagctgtcggcgcagcgcagccacacgctcgagttccagagtcta gaggacctgatcatggagaagcgacgcaacgaccagatcgggcgcgcggccgtgctgcag gagctcgccacgcacctgcacccggcggagccggaggagggcgacagcaacgtggcgcgg actacgccgcctcccgggcgcccccctgcgcccagctccgaggaggaggacggagaggca gtggcacactga >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_6|73_aa MQEQWLAFSPNGSGNGHLAGSETQWTPCQIRRGGSQQRVCDGGKQQWWTESKSSAQAITN RDQKSVQLQDLIE >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_6|222_bp atgcaggaacaatggctagcctttagcccaaacgggagtggcaatgggcacctcgctgga tcagaaacacagtggacaccctgccagatccggaggggtggaagtcagcagcgggtctgt gacggtggcaaacagcagtggtggacagagagcaaaagctcagctcaagccataacaaac cgggaccagaagagtgtgcagttgcaagatttaattgagtga >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_7|95_aa MRTSIQPHGEAHMKRNEVYHQKPAPSGIGRSTAKMPGPPGSLEMEPSTFMDIEFSLEEWQ YLDTAQRNLYGNATLMDYRNLVFLGIAVSKPDLIT >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_7|288_bp atgaggacatccattcagccccatggagaggcccacatgaagagaaatgaagtctaccac caaaaaccagcaccatcaggtattgggagatccacagctaagatgccaggaccccctgga agcctagaaatggaaccatcgacatttatggacatagaattttctctggaggagtggcaa tacctggacactgcacagcggaatttatacggaaatgcgacgttaatggactacagaaac ctggtcttcctgggtattgctgtctctaagccagacctgatcacttga >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_8|288_aa MAELSRPKDFAFPHIPLDLLTPANCDDVLRASSVALGETITERRKQGTKPCPCDITVHDP RNWKPLKAKEYARISIQWFFNKKRNLAEQEFQLTTWLFRWRVCLRLFECGTKGAQRSHPS FPETVPPYLIIDWVVSVTSTSFISHELEALSLRPPPTWPLSFWDSPQSSWHIPLVIPAAQ LHCGSGQESSVCKLGPSLRAGQHSSGRGTLASQPSRARASVQLRDGAKADWSGTPYHAFK QVLHLAQRPRKGSSPLRLEKKATAKKQAININVPGKEEERKDTGAQEE >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_8|867_bp atggcagaactgtctagacccaaagattttgcttttcctcatattccgctcgaccttctg acccctgcaaactgtgatgatgtgttgagggcatcatcagtggctttgggagaaacaata acagagagaaggaagcagggaacaaagccatgcccgtgtgacatcactgtgcatgatcct agaaactggaaacctcttaaagccaaagagtatgcacggatatctattcagtggttcttt aacaagaagagaaacctggcagaacaggagtttcaactcaccacgtggcttttcaggtgg agggtctgtttgcgattatttgagtgtggaacaaagggtgcacagagaagtcaccccagt tttcctgaaactgtgcctccttacctcattattgactgggtggtgtctgtcacttccaca tcatttatctcccatgaacttgaagctctgtccctccgccctcctccaacatggccactg tcattttgggattcaccccagtcttcctggcacatccctctggtgattcctgccgcgcag ctgcactgtggctcagggcaggagagcagcgtgtgcaaactggggccatctctgagggct ggccagcacagttctggccgtgggactctggcttcacagccctctcgtgccagagcttct gtgcagctcagagatggtgccaaggcagactggagtggcacgccttaccacgccttcaaa caagtgctccatctggcacaaagaccaagaaaaggttccagtcccttgaggttggagaag aaagcaacggcaaagaaacaagctattaatattaatgttccaggaaaagaagaagagaga aaggacacgggcgcccaggaagaatga >gi568815590r:126456460_126657389|GENSCAN_predicted_peptide_9|158_aa XLMKKERDRERKREKKLDHKAWSEWVLKREGLNSKQAYPFHAAKGKGVCGGKRKKENVSF HLASELAAQQENKASANKEDENVRPPSWGEHSGLQQPDSLLMKMQVLVIRNLAIVHGNVG SDRMQAEHQHPMPLPSWFWLDFESDPEMVPLDLEFLFG >gi568815590r:126456460_126657389|GENSCAN_predicted_CDS_9|477_bp nctctgatgaagaaagagagagacagagagagaaaaagagagaaaaaacttgatcacaag gcttggagtgaatgggtgttgaagagagaagggcttaatagtaagcaagcatatcctttt catgcagccaagggcaagggtgtctgtgggggcaagagaaagaaggaaaatgtgagtttc cacttagccagtgagttagcggctcagcaagagaacaaagctagtgcaaacaaagaagac gagaatgtcaggcccccatcatggggtgagcacagtgggctccagcagcctgacagtctc ctcatgaagatgcaggtactggtcatcagaaatttggccattgtgcatggaaatgttggt tctgacagaatgcaggcagagcatcagcatcccatgccactcccatcttggttctggctt gactttgaaagtgatcctgaaatggttccattggatctggaattcctttttggataa