GENSCAN 1.0 Date run: 3-Nov-116 Time: 11:28:20 Sequence gi568815597r:20400693_20608024 : 207332 bp : 48.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2872 3070 199 2 1 80 76 81 0.461 4.51 1.02 Intr + 5136 5205 70 0 1 59 51 58 0.502 -1.62 1.03 Term + 5981 6103 123 1 0 115 49 35 0.603 0.68 1.04 PlyA + 6451 6456 6 1.05 2.00 Prom + 6832 6871 40 -1.86 2.01 Init + 10851 10990 140 2 2 45 49 153 0.959 4.71 2.02 Intr + 12477 12595 119 0 2 59 70 75 0.628 2.91 2.03 Intr + 12613 12695 83 2 2 20 110 35 0.590 -1.74 2.04 Intr + 12964 13095 132 0 0 110 55 62 0.761 5.94 2.05 Intr + 31607 31741 135 1 0 1 94 86 0.010 1.26 2.06 Intr + 41521 41577 57 0 0 68 110 9 0.004 0.18 2.07 Term + 62726 63142 417 1 0 -43 38 996 0.983 76.78 2.08 PlyA + 64049 64054 6 1.05 3.03 PlyA - 64931 64926 6 1.05 3.02 Term - 65071 65000 72 1 0 112 55 33 0.738 0.31 3.01 Init - 66204 66145 60 0 0 98 72 56 0.579 6.40 3.00 Prom - 79035 78996 40 -3.96 4.08 PlyA - 79513 79508 6 1.05 4.07 Term - 83027 82957 71 0 2 115 35 147 0.999 10.20 4.06 Intr - 84840 84522 319 0 1 44 84 530 0.525 43.73 4.05 Intr - 98170 98060 111 1 0 49 78 74 0.022 3.08 4.04 Intr - 100727 100022 706 1 1 101 26 1031 0.031 89.66 4.03 Intr - 101497 101377 121 2 1 24 76 145 0.907 6.45 4.02 Intr - 102617 102530 88 2 1 98 95 34 0.998 4.64 4.01 Init - 107332 107213 120 1 0 108 86 232 0.839 23.19 4.00 Prom - 116734 116695 40 -5.96 5.00 Prom + 116882 116921 40 -8.96 5.01 Init + 120428 120434 7 1 1 61 114 10 0.226 1.37 5.02 Intr + 122050 122118 69 2 0 62 58 80 0.108 1.55 5.03 Term + 125643 126052 410 1 2 -70 42 443 0.096 19.28 5.04 PlyA + 126125 126130 6 -0.45 6.02 PlyA - 126234 126229 6 1.05 6.01 Sngl - 133596 133366 231 0 0 82 37 165 0.361 5.91 6.00 Prom - 144564 144525 40 -3.56 7.00 Prom + 148698 148737 40 -4.06 7.01 Init + 148786 148902 117 0 0 66 70 41 0.366 0.30 7.02 Intr + 152189 152240 52 1 1 81 105 47 0.291 4.18 7.03 Term + 152262 153271 1010 1 2 75 54 1845 0.527 172.07 7.04 PlyA + 154294 154299 6 1.05 8.00 Prom + 171242 171281 40 -3.26 8.01 Init + 188438 188591 154 1 1 110 98 110 0.691 14.00 8.02 Intr + 204236 204347 112 0 1 95 109 177 0.583 19.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100727 99998 730 1 1 101 38 1058 0.965 95.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_1|130_aa XGCLDQDAQMRPVTSIPPEKPGRWTPSLLMEKLKPDRPRSLPKSQGKECQRENWTKSCPS LDLSLPSTWMHMDLHFYTATCVQVYKPMHTEDICASSYQQLYKTTNPSSKEENTPWNEKP PPARLALKSF >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_1|393_bp nttggatgtcttgaccaggatgcacagatgagacctgtgacatccatccccccagagaag ccaggcaggtggacaccatcattgttgatggagaagctgaagccagacagacctagaagt ttgcccaagtcccagggcaaggagtgccagcgtgagaactggaccaagtcatgtccctct ctggacctcagtctccccagcacgtggatgcacatggatctgcatttttacacagctaca tgtgtgcaggtgtacaagcccatgcacacagaggatatctgtgcctcaagctatcagcag ttatacaagaccacaaacccaagtagcaaggaggaaaacacaccctggaatgagaagccc cctcctgctcgtttagcacttaagagtttttga >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_2|360_aa MWLSWEASPSLPGAGGASCLQVDPSPIIPDDTVTQQLLNKGTISPDGGKTKIQCVQGLSK VQLRAGGELGPVPRSQDAARAPSVTPWQASLAFDFPSHLASPALDANRTLPCTQPLSDAA ALAGVTAHTMTRGLLHSHLEASDPGIKDPEGILGKEGNLPGETDTAVPENSFDKQGQQKR GLVKALVLLSPSLEMPRRYFFLQVSKCIFPNLHKQVLVTIFQLLIFIIITTITTTIIIIT TTNIITTITIIIITTTITTIITNIIITTTITISTNNSIITNIIITTITIISITISVTIIT TIAIITTTTTTIIIIITIIITITTTTTIITTTITSTTTIIIIIIIIILFSPKTTLESGCC >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_2|1083_bp atgtggctgagttgggaagcctcaccctccctgcctggggctgggggagcttcctgtctg caagttgacccttccccaatcatccctgatgacacagtcactcagcaactcctgaacaag ggcaccatctccccagatggaggaaaaacgaagatccaatgtgtccaaggattgtccaag gtccaactaagagctggtggcgagctgggaccagtgcccaggtctcaggatgctgcccgg gcaccatctgtcactccttggcaggcctccctggcctttgactttcccagccacttggcc agccctgctcttgatgccaaccgcaccctaccctgcacacagcctctctctgatgcagcc gccctggccggggtcacagctcacaccatgacacggggcctcctgcacagccatcttgag gcatctgacccaggaatcaaagacccagaagggattctggggaaagagggaaacctccca ggggagactgatactgctgtccctgagaactcatttgacaagcaagggcagcaaaagcgt ggcctagtaaaggccttggtgctcctgtccccttctctggaaatgcctaggagatacttc ttcctccaggtctccaaatgcatctttcccaatcttcacaagcaagtccttgtaaccatt ttccagctgttaatcttcatcatcattaccaccatcaccaccaccatcatcatcatcacc accaccaacatcatcaccaccatcaccatcatcatcattaccaccaccatcaccaccatc atcaccaacatcatcatcaccaccaccatcaccataagcaccaacaacagcatcatcacc aacatcatcatcaccaccatcaccatcatcagcatcaccatttctgtcaccatcataacc accatcgccataatcaccaccaccaccactaccataatcatcataattaccatcattatc accatcaccaccaccaccaccattatcaccaccaccatcacaagcaccaccaccatcatc atcatcatcatcatcatcatcttgtttagccccaaaaccaccttggaaagtggatgctgt taa >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_3|43_aa MRTNWNPCLSLVTSNVDAAGTLGCPLLLGTCQPSAPKAPVPLR >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_3|132_bp atgaggacaaactggaacccgtgtctgtctctggtgacctccaacgtcgatgccgctggg acccttggctgtcccctgcttttgggcacctgccagccgtcagcacctaaagctcctgtg cctttaagatga >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_4|511_aa MESGGRPSLCQFILLGTTSVVTAALYSVYRQKARVSQELKGAKKVHLGEDLKSILSEAPG KCVPYAVIEGAVRSVKETLNSQFVENCKGVIQRLTLQEHKMVWNRTTHLWNDCSKIIHQR TNTVPFDLVPHEDGVDVAVRVLKPLDSVDLGLETVYEKFHPSIQSFTDVIGHYISGERPK GIQETEEMLKVGATLTGVGELVLDNNSVRLQPPKQGMQYYLSSQDFDSLLQRQESSVRLW KVLALVFGFATCATLFFILRKQYLQRQERLRLKQMQEEFQEHEAQLLSRAKPEDRESLKS ACVVCLSSFKSCVFLECGHVCSCTECYRALPEPKKCPICRQAITRCGASEFLSANRNTHK RCVPQVHRGELVATLEEQAKVGRRPPASRRRHRRSSPRAVPGRPPPAPAPLTLSAAGGDA GGGGAAAEPPDATMSEVLPYGDEKLSPYGDGGDVGQIFSCRLQDTNNFFGAGQNKRPPKL GQIGRSKRVVIEDDRIDDVLKNMTDKAPPGV >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_4|1536_bp atggagagcggagggcggccctcgctgtgccagttcatcctcctgggcaccacctctgtg gtcaccgccgccctgtactccgtgtaccggcagaaggcccgggtctcccaagagctcaag ggagctaaaaaagttcatttgggtgaagatttaaagagtattctttcagaagctccagga aaatgcgtgccttatgctgttatagaaggagctgtgcggtctgttaaagaaacgcttaac agccagtttgtggaaaactgcaagggggtaattcagcggctgacacttcaggagcacaag atggtgtggaatcgaaccacccacctttggaatgattgctcaaagatcattcatcagagg accaacacagtgccctttgacctggtgccccacgaggatggcgtggatgtggctgtgcga gtgctgaagcccctggactcagtggatctgggtctagagactgtgtatgagaagttccac ccctcgattcagtccttcaccgatgtcatcggccactacatcagcggtgagcggcccaaa ggcatccaagagaccgaggagatgctgaaggtgggggccaccctcacaggggttggcgaa ctggtcctggacaacaactctgtccgcctgcagccgcccaaacaaggcatgcagtactat ctaagcagccaggacttcgacagcctgctgcagaggcaggagtcgagcgtcaggctctgg aaggtgctggcgctggtttttggctttgccacatgtgccaccctcttcttcattctccgg aagcagtatctgcagcggcaggagcgcctgcgcctcaagcagatgcaggaggagttccag gagcatgaggcccagctgctgagccgagccaagcctgaggacagggagagtctgaagagc gcctgtgtagtgtgtctgagcagcttcaagtcctgcgtctttctggagtgtgggcacgtt tgttcctgcaccgagtgctaccgcgccttgccagagcccaagaagtgccctatctgcaga caggcgatcacccggtgtggggccagtgaattcctcagtgctaacaggaacactcacaag agatgtgttccccaagtccatagaggagaactggtggccacactagaggaacaggcaaaa gttgggcggcgtcccccggcctctcgccggcgccaccgccgcagcagcccgcgggccgtc cccggccggccgcccccggccccagcgccgctgaccctgtccgccgcgggcggggacgcg ggcggaggaggcgccgcggcggagcccccggacgcgaccatgtcggaggtgctgccctac ggcgacgagaagctgagcccctacggcgacggcggcgacgtgggccagatcttctcctgc cgcctgcaggacaccaacaacttcttcggcgccgggcagaacaagcggccgcccaagctg ggccagatcggccggagcaagcgggttgttattgaagatgataggattgatgacgtgctg aaaaatatgaccgacaaggcacctcctggtgtctaa >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_5|161_aa MAASVPENDRNPVFIKEPAGHLKAAGGQVQIVQSEKNLAATKGIPHLVTNDAGTIRYPGP LIKVNDIIQIDLDTGKSTDVIKFDTGNLWMVTGGANLGRIGVITNRERHPGSFDVVHVKD ANSNSFATWLSSIFVIGKGNKTQGKGIHLTTAEETDKRLAA >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_5|486_bp atggcggcctccgttcctgagaacgacaggaaccccgtctttatcaaggagcctgctggt cacctgaaagctgccggaggccaagtacaaattgtgcaaagtgagaaaaatcttgcggct acaaaaggaatccctcatctggtaaccaatgatgctggcaccatccgctaccctggtccc ctcatcaaggtgaatgacatcattcagattgatttggatactggcaagagtaccgatgtc atcaagtttgatactggtaacctgtggatggtgactggaggtgcgaacctgggaagaatt ggtgtgattaccaacagagagaggcaccctggatcttttgatgtggttcatgtgaaagat gccaacagcaacagctttgccacctggctttccagcatttttgttattggcaagggcaac aaaacccaaggaaagggtatccacctcaccactgctgaagagacagacaagagactggca gcctaa >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_6|76_aa MVAQVKNFGINLNSSLSFIPFNQSRPIHISSISNCTRDLTRSHHVCCSYLVHATIISYVE QCSNLLPTLPDLSRRP >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_6|231_bp atggttgctcaggtcaagaactttggcatcaatctcaactcttctctttccttcataccc tttaaccagtccagacctattcacatcagctctatctccaactgcacccgagatctgact cgttctcaccatgtctgctgctcctacctggtccacgctaccatcatctcttacgtggag cagtgcagcaacctcctccctaccctaccggatctgtcccgcagaccctag >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_7|392_aa MGIEATSPGLRKVGQAPERVETFNVQHTCTYGHLISSLLGGGLLRAFPPPVSRQDAAPFA AAAMLPWRRNKFVLVEDEAKCKAKSLSPGLAYTSLLSSFLRSCPDLLPDWPLERLGRVFR SRRQKVELNKEDPTYTVWYLGNAVTLHAKGDGCTDDAVGKIWARCGPGGGTKMKLTLGPH GIRMQPCERSAAGGSGGRRPAHAYLLPRITYCTADGRHPRVFAWVYRHQARHKAVVLRCH AVLLARAHKARALARLLRQTALAAFSDFKRLQRQSDARHVRQQHLRAGGAAASVPRAPLR RLLNAKCAYRPPPSERSRGAPRLSSIQEEDEEEEEDDAEEQEGGVPQRERPEVLSLAREL RTCSLRGAPAPPPPAQPRRWKAGPRERAGQAR >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_7|1179_bp atgggaatagaagccacatcaccaggattaaggaaagtggggcaggctccagagagagta gagaccttcaatgtgcagcacacatgtacctatggccacttgatctctagcctgctgggc ggtggacttctgcgcgccttccctcccccggtctcccgacaggacgccgcccctttcgcc gccgccgcgatgctgccctggagacgtaacaaattcgtgctggtggaggacgaggccaag tgcaaggcgaagagcctgagtccggggctcgcctacacgtcgctgctctccagcttcctg cgctcctgcccggacctgctgcccgactggccgctggagcgcttgggccgtgtgttccgc agccggcgccagaaagtggagctcaacaaggaggacccgacctacaccgtgtggtacctg ggcaacgccgtcaccctgcacgccaagggcgacggctgcaccgacgacgccgtgggcaag atctgggctcgctgcgggcctggcgggggcactaagatgaagctgacgctggggccgcac ggcatccgcatgcagccgtgcgagcgcagcgccgccgggggttcggggggccgcaggccg gcgcacgcctacctgctgccgcgcatcacctactgcacggcggacgggcgccacccgcgc gtcttcgcctgggtctaccgccaccaggcgcgccacaaggccgtggtgctgcgctgccac gctgtgctgctggcgcgggcgcacaaggcgcgcgccctggcccgcctgctccgccagacc gcgctggcggccttcagcgacttcaagcgcctgcagcgccagagcgacgcgcgccacgtg cgccagcagcatctccgcgctgggggcgccgccgcctcggtgccccgcgccccactgcgc cgcctgctcaatgccaagtgcgcctaccggccgccgccgagcgagcgcagccgcggggcg ccgcgcctcagcagcatccaggaggaggacgaggaggaggaggaggacgacgcggaggag caagagggaggagtcccccagcgcgagcggccggaggtgctcagcctggcccgggagctg aggacgtgcagcctgcggggcgccccggcgcccccgccgcccgcgcagccccgccgctgg aaggccggccccagggagcgggcgggccaggcgcgctga >gi568815597r:20400693_20608024|GENSCAN_predicted_peptide_8|89_aa MAQKRPACTLKPECVQQLLVCSQEAKKSAYCPYSHFPVGAALLTQEGRIFKGCNIENACY PLGICAERTAIQKAVSEGYKDFRAIAIAS >gi568815597r:20400693_20608024|GENSCAN_predicted_CDS_8|267_bp atggcccagaagcgtcctgcctgcaccctgaagcctgagtgtgtccagcagctgctggtt tgctcccaggaggccaagaagtcagcctactgcccctacagtcactttcctgtgggggct gccctgctcacccaggaggggagaatcttcaaagggtgcaacatagaaaatgcctgctac ccgctgggcatctgtgctgaacggaccgctatccagaaggccgtctcagaagggtacaag gatttcagggcaattgctatcgccagn