GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:29:57 Sequence gi568815597r:20383652_20585379 : 201728 bp : 48.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 769 764 6 -1.95 1.02 Term - 3330 3259 72 0 0 137 36 65 0.617 4.11 1.01 Init - 6137 6027 111 2 0 75 78 93 0.439 5.23 1.00 Prom - 12308 12269 40 -3.16 2.00 Prom + 12760 12799 40 -7.26 2.01 Init + 14494 14650 157 0 1 78 56 165 0.875 12.38 2.02 Intr + 16725 16858 134 1 2 79 -4 116 0.549 1.86 2.03 Intr + 17647 17707 61 0 1 58 110 21 0.385 -0.39 2.04 Intr + 19913 20111 199 0 1 80 76 81 0.383 4.51 2.05 Intr + 22177 22246 70 1 1 59 51 58 0.400 -1.62 2.06 Term + 23022 23144 123 2 0 115 49 35 0.525 0.68 2.07 PlyA + 23492 23497 6 1.05 3.00 Prom + 23873 23912 40 -1.86 3.01 Init + 27892 28031 140 0 2 45 49 153 0.960 4.71 3.02 Intr + 29518 29636 119 1 2 59 70 75 0.628 2.91 3.03 Intr + 29654 29736 83 0 2 20 110 35 0.590 -1.74 3.04 Intr + 30005 30136 132 1 0 110 55 62 0.761 5.94 3.05 Intr + 48648 48782 135 2 0 1 94 86 0.010 1.26 3.06 Intr + 58562 58618 57 1 0 68 110 9 0.004 0.18 3.07 Term + 79767 80183 417 2 0 -43 38 996 0.983 76.78 3.08 PlyA + 81090 81095 6 1.05 4.03 PlyA - 81972 81967 6 1.05 4.02 Term - 82112 82041 72 2 0 112 55 33 0.738 0.31 4.01 Init - 83245 83186 60 1 0 98 72 56 0.579 6.40 4.00 Prom - 96076 96037 40 -3.96 5.08 PlyA - 96554 96549 6 1.05 5.07 Term - 100068 99998 71 1 2 115 35 147 0.999 10.20 5.06 Intr - 101881 101563 319 1 1 44 84 530 0.525 43.73 5.05 Intr - 115211 115101 111 2 0 49 78 74 0.022 3.08 5.04 Intr - 117768 117063 706 2 1 101 26 1031 0.031 89.66 5.03 Intr - 118538 118418 121 0 1 24 76 145 0.907 6.45 5.02 Intr - 119658 119571 88 0 1 98 95 34 0.998 4.64 5.01 Init - 124373 124254 120 2 0 108 86 232 0.839 23.19 5.00 Prom - 133775 133736 40 -5.96 6.00 Prom + 133923 133962 40 -8.96 6.01 Init + 137469 137475 7 2 1 61 114 10 0.226 1.37 6.02 Intr + 139091 139159 69 0 0 62 58 80 0.108 1.55 6.03 Term + 142684 143093 410 2 2 -70 42 443 0.096 19.28 6.04 PlyA + 143166 143171 6 -0.45 7.02 PlyA - 143275 143270 6 1.05 7.01 Sngl - 150637 150407 231 1 0 82 37 165 0.361 5.91 7.00 Prom - 161605 161566 40 -3.56 8.00 Prom + 165739 165778 40 -4.06 8.01 Init + 165827 165943 117 1 0 66 70 41 0.366 0.30 8.02 Intr + 169230 169281 52 2 1 81 105 47 0.291 4.18 8.03 Term + 169303 170312 1010 2 2 75 54 1845 0.527 172.07 8.04 PlyA + 171335 171340 6 1.05 9.03 PlyA - 171935 171930 6 1.05 9.02 Term - 180293 180070 224 0 2 71 38 85 0.014 -1.12 9.01 Init - 193200 193143 58 0 1 65 94 55 0.442 5.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 117768 117039 730 2 1 101 38 1058 0.965 95.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_1|60_aa MVSACRKLSTALGPAVRAAALVRWQSLSAAAASTASAVARCQSGTSRVAMGTTERLRKLP >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_1|183_bp atggtgagtgcttgcagaaagctgtcaacagctcttgggccggctgtgcgcgctgcagct ctggtacggtggcagtcactcagtgctgccgcggcctccactgcttcggcggtggcccgc tgccagtctggtaccagccgggttgccatgggaaccactgagcggcttaggaagctgcca tag >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_2|247_aa MKGENIAQFLAMAGTSDMQNLQQRGRYLGKKKRDPRNDNEDGNDDEDDVGDEASDLPTVP TSFAQAAWGSQGLKDTDPTPAVSIPLAAVGTVPSDTVINRDTGAQRGTVSCSSKLAAVGC LDQDAQMRPVTSIPPEKPGRWTPSLLMEKLKPDRPRSLPKSQGKECQRENWTKSCPSLDL SLPSTWMHMDLHFYTATCVQVYKPMHTEDICASSYQQLYKTTNPSSKEENTPWNEKPPPA RLALKSF >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_2|744_bp atgaaaggagagaacatagcccagtttctggcaatggctggtacctcagacatgcagaat ctccagcagaggggcaggtatctgggaaagaaaaagagagaccccaggaatgataatgag gatggcaatgatgatgaagatgatgttggtgatgaagcctcagacctacccacggttccc acaagctttgctcaagctgcctggggctcccagggtctgaaggacacagatcccacccca gctgtgagcatccccctggctgccgtgggcacagtgccttctgacacagtgataaacagg gatactggggcccagagagggactgtgtcctgctcaagcaagctggcagcagttggatgt cttgaccaggatgcacagatgagacctgtgacatccatccccccagagaagccaggcagg tggacaccatcattgttgatggagaagctgaagccagacagacctagaagtttgcccaag tcccagggcaaggagtgccagcgtgagaactggaccaagtcatgtccctctctggacctc agtctccccagcacgtggatgcacatggatctgcatttttacacagctacatgtgtgcag gtgtacaagcccatgcacacagaggatatctgtgcctcaagctatcagcagttatacaag accacaaacccaagtagcaaggaggaaaacacaccctggaatgagaagccccctcctgct cgtttagcacttaagagtttttga >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_3|360_aa MWLSWEASPSLPGAGGASCLQVDPSPIIPDDTVTQQLLNKGTISPDGGKTKIQCVQGLSK VQLRAGGELGPVPRSQDAARAPSVTPWQASLAFDFPSHLASPALDANRTLPCTQPLSDAA ALAGVTAHTMTRGLLHSHLEASDPGIKDPEGILGKEGNLPGETDTAVPENSFDKQGQQKR GLVKALVLLSPSLEMPRRYFFLQVSKCIFPNLHKQVLVTIFQLLIFIIITTITTTIIIIT TTNIITTITIIIITTTITTIITNIIITTTITISTNNSIITNIIITTITIISITISVTIIT TIAIITTTTTTIIIIITIIITITTTTTIITTTITSTTTIIIIIIIIILFSPKTTLESGCC >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_3|1083_bp atgtggctgagttgggaagcctcaccctccctgcctggggctgggggagcttcctgtctg caagttgacccttccccaatcatccctgatgacacagtcactcagcaactcctgaacaag ggcaccatctccccagatggaggaaaaacgaagatccaatgtgtccaaggattgtccaag gtccaactaagagctggtggcgagctgggaccagtgcccaggtctcaggatgctgcccgg gcaccatctgtcactccttggcaggcctccctggcctttgactttcccagccacttggcc agccctgctcttgatgccaaccgcaccctaccctgcacacagcctctctctgatgcagcc gccctggccggggtcacagctcacaccatgacacggggcctcctgcacagccatcttgag gcatctgacccaggaatcaaagacccagaagggattctggggaaagagggaaacctccca ggggagactgatactgctgtccctgagaactcatttgacaagcaagggcagcaaaagcgt ggcctagtaaaggccttggtgctcctgtccccttctctggaaatgcctaggagatacttc ttcctccaggtctccaaatgcatctttcccaatcttcacaagcaagtccttgtaaccatt ttccagctgttaatcttcatcatcattaccaccatcaccaccaccatcatcatcatcacc accaccaacatcatcaccaccatcaccatcatcatcattaccaccaccatcaccaccatc atcaccaacatcatcatcaccaccaccatcaccataagcaccaacaacagcatcatcacc aacatcatcatcaccaccatcaccatcatcagcatcaccatttctgtcaccatcataacc accatcgccataatcaccaccaccaccactaccataatcatcataattaccatcattatc accatcaccaccaccaccaccattatcaccaccaccatcacaagcaccaccaccatcatc atcatcatcatcatcatcatcttgtttagccccaaaaccaccttggaaagtggatgctgt taa >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_4|43_aa MRTNWNPCLSLVTSNVDAAGTLGCPLLLGTCQPSAPKAPVPLR >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_4|132_bp atgaggacaaactggaacccgtgtctgtctctggtgacctccaacgtcgatgccgctggg acccttggctgtcccctgcttttgggcacctgccagccgtcagcacctaaagctcctgtg cctttaagatga >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_5|511_aa MESGGRPSLCQFILLGTTSVVTAALYSVYRQKARVSQELKGAKKVHLGEDLKSILSEAPG KCVPYAVIEGAVRSVKETLNSQFVENCKGVIQRLTLQEHKMVWNRTTHLWNDCSKIIHQR TNTVPFDLVPHEDGVDVAVRVLKPLDSVDLGLETVYEKFHPSIQSFTDVIGHYISGERPK GIQETEEMLKVGATLTGVGELVLDNNSVRLQPPKQGMQYYLSSQDFDSLLQRQESSVRLW KVLALVFGFATCATLFFILRKQYLQRQERLRLKQMQEEFQEHEAQLLSRAKPEDRESLKS ACVVCLSSFKSCVFLECGHVCSCTECYRALPEPKKCPICRQAITRCGASEFLSANRNTHK RCVPQVHRGELVATLEEQAKVGRRPPASRRRHRRSSPRAVPGRPPPAPAPLTLSAAGGDA GGGGAAAEPPDATMSEVLPYGDEKLSPYGDGGDVGQIFSCRLQDTNNFFGAGQNKRPPKL GQIGRSKRVVIEDDRIDDVLKNMTDKAPPGV >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_5|1536_bp atggagagcggagggcggccctcgctgtgccagttcatcctcctgggcaccacctctgtg gtcaccgccgccctgtactccgtgtaccggcagaaggcccgggtctcccaagagctcaag ggagctaaaaaagttcatttgggtgaagatttaaagagtattctttcagaagctccagga aaatgcgtgccttatgctgttatagaaggagctgtgcggtctgttaaagaaacgcttaac agccagtttgtggaaaactgcaagggggtaattcagcggctgacacttcaggagcacaag atggtgtggaatcgaaccacccacctttggaatgattgctcaaagatcattcatcagagg accaacacagtgccctttgacctggtgccccacgaggatggcgtggatgtggctgtgcga gtgctgaagcccctggactcagtggatctgggtctagagactgtgtatgagaagttccac ccctcgattcagtccttcaccgatgtcatcggccactacatcagcggtgagcggcccaaa ggcatccaagagaccgaggagatgctgaaggtgggggccaccctcacaggggttggcgaa ctggtcctggacaacaactctgtccgcctgcagccgcccaaacaaggcatgcagtactat ctaagcagccaggacttcgacagcctgctgcagaggcaggagtcgagcgtcaggctctgg aaggtgctggcgctggtttttggctttgccacatgtgccaccctcttcttcattctccgg aagcagtatctgcagcggcaggagcgcctgcgcctcaagcagatgcaggaggagttccag gagcatgaggcccagctgctgagccgagccaagcctgaggacagggagagtctgaagagc gcctgtgtagtgtgtctgagcagcttcaagtcctgcgtctttctggagtgtgggcacgtt tgttcctgcaccgagtgctaccgcgccttgccagagcccaagaagtgccctatctgcaga caggcgatcacccggtgtggggccagtgaattcctcagtgctaacaggaacactcacaag agatgtgttccccaagtccatagaggagaactggtggccacactagaggaacaggcaaaa gttgggcggcgtcccccggcctctcgccggcgccaccgccgcagcagcccgcgggccgtc cccggccggccgcccccggccccagcgccgctgaccctgtccgccgcgggcggggacgcg ggcggaggaggcgccgcggcggagcccccggacgcgaccatgtcggaggtgctgccctac ggcgacgagaagctgagcccctacggcgacggcggcgacgtgggccagatcttctcctgc cgcctgcaggacaccaacaacttcttcggcgccgggcagaacaagcggccgcccaagctg ggccagatcggccggagcaagcgggttgttattgaagatgataggattgatgacgtgctg aaaaatatgaccgacaaggcacctcctggtgtctaa >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_6|161_aa MAASVPENDRNPVFIKEPAGHLKAAGGQVQIVQSEKNLAATKGIPHLVTNDAGTIRYPGP LIKVNDIIQIDLDTGKSTDVIKFDTGNLWMVTGGANLGRIGVITNRERHPGSFDVVHVKD ANSNSFATWLSSIFVIGKGNKTQGKGIHLTTAEETDKRLAA >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_6|486_bp atggcggcctccgttcctgagaacgacaggaaccccgtctttatcaaggagcctgctggt cacctgaaagctgccggaggccaagtacaaattgtgcaaagtgagaaaaatcttgcggct acaaaaggaatccctcatctggtaaccaatgatgctggcaccatccgctaccctggtccc ctcatcaaggtgaatgacatcattcagattgatttggatactggcaagagtaccgatgtc atcaagtttgatactggtaacctgtggatggtgactggaggtgcgaacctgggaagaatt ggtgtgattaccaacagagagaggcaccctggatcttttgatgtggttcatgtgaaagat gccaacagcaacagctttgccacctggctttccagcatttttgttattggcaagggcaac aaaacccaaggaaagggtatccacctcaccactgctgaagagacagacaagagactggca gcctaa >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_7|76_aa MVAQVKNFGINLNSSLSFIPFNQSRPIHISSISNCTRDLTRSHHVCCSYLVHATIISYVE QCSNLLPTLPDLSRRP >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_7|231_bp atggttgctcaggtcaagaactttggcatcaatctcaactcttctctttccttcataccc tttaaccagtccagacctattcacatcagctctatctccaactgcacccgagatctgact cgttctcaccatgtctgctgctcctacctggtccacgctaccatcatctcttacgtggag cagtgcagcaacctcctccctaccctaccggatctgtcccgcagaccctag >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_8|392_aa MGIEATSPGLRKVGQAPERVETFNVQHTCTYGHLISSLLGGGLLRAFPPPVSRQDAAPFA AAAMLPWRRNKFVLVEDEAKCKAKSLSPGLAYTSLLSSFLRSCPDLLPDWPLERLGRVFR SRRQKVELNKEDPTYTVWYLGNAVTLHAKGDGCTDDAVGKIWARCGPGGGTKMKLTLGPH GIRMQPCERSAAGGSGGRRPAHAYLLPRITYCTADGRHPRVFAWVYRHQARHKAVVLRCH AVLLARAHKARALARLLRQTALAAFSDFKRLQRQSDARHVRQQHLRAGGAAASVPRAPLR RLLNAKCAYRPPPSERSRGAPRLSSIQEEDEEEEEDDAEEQEGGVPQRERPEVLSLAREL RTCSLRGAPAPPPPAQPRRWKAGPRERAGQAR >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_8|1179_bp atgggaatagaagccacatcaccaggattaaggaaagtggggcaggctccagagagagta gagaccttcaatgtgcagcacacatgtacctatggccacttgatctctagcctgctgggc ggtggacttctgcgcgccttccctcccccggtctcccgacaggacgccgcccctttcgcc gccgccgcgatgctgccctggagacgtaacaaattcgtgctggtggaggacgaggccaag tgcaaggcgaagagcctgagtccggggctcgcctacacgtcgctgctctccagcttcctg cgctcctgcccggacctgctgcccgactggccgctggagcgcttgggccgtgtgttccgc agccggcgccagaaagtggagctcaacaaggaggacccgacctacaccgtgtggtacctg ggcaacgccgtcaccctgcacgccaagggcgacggctgcaccgacgacgccgtgggcaag atctgggctcgctgcgggcctggcgggggcactaagatgaagctgacgctggggccgcac ggcatccgcatgcagccgtgcgagcgcagcgccgccgggggttcggggggccgcaggccg gcgcacgcctacctgctgccgcgcatcacctactgcacggcggacgggcgccacccgcgc gtcttcgcctgggtctaccgccaccaggcgcgccacaaggccgtggtgctgcgctgccac gctgtgctgctggcgcgggcgcacaaggcgcgcgccctggcccgcctgctccgccagacc gcgctggcggccttcagcgacttcaagcgcctgcagcgccagagcgacgcgcgccacgtg cgccagcagcatctccgcgctgggggcgccgccgcctcggtgccccgcgccccactgcgc cgcctgctcaatgccaagtgcgcctaccggccgccgccgagcgagcgcagccgcggggcg ccgcgcctcagcagcatccaggaggaggacgaggaggaggaggaggacgacgcggaggag caagagggaggagtcccccagcgcgagcggccggaggtgctcagcctggcccgggagctg aggacgtgcagcctgcggggcgccccggcgcccccgccgcccgcgcagccccgccgctgg aaggccggccccagggagcgggcgggccaggcgcgctga >gi568815597r:20383652_20585379|GENSCAN_predicted_peptide_9|93_aa MVFSQVVSVGLIEKQIVEAEGELNTSLRSQAARGNILADIYWPLTAYLAPWATPNPMVVI PHVCFPKEIRRHPRAQPAIWACVYQWNFGRRLM >gi568815597r:20383652_20585379|GENSCAN_predicted_CDS_9|282_bp atggtattcagtcaagtagtcagtgtgggcctcattgagaagcaaatagttgaagcagaa ggagaacttaatacatctctaaggagtcaagctgccaggggaaacatccttgcagacatc tactggccactgacagcataccttgctccctgggccacacctaaccccatggttgtcatc ccccatgtatgcttccccaaggagatcaggagacatcccagggcccagccagccatctgg gcctgcgtgtatcagtggaattttggcagaaggttgatgtaa