GENSCAN 1.0 Date run: 8-Nov-116 Time: 05:06:07 Sequence gi568815593r:160303288_160515294 : 212007 bp : 43.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1541 1536 6 -0.45 1.06 Term - 2302 2201 102 1 0 67 45 110 0.643 2.88 1.05 Intr - 8685 8571 115 2 1 76 86 87 0.979 7.75 1.04 Intr - 9336 9077 260 0 2 15 66 103 0.065 -2.94 1.03 Intr - 10253 10122 132 2 0 95 77 34 0.059 3.84 1.02 Intr - 18199 18116 84 1 0 72 45 111 0.067 5.22 1.01 Init - 20655 20650 6 0 0 69 99 0 0.393 0.17 1.00 Prom - 22615 22576 40 -3.56 2.00 Prom + 26447 26486 40 -4.76 2.01 Init + 27169 27305 137 0 2 102 41 129 0.495 7.22 2.02 Term + 28298 28445 148 2 1 129 47 21 0.331 -0.53 2.03 PlyA + 31098 31103 6 1.05 3.00 Prom + 34821 34860 40 -3.46 3.01 Init + 37540 37594 55 0 1 105 51 3 0.428 -0.25 3.02 Intr + 39124 39201 78 2 0 84 40 69 0.433 1.22 3.03 Intr + 39242 39638 397 0 1 101 60 217 0.383 13.84 3.04 Term + 44158 44218 61 1 1 80 42 41 0.043 -4.02 3.05 PlyA + 44280 44285 6 1.05 4.03 PlyA - 44297 44292 6 1.05 4.02 Term - 46494 45881 614 1 2 115 42 1142 0.982 106.95 4.01 Init - 51724 51481 244 1 1 93 116 310 0.948 30.20 4.00 Prom - 55367 55328 40 -5.46 5.04 PlyA - 56262 56257 6 1.05 5.03 Term - 60566 60458 109 1 1 82 41 70 0.385 -0.32 5.02 Intr - 63329 63186 144 1 0 23 46 123 0.413 1.00 5.01 Init - 67475 67225 251 2 2 79 96 202 0.498 17.04 5.00 Prom - 84479 84440 40 -4.66 6.02 PlyA - 84986 84981 6 1.05 6.01 Sngl - 92203 90419 1785 1 0 61 43 465 0.977 33.82 6.00 Prom - 92739 92700 40 -5.96 7.14 PlyA - 96933 96928 6 1.05 7.13 Term - 100177 99998 180 1 0 69 43 214 0.997 12.61 7.12 Intr - 101269 101153 117 1 0 112 86 237 0.999 26.56 7.11 Intr - 101593 101522 72 1 0 84 107 15 0.858 2.60 7.10 Intr - 103342 103181 162 1 0 53 45 170 0.962 9.37 7.09 Intr - 104328 104189 140 1 2 48 86 113 0.996 7.28 7.08 Intr - 104526 104459 68 2 2 75 84 1 0.998 -3.05 7.07 Intr - 104781 104684 98 0 2 74 80 97 0.999 6.31 7.06 Intr - 105173 105042 132 2 0 124 106 206 0.999 26.84 7.05 Intr - 109232 109164 69 2 0 88 87 37 0.842 2.98 7.04 Intr - 110333 110169 165 2 0 27 85 201 0.994 13.86 7.03 Intr - 110692 110612 81 1 0 57 107 52 0.934 3.83 7.02 Intr - 111185 111032 154 1 1 69 87 76 0.995 5.67 7.01 Init - 112007 111838 170 2 2 62 106 207 0.892 18.91 7.00 Prom - 112566 112527 40 -3.26 8.00 Prom + 116099 116138 40 -7.96 8.01 Init + 119026 119116 91 0 1 75 89 71 0.991 6.55 8.02 Intr + 119422 119606 185 2 2 37 115 56 0.884 2.71 8.03 Intr + 120950 121043 94 1 1 106 35 57 0.948 1.74 8.04 Intr + 124428 124586 159 1 0 102 100 80 0.660 10.56 8.05 Intr + 128955 129022 68 1 2 51 85 16 0.203 -3.78 8.06 Intr + 130127 130255 129 1 0 116 70 26 0.116 4.49 8.07 Intr + 135454 135612 159 0 0 78 59 48 0.042 1.08 8.08 Intr + 184102 184209 108 0 0 50 90 73 0.022 4.18 8.09 Intr + 200812 200887 76 0 1 95 57 13 0.015 -2.01 8.10 Term + 206267 206451 185 0 2 103 47 128 0.550 7.91 8.11 PlyA + 208976 208981 6 1.05 9.02 PlyA - 209492 209487 6 1.05 9.01 Term - 211898 211776 123 2 0 100 48 88 0.929 4.38 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 9333 9077 257 0 2 30 66 171 0.879 3.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_1|232_aa MQPWKLRKPLIEGGFVASRWPKESSTLLLAVCLYVQESSQKLKALQSQLFTECSLNTWCS DYPFTRIISFNFQQRMREARPPIPPAREGGPRRPGACEPRRDPSARRGAATAGASRAAEA AVGSAERLGRGSNRRQAPEGTAGSALGGARRSQPPTPREDAVRPESRVASDSGAAYAMMD EPWWEGRVASDVHCTLREKFVFWRSPTDVLSLNHPEKPAAMDVGLAVGIGKG >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_1|699_bp atgcagccctggaagctacggaagcccctcatagaaggtggctttgtggcatccaggtgg cccaaggaaagcagcaccctgctgctggcagtgtgtctttatgtgcaggaaagcagtcag aaactaaaggctcttcagtcacagttgtttactgagtgttcacttaatacctggtgtagt gattatccctttacacgcattatctcctttaattttcagcagcgcatgcgcgaggcccgg cccccaattcccccagcgagggagggaggcccccggcggccgggagcctgcgagccgcgg cgggacccgagcgcacgcaggggcgcggcgacggcgggggctagtcgggctgcggaggcg gccgtcgggagtgcggagcgcctcggacgagggtccaaccgccggcaggcaccagagggc acggctggctcggcactgggaggggcccggcgctcgcagccccccacgcccagagaggat gcggtgcgccctgagagccgggtagcctcggatagcggcgctgcgtacgcgatgatggat gagccgtggtgggaagggcgcgtcgcctcggacgtccactgcaccctgcgcgagaagttt gttttctggcgctctccaacggatgtgctctctctaaaccatcctgagaagccagccgct atggacgtgggccttgctgtgggcattggaaagggctga >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_2|94_aa MGLKVSWAPLSSRLPLVTAILFSIIVPVKPSGFKPHVFKVHLLATRFHSKVNQWNQSFAS TLNPLDFLITCSIHLVNLQPRFNTAACSAPDPVH >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_2|285_bp atgggcctaaaggtgtcgtgggctcctctctcctctcggctcccccttgtcactgccatt ctcttctccatcattgttcctgtaaaaccttcaggctttaaacctcatgtcttcaaagtg cacctcctggccaccagattccattccaaagtcaatcagtggaatcaatcatttgcatct acccttaaccctcttgactttctcataacttgcagcatccacttggtcaaccttcaaccc aggttcaatacagctgcctgttctgcaccagatcctgtgcactga >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_3|196_aa MAKPLLYKKDKNYLDMVAAPDTLKGVDQILANSQQHPVLAVPTVGLVCVRGKMVGRVRRY SPEDSALPDWPRELTQYYQGPSGSICQFQKFSHLGESLLAAELVNHKVKKKSKLEGNSEI TESSQRDGYSLQQLSQVFYELSAYESQCGCTTDSAGEFPKIPNVQAVLQTNCIRVTWGAA GDPNPHVNLIGPESLS >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_3|591_bp atggcaaaaccccttctctacaaaaaggacaaaaattatctggacatggtggcagctcct gacactctcaaaggagtggatcagatcctggccaacagccagcagcatccagtcctggca gtgccaactgtgggcttggtctgcgtcagagggaagatggtgggcagggtgcggagatac agccctgaagactctgctctccctgactggccacgggagctgacacagtattaccaaggt ccctcaggctccatctgtcagttccagaaattcagtcaccttggagaaagccttctggca gcagaactggtgaatcataaagtcaaaaaaaaatccaagctagaagggaactcagagatc actgagtccagccagagagatggctattccctgcaacagctctctcaagtgttttacgag ctatctgcctatgagtctcaatgtggctgcaccacagactcagctggggagtttccaaaa attcccaatgtccaggctgtactccagactaattgcattagagtcacttggggtgctgca ggggaccccaacccacacgtcaatctgattgggccagaaagcctttcctga >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_4|285_aa MIPWVLLACALPCAADPLLGAFARRDFRKGSPQLVCSLPGPQGPPGPPGAPGPSGMMGRM GFPGKDGQDGHDGDRGDSGEEGPPGRTGNRGKPGPKGKAGAIGRAGPRGPKGVNGTPGKH GTPGKKGPKGKKGEPGLPGPCSCGSGHTKSAFSVAVTKSYPRERLPIKFDKILMNEGGHY NASSGKFVCGVPGIYYFTYDITLANKHLAIGLVHNGQYRIRTFDANTGNHDVASGSTILA LKQGDEVWLQIFYSEQNGLFYDPYWTDSLFTGFLIYADQDDPNEV >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_4|858_bp atgatcccctgggtgctcctggcctgtgccctcccctgtgctgctgacccactgcttggc gcctttgctcgcagggacttccggaaaggctcccctcaactggtctgcagcctgcctggc ccccagggcccacccggccccccaggagccccagggccctcaggaatgatgggacgaatg ggctttcctggcaaagacggccaagatggacacgacggcgaccggggggacagcggagag gaaggtccacctggccggacaggtaaccggggaaagccaggaccaaagggcaaagccggg gccattgggcgggctggcccccgtggccccaagggggtcaacggtacccccgggaagcat ggcacaccaggcaagaaggggcccaagggcaagaagggggagccaggcctcccaggcccc tgcagctgtggcagtggccataccaagtcagctttctcggtggcagtgaccaagagctac ccacgggagcggctgcccatcaagtttgacaagattctgatgaacgagggtggccactac aatgcttccagcggcaagttcgtctgcggcgtgcctgggatctactacttcacctacgac atcacgctggccaacaagcacctggccatcggcctggtgcacaacggccagtaccgcatc cggacctttgatgccaacaccggcaaccacgatgtggcctcaggctccaccatcctggct ctcaagcagggtgacgaagtttggctgcagatcttctactcagagcagaacgggctcttc tatgacccttactggacagacagcctctttacgggcttcctaatctatgccgaccaggat gaccccaacgaggtatag >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_5|167_aa MHRHTRVVISTASFFQTAAKKRMLRCGPAHVTGRKALPPMTGWENYAWGRRSARLLPLRK AGTRSPAESFFAPDAPGRGGQPRGYTIASAIITAFIAIDAKVCKCSAPHGVTVTATLQMG KQQTQVYPGLPSCCILLSHGSPWSPRGRKQLQTMPKLDGFVFSRSRA >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_5|504_bp atgcatcgacacacccgagttgtcatcagcactgcaagttttttccaaacagctgctaaa aagagaatgctgcgctgcgggccagcccacgtgaccgggaggaaggctctcccgcccatg acgggatgggaaaactatgcctggggccgacgctctgcccggctgctgccgctgaggaaa gccgggacgcggagccccgccgagagcttctttgctccggacgcccctggacgtggcggg cagccgcgaggctatacaattgcatcagccattataactgctttcattgccattgatgcc aaagtttgcaaatgttcagctcctcacggggtcactgttacggccactttacagatgggg aaacagcagacccaggtctatccaggtttacccagctgctgcattcttctctcccatggc agtccttggtctccacgtggcagaaaacagctgcagacaatgccaaagcttgatggcttt gtcttcagccgctccagagcctga >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_6|594_aa MSKKRKWDDDYVRYWFTCTTEVDGTQRPQCVLCNSVFSNADLRPSKLSDHFNRQHGGVAG HDLNSLKHMPAPSDQSETLKAFGVASHEDTLLQASYQFAYLCAKEKNPHTVAEKLVKPCA LEIAQIVLGPDAQKKLQQVPLSDDVIHSRIDEMSQDILQQVLEDIKASPLKVGIQLAETT DMDDCSQLMAFVRYIKEREIVEEFLFCEPLQLSMKGIDVFNLFRDFFLKHKIALDVCGSV CTDGASSMLGENSEFVAYVKKEIPHIVVTHCLLNPHALVIKTLPTKLRDALFTVVRVINF IKGRAPNHRLFQAFFEEIGIEYSVLLFHTEMRWLSRGQILTHIFEMYEEINQFLHHKSSN LVDGFENKEFKIHLAYLADLFKHLNELSASMQRTGMNTVSAREKLSAFVRKFPFWQKRIE KRNFTNFPFLEEIIVSDNEGIFIAAEITLHLQQLSNFFHGYFSIGDLNEASKWILDPFLF NIDFVDDSYLMKNDLAELRASGQILMEFETMKLEDFWCAQFTAFPNLAKTALEILMPFAT TYLCELGFSSLLHFKTKSRSCFNLSDDIRVAISKKVPRFSDIIEQKLQLQQKSL >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_6|1785_bp atgtcgaagaaacgcaaatgggatgatgactatgttcgctactggttcacctgtacaacg gaagtagatggaactcagcgtccacagtgtgtgttgtgtaactcagtattttcaaatgct gacctcagaccatcaaaactgtcagaccattttaacagacagcatggtggtgtagctggg catgatctcaatagcctgaagcatatgccagcaccatctgatcagagtgaaaccttgaaa gcatttggagttgcatctcatgaggatactttattacaagcatcgtatcaatttgcgtat ttatgtgccaaggagaagaatcctcatacagtagctgaaaagttagtgaaaccttgtgca ctggaaatagcacaaatagttttgggaccagatgcacaaaaaaagcttcagcaggtaccc ttatcagatgatgtgatccattctagaattgatgaaatgagccaggatatcttacagcaa gttctagaagatatcaaagccagtcctcttaaagtgggtattcagcttgctgagacaact gacatggatgactgcagtcagctaatggcatttgtgcgctatataaaagaaagagagatc gtagaagaatttctcttctgtgaaccattgcagctatccatgaaaggaatagatgtgttc aatctcttcagagacttctttctgaagcataagatagcacttgatgtatgtggctctgtt tgtactgatggtgcctcctctatgctaggagaaaattccgagtttgttgcctacgtgaaa aaagagatacctcatatcgtagttacacattgtttattgaatcctcatgcacttgtcata aagacattgcctacaaaactgagggatgctctgtttactgtggtgagggtaataaatttc atcaaagggagagctccaaatcatcgcctatttcaggctttctttgaagaaattggcata gaatatagtgtcctccttttccatactgaaatgaggtggctttcccgaggccaaatactt actcacatttttgaaatgtatgaagaaataaatcagtttcttcaccacaagagcagtaat ttagttgatggctttgaaaataaagagtttaaaattcacctagcataccttgcagattta ttcaaacacttaaatgaactcagtgcatctatgcagaggactgggatgaacacagtatca gctagagagaagttatctgcttttgttaggaagtttccattttggcaaaaacgaattgag aaaagaaattttaccaattttccttttcttgaagaaataattgtttcagataatgaaggc atattcattgcagctgaaataacactgcatctgcaacaattgagcaacttcttccatgga tatttttccattggagatcttaatgaggcaagtaaatggatattggatccatttcttttt aatatcgattttgttgatgatagttatttaatgaaaaatgatcttgctgaattacgagct agtggccaaatcctaatggaatttgagacaatgaagcttgaggatttctggtgtgcccaa ttcacagcatttccaaacctggcaaagacagctctagaaatccttatgccatttgcaact acatacctttgtgagttgggattttcatcacttttacatttcaaaacaaagtccagaagc tgctttaatctgagtgatgatatccgtgtggctatttcaaaaaaagttcctcgtttctcg gacatcattgaacaaaagctacagctacagcagaagtcactgtaa >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_7|535_aa MSATVVDAVNAAPLSGSKEMSLEEPKKMTREDWRKKKELEEQRKLGNAPAEVDEEGKDIN PHIPQYISSVPWYIDPSKRPTLKHQRPQPEKQKQFSSSGEWYKRGVKENSIITKYRKGAC ENCGAMTHKKKDCFERPRRVGAKFTGTNIAPDEHVQPQLMFDYDGKRDRWNGYNPEEHMK IVEEYAKVDLAKRTLKAQKLQEELASGKLVEQAEKDHNSEDEDEDKYADDIDMPGQNFDS KRRITVRNLRIREDIAKYLRNLDPNSAYYDPKTRAMRENPYANAGKNPDEVSYAGDNFVR YTGDTISMAQTQLFAWEAYDKGSEVHLQADPTKLELLYKSFKVKKEDFKEQQKESILEKY GGQEHLDAPPAELLLAQTEDYVEYSRHGTVIKGQERAVACSKYEEDVKIHNHTNSEECII NEITGEESVKKPQTLMELHQEKLKEEKKKKKKKKKKHRKSSSDSDDEEKKHEKLKKALNA EEARLLHVKETMQIDERKRPYNSMYETREPTEEEMEAYRMKRQRPDDPMASFLGQ >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_7|1608_bp atgtcagccacagttgtagatgcagttaatgctgcacccctatcggggtccaaagaaatg agtttggaagaaccaaagaagatgaccagagaggactggagaaagaagaaggagctagaa gaacagcgaaaattgggcaatgctcctgcagaagttgatgaagaaggaaaagacatcaac ccccatattcctcagtatatttcttcagtgccatggtatattgatccttcaaaaagacct actttaaaacaccagagaccacaaccagaaaaacaaaagcagttcagctcatctggagaa tggtacaagaggggtgtaaaagagaattccataattactaagtaccgcaaaggagcatgt gaaaattgtggggccatgacacacaaaaagaaagactgctttgagagacctaggcgagtt ggagccaaatttacaggtactaatatagctccagatgaacatgtccagcctcaactgatg tttgactatgatgggaagagggatcggtggaatggctacaatccagaagaacacatgaaa attgttgaagagtatgccaaagttgatttggcaaaacgaacattgaaagcccagaaactc caagaggaattagcctcaggaaaattagtggaacaggctgaaaaagatcataatagtgaa gatgaggatgaagataaatatgcagatgatattgacatgcctggacagaattttgactcc aagagacgaattactgtccggaatctcaggattcgagaagatattgcaaaatatttgagg aatttagatccaaattctgcctactatgatccaaaaactagagcaatgagagagaatcct tatgccaatgcaggaaagaatccagatgaagtgagttatgctggagataactttgttagg tacacaggagataccatttcaatggctcagacacagttgtttgcatgggaagcctatgac aagggatctgaagtgcatctacaggcagatcctacaaagctagagctgttgtataagtcc ttcaaagtcaaaaaagaagatttcaaagaacagcagaaagaaagcatcctggaaaagtat ggtggccaagaacatttggatgcccctccagctgaattgcttttagcccagactgaagac tatgtggagtactcaagacatgggacagtcatcaaaggacaggagcgggctgttgcctgc tctaagtatgaggaggatgtgaagatccacaatcacacaaactctgaggagtgtattata aatgagataactggggaagaatctgtgaaaaaacctcaaaccctcatggagctgcatcaa gaaaaactgaaagaggaaaagaagaagaagaaaaagaaaaagaagaagcatcgaaagagc agttcagatagtgatgatgaagaaaagaagcatgaaaaattgaaaaaggcactgaacgca gaggaggcccgccttcttcatgtcaaggagaccatgcagattgatgagaggaagcggcct tacaatagcatgtatgaaactcgagaacctactgaagaggaaatggaggcatatagaatg aaacgtcagaggccagatgaccccatggcctctttccttggacagtag >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_8|417_aa MATLIYVDKENGEPGTRVVAKDGLKLGSGPSIKALDGRSQVSTPRFGKTFDAPPALPKAT RKALGTVNRATEKSVKTKGPLKQKQPSFSAKKMTEKTVKAKSSVPASDDAYPEIEKFFPF NPLDFESFDLPEEHQIAHLPLSGVPLMILDEERELEKLFQLGPPSPVKMPSPPWESKAFA LTMLGCVWSSPPLYWEQDECSEKAEKDNKYSQNALSRALSLTSLGSVGGTSIGLPISPAT QLTSFNKTGVERLVEGRALLPPPPDIDKYLLAARSVLSASHASSFGLSEKRHGVWNMFGY NNKVIEDNITPNCDESRQYSCCAFIRQYAIEFFGPFLLTDPVHLSGAVSCIHKPSVAVEY TLTERNEELRVEFWQVVKEVLLVLSNRTMTVSILFAFDEDETDMGDLVNICESYVSP >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_8|1254_bp atggctactctgatctatgttgataaggaaaatggagaaccaggcacccgtgtggttgct aaggatgggctgaagctggggtctggaccttcaatcaaagccttagatgggagatctcaa gtttcaacaccacgttttggcaaaacgttcgatgccccaccagccttacctaaagctact agaaaggctttgggaactgtcaacagagctacagaaaagtctgtaaagaccaagggaccc ctcaaacaaaaacagccaagcttttctgccaaaaagatgactgagaagactgttaaagca aaaagctctgttcctgcctcagatgatgcctatccagaaatagaaaaattctttcccttc aatcctctagactttgagagttttgacctgcctgaagagcaccagattgcgcacctcccc ttgagtggagtgcctctcatgatccttgacgaggagagagagcttgaaaagctgtttcag ctgggccccccttcacctgtgaagatgccctctccaccatgggaatccaaagcctttgca cttaccatgctcggctgtgtctggtcctctcctccgctctactgggagcaggatgagtgc agtgagaaggctgaaaaggacaataaatactcccagaatgctctctcaagagccctctct ctcaccagtcttggttctgttggtggcaccagcataggcttacctatttctcctgcaact cagctgacctcttttaataaaactggggtggagaggttggtggaagggagggcgctgtta ccaccgccacctgacattgataagtacttactggctgccagatcagtgctaagtgcctca cacgcatcgtcctttggactctcagagaaacgccatggggtgtggaacatgtttggttac aacaataaagttattgaagacaatataacacccaattgtgatgagtccagacagtactca tgctgtgcatttattagacagtatgccatagagttctttggtcccttcctgttgacagat cctgtgcatctttcaggagctgtttcatgcatacacaagcctagtgtggcagttgagtat actttaacagaaaggaatgaagagctaagggtggaattttggcaggtggtgaaggaggta ctgttggttctttcaaatcgcaccatgacagtcagtattctctttgcctttgatgaggat gagactgatatgggagacttggtgaatatctgtgagagttacgtgtctccctga >gi568815593r:160303288_160515294|GENSCAN_predicted_peptide_9|40_aa TVIFMKAKNLSLLVIIISPVPRTVTHIAGIPKRVVECQNE >gi568815593r:160303288_160515294|GENSCAN_predicted_CDS_9|123_bp actgtcattttcatgaaggcaaagaacctgtctctcttggtcatcatcatatccccagta cctcgcactgtgacacacattgctggcattccaaaaagagttgttgaatgccagaatgaa tga