GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:44:19 Sequence gi568815591r:129310116_129510370 : 200255 bp : 41.73% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 35933 35979 47 1 2 69 116 34 0.733 4.51 1.02 Term + 42081 42291 211 0 1 107 49 111 0.620 4.88 1.03 PlyA + 42761 42766 6 1.05 2.00 Prom + 46852 46891 40 -5.15 2.01 Init + 65755 65811 57 0 0 69 94 31 0.805 3.16 2.02 Intr + 69526 69634 109 0 1 78 109 103 0.839 10.34 2.03 Intr + 78941 79084 144 0 0 64 98 179 0.649 15.83 2.04 Intr + 79519 79619 101 2 2 75 68 75 0.979 3.11 2.05 Intr + 87107 87209 103 1 1 97 83 160 0.722 15.23 2.06 Intr + 90175 90269 95 2 2 115 64 54 0.816 4.46 2.07 Intr + 93264 93370 107 2 2 66 110 86 0.978 6.69 2.08 Intr + 94982 95098 117 2 0 103 30 110 0.968 5.36 2.09 Intr + 95721 95784 64 0 1 104 57 52 0.982 1.50 2.10 Intr + 96263 96351 89 1 2 138 107 113 0.915 16.05 2.11 Intr + 99361 99431 71 1 2 44 80 117 0.175 4.41 2.12 Term + 99592 99617 26 2 2 78 39 31 0.182 -5.39 2.13 PlyA + 99710 99715 6 -3.94 3.02 PlyA - 99723 99718 6 1.05 3.01 Sngl - 100255 99998 258 1 0 39 44 401 0.595 25.68 3.00 Prom - 107347 107308 40 -6.15 4.00 Prom + 107874 107913 40 -6.95 4.01 Init + 108824 108883 60 1 0 46 111 36 0.613 3.00 4.02 Intr + 112680 112823 144 2 0 108 94 77 0.535 9.86 4.03 Intr + 114948 115026 79 2 1 102 106 102 0.996 11.61 4.04 Term + 116328 116560 233 1 2 68 46 149 0.886 4.35 4.05 PlyA + 116685 116690 6 1.05 5.00 Prom + 117460 117499 40 -8.75 5.01 Init + 124358 124551 194 1 2 93 45 152 0.601 9.75 5.02 Intr + 124788 125003 216 0 0 43 105 88 0.404 2.70 5.03 Intr + 125993 126144 152 2 2 106 68 10 0.893 -0.31 5.04 Intr + 126227 126530 304 0 1 53 50 177 0.333 5.12 5.05 Intr + 133554 133716 163 0 1 73 36 83 0.111 0.66 5.06 Intr + 133909 133983 75 0 0 108 86 57 0.114 6.29 5.07 Intr + 135295 135390 96 0 0 54 101 66 0.110 3.79 5.08 Intr + 135590 135787 198 1 0 53 83 101 0.097 4.73 5.09 Term + 135836 136087 252 1 0 57 38 134 0.093 -0.05 5.10 PlyA + 136126 136131 6 -1.95 6.03 PlyA - 136244 136239 6 -0.45 6.02 Term - 137088 136759 330 0 0 9 48 336 0.955 15.37 6.01 Init - 137732 137352 381 2 0 31 77 336 0.031 23.72 6.00 Prom - 138420 138381 40 -3.45 7.00 Prom + 138898 138937 40 -9.75 7.01 Init + 139410 139473 64 2 1 94 22 61 0.114 1.46 7.02 Intr + 141498 141632 135 1 0 59 113 148 0.142 14.02 7.03 Intr + 143112 143232 121 1 1 101 24 105 0.587 4.03 7.04 Intr + 144027 144095 69 0 0 94 109 71 0.988 7.18 7.05 Intr + 144306 144412 107 0 2 79 60 103 0.986 5.54 7.06 Intr + 145129 145256 128 2 2 130 101 71 0.989 12.08 7.07 Intr + 146324 146527 204 1 0 116 51 212 0.998 18.77 7.08 Intr + 148100 148335 236 1 2 116 96 295 0.963 28.66 7.09 Intr + 148597 148662 66 1 0 59 92 84 0.859 3.00 7.10 Intr + 149402 149654 253 2 1 79 76 64 0.561 0.81 7.11 Intr + 150186 150257 72 2 0 80 85 104 0.905 7.98 7.12 Intr + 152851 152925 75 0 0 46 91 61 0.449 0.99 7.13 Intr + 153929 154026 98 1 2 91 78 50 0.529 2.19 7.14 Intr + 154176 154252 77 0 2 117 68 56 0.713 4.74 7.15 Intr + 154513 154835 323 2 2 83 89 163 0.466 10.55 7.16 Intr + 155752 155880 129 0 0 53 71 69 0.733 1.67 7.17 Intr + 160472 160600 129 1 0 42 107 121 0.844 9.37 7.18 Intr + 170670 170774 105 2 0 58 101 56 0.856 3.39 7.19 Intr + 172727 172931 205 1 1 48 101 181 0.939 13.25 7.20 Intr + 175464 175666 203 1 2 97 22 200 0.426 12.38 7.21 Term + 186053 186103 51 1 0 113 44 82 0.622 2.75 7.22 PlyA + 186376 186381 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 134281 134176 106 2 1 53 75 84 0.811 1.95 S.002 Intr - 137753 137352 402 2 0 53 77 349 0.961 24.00 S.003 Intr - 138256 138096 161 2 2 50 93 92 0.809 4.69 S.004 Intr + 141503 141632 130 1 1 48 113 150 0.826 12.85 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:129310116_129510370|GENSCAN_predicted_peptide_1|85_aa MVTEKIIREDLWDKRRETMVIQVWASGRYFLENERSEPVSRKLTVFVVNDKIQAFKWKLE FWKTQTRHHELDSFQSSCPRCRPRI >gi568815591r:129310116_129510370|GENSCAN_predicted_CDS_1|258_bp atggtaacagaaaagattatcagagaggatttgtgggataagagaagagagaccatggtt attcaggtttgggcatctggcagatattttcttgaaaatgaacgaagtgagcctgtctca aggaaactgacagtatttgttgtcaatgataaaattcaagctttcaaatggaaattagaa ttttggaaaacccaaacccgccaccatgagcttgacagcttccagtcaagctgtccccgg tgtagacctcgtatttga >gi568815591r:129310116_129510370|GENSCAN_predicted_peptide_2|360_aa MLGSKKKYIVNGNSGIKAQIQFADQKQEFNKRPTKIGRRSLSRSISQSSTDSYSSAASYT DSSDDETSPRDKQQKNSKGSSDFCVKNIKQAEFGRREIEIAEQEMPALMALRKRAQGEKP LAGAKIVGCTHITAQTAVLMETLGALGAQCRWAACNIYSTLNEVAAALAESGFPVFAWKG ESEDDFWWCIDRCVNVEGWQPNMILDDGGDLTHWIYKKYPNMFKKIKGIVEESVTGVHRL YQLSKAGKLCVPAMNVNDSVTKQKFDNLYCCRESILDGLKRTTDMMFGGKQVVVCGYGEV GKGCCAALKAMGSIVYVTEIDPICALQACMDGFRLVKLNEVIRQVDIVITCTELRPKVYD >gi568815591r:129310116_129510370|GENSCAN_predicted_CDS_2|1083_bp atgctgggcagcaagaagaaatacattgttaatggcaactctgggattaaggcccagatc cagtttgctgaccagaagcaagaattcaacaaacgtcccaccaaaattggacgtcgctct ttgtctcgttccatttctcagtcatctactgacagctacagctcagcggcttcatataca gatagctctgatgatgagacatcgcccagggacaagcagcaaaagaactctaagggaagc agtgacttctgtgttaagaacatcaaacaggcagagtttggacgaagagaaattgaaatt gctgaacaagaaatgcctgcattgatggctttgaggaagagagctcaaggagaaaagcct ttggctggagccaaaatcgtgggttgcacacacatcactgctcagactgctgtgcttatg gaaactctgggtgctctgggggcccagtgccgatgggctgcctgcaacatctattccact ctcaatgaagtggctgctgctctagcagaaagtggatttcctgtttttgcctggaaggga gagtcagaagatgacttttggtggtgtatcgatagatgtgtgaatgtggagggctggcag ccaaacatgatcttggatgatggaggggatcttacccactggatttataaaaagtatccc aacatgtttaagaaaatcaagggcatagtagaggagagtgttactggagttcacaggctg taccaactgtccaaagctgggaagctgtgtgttccagccatgaatgtcaatgactcagtc accaaacagaaatttgacaacctctactgttgccgtgaatcaattcttgatggacttaaa aggacaacagacatgatgtttggtggaaagcaagtggtagtctgtggctatggagaggtg gggaaagggtgctgtgctgccctgaaagccatgggctccattgtgtatgtaactgaaatt gaccccatctgtgccctgcaagcctgtatggatggatttcgactggtgaaattaaatgag gtcatccgacaagtggacattgttattacctgtacagagctccgaccaaaggtatacgat tag >gi568815591r:129310116_129510370|GENSCAN_predicted_peptide_3|85_aa MLRRKPTRLELKLDDIEEFENIRKDLETRKKQKEDVEVVGGSDGEGAIGLSSDPKSREQM INDRIGYKPQPKPNNRSSQFGSLEF >gi568815591r:129310116_129510370|GENSCAN_predicted_CDS_3|258_bp atgctgagacggaaaccaacacgcctagagctaaagcttgatgacattgaagagtttgag aacattcgaaaggacctggagacccgtaagaaacagaaggaagatgtggaagttgtagga ggcagtgatggagaaggagccattgggcttagcagtgatcccaagagccgggaacaaatg atcaatgatcggattggttataaaccccaacccaagcccaataatcgttcatctcaattt ggaagtcttgaattttag >gi568815591r:129310116_129510370|GENSCAN_predicted_peptide_4|171_aa MRKRGKSVPVNSVKLWENDLWGLLWLMLYLLYVFQASLRTPELTWERVRSQVDHVIWPDG KRIVLLAEALALIELYNAPEGRYKQDVYLLPKKMDEYVASLHLPTFDAHLTELTDEQAKY LGLNKNGPFKPNYYRCLGLPRNQGHLGSDELPALELVFASPTTYAYMNSEK >gi568815591r:129310116_129510370|GENSCAN_predicted_CDS_4|516_bp atgagaaaaagagggaaatcagtaccagtcaactcagtgaaactgtgggagaatgatttg tggggcttgctatggctaatgctgtacttgctgtatgtgttccaggcgagtctgcggaca ccagaactgacctgggagcgagtgagatctcaagttgaccatgtgatatggcctgatggc aagaggatagtactgctggcagaggctcttgccttgatagagctttacaatgctcctgag ggtcgctataagcaggatgtctacctgttgcccaagaagatggatgagtatgtggccagc ctacacctgcctacctttgatgcccacttgacagagctgacagatgaacaggccaagtat ctgggactcaacaagaatgggcccttcaagcctaattactacaggtgccttgggctcccc agaaatcagggacacctgggcagtgatgagcttcctgcactggaattggtctttgcatcc ccaaccacatatgcttacatgaactcagaaaaatga >gi568815591r:129310116_129510370|GENSCAN_predicted_peptide_5|549_aa MEDPAAPGTGGPPANGNGNGGGKGKQAAPKGREAFRSQRRESEVRSPESFGWGPETPAGG KRRLSPLKQKFRSQFIDGGTGAERLSDWLEGWEAGAQPQESRIGEWGNRSPPPRLPAAGP WASRGTSARSLPVPFPPDIVLVQMFFWGDEPFINSTIIYRGLSASHCFRCQGYISEQDRV FAHGETNFAICSPPVKSSSGKGSESTVLCFCQYFAVSRDSKAFHLHRGLPADCLQLHSSC LEMNNCCDHLYLGSLPAKSLAGKISFPLPSAAAGLLLPASTSPITTVPRDGIVPGWVDKA GVAALRNRSDEIKLSVLQSSSWPSPNVEGCLLCCSSQLAGRGQNCIVTLRTWNSPITGGA LKKISRLKFPPSGAKTSRPAERNVTGTGSKNFATRSLGVMLLQDDGEHGKTMSMSPLPHF FSHEMSALVRGNALWNTMTVDEAFHESMDGSLGRSIACRIGKPLSGKRSNTINLPPGSWL ITLRNGALSRAQCWSLLLANWALSSGRSQVNLGEWKSMLLSPCVTSVPATVATLLMGPLD NDRGAGERG >gi568815591r:129310116_129510370|GENSCAN_predicted_CDS_5|1650_bp atggaggaccccgccgcgcctgggaccgggggcccgcccgcaaatggcaatggcaacggc ggcggcaaagggaagcaggcggcgcccaagggccgcgaagcgttccgaagccagcggcgg gagtcagaggtgaggagcccggaaagcttcggctggggcccggagacgcccgccggcggg aagcggcggctgagccccctgaagcagaaattccgatcccagtttatagacgggggaaca ggagcggagaggttaagtgactggcttgaggggtgggaggctggagctcagccgcaggaa agcaggattggcgagtggggcaaccgcagtccccctcccaggctgcctgcggcgggcccc tgggccagccgagggacttctgcccgttccctgcccgtccccttccccccagacattgtt ctcgttcaaatgtttttttggggagatgagccattcattaattcaacaattatttatcga ggactttctgccagccactgttttaggtgccagggatatatcagtgaacaagatagggtc ttcgcccatggggagacaaattttgcaatctgttccccacctgtcaagtcttcttcaggc aaaggctctgagagtacagtcctttgcttttgccagtactttgcagtgtccagagactcg aaagctttccacctgcaccgcggtttacctgctgattgtctgcagctgcactcttcatgc ctggaaatgaacaactgctgcgatcatctctatctgggctcattgcctgccaaaagcctg gctggaaagattagctttcctctaccttctgctgcagcaggactccttttacctgccagt acatcccctattaccacagtacccagagatgggatagttcctggttgggttgacaaagca ggagttgcagccctgagaaacaggagtgatgagattaagctctctgtgctgcagtcctcc tcttggccctcaccaaatgttgaaggctgccttctctgctgcagcagccagctggcagga agaggacagaattgtatagttacactgagaacctggaattcaccaataacaggaggtgct ttgaagaagatttcaagactcaagtttcctccctctggagctaaaacctctaggccagca gaacgtaatgtcacgggaacaggaagcaaaaattttgctactagatcactaggggtgatg ctgcttcaggatgatggggaacatggtaagaccatgagcatgagcccactgccacacttc tttagccatgaaatgagtgccttggtcagaggcaatgctttgtggaataccatgacggtg gatgaggcattccatgagtccatggatggtagtcttggcagaagcattgcttgcaggata ggcaaacccttatctggaaagaggtccaatacaatcaacctgccaccaggtagctggctg atcaccctgaggaatggtgccttatccagggctcagtgttggtctctgctgctagcaaat tgggcactcagcagtggccgtagccaggtcaaccttggtgagtggaagtccatgttactg agcccatgcgtaacttccgtccctgccacagtggccactttgctcatgggcccattggac aatgacagaggtgctggggaaagaggctga >gi568815591r:129310116_129510370|GENSCAN_predicted_peptide_6|236_aa MSVDYRKLNQVVTPIAAAVLDVVSLLEQINTSSGTWYVVIDLASAFFSIPVHKARQKQFA CSWLGQQYTFTVLPQRYIYSLALSHNLIRRDLDRFLLPQDITLVHYIDDIMLIGSSEQEV ANAVDLLWGPEQEKALQQAQAVVQAALPLGPYDPADPMVLEVSAADGDAVWCLWQAPRDE SQWRPLGFWSKALPSSADNYFTFERQLLACYWSLVETERLTMGHQVTMRPQLPIMN >gi568815591r:129310116_129510370|GENSCAN_predicted_CDS_6|711_bp atgtcagtggattatcgtaagcttaaccaagtggtgactccaattgcagctgctgtacta gatgtggtttcattgctcgagcaaattaacacatcttctggtacctggtacgtggtcatt gacttggcaagtgcctttttctccattcctgtccataaggcccgccagaagcaatttgcc tgcagctggctgggacagcaatatacctttactgtcctaccccagaggtatatctattct ctggctttgagtcataatcttattcggagagaccttgatcgctttttgcttccacaagat atcacattggtccattacattgatgacattatgctgattggatccagtgagcaagaagta gcaaatgcagtggacttattgtggggtccagaacaagagaaggctctgcaacaggcccag gctgttgtgcaagctgctctgccacttgggccatatgacccagcagatccaatggtgctt gaggtgtcagcggcagatggggatgctgtttggtgcctttggcaggctcccagagatgaa tcacagtggaggcctttaggattttggagcaaggccctgccatcttctgcagataactac tttacttttgagagacagctcttggcctgttactggtctttggtggaaactgaacgtttg actatgggtcatcaagtcaccatgcgacctcaactgcctatcatgaactga >gi568815591r:129310116_129510370|GENSCAN_predicted_peptide_7|949_aa MGQGGDIATTGDGETVPPGKKVQGKEWLELEEDAQKAYIMGLLDRLEVVSRERRLKVARA VLYLAQGTFGECDSEVDVLHWSRYNCFLLYQMGTFSTFLELLHMEIDNSQACSSALRKPA VSIADSTELRVLLSVMYLMVENIRLERETDPCGWRTARETFRTELSFSMHNEEPFALLLF SMVTKFCSGLAPHFPIKKVLLLLWKVVMFTLGGFEHLQTLKVQKRAELGLPPLAEDSIQV VKSMRAASPPSYTLDLGESQLAPPPSKLRGRRGSRRQLLTKQDSLDIYNERDLFKTEEPA TEEEEESAGDGERTLDGELDLLEQDPLVPPPPSQAPLSAERVAFPKGLPWAPKVRQKDIE HFLEMSRNKFIGFTLGQDTDTLVGLPRPIHESVKTLKQVTGVGSQSSGIGSHQSRSDYQA PGYWVGLLYQTLSGPSAFLLQSFGQIGKIRSLFTYSVSYIQHKYISIADVQIKNEEELEK CPMSLGEEVVPETPCEILYQGMLYSLPQYMIALLKILLAAAPTSKAKTDSINILADVLPE EMPAFCLALFSSELWIWRVTPVPATGCPEHEAGHRCEQAQGDYCKEYLYPASATPQTLQT QPYLPGEQLGALLHRSWVFARPYLSKEGCKVPFNPDEHLSAFSLSSLRESSHPLHELEVG LWSGNCPRKVQMSVSLKSESSLLVEEQVLRVQRGFCRVYVHGPINQGIFPLSYIRRHLWI QPIPAVAIYLKPVLISASYSISVLDYPCCTIQDLPELTTESLEAGDNSQFCWRNLFSCIN LLRLLNKLTKWKHSRTMMLVVFKSAPILKRALKVKQAMLQLYVLKLLKLQTKYLGRQWRK SNMKTMSAIYQKVRHRMNDDWAYGNDIDARPWDFQAEECTLRANIEAFNSRRYDRPQDSE FSPVDNCLQSVLGQRLDLPEDFHYSYELWLEREINAKAFFRAKAEDPFG >gi568815591r:129310116_129510370|GENSCAN_predicted_CDS_7|2850_bp atgggtcagggaggagatattgccactactggggatggggaaactgttcctcctggcaaa aaagtgcagggcaaggaatggctggagttggaagaagatgcccaaaaggcctatataatg ggactcttggaccggctagaggtggtcagtagggaacggcggctgaaggtggcccgggct gttctctacctggcccaaggtacttttggggaatgtgattcagaggtcgatgtgctacac tggtccaggtacaactgcttcctgctgtatcagatggggaccttctccaccttcctggag ctactccacatggaaattgacaacagccaggcctgtagcagtgcccttcggaaaccagct gtctccatagctgatagcacagagctcagggtgctgctgagtgttatgtacctaatggtg gaaaatattcgcctggagcgagagacagacccctgtgggtggagaacagcccgggagacc ttccgcactgaattaagcttctccatgcataatgaggagccttttgcccttttactcttc tccatggttaccaagttctgcagtggcctggctcctcacttccccataaagaaggtcctg ctcctgctctggaaggtggtcatgtttaccctcggtggatttgagcatctgcagactctc aaagtacagaagcgggcagaattgggcctgcctccactggctgaagacagtatccaggtg gtgaagagcatgcgtgctgcctccccgccctcttacactcttgacctgggagagtctcag ctggcacccccaccctccaagctgcgaggccgccgtggctctcgaaggcaactcctcact aagcaggacagcctggacatctacaatgaaagggatctcttcaagactgaggagcccgcc acagaggaggaagaggagtctgctggtgatggagaacgaaccttggatggagagctagac ctgctagagcaggaccctctggtgccacctccaccctcacaggcacccctctctgctgag cgggtggcttttcccaagggcctgccctgggccccaaaggtcagacagaaggacattgag cacttcttggagatgagcaggaacaagttcatcggattcaccctggggcaggacacagat acattggttggattacccaggcccatccatgagagtgtgaagaccctaaagcaggtgact ggggtgggctctcagtcttcagggatagggagccatcaaagcaggtcagattaccaggct cctggatactgggtggggcttttataccagactctttctggaccttctgcattcctcctg cagtcctttggccaaataggaaaaattaggtctctctttacttactcagtttcctatatt cagcacaagtatatctccatcgcagatgtgcagatcaagaatgaagaggagctggagaag tgccctatgtctttgggggaagaggtggtaccagagacgccatgtgaaatcctctaccag ggaatgctgtacagccttccgcagtatatgatcgctctgcttaagattctgctggctgca gctcccacctctaaggctaagacagactctatcaatatcctggcagatgtcctacctgag gagatgccggctttctgcttggctctttttagctcagaactctggatctggagggtgaca ccagtgccagcgactggctgcccagagcatgaagctgggcatcgatgtgaacaggcacaa ggagattattgtaaagagtatctctaccctgcttctgctactcctcaaacacttcaaact caaccatatctaccaggtgagcagctaggagcacttctccacaggtcttgggtgtttgcc aggccctatctctcaaaggaaggatgcaaagtcccttttaatcctgatgagcatctttca gccttttctttgtcctctctcagggaatccagccacccattgcatgagcttgaggtaggg ttatggagtgggaactgtcctagaaaggtgcagatgtctgtgtcccttaagtcagaatca agcttgctggtagaagagcaggtcctcagagtacagagaggcttctgtagggtctatgtc catggccctatcaaccagggtatttttccactttcatacattcgaagacatctttggata cagccaattcctgctgttgccatttacctaaagcctgtccttatctctgcatcttacagc atctcagtcctggattatccttgctgtaccatccaggatttgccggagcttactactgaa agtctggaagctggagacaacagccagttctgctggaggaacctcttttcctgcatcaac ctcctgaggctgctcaataaactgaccaaatggaaacattcccggaccatgatgctggta gtgtttaaatcggcaccaatcttaaagcgggccctcaaggtcaaacaggccatgctgcaa ctttatgtcctaaagctactaaagttacagaccaagtacctggggcgccaatggaggaaa agcaacatgaaaaccatgtcagccatttaccagaaagtgcgtcaccgcatgaacgatgac tgggcttacgggaatgacatcgatgccagaccatgggacttccaagcagaagaatgtacc ttgagggccaacattgaggcttttaacagccgtcgctatgacagaccccaggactctgag ttttcacctgtggataactgcttgcagagcgtactggggcagaggttggatctgcctgaa gatttccactattcatatgagctctggctcgagagagagatcaatgctaaggctttcttc cgtgcaaaggctgaagatccatttggttga