GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:36:47 Sequence gi568815575f:77799554_78005258 : 205705 bp : 39.98% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 465 460 6 1.05 1.02 Term - 23291 23115 177 2 0 -18 47 195 0.395 1.50 1.01 Init - 23496 23347 150 0 0 89 103 79 0.405 9.59 1.00 Prom - 24575 24536 40 -4.85 2.02 PlyA - 26479 26474 6 1.05 2.01 Sngl - 45506 44835 672 2 0 49 48 273 0.705 15.32 2.00 Prom - 45599 45560 40 -6.15 3.14 PlyA - 45764 45759 6 1.05 3.13 Term - 46838 46353 486 2 0 -56 32 479 0.507 21.61 3.12 Intr - 56037 55948 90 0 0 94 79 77 0.989 6.67 3.11 Intr - 57320 57180 141 2 0 79 97 106 0.999 10.23 3.10 Intr - 57944 57804 141 2 0 72 101 115 0.731 10.83 3.09 Intr - 71372 71255 118 1 1 56 95 153 0.968 12.35 3.08 Intr - 76044 75890 155 0 2 99 26 101 0.544 2.95 3.07 Intr - 95903 95612 292 1 1 67 32 160 0.034 4.51 3.06 Intr - 96285 96151 135 2 0 55 -6 164 0.178 2.46 3.05 Intr - 100033 99929 105 0 0 -5 55 163 0.319 2.11 3.04 Intr - 108829 108761 69 0 0 29 97 84 0.304 0.78 3.03 Intr - 111210 110916 295 1 1 6 63 190 0.279 3.44 3.02 Intr - 119049 118887 163 0 1 76 61 80 0.101 2.73 3.01 Init - 121721 121617 105 2 0 79 115 34 0.547 5.47 3.00 Prom - 123814 123775 40 -4.95 4.00 Prom + 124945 124984 40 -5.35 4.01 Init + 131656 131776 121 0 1 56 74 71 0.053 2.90 4.02 Term + 163070 163293 224 0 2 132 49 251 0.342 21.70 4.03 PlyA + 165316 165321 6 1.05 5.02 PlyA - 165641 165636 6 1.05 5.01 Sngl - 170085 169321 765 0 0 111 49 1053 0.993 99.54 5.00 Prom - 170333 170294 40 -8.45 6.00 Prom + 171251 171290 40 -12.23 6.01 Init + 171592 171639 48 0 0 71 97 21 0.736 2.30 6.02 Intr + 172068 172208 141 2 0 80 115 129 0.946 14.43 6.03 Intr + 188689 189178 490 0 1 76 90 452 0.737 35.85 6.04 Intr + 189680 190405 726 0 0 70 91 492 0.994 38.17 6.05 Intr + 198925 199131 207 2 0 59 100 218 0.981 18.23 6.06 Intr + 203520 203683 164 1 2 47 81 228 0.343 16.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:77799554_78005258|GENSCAN_predicted_peptide_1|108_aa MERNQRKKAENSKNQNTSSLPKGHNSSPAREQNWTENEFDELTEVGFRRQKNINDLMALK NTAQELHEAYTSINSRIDQVEERISEIEDQLNEIKQEDKIREKKSEKK >gi568815575f:77799554_78005258|GENSCAN_predicted_CDS_1|327_bp atggagagaaaccagcgcaaaaaggctgaaaactccaaaaaccagaacacctcttctctt ccaaagggtcacaactcctcaccagcaagggaacaaaactggacggagaatgagtttgat gaattgacagaagtaggcttcagaaggcagaagaacataaatgacctgatggcgctgaaa aacacagcacaggaacttcatgaagcatacacaagtatcaatagccgaatcgatcaagtg gaagaaaggatatcagagattgaagatcaactcaatgaaataaagcaggaagacaagatt agagaaaaaaagagtgaaaagaaatga >gi568815575f:77799554_78005258|GENSCAN_predicted_peptide_2|223_aa MGDFNTPLSTLDRSTRQKVNKDIQELNSALHQADLIDIYRTLHPKSTEYTFFAAPHNTYS KIDHIVGSKALLSKSKRTEIITNCLSDHGAIKLELRIKKLTQNHSTTWKLNSLLLNDYWV HNKMKAEIKMFFETENKDTTYQNLWDTFKPVCRGKFIALNAHKRKQERSKIDTLTSQLKE IEKQEQTHSKASRRQEITKIRSELKEIDTKNPSKKSVNPGAGF >gi568815575f:77799554_78005258|GENSCAN_predicted_CDS_2|672_bp atgggagactttaacaccccactgtcaacattagacagatccacgagacagaaggttaac aaggatatccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattctttgcagcaccacacaacacctattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaaagtaaaagaacagaaatt ataacaaactgtctctcagaccacggtgcaatcaaactagaactcaggattaagaaactc actcagaaccactcaactacatggaaactgaacagcctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccgagaacaaagacacaaca taccagaatctctgggacacatttaaaccagtgtgtagagggaaatttatagcactaaat gcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaa atagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaagatc agatcagaactgaaggaaatagacacaaaaaacccttcaaaaaaatcagtgaatcctgga gctggtttttga >gi568815575f:77799554_78005258|GENSCAN_predicted_peptide_3|764_aa MNKVTELSNSMPYLRTTTGMAEAYGMNGNNKEIVKLRFPLGKEEQHSQQLLLRNYTVSIC PDLMTLRLKEAPPYNKVGHHHNKRLGGDKGSNKAHTIGEVTVAAAAAAAATTPPSASLFP PPAAAASRLYPSLPRLPPLPGDAPAGSRAATGKRATGHVKRNAWSRASFSCPLLPPGRHA ASSGSLLGDIKPGGWELNICTSTTTTVPVSRRFSALLTKGNIVKVKLELRQYSCCFGANS LQVPLKILHEGGEFQGLLNESLKDSVLATSSHFGLRKLRNISQLRMRAWKAPSRSKVSLI EGRGANMAARWRFWCVSVTMVVALLIVCDVPSASAQRKKEVRTRFPAAWAFPNDWGLRGS VRLFPAPFPCRFYAFGGFNYAVFGVVEFMVLSEKVSQLMEWTNKRPVIRMNGDKFRRLVK APPRNYSVIVMFTALQLHRQQADEEFQILANSWRYSSAFTNRIFFAMVDFDEGSDVFQML NMNSAPTFINFPAKGKPKRGDTYELQVRGFSAEQIARWIADRTDVNIRVIRPPNYAGPLM LGLLLAVIGGLVYLRRSNMEFLFNKTGWAFAALCFVLAMTSGQMWNHIRGPPYAHKNPHT GHVLKEEVRTHGKEVKNLEKRLDEWLTRITNAKKSLKDLMELKTTARELRDECRSLSSRF DQLEERVSVMEDQMNEMKQEEKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDKENG TKLENTAGYYPGELPQSSKAGEHSNSGDTENATKIFLEKSNSKT >gi568815575f:77799554_78005258|GENSCAN_predicted_CDS_3|2295_bp atgaataaagttacagagttgtcaaacagcatgccatatttaaggaccaccactggtatg gctgaggcatatggcatgaatggtaacaacaaggagattgtaaagctaagattcccattg ggaaaggaagaacagcacagccagcaactacttctcaggaactatactgtcagtatctgc cctgatctcatgactctgagattaaaagaagctcctccttataacaaggttggacatcac cataataaaagattgggaggagacaaaggctccaacaaagctcacacaatcggagaagtc acagtagctgcggctgcggcggcggcagcaacaactcctccctctgcttctcttttccca ccccccgccgcggcggccagccgcctttacccctccctccctcgactacccccacttccg ggtgatgcccctgccggaagcagggccgccacgggaaagagagcgactggtcacgtgaaa aggaacgcgtggtctagagccagcttctcctgccccctgctgcccccgggccgacacgca gcctcgagtgggtctttgttaggtgacattaagccaggaggatgggaactcaacatctgt acctctacaactaccacagtacccgtgtctagacgatttagtgcgcttttgaccaaggga aacatcgtgaaggtgaagctagaactgcggcaatacagctgctgctttggtgccaattcg cttcaggtacccttgaaaatattacatgaaggcggcgaatttcaaggacttctaaatgag tctttgaaagattctgttctagcaacttcaagccattttggactccgaaaactccgcaat atttcacaactgcgcatgcgtgcttggaaagcacctagccggagcaaagtttcacttata gaagggagaggagcgaacatggcagcgcgttggcggttttggtgtgtctctgtgaccatg gtggtggcgctgctcatcgtttgcgacgttccctcagcctctgcccaaagaaagaaggag gtgagaacgcggtttccagcagcatgggcttttcccaatgactggggcttaagagggtct gttcgcctcttcccagccccctttccctgccgcttctatgcctttggtggcttcaattac gcggttttcggagttgtggaattcatggtgttatctgaaaaggttagtcagctgatggaa tggactaacaaaagacctgtaataagaatgaatggagacaagttccgtcgccttgtgaaa gccccaccgagaaattactccgttatcgtcatgttcactgctctccaactgcatagacag caagctgatgaagaattccagatcctggcaaactcctggcgatactccagtgcattcacc aacaggatattttttgccatggtggattttgatgaaggctctgatgtatttcagatgcta aacatgaattcagctccaactttcatcaactttcctgcaaaagggaaacccaaacggggt gatacatatgagttacaggtgcggggtttttcagctgagcagattgcccggtggatcgcc gacagaactgatgtcaatattagagtgattagacccccaaattatgctggtccccttatg ttgggattgcttttggctgttattggtggacttgtgtatcttcgaagaagtaatatggaa tttctctttaataaaactggatgggcttttgcagctttgtgttttgtgcttgctatgaca tctggtcaaatgtggaaccatataagaggaccaccatatgcccataagaatccccacacg ggacatgtgctaaaggaggaagttcgaacccatggcaaagaagttaaaaaccttgaaaaa aggttagacgaatggctaactagaataaccaatgcaaagaagtccttaaaggacctgatg gagctgaaaaccacggcacgagagctacgtgatgaatgcagaagcctcagtagccgattc gatcagctggaagaaagggtatcagtgatggaagatcaaatgaatgaaatgaagcaagaa gagaagtttagagaaaaaagaataaaaagaaacgaacaaagcctccaagaaatatgggac tatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgacaaagagaatgga accaagttggaaaacactgcaggatattatccaggagaacttccccaatctagcaaggca ggcgaacattcaaattcaggagatacagagaacgccacaaagatattcctcgagaagagc aactccaagacataa >gi568815575f:77799554_78005258|GENSCAN_predicted_peptide_4|114_aa MIVNEHAAFKHLFNKAHLAPPLIHSTLSGHSTCFREHRVGDPSSASSLGIAVSLGRPVLS RSSSGTVDLLEEVGLQIRDTAFSSTKLLEAISTVSAQVEELAVKCTENARFLKT >gi568815575f:77799554_78005258|GENSCAN_predicted_CDS_4|345_bp atgattgttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccattcaactctgagtggacacagcacatgtttcagagagcacagggttggg gacccttcttcagcatcttccctgggcattgctgtgagtttaggccggcccgttttgagc aggagcagcagcggaacagtagacctgctggaggaagtggggctgcagatcagagacaca gcattttcgtcaaccaaacttcttgaagccatatctacagtatcagctcaagtggaagag cttgccgtcaaatgtacggaaaatgcacgtttccttaaaacatga >gi568815575f:77799554_78005258|GENSCAN_predicted_peptide_5|254_aa MAAYKLVLIRHGESTWNLENRFSCWYDADLSPAGHEEAKRGGQALRDAGYEFDICLTSVQ KRVIRTLWTVLDAIDQMWLPVVRTWRLNERHYGGLTGLNKAETAAKHGEAQVKIWRRSYD VPPPPMEPDHPFYSNISKDRRYADLTEDQLPSYESPKDTIARALPFWNEEIVPQIKEGKR VLIAAHGNSLQGIAKHVEGLSEEAIMELNLPTGIPIVYELDKNLKPIKPMQFLGDEETVC KAIEAVAAQGKAKK >gi568815575f:77799554_78005258|GENSCAN_predicted_CDS_5|765_bp atggccgcctacaaactggtgctgatccggcacggcgagagcacatggaacctggagaac cgcttcagctgctggtacgacgccgatctgagcccggcgggccacgaggaggcgaagcgc ggcgggcaggcgctacgagatgctggctatgagtttgacatctgcctcacctcagtgcag aagagagtgatccggaccctctggacagtgctagatgccattgatcagatgtggctgcca gtggtgaggacttggcgcctcaatgagcggcactatgggggtctaaccggtctcaataaa gcagaaactgctgcaaagcatggtgaggcccaggtgaagatctggaggcgctcctatgat gtcccaccacctccgatggagcccgaccatcctttctacagcaacatcagtaaggatcgc aggtatgcagacctcacagaagatcagctaccctcctatgagagtccgaaggatactatt gccagagctctgcccttctggaatgaagaaatagttccccagatcaaggaggggaaacgt gtactgattgcagcccatggcaacagcctccagggcattgccaagcatgtggagggtctc tctgaagaggctatcatggagctgaacctgccgactggtattcccatcgtctatgaattg gacaagaacttgaagcctatcaagcccatgcagtttctgggggatgaagagacggtgtgc aaagccatagaagctgtggctgcccagggcaaggccaagaagtga >gi568815575f:77799554_78005258|GENSCAN_predicted_peptide_6|592_aa MTGTEKGLVSGRRGGQECNEEIKMDPSMGVNSVTISVEGMTCNSCVWTIEQQIGKVNGVH HIKVSLEEKNATIIYDPKLQTPKTLQEAIDDMGFDAVIHNPDPLPVLTDTLFLTVTASLT LPWDHIQSTLLKTKGVTDIKIYPQKRTVAVTIIPSIVNANQIKELVPELSLDTGTLEKKS GACEDHSMAQAGEVVLKMKVEGMTCHSCTSTIEGKIGKLQGVQRIKVSLDNQEATIVYQP HLISVEEMKKQIEAMGFPAFVKKQPKYLKLGAIDVERLKNTPVKSSEGSQQRSPSYTNDS TATFIIDGMHCKSCVSNIESTLSALQYVSSIVVSLENRSAIVKYNASSVTPESLRKAIEA VSPGLYRVSITSEVESTSNSPSSSSLQKIPLNVVSQPLTQETVINIDGMTCNSCVQSIEG VISKKPGVKSIRVSLANSNGTVEYDPLLTSPETLRGAIEDMGFDATLSDTNEPLVVIAQP SSEMPLLTSTNEFYTKGMTPVQDKEEGKNSSKCYIQVTGMTCASCVANIERNLRREEGIY SILVALMAGKAEVRYNPAVIQPPMIAEFIRELGFGATVIENADEGDGVLELV >gi568815575f:77799554_78005258|GENSCAN_predicted_CDS_6|1776_bp atgactggcacagagaagggtttggtgagtggtagaagaggaggccaggaatgtaatgag gaaatcaaaatggatccaagtatgggtgtgaattctgttaccatttctgttgagggtatg acttgcaattcctgtgtttggaccattgagcagcagattggaaaagtgaatggtgtgcat cacattaaggtatcactggaagaaaaaaatgcaactattatttatgaccctaaactacag actccaaagaccctacaggaagctattgatgacatgggctttgatgctgttatccataat cctgaccctctccctgttttaactgacaccttgtttctgactgttacggcgtcactgact ttgccatgggaccatatccaaagcacattgctgaagaccaagggtgtgacagacattaaa atttaccctcagaaaagaactgtagcagtgacaataatcccttctatagtgaatgccaat cagataaaagagctggttccagaactcagtttagatactgggacactggagaaaaagtca ggagcttgtgaagatcatagtatggctcaagctggtgaagtcgtgctgaagatgaaagtg gaagggatgacctgccattcatgtactagcactattgaaggaaaaattgggaaactgcaa ggtgttcagcgaattaaagtctccctggacaatcaagaagctactattgtttatcaacct catcttatctcagtagaggaaatgaaaaagcagattgaagctatgggctttccagcattt gtcaaaaagcagcccaagtacctcaaattgggagctattgatgtagaacgtctaaagaac acaccagttaaatcctcagaagggtcacagcaaaggagtccatcatataccaatgattca acagccactttcatcattgatggcatgcattgtaaatcatgtgtgtcaaatattgaaagt actttatctgcactccaatatgtaagcagcatagtagtttctttagagaataggtctgcc attgtgaagtataatgcaagctcagtcactccagaatccctgagaaaagcaatagaggct gtatcaccggggctatatagagttagtatcacaagtgaagttgagagtacctcaaactct ccctccagctcatctcttcagaagattcctttgaatgtagttagccagcctctgacacaa gaaactgtgataaacattgatggcatgacttgtaattcctgtgtgcagtctattgagggt gtcatatcaaaaaagccaggtgtaaaatccatacgagtctcccttgcaaatagcaatggg actgttgagtatgatcctctactaacctctccagaaacgttgagaggagcaatagaagac atgggatttgatgctaccttgtcagacacgaatgagccgttggtagtaatagctcagcct tcatcggaaatgccgcttttgacttcaactaatgaattttatactaaagggatgacacca gttcaagacaaggaggaaggaaagaattcatctaagtgttacatacaggtcactggcatg acttgcgcttcctgtgtagcaaacattgaacggaatttaaggcgggaagaaggaatatat tctatacttgtggccctgatggctggcaaggcagaagtaaggtataatcctgctgttata caacccccaatgatagcagagttcatccgagaacttggatttggagccactgtgatagaa aatgctgatgaaggagatggtgttttggaacttgtt