GENSCAN 1.0 Date run: 8-Nov-116 Time: 10:15:25 Sequence gi568815575r:77770808_77995410 : 224603 bp : 40.39% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 14800 14889 90 0 0 119 41 126 0.805 7.74 1.02 PlyA + 16079 16084 6 1.05 2.03 PlyA - 16343 16338 6 1.05 2.02 Term - 19265 19131 135 2 0 104 37 80 0.148 1.54 2.01 Init - 52242 52093 150 0 0 89 103 79 0.406 9.59 2.00 Prom - 53321 53282 40 -4.85 3.02 PlyA - 55225 55220 6 1.05 3.01 Sngl - 74252 73581 672 2 0 49 48 273 0.705 15.32 3.00 Prom - 74345 74306 40 -6.15 4.14 PlyA - 74510 74505 6 1.05 4.13 Term - 75584 75099 486 2 0 -56 32 479 0.507 21.61 4.12 Intr - 84783 84694 90 0 0 94 79 77 0.989 6.67 4.11 Intr - 86066 85926 141 2 0 79 97 106 0.999 10.23 4.10 Intr - 86690 86550 141 2 0 72 101 115 0.731 10.83 4.09 Intr - 100118 100001 118 1 1 56 95 153 0.968 12.35 4.08 Intr - 104790 104636 155 0 2 99 26 101 0.544 2.95 4.07 Intr - 124649 124358 292 1 1 67 32 160 0.034 4.51 4.06 Intr - 125031 124897 135 2 0 55 -6 164 0.178 2.46 4.05 Intr - 128779 128675 105 0 0 -5 55 163 0.319 2.11 4.04 Intr - 137575 137507 69 0 0 29 97 84 0.304 0.78 4.03 Intr - 139956 139662 295 1 1 6 63 190 0.279 3.44 4.02 Intr - 147795 147633 163 0 1 76 61 80 0.101 2.73 4.01 Init - 150467 150363 105 2 0 79 115 34 0.547 5.47 4.00 Prom - 152560 152521 40 -4.95 5.00 Prom + 153691 153730 40 -5.35 5.01 Init + 160402 160522 121 0 1 56 74 71 0.053 2.90 5.02 Term + 191816 192039 224 0 2 132 49 251 0.342 21.70 5.03 PlyA + 194062 194067 6 1.05 6.02 PlyA - 194387 194382 6 1.05 6.01 Sngl - 198831 198067 765 0 0 111 49 1053 0.993 99.54 6.00 Prom - 199079 199040 40 -8.45 7.00 Prom + 199997 200036 40 -12.23 7.01 Init + 200338 200385 48 0 0 71 97 21 0.736 2.30 7.02 Intr + 200814 200954 141 2 0 80 115 129 0.946 14.43 7.03 Intr + 217435 217924 490 0 1 76 90 452 0.737 35.85 7.04 Intr + 218426 219151 726 0 0 70 91 492 0.816 38.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:77770808_77995410|GENSCAN_predicted_peptide_1|29_aa FLGPKISKNDRDTKDRLRLRGLNPTGTKE >gi568815575r:77770808_77995410|GENSCAN_predicted_CDS_1|90_bp tttctaggcccaaagattagcaaaaatgaccgggataccaaagaccgcctccgacttcga ggtctcaatcccacaggcaccaaagaataa >gi568815575r:77770808_77995410|GENSCAN_predicted_peptide_2|94_aa MERNQRKKAENSKNQNTSSLPKGHNSSPAREQNWTENEFDELTEVGFRRQVLYLLFPSKT NKHIHKLRIYGQESEKGLGCLLWQFEEKCEDVYQ >gi568815575r:77770808_77995410|GENSCAN_predicted_CDS_2|285_bp atggagagaaaccagcgcaaaaaggctgaaaactccaaaaaccagaacacctcttctctt ccaaagggtcacaactcctcaccagcaagggaacaaaactggacggagaatgagtttgat gaattgacagaagtaggcttcagaaggcaggtactgtatctcttattcccttccaaaaca aataagcacattcacaagttacgcatttatggtcaagagtcagagaaaggtctaggttgc cttctgtggcaatttgaagaaaaatgtgaagatgtgtaccagtaa >gi568815575r:77770808_77995410|GENSCAN_predicted_peptide_3|223_aa MGDFNTPLSTLDRSTRQKVNKDIQELNSALHQADLIDIYRTLHPKSTEYTFFAAPHNTYS KIDHIVGSKALLSKSKRTEIITNCLSDHGAIKLELRIKKLTQNHSTTWKLNSLLLNDYWV HNKMKAEIKMFFETENKDTTYQNLWDTFKPVCRGKFIALNAHKRKQERSKIDTLTSQLKE IEKQEQTHSKASRRQEITKIRSELKEIDTKNPSKKSVNPGAGF >gi568815575r:77770808_77995410|GENSCAN_predicted_CDS_3|672_bp atgggagactttaacaccccactgtcaacattagacagatccacgagacagaaggttaac aaggatatccaggaattgaactcagctctgcaccaagcggacctaatagacatctacaga actctccaccccaaatcaacagaatatacattctttgcagcaccacacaacacctattcc aaaattgaccacatagttggaagtaaagcactcctcagcaaaagtaaaagaacagaaatt ataacaaactgtctctcagaccacggtgcaatcaaactagaactcaggattaagaaactc actcagaaccactcaactacatggaaactgaacagcctgctcctgaatgactactgggta cataacaaaatgaaggcagaaataaagatgttctttgaaaccgagaacaaagacacaaca taccagaatctctgggacacatttaaaccagtgtgtagagggaaatttatagcactaaat gcccacaagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaa atagagaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaagatc agatcagaactgaaggaaatagacacaaaaaacccttcaaaaaaatcagtgaatcctgga gctggtttttga >gi568815575r:77770808_77995410|GENSCAN_predicted_peptide_4|764_aa MNKVTELSNSMPYLRTTTGMAEAYGMNGNNKEIVKLRFPLGKEEQHSQQLLLRNYTVSIC PDLMTLRLKEAPPYNKVGHHHNKRLGGDKGSNKAHTIGEVTVAAAAAAAATTPPSASLFP PPAAAASRLYPSLPRLPPLPGDAPAGSRAATGKRATGHVKRNAWSRASFSCPLLPPGRHA ASSGSLLGDIKPGGWELNICTSTTTTVPVSRRFSALLTKGNIVKVKLELRQYSCCFGANS LQVPLKILHEGGEFQGLLNESLKDSVLATSSHFGLRKLRNISQLRMRAWKAPSRSKVSLI EGRGANMAARWRFWCVSVTMVVALLIVCDVPSASAQRKKEVRTRFPAAWAFPNDWGLRGS VRLFPAPFPCRFYAFGGFNYAVFGVVEFMVLSEKVSQLMEWTNKRPVIRMNGDKFRRLVK APPRNYSVIVMFTALQLHRQQADEEFQILANSWRYSSAFTNRIFFAMVDFDEGSDVFQML NMNSAPTFINFPAKGKPKRGDTYELQVRGFSAEQIARWIADRTDVNIRVIRPPNYAGPLM LGLLLAVIGGLVYLRRSNMEFLFNKTGWAFAALCFVLAMTSGQMWNHIRGPPYAHKNPHT GHVLKEEVRTHGKEVKNLEKRLDEWLTRITNAKKSLKDLMELKTTARELRDECRSLSSRF DQLEERVSVMEDQMNEMKQEEKFREKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDKENG TKLENTAGYYPGELPQSSKAGEHSNSGDTENATKIFLEKSNSKT >gi568815575r:77770808_77995410|GENSCAN_predicted_CDS_4|2295_bp atgaataaagttacagagttgtcaaacagcatgccatatttaaggaccaccactggtatg gctgaggcatatggcatgaatggtaacaacaaggagattgtaaagctaagattcccattg ggaaaggaagaacagcacagccagcaactacttctcaggaactatactgtcagtatctgc cctgatctcatgactctgagattaaaagaagctcctccttataacaaggttggacatcac cataataaaagattgggaggagacaaaggctccaacaaagctcacacaatcggagaagtc acagtagctgcggctgcggcggcggcagcaacaactcctccctctgcttctcttttccca ccccccgccgcggcggccagccgcctttacccctccctccctcgactacccccacttccg ggtgatgcccctgccggaagcagggccgccacgggaaagagagcgactggtcacgtgaaa aggaacgcgtggtctagagccagcttctcctgccccctgctgcccccgggccgacacgca gcctcgagtgggtctttgttaggtgacattaagccaggaggatgggaactcaacatctgt acctctacaactaccacagtacccgtgtctagacgatttagtgcgcttttgaccaaggga aacatcgtgaaggtgaagctagaactgcggcaatacagctgctgctttggtgccaattcg cttcaggtacccttgaaaatattacatgaaggcggcgaatttcaaggacttctaaatgag tctttgaaagattctgttctagcaacttcaagccattttggactccgaaaactccgcaat atttcacaactgcgcatgcgtgcttggaaagcacctagccggagcaaagtttcacttata gaagggagaggagcgaacatggcagcgcgttggcggttttggtgtgtctctgtgaccatg gtggtggcgctgctcatcgtttgcgacgttccctcagcctctgcccaaagaaagaaggag gtgagaacgcggtttccagcagcatgggcttttcccaatgactggggcttaagagggtct gttcgcctcttcccagccccctttccctgccgcttctatgcctttggtggcttcaattac gcggttttcggagttgtggaattcatggtgttatctgaaaaggttagtcagctgatggaa tggactaacaaaagacctgtaataagaatgaatggagacaagttccgtcgccttgtgaaa gccccaccgagaaattactccgttatcgtcatgttcactgctctccaactgcatagacag caagctgatgaagaattccagatcctggcaaactcctggcgatactccagtgcattcacc aacaggatattttttgccatggtggattttgatgaaggctctgatgtatttcagatgcta aacatgaattcagctccaactttcatcaactttcctgcaaaagggaaacccaaacggggt gatacatatgagttacaggtgcggggtttttcagctgagcagattgcccggtggatcgcc gacagaactgatgtcaatattagagtgattagacccccaaattatgctggtccccttatg ttgggattgcttttggctgttattggtggacttgtgtatcttcgaagaagtaatatggaa tttctctttaataaaactggatgggcttttgcagctttgtgttttgtgcttgctatgaca tctggtcaaatgtggaaccatataagaggaccaccatatgcccataagaatccccacacg ggacatgtgctaaaggaggaagttcgaacccatggcaaagaagttaaaaaccttgaaaaa aggttagacgaatggctaactagaataaccaatgcaaagaagtccttaaaggacctgatg gagctgaaaaccacggcacgagagctacgtgatgaatgcagaagcctcagtagccgattc gatcagctggaagaaagggtatcagtgatggaagatcaaatgaatgaaatgaagcaagaa gagaagtttagagaaaaaagaataaaaagaaacgaacaaagcctccaagaaatatgggac tatgtgaaaagaccaaatctacgtctgattggtgtacctgaaagtgacaaagagaatgga accaagttggaaaacactgcaggatattatccaggagaacttccccaatctagcaaggca ggcgaacattcaaattcaggagatacagagaacgccacaaagatattcctcgagaagagc aactccaagacataa >gi568815575r:77770808_77995410|GENSCAN_predicted_peptide_5|114_aa MIVNEHAAFKHLFNKAHLAPPLIHSTLSGHSTCFREHRVGDPSSASSLGIAVSLGRPVLS RSSSGTVDLLEEVGLQIRDTAFSSTKLLEAISTVSAQVEELAVKCTENARFLKT >gi568815575r:77770808_77995410|GENSCAN_predicted_CDS_5|345_bp atgattgttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccattcaactctgagtggacacagcacatgtttcagagagcacagggttggg gacccttcttcagcatcttccctgggcattgctgtgagtttaggccggcccgttttgagc aggagcagcagcggaacagtagacctgctggaggaagtggggctgcagatcagagacaca gcattttcgtcaaccaaacttcttgaagccatatctacagtatcagctcaagtggaagag cttgccgtcaaatgtacggaaaatgcacgtttccttaaaacatga >gi568815575r:77770808_77995410|GENSCAN_predicted_peptide_6|254_aa MAAYKLVLIRHGESTWNLENRFSCWYDADLSPAGHEEAKRGGQALRDAGYEFDICLTSVQ KRVIRTLWTVLDAIDQMWLPVVRTWRLNERHYGGLTGLNKAETAAKHGEAQVKIWRRSYD VPPPPMEPDHPFYSNISKDRRYADLTEDQLPSYESPKDTIARALPFWNEEIVPQIKEGKR VLIAAHGNSLQGIAKHVEGLSEEAIMELNLPTGIPIVYELDKNLKPIKPMQFLGDEETVC KAIEAVAAQGKAKK >gi568815575r:77770808_77995410|GENSCAN_predicted_CDS_6|765_bp atggccgcctacaaactggtgctgatccggcacggcgagagcacatggaacctggagaac cgcttcagctgctggtacgacgccgatctgagcccggcgggccacgaggaggcgaagcgc ggcgggcaggcgctacgagatgctggctatgagtttgacatctgcctcacctcagtgcag aagagagtgatccggaccctctggacagtgctagatgccattgatcagatgtggctgcca gtggtgaggacttggcgcctcaatgagcggcactatgggggtctaaccggtctcaataaa gcagaaactgctgcaaagcatggtgaggcccaggtgaagatctggaggcgctcctatgat gtcccaccacctccgatggagcccgaccatcctttctacagcaacatcagtaaggatcgc aggtatgcagacctcacagaagatcagctaccctcctatgagagtccgaaggatactatt gccagagctctgcccttctggaatgaagaaatagttccccagatcaaggaggggaaacgt gtactgattgcagcccatggcaacagcctccagggcattgccaagcatgtggagggtctc tctgaagaggctatcatggagctgaacctgccgactggtattcccatcgtctatgaattg gacaagaacttgaagcctatcaagcccatgcagtttctgggggatgaagagacggtgtgc aaagccatagaagctgtggctgcccagggcaaggccaagaagtga >gi568815575r:77770808_77995410|GENSCAN_predicted_peptide_7|469_aa MTGTEKGLVSGRRGGQECNEEIKMDPSMGVNSVTISVEGMTCNSCVWTIEQQIGKVNGVH HIKVSLEEKNATIIYDPKLQTPKTLQEAIDDMGFDAVIHNPDPLPVLTDTLFLTVTASLT LPWDHIQSTLLKTKGVTDIKIYPQKRTVAVTIIPSIVNANQIKELVPELSLDTGTLEKKS GACEDHSMAQAGEVVLKMKVEGMTCHSCTSTIEGKIGKLQGVQRIKVSLDNQEATIVYQP HLISVEEMKKQIEAMGFPAFVKKQPKYLKLGAIDVERLKNTPVKSSEGSQQRSPSYTNDS TATFIIDGMHCKSCVSNIESTLSALQYVSSIVVSLENRSAIVKYNASSVTPESLRKAIEA VSPGLYRVSITSEVESTSNSPSSSSLQKIPLNVVSQPLTQETVINIDGMTCNSCVQSIEG VISKKPGVKSIRVSLANSNGTVEYDPLLTSPETLRGAIEDMGFDATLSX >gi568815575r:77770808_77995410|GENSCAN_predicted_CDS_7|1407_bp atgactggcacagagaagggtttggtgagtggtagaagaggaggccaggaatgtaatgag gaaatcaaaatggatccaagtatgggtgtgaattctgttaccatttctgttgagggtatg acttgcaattcctgtgtttggaccattgagcagcagattggaaaagtgaatggtgtgcat cacattaaggtatcactggaagaaaaaaatgcaactattatttatgaccctaaactacag actccaaagaccctacaggaagctattgatgacatgggctttgatgctgttatccataat cctgaccctctccctgttttaactgacaccttgtttctgactgttacggcgtcactgact ttgccatgggaccatatccaaagcacattgctgaagaccaagggtgtgacagacattaaa atttaccctcagaaaagaactgtagcagtgacaataatcccttctatagtgaatgccaat cagataaaagagctggttccagaactcagtttagatactgggacactggagaaaaagtca ggagcttgtgaagatcatagtatggctcaagctggtgaagtcgtgctgaagatgaaagtg gaagggatgacctgccattcatgtactagcactattgaaggaaaaattgggaaactgcaa ggtgttcagcgaattaaagtctccctggacaatcaagaagctactattgtttatcaacct catcttatctcagtagaggaaatgaaaaagcagattgaagctatgggctttccagcattt gtcaaaaagcagcccaagtacctcaaattgggagctattgatgtagaacgtctaaagaac acaccagttaaatcctcagaagggtcacagcaaaggagtccatcatataccaatgattca acagccactttcatcattgatggcatgcattgtaaatcatgtgtgtcaaatattgaaagt actttatctgcactccaatatgtaagcagcatagtagtttctttagagaataggtctgcc attgtgaagtataatgcaagctcagtcactccagaatccctgagaaaagcaatagaggct gtatcaccggggctatatagagttagtatcacaagtgaagttgagagtacctcaaactct ccctccagctcatctcttcagaagattcctttgaatgtagttagccagcctctgacacaa gaaactgtgataaacattgatggcatgacttgtaattcctgtgtgcagtctattgagggt gtcatatcaaaaaagccaggtgtaaaatccatacgagtctcccttgcaaatagcaatggg actgttgagtatgatcctctactaacctctccagaaacgttgagaggagcaatagaagac atgggatttgatgctaccttgtcagnn