GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:31:12 Sequence gi568815594r:152222884_152452730 : 229847 bp : 37.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4155 4388 234 2 0 58 -11 164 0.497 0.46 1.02 Intr + 5315 5429 115 1 1 90 98 33 0.758 3.60 1.03 Term + 7558 7682 125 2 2 102 42 120 0.967 6.37 1.04 PlyA + 7782 7787 6 1.05 2.00 Prom + 14497 14536 40 -1.65 2.01 Init + 21969 22163 195 2 0 69 108 172 0.301 16.55 2.02 Term + 24719 25129 411 1 0 33 38 199 0.110 3.66 2.03 PlyA + 25303 25308 6 1.05 3.00 Prom + 25912 25951 40 -5.05 3.01 Init + 29425 29602 178 0 1 71 75 72 0.383 3.67 3.02 Intr + 33345 33535 191 1 2 110 103 94 0.432 11.48 3.03 Term + 33752 33997 246 1 0 71 52 119 0.206 1.41 3.04 PlyA + 34272 34277 6 1.05 4.03 PlyA - 34359 34354 6 1.05 4.02 Term - 35456 35299 158 0 2 63 39 142 0.077 3.91 4.01 Init - 36293 36230 64 2 1 66 44 75 0.461 2.26 4.00 Prom - 42777 42738 40 -3.75 5.10 PlyA - 43576 43571 6 1.05 5.09 Term - 47990 47793 198 2 0 73 48 111 0.521 2.02 5.08 Intr - 48424 48242 183 1 0 63 72 165 0.931 11.46 5.07 Intr - 49687 49655 33 1 0 72 111 23 0.538 0.50 5.06 Intr - 53068 53029 40 0 1 80 103 26 0.286 0.61 5.05 Intr - 56584 56462 123 0 0 72 33 106 0.178 2.28 5.04 Intr - 56843 56787 57 1 0 100 77 65 0.207 3.68 5.03 Intr - 59614 59479 136 2 1 116 -30 115 0.490 1.21 5.02 Intr - 60704 60556 149 1 2 101 61 110 0.974 8.56 5.01 Init - 62454 62204 251 0 2 86 99 127 0.929 10.30 5.00 Prom - 65013 64974 40 -7.15 6.00 Prom + 70221 70260 40 -5.35 6.01 Init + 70939 70999 61 0 1 63 66 76 0.435 3.05 6.02 Intr + 76523 76705 183 0 0 65 52 112 0.231 4.14 6.03 Intr + 81989 82123 135 0 0 73 55 66 0.062 1.42 6.04 Intr + 84873 85016 144 1 0 47 49 180 0.074 9.33 6.05 Term + 95464 95627 164 2 2 110 36 78 0.175 1.92 6.06 PlyA + 96151 96156 6 1.05 7.19 PlyA - 96171 96166 6 1.05 7.18 Term - 100266 99998 269 1 2 91 49 248 0.980 15.87 7.17 Intr - 101511 101301 211 0 1 51 115 157 0.999 12.26 7.16 Intr - 103348 103123 226 0 1 114 110 90 0.684 10.86 7.15 Intr - 105506 105325 182 2 2 8 116 108 0.997 3.34 7.14 Intr - 106902 106789 114 0 0 102 106 51 0.990 8.02 7.13 Intr - 107985 107849 137 1 2 24 88 77 0.888 0.67 7.12 Intr - 109836 109713 124 0 1 50 86 132 0.879 8.44 7.11 Intr - 115053 114919 135 0 0 60 88 47 0.673 1.74 7.10 Intr - 119539 119471 69 1 0 44 25 124 0.208 0.16 7.09 Intr - 124188 124047 142 2 1 107 61 90 0.903 7.63 7.08 Intr - 127241 127159 83 2 2 85 85 20 0.931 -1.08 7.07 Intr - 159483 159306 178 2 1 13 114 106 0.502 4.70 7.06 Intr - 170864 170796 69 1 0 25 83 96 0.011 0.18 7.05 Intr - 178645 178569 77 1 2 98 45 47 0.053 -1.21 7.04 Intr - 188989 188420 570 1 0 102 87 783 0.855 71.63 7.03 Intr - 203855 203749 107 0 2 50 41 95 0.227 0.01 7.02 Intr - 205615 205557 59 0 2 46 99 53 0.287 -0.19 7.01 Init - 210569 210424 146 2 2 123 36 74 0.340 5.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 129742 129587 156 1 0 59 106 61 0.923 4.96 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594r:152222884_152452730|GENSCAN_predicted_peptide_1|157_aa ISSLDPEQGRELTGHQPDGGRPGNSCALSATPRAIPDPVSCCGSSWMWSYLLFPFAASKA GEMGGAATGAHSPSPLGLQTLKERDNDQCTANTIENKIKQPGAVTHACNPSILEGQANLV PVLPPDLLEQAAVSVGDCPQSVLVWPELGIQKGYGST >gi568815594r:152222884_152452730|GENSCAN_predicted_CDS_1|474_bp ataagcagtttggacccagaacagggaagggagttgaccggtcaccagccagatggtggc aggccagggaactcctgtgctctttctgctaccccacgggccatccctgatcccgtgtcc tgctgtggcagctcctggatgtggagttacctgctattccctttcgcagcatcaaaagca ggggagatgggaggagctgccactggcgcccacagcccttctcctcttggccttcaaaca ttaaaagaaagagacaatgatcaatgcacagcaaacaccattgaaaacaaaataaagcag ccaggtgcagtgactcatgcctgtaatcccagcattttggaaggccaagcaaacctggtt cctgtactgcctcctgatctcctggagcaggctgctgtgtcagtgggtgattgtccccag tcagtgttggtgtggccagaactgggaatccagaagggctatggctccacatag >gi568815594r:152222884_152452730|GENSCAN_predicted_peptide_2|201_aa MPCQLLAQQFQLQAALIQSDALAPDLMPAPGSVLVAADVPSPLSSSPLLYPLPLNEKHEE DCEGQQRQRLPSSGHRNKSFICQPGCSGPGTQPASLSVEITQPGSLSPARIPGARASLQR QHLISLGPQQTQSGIQLPPPERGGQCLGLTSSCRPSFWRPNSPFPSECGRASQGGHVTPH PPKVEFPLAVTPLNVSGVLPS >gi568815594r:152222884_152452730|GENSCAN_predicted_CDS_2|606_bp atgccctgtcagctccttgcccagcagttccagctgcaggctgccctgatccagtctgat gctctggcaccggatctgatgccagccccaggctcagtcctggttgcagctgacgttccc tctcctctttcttcttctcccctcctttaccctctccctctcaatgaaaaacatgaagag gattgtgaggggcaacaaagacagcgcctcccaagctcaggacacaggaacaagtcattc atctgccagccaggatgcagcggccccgggacccagccagcgagcttgtcagtagaaatt actcagcctggcagcctctctcccgcccgcattcccggagccagagcctctctgcagagg cagcacctgatctccctggggccgcagcagacgcagtctggaatccagctccctccgcca gagaggggtggccaatgccttggcctgacctccagctgccgccccagcttctggaggccc aattctccctttccgtctgaatgtggaagagcaagccaagggggccatgtcaccccacat cctccaaaagtggagttcccgttggcagtgacccccctgaatgtgtccggggtgctcccg agctaa >gi568815594r:152222884_152452730|GENSCAN_predicted_peptide_3|204_aa MIAEWYYESMFSFIRNCQTIFQRGCTILHSHQQGMRVPVAPYSHGPLVLSVFWILAILIV PAYDLSSSATSLAAASQSSLLVPPYFPIILPLECHKAQDVASLLCHHSLGGFQQSQGLLT HQHDKILGVVFVFFSLIYPISSPLAKTVGCSSKYIENLTTFHWLHCQHPDVRCHPLSSGG SRLPAGLLLPPLPAQKLRSTQQTQ >gi568815594r:152222884_152452730|GENSCAN_predicted_CDS_3|615_bp atgattgctgaatggtattatgagagcatgtttagttttataagaaactgccaaactatc ttccaaagaggctgtaccattttgcattcccatcagcaaggaatgagagttcctgttgct ccatattctcacggacctttggtgttgtcagtgttctggattttggccattctaatagtc cctgcttatgacctctcttcttctgcgacatcactggctgctgcttctcagtcttctttg ctggttcctccttacttccccatcatcctgccattggagtgccataaggctcaggacgtg gcttctcttctgtgtcaccactccctaggtggcttccagcagtctcaaggccttctcaca caccagcatgacaaaatccttggagtcgtctttgtctttttttctctcatataccctata tcctctccattagcaaagactgttggctgttcctccaaatatattgagaatcttaccact ttccactggcttcactgccagcaccccgatgtgagatgccatcccctgtcctccggagga agccgcctccctgcgggtctcctgcttccaccgcttcccgctcagaagctgcgctcaaca cagcagacacagtga >gi568815594r:152222884_152452730|GENSCAN_predicted_peptide_4|73_aa MSTQGALLEVLTKPSTQMKTNGHLSGKVERHGRDALKDALQLLSIVILWFYNEDQGEVGL WNLWTIKGFRQPF >gi568815594r:152222884_152452730|GENSCAN_predicted_CDS_4|222_bp atgtccactcaaggagccctgctggaggtgctcacaaaaccttcaacacagatgaaaacg aatggacacttaagtgggaaggtggaacgccacggacgagatgcactcaaagatgccctc cagctccttagcattgtgattctttggttctataatgaagatcaaggagaagtaggatta tggaatttgtggacaataaaaggatttaggcagcccttttag >gi568815594r:152222884_152452730|GENSCAN_predicted_peptide_5|389_aa MTHWRTRPGSQPPRVSVGLGGSPVLPGGAEITLIVQKLYDGDRLSLEVQQETEASELYGA PTVCQAPCNGLLWRPAPAIFTTLRHMDGPGSHYLQQTNAGTENQTRHLLTYKWELNNENT WTQGEEKHTLGPAGTSQERELEKLYPKGHNADRCVGDDCVNEESPPQRRTFPLLQYFSNL AAVGGDAPGDTLKVWVQWIIRSQCRARPYPPKRPTLLPRAYRMWGRRGLLRLTPTLELRG TRGDSGPWNLDSKGVRAQCRQDQFEMVDDNRRLRSGVAAEQEPGGESMAQEGEEGSPVAS EAKGPKAATETRSVDSGGQCREQWLWGLLIPFVLSLNGHPQLLLAEDHSPIVHGEPSASR SIHSGFYIDYLGHCGILNCTPCRRLLVSS >gi568815594r:152222884_152452730|GENSCAN_predicted_CDS_5|1170_bp atgactcactggagaaccaggccaggatcccaaccaccccgtgtctcagtgggcctgggc gggagccctgtgctgccaggaggagcagagataacattgattgtccagaagctctacgat ggagacaggttgtcattggaagtccaacaagagacagaggcaagtgagctttatggagca cctactgtgtgccaggccccatgcaacgggcttctgtggaggcctgcccctgccatcttc acaaccctcaggcacatggatggacctggaagccattatcttcagcaaactaatgcagga acagaaaaccaaacacggcatcttctcacttacaagtgggagctgaacaatgagaataca tggacacagggagaggaaaaacacacactggggcctgcaggaacatcacaagaaagggaa ttagagaagttgtaccccaaggggcacaatgctgatcgttgtgtaggagatgactgtgtc aatgaagagagtccacctcagagaaggacatttccccttctgcagtatttctccaacttg gctgcagttggaggcgatgccccaggggacactctcaaggtctgggtgcagtggataatc agaagccagtgcagggcccgcccataccctccgaagagacccaccctcctgcccagggcg tatcggatgtgggggagaagaggccttctcaggctcacccccacgctggagctaagggga accagaggagactctggtccttggaacttggactcgaaaggagtgagagctcagtgtaga caggaccagtttgaaatggttgatgacaacaggagactgaggtcaggagttgcagcagag caagaaccaggtggggaatctatggctcaggaaggagaagaggggagtccagtggcttct gaggcaaaggggcctaaggcagccactgaaaccaggagtgtggattcaggaggacaatgc agggagcaatggctctggggacttctgataccctttgtgcttagcttgaatggacacccc caactgctgcttgcagaggaccatagtcccattgttcatggagaaccatcagcatctaga agtattcacagtgggttctatattgactaccttggacattgtgggatcctgaattgtact ccatgtaggagactgctggtgtcttcctaa >gi568815594r:152222884_152452730|GENSCAN_predicted_peptide_6|228_aa MAPQLASCVHAQLQLCLIGAVTCIEDSKLLHVGDCESLTQSDLLSEGCEFESKKKRNIRK KPKQRSTSSSGIDGGRLNPGCDQGAPPDPPWNMESCRPSLAAGLLPPSLGERPLGASPPI LRAGELVFEMKDHSDQHRRVNVQILVQQVWNRTQEPAFPSSSLVRVLLPQDTLSMCNVYL REKCTENQGSAGSGEAALDPHFTLGKRKAKRSFVASNQFREMEEFTTD >gi568815594r:152222884_152452730|GENSCAN_predicted_CDS_6|687_bp atggcccctcagctggctagctgtgttcatgcacagcttcagctctgcctgatcggagca gtgacctgtattgaggactccaagttgctccatgttggagattgtgaatcactgacccaa tcagacctactctcggagggctgtgaatttgagagtaagaaaaagagaaacattaggaaa aaaccgaaacagagaagcacaagcagtagtggcatagacggaggaaggctgaatcctggg tgtgaccagggggcaccaccagatcccccttggaacatggagtcatgtagaccgagcctg gcagcgggactgctccctccttctttgggagagaggcccctgggagcatccccacctatc ctccgggcaggggaattggtatttgagatgaaggaccattcagatcagcaccggagggtg aatgtacagattctggttcagcaggtctggaacaggacccaagagcctgcatttccaagc agctctctggtgagggtgctgctgccgcaggacactctgagtatgtgcaacgtgtatctc agggagaagtgtacagaaaaccagggctcagctgggtctggggaagctgcccttgatccc catttcactttagggaagagaaaagcaaagaggtcttttgtagcctctaatcaattcagg gaaatggaagagtttacaacagactag >gi568815594r:152222884_152452730|GENSCAN_predicted_peptide_7|965_aa MEMQIKTTMPYYFPSTRIAKIKRSQSPLNVGENRKQLKLSYMAGGSVKCPRTVEIHSIVV KDKPFLAPVLAEGPLHLSVIRVGKRPTRFHLLGGRLFGGIPLLQNVKTFASSDSLAKVQE VASWLLEMNQELLSVGSKRRRTGGSLRGNPSSSQVDEEQMNRVVEEEQQQQLRQQEEEHT ARNGEVVGVEPRPGGQNDSQQGQLEENNNRFISVDEDSSGNQEEQEEDEEHAGEQDEEDE EEEEMDQESDDFDQSDDSSREDEHTHTNSVTNSSSIVDLPVHQLSSPFYTKTTKPLIATD PFTVSIVLLFPECHIVGIIHFWLLKSDANEEWKYEEGTGEFEEEYSSTKRSFKMSKPGKP TLNHGLVPVDLKSAKEPLPHQTVMKIFSISIIAQGLPFCRRRMKRKLDHGSEVRSFSLGK KPCKVSEYTSTTGLVPCSATPTTFGDLRAANGQGQQRRRITSVQPPTGLQEWLKMFQNSV ILAVLDRHLGTTGLDQASHCSWSGPEKLLALDELIDSCEPTQVKHMMQVIEPQFQRDFIS LLPKELALYVLSFLEPKDLLQAAQTCRYWRILAEDNLLWREKCKEEGIDEPLHIKRRKVI KPGFIHSPWKSAYIRQHRIDTNWRRGELKSPKVLKGHDDHVITCLQFCGNRIVSGSDDNT LKVWSAVTGKCLRTLVGHTGGVWSSQMRDNIIISGSTDRTLKVWNAETGECIHTLYGHTS TVRCMHLHEKRVVSGSRDATLRVWDIETGQCLHVLMGHVAAVRCVQYDGRRVVSGAYDFM VKVWDPETETCLHTLQGHTNRVYSLQFDGIHVVSGSLDTSIRVWDVETGNCIHTLTGHQS LTSGMELKDNILVSGNADSTVKIWDIKTGQCLQTLQGPNKHQSAVTCLQFNKNFVITSSD DGTVKLWDLKTGEFIRNLVTLESGGSGGVVWRIRASNTKLVCAVGSRNGTEETKLLVLDF DVDMK >gi568815594r:152222884_152452730|GENSCAN_predicted_CDS_7|2898_bp atggagatgcagattaaaaccacaatgccatactactttccatccacaagaatagctaaa attaaaagaagtcagtcaccactgaatgttggtgaaaataggaagcagctgaaactctct tacatggctggtggaagtgtaaaatgtcccaggacagtggaaatacacagtattgtggtc aaagataaaccattcttggctccagtcctggctgagggcccactacacttgtctgttatc cgagtgggcaaacgacctacccgttttcatctgctgggcggccggttatttggggggatc cccctgttacagaatgtgaaaacctttgcatcttctgatagtctagccaaggtccaagaa gtagcaagctggcttttggaaatgaatcaggaactgctctctgtgggcagcaaaagacga cgaactggaggctctctgagaggtaacccttcctcaagccaggtagatgaagaacagatg aatcgtgtggtagaggaggaacagcaacagcaactcagacaacaagaggaggagcacact gcaaggaatggtgaagttgttggagtagaacctagacctggaggccaaaatgattcccag caaggacagttggaagaaaacaataatagatttatttcggtagatgaggactcctcagga aaccaagaagaacaagaggaagatgaagaacatgctggtgaacaagatgaggaggatgag gaggaggaggagatggaccaggagagtgacgattttgatcagtctgatgatagtagcaga gaagatgaacatacacatactaacagtgtcacgaactccagtagtattgtggacctgccc gttcaccaactctcctccccattctatacaaaaacaacaaaacctctcatcgccactgat cctttcaccgtctccatagttttgctttttccagaatgtcatatagttggaattatacat ttttggttattgaaatcagatgcaaatgaggaatggaagtatgaagaaggtactggtgaa tttgaggaagagtattcgtcaaccaagaggagttttaaaatgtcaaaaccgggaaaacct actctaaaccatggcttggttcctgttgatcttaaaagtgcaaaagagcctctaccacat caaactgtgatgaagatatttagcattagcatcattgcccaaggcctccctttttgtcga agacggatgaaaagaaagttggaccatggttctgaggtccgctctttttctttgggaaag aaaccatgcaaagtctcagaatatacaagtaccactgggcttgtaccatgttcagcaaca ccaacaacttttggggacctcagagcagccaatggccaagggcaacaacgacgccgaatt acatctgtccagccacctacaggcctccaggaatggctaaaaatgtttcagaactcagta attcttgctgtccttgatcggcatttgggaaccactggcctggaccaagcctctcactgt agctggagtggaccagagaaattgcttgctttagatgaactcattgatagttgtgaacca acacaagtaaaacatatgatgcaagtgatagaaccccagtttcaacgagacttcatttca ttgctccctaaagagttggcactctatgtgctttcattcctggaacccaaagacctgcta caagcagctcagacatgtcgctactggagaattttggctgaagacaaccttctctggaga gagaaatgcaaagaagaggggattgatgaaccattgcacatcaagagaagaaaagtaata aaaccaggtttcatacacagtccatggaaaagtgcatacatcagacagcacagaattgat actaactggaggcgaggagaactcaaatctcctaaggtgctgaaaggacatgatgatcat gtgatcacatgcttacagttttgtggtaaccgaatagttagtggttctgatgacaacact ttaaaagtttggtcagcagtcacaggcaaatgtctgagaacattagtgggacatacaggt ggagtatggtcatcacaaatgagagacaacatcatcattagtggatctacagatcggaca ctcaaagtgtggaatgcagagactggagaatgtatacacaccttatatgggcatacttcc actgtgcgttgtatgcatcttcatgaaaaaagagttgttagcggttctcgagatgccact cttagggtttgggatattgagacaggccagtgtttacatgttttgatgggtcatgttgca gcagtccgctgtgttcaatatgatggcaggagggttgttagtggagcatatgattttatg gtaaaggtgtgggatccagagactgaaacctgtctacacacgttgcaggggcatactaat agagtctattcattacagtttgatggtatccatgtggtgagtggatctcttgatacatca atccgtgtttgggatgtggagacagggaattgcattcacacgttaacagggcaccagtcg ttaacaagtggaatggaactcaaagacaatattcttgtctctgggaatgcagattctaca gttaaaatctgggatatcaaaacaggacagtgtttacaaacattgcaaggtcccaacaag catcagagtgctgtgacctgtttacagttcaacaagaactttgtaattaccagctcagat gatggaactgtaaaactatgggacttgaaaacgggtgaatttattcgaaacctagtcaca ttggagagtggggggagtgggggagttgtgtggcggatcagagcctcaaacacaaagctg gtgtgtgcagttgggagtcggaatgggactgaagaaaccaagctgctggtgctggacttt gatgtggacatgaagtga