GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:28:58 Sequence gi568815575f:2652275_2911452 : 259178 bp : 45.66% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 198 237 40 -2.66 1.01 Init + 2563 2611 49 0 1 83 32 67 0.398 1.71 1.02 Term + 9366 9391 26 1 2 128 52 24 0.465 1.19 1.03 PlyA + 10779 10784 6 1.05 2.04 PlyA - 11885 11880 6 1.05 2.03 Term - 20884 20823 62 2 2 89 51 38 0.503 -1.83 2.02 Intr - 24784 24698 87 2 0 106 69 48 0.462 4.64 2.01 Init - 34074 33984 91 0 1 88 80 27 0.254 0.55 2.00 Prom - 35014 34975 40 -4.16 3.00 Prom + 35649 35688 40 -2.86 3.01 Init + 39087 39153 67 2 1 110 88 248 0.995 26.23 3.02 Intr + 62148 62180 33 1 0 114 105 15 0.831 3.99 3.03 Intr + 67387 67431 45 2 0 108 101 57 0.986 7.38 3.04 Intr + 68082 68150 69 1 0 57 111 61 0.947 4.45 3.05 Intr + 70353 70400 48 1 0 116 87 37 0.915 5.05 3.06 Intr + 73986 74099 114 1 0 104 115 95 0.963 14.22 3.07 Intr + 85926 85982 57 1 0 107 99 86 0.851 10.46 3.08 Intr + 88409 88452 44 0 2 80 94 -4 0.825 -2.64 3.09 Intr + 88861 88943 83 0 2 81 121 3 0.839 1.34 3.10 Term + 89350 89461 112 1 1 61 47 135 0.444 4.73 3.11 PlyA + 89606 89611 6 1.05 4.00 Prom + 90816 90855 40 -1.56 4.01 Init + 100001 100061 61 1 1 91 94 44 0.897 4.71 4.02 Intr + 101632 101748 117 2 0 80 110 19 0.826 3.74 4.03 Term + 102353 102465 113 0 2 38 47 80 0.381 -2.38 4.04 PlyA + 102950 102955 6 1.05 5.00 Prom + 108538 108577 40 -3.76 5.01 Init + 110152 110215 64 0 1 52 105 100 0.931 9.41 5.02 Intr + 115278 115398 121 1 1 73 58 50 0.863 0.05 5.03 Intr + 116084 116217 134 2 2 61 80 83 0.898 5.09 5.04 Intr + 117467 117499 33 0 0 91 91 15 0.551 0.29 5.05 Intr + 118276 118317 42 2 0 99 119 69 0.998 9.31 5.06 Intr + 122442 122465 24 1 0 99 109 25 0.910 3.70 5.07 Intr + 129792 129854 63 1 0 108 63 88 0.153 6.99 5.08 Intr + 142261 142329 69 2 0 106 96 87 0.791 10.45 5.09 Intr + 145036 145112 77 2 2 109 64 62 0.705 5.13 5.10 Intr + 146988 147105 118 2 1 59 62 110 0.655 5.54 5.11 Intr + 154427 154498 72 0 0 108 45 32 0.280 0.28 5.12 Intr + 155911 155946 36 2 0 126 88 5 0.587 2.53 5.13 Intr + 159062 159178 117 0 0 104 115 44 0.969 9.14 5.14 Term + 160236 160360 125 1 2 105 39 30 0.552 -1.65 5.15 PlyA + 161715 161720 6 1.05 6.05 PlyA - 161767 161762 6 1.05 6.04 Term - 167636 167521 116 0 2 50 39 110 0.906 1.03 6.03 Intr - 171760 171736 25 1 1 93 115 26 0.870 3.40 6.02 Intr - 177569 177309 261 2 0 108 49 185 0.947 14.28 6.01 Init - 177916 177860 57 1 0 96 23 52 0.380 -1.02 6.00 Prom - 179646 179607 40 -7.96 7.00 Prom + 186775 186814 40 -1.96 7.01 Init + 190875 191080 206 2 2 79 110 209 0.853 18.54 7.02 Intr + 201706 201880 175 1 1 70 105 241 0.960 23.94 7.03 Intr + 202719 202881 163 2 1 73 127 146 0.999 16.55 7.04 Intr + 204224 204350 127 0 1 63 92 82 0.983 5.84 7.05 Intr + 207569 207791 223 2 1 76 98 138 0.918 11.73 7.06 Intr + 209248 209448 201 0 0 72 95 73 0.839 5.88 7.07 Term + 228778 228939 162 0 0 92 36 324 0.829 25.54 7.08 PlyA + 229986 229991 6 1.05 8.05 PlyA - 230031 230026 6 1.05 8.04 Term - 255358 254997 362 2 2 122 42 282 0.989 21.70 8.03 Intr - 256568 256447 122 1 2 60 79 76 0.994 4.04 8.02 Intr - 257705 257543 163 0 1 91 98 121 0.997 12.43 8.01 Intr - 258519 258385 135 1 0 64 113 80 0.992 8.64 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 129792 129874 83 1 2 108 47 115 0.824 7.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_1|24_aa MLLELHKAVPLIGSTPGAVFGKEQ >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_1|75_bp atgcttctggagctccacaaagctgtccctttaattggcagcacgcctggtgcggtgttt gggaaggagcagtga >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_2|79_aa MEFHHVGQAGLELLTSTDLPASASRSAGITGSDQDTCTFCLQVLVTGKSMNRILIDVCAG LQGVTVFGDMVFKKVINIK >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_2|240_bp atggagtttcaccacgttggccaggctggtctcgaactcctgacctcaactgatctgccg gcctcggcctcccgaagtgctggtattacagggtcagatcaggacacctgcacattctgt ttgcaggttctcgtcaccggcaaatcaatgaaccgaattttaattgatgtgtgtgcagga ctgcagggtgtcactgtgtttggagatatggtctttaaaaaggtgattaatataaaatga >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_3|223_aa MARGAALALLLFGLLGVLVAAPDGGFDLSDALPGDDFDLGDAVVDGENDDPRPPNPPKPM PNPNPNHPSSSGSFSDADLADGVSGGEADAPGVIPGIVGAVVVAVAGAISSFIAYQKKKL CFKENAEQGEVDMESHRNANAEPAEIKPLAPESCRNCVHNLGCLQITCPSSTSETPGSPD FFLIKIRYSSVSRYSSKSRYSLHPDTVLPKMYKMYRIFPTLKT >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_3|672_bp atggcccgcggggctgcgctggcgctgctgctcttcggcctgctgggtgttctggtcgcc gccccggatggtggtttcgatttatccgatgcccttcctggggatgactttgacttagga gatgctgttgttgatggagaaaatgacgacccacgaccaccgaacccacccaaaccgatg ccaaatccaaaccccaaccaccctagttcctccggtagcttttcagatgctgaccttgcg gatggcgtttcaggtggagaagccgacgccccaggcgtgatccccgggattgtgggggct gtcgtggtcgccgtggctggagccatctctagcttcattgcttaccagaaaaagaagcta tgcttcaaagaaaatgcagaacaaggggaggtggacatggagagccaccggaatgccaac gcagagccagctgaaataaaaccactggctcctgaaagttgtaggaactgtgtccacaat cttggctgtttacaaatcacgtgtccatcgagcacgtctgaaacccctggtagccccgac ttctttttaattaaaataagatactcctctgtatccagatactcctctaaatccaggtac tccctacatccagatactgtacttcctaagatgtacaagatgtaccgcattttcccaaca ctgaagacttga >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_4|96_aa MESWWGLPCLAFLCFLMHARVQCSINYVRYATPHYKVDCVLDDLSIRWLMKSVLSTLKAG RLQAEEQGEPVQVPKLKNLESDVPASTMGERCRPED >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_4|291_bp atggagagctggtggggacttccctgtcttgcgttcctgtgttttctaatgcacgcccga gtacaatgttcaataaattatgtgaggtatgcaacacctcattataaagttgactgtgtg ctagatgatttgtccatccgttggctgatgaaaagtgttctgagcactctgaaggcaggc cgtctgcaggctgaggagcaaggagagccagtccaggttccaaaactgaagaacttggag tctgatgttccagcatccaccatgggagaaagatgtaggccagaagactag >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_5|364_aa MGSEEQIMHQGGLLSSGVCSPVENKPRESNGVMCNPTLETETPELALSTWDISQPRGLTP DPCLDWNCWQPWRGDVAQIQEADTVPVPQPSLSSLDHDDQERWRRPDTFIRLTAFGSGQR DFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPDSGGSYFNDVDRDDGRYPPRPRPR PPAGGGGGGYSSYGNSDNTHGTPTSGHAMALVSGVPFFVSMLSIEKNDETSVKDLFAKVK VKGAPQETGRGGYRLNSRYGNTYGKGIMSYRICGDHHSTYGNPEGNMVAKIVSPIVSVVV VTLLGAAASYFKLNNRRNCFRTHGNQKGVPIPTVFALGFAFGLKNPKTFPCAPSAGEWAV STVS >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_5|1095_bp atgggttccgaggagcagatcatgcaccagggcggcctgctcagctctggtgtctgcagc cccgtggaaaataaaccccgtgaaagcaatggcgtcatgtgcaacccgaccttggagaca gagaccccagagttagcgttgtctacctgggacatctcccagcccaggggtcttactcca gacccgtgtctcgactggaactgctggcagccttggaggggagatgttgcacagatccaa gaggctgacacggttcctgtcccccagcccagtctgagcagcttggaccatgatgatcaa gagcgctggaggaggccagacacattcattagattaactgcttttggatcaggtcaaaga gactttgatttggcagatgcccttgatgaccctgaacccaccaagaagccaaactcagat atctacccaaagccaaaaccaccttactacccacagcccgagaatcccgacagcggtgga agttacttcaatgatgtggaccgtgatgacggacgctacccgcccaggcccaggccacgg ccgcctgcaggaggtggcggcggtggctactccagttatggcaactccgacaacacgcac ggtacccctacatctgggcatgcgatggccctggtgtctggtgttcccttctttgtgtcc atgttgtcaatcgagaaaaatgacgagacgagcgtaaaagatttatttgccaaagttaag gttaagggcgcgccccaggagacaggaagagggggctatagactcaactctcgttatgga aatacttatggtaaaggaattatgtcttacagaatatgtggagatcaccattcaacgtat ggcaatccagaaggcaatatggtagcaaaaatcgtgtctcccatcgtatccgtggtggtg gtgacactgctgggagcagcagccagttatttcaaactaaacaataggagaaattgtttc aggacccatggaaaccagaaaggggtgcccattcctacggtgtttgctttaggatttgca tttggcctcaaaaatcctaaaaccttcccatgcgctccctctgctggtgaatgggcagta tcaaccgttagctaa >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_6|152_aa MVSGAAGSGRGPADLRPGASPATPLRLNTHAPSNDNAPGTPPDLSGVSSCPAAPRTCLEF WASSSSTPAGAPRSPLSTPWTTPALPDASICTSSELGLPLPAPPGQEAMESLAAALYEVH NWAQIKVVVEAFRILKEAEKALNDTPKTNFYV >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_6|459_bp atggtcagtggcgccgcgggcagtgggcgcggacctgcagacctgcgcccgggagcatca cctgcgaccccgctgcgtcttaacacccatgcgcccagcaacgacaacgcccccgggaca ccacccgacctcagcggtgtcagcagctgccccgccgccccgcgcacctgcttggagttc tgggcatcctcttcctccaccccagctggcgccccccggtcacccctctccactccctgg accacccctgcgctcccggacgccagtatctgcacctcctccgagcttgggctgcctctg cccgcgccccccggccaggaagccatggagagcttggcagcagctttgtatgaagttcat aactgggctcagatcaaagttgtggtggaagcctttagaattctgaaggaagcagaaaag gcattgaatgacacccccaagaccaacttttatgtttaa >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_7|418_aa MAPSPLLSLRSVTLVFLLIFTVTDQAFVTLATNDIYCQGALVLGQSLRRHRLTRKLVVLI TPQVSSLLRVILSKVFDEVIEVNLIDSADYIHLAFLKRPELGLTLTKLHCWTLTHYSKCV FLDADTLVLSNVDELFDRGEFSAAPDPGWPDCFNSGVFVFQPSLHTHKLLLQHAMEHGSF DGADQGLLNSFFRNWSTTDIHKHLPFIYNLSSNTMYTYSPAFKQFGSSAKVVHFLGSMKP WNYKYNPQSGSVLEQGSASSSQHQAAFLHLWWTVYQNNVLPLYKSVQAGEARASPGHTLC HSDVGGPCADSASGVGEPCENSTPSAGVPCANSPLGSNQPAQGLPEPTQIVDETLSLPEG RRSEDVDLAVSVSQISIEEKVKELSPEEERRKWEEGRIDYMGKDAFARIQEKLDRFLQ >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_7|1257_bp atggccccgtcacccctgctgtccctcaggagtgtaactctggtgtttctgttgattttc acagtgactgatcaggcttttgtcacactagccaccaatgacatctactgccagggcgcc ctggtcctggggcagtcactgaggagacacaggctgacgaggaagctggtggtgttgatc actcctcaggtgtccagcctgctcagggtcatcctctcgaaggtgttcgatgaagtcatt gaagtgaatctaatcgatagtgccgactacatccacctggcctttctgaagagacctgag ctcgggctcaccctcaccaagcttcactgttggactctcactcactacagcaagtgtgtc ttcctggatgcagacactctggtgctgtccaatgtcgatgagctgtttgacaggggagag ttttctgcggccccggaccccggatggccggattgcttcaatagcggggtgtttgtcttc cagccttctctccacacgcataaactcctgctacagcacgccatggaacacggcagcttt gacggggcagaccaaggcttactgaatagtttcttcaggaactggtcgaccacagacatc cacaagcacctgccgttcatctataacttgagtagtaacacgatgtacacttacagccct gccttcaagcaattcggttccagtgcaaaggtcgtccactttttggggtccatgaaacct tggaactacaagtacaatccacagagtggctcggtgttggagcaaggctcagcgtccagc agccagcaccaggcggcattccttcatctctggtggacggtctaccagaacaacgtgctg cccctttataaaagcgtccaagcgggggaagcacgcgcgtctcctggtcacacactttgc cacagtgatgtgggggggccgtgtgcggattcagcctctggtgttggagagccgtgtgaa aattcaacacccagtgcgggcgtgccgtgtgcaaattcaccactgggttctaaccagcct gctcagggccttccggagccgacccagatagtggatgagaccctgtccctacctgaagga cgccgttcagaagatgtcgacctggccgtctctgtttcccagatctccatcgaagagaag gtgaaggaattgagccccgaggaagagaggaggaagtgggaggaaggccgtatcgactac atggggaaggacgcgtttgctcgcatccaggagaagctggaccggttcctgcagtaa >gi568815575f:2652275_2911452|GENSCAN_predicted_peptide_8|260_aa XKVLNAIEDNGLKNSTFTYFTSDHGGHLEARDGHSQLGGWNGIYKGGKGMGGWEGGIRVP GIFHWPGVLPAGRVIGEPTSLMDVFPTVVQLVGGEVPQDRVIDGHSLVPLLQGAEARSAH EFLFHYCGQHLHAARWHQKDSGSVWKVHYTTPQFHPEGAGACYGRGVCPCSGEGVTHHRP PLLFDLSRDPSEARPLTPDSEPLYHAVIARVGAAVSEHRQTLSPVPQQFSMSNILWKPWL QPCCGHFPFCSCHEDGDGTP >gi568815575f:2652275_2911452|GENSCAN_predicted_CDS_8|783_bp ngtaaggttcttaatgccatcgaagacaatggtttaaagaactcaacattcacgtatttc acctctgaccatggaggacatttagaggcaagagatggacacagccagttagggggatgg aacggaatttacaaaggtgggaagggcatgggaggatgggaaggtgggatccgcgtgccc gggatcttccactggccgggggtgctcccggccggccgagtgattggagagcccacgagc ctgatggacgtgttccctactgtggtccagctggtgggtggcgaggtgccccaggacagg gtgattgatggccacagcctggtacccttgctgcagggagctgaggcacgctcggcacat gagttcctgtttcattactgtgggcagcatcttcacgcagcacgctggcaccagaaggac agtggaagcgtctggaaggttcattacacgaccccgcagttccaccccgagggagcgggg gcctgctacggccgaggcgtctgcccatgctccggggagggcgtgacccatcacagaccc cctttgctctttgacctctccagggacccctccgaggcacggcccctgacccccgactcc gagcccctgtaccacgccgtgatagcaagggtaggtgccgcggtgtcggagcatcggcag accctgagtcctgtgccccagcagttttccatgagcaacatcctgtggaagccgtggctg cagccgtgctgcggacatttcccgttctgttcatgccacgaggatggggatggcaccccc tga