GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:45:25 Sequence gi568815575f:2591361_2838256 : 246896 bp : 45.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1997 2051 55 1 1 81 36 55 0.097 1.08 1.02 Intr + 14099 14280 182 0 2 111 66 43 0.058 3.99 1.03 Intr + 17951 18143 193 1 1 4 43 137 0.149 -0.13 1.04 Intr + 20754 20858 105 1 0 72 99 58 0.362 5.49 1.05 Intr + 27380 27424 45 0 0 74 73 60 0.363 1.48 1.06 Intr + 28069 28137 69 2 0 53 111 43 0.595 2.25 1.07 Intr + 31330 31374 45 2 0 89 74 61 0.627 3.18 1.08 Intr + 31989 32093 105 1 0 69 111 44 0.612 4.99 1.09 Term + 35237 35274 38 0 2 87 47 19 0.108 -4.80 1.10 PlyA + 36549 36554 6 1.05 2.02 PlyA - 37249 37244 6 1.05 2.01 Sngl - 41292 40801 492 0 0 64 46 177 0.743 7.16 2.00 Prom - 43356 43317 40 -4.16 3.06 PlyA - 44467 44462 6 1.05 3.05 Term - 49720 49545 176 2 2 37 41 124 0.374 0.52 3.04 Intr - 57500 57344 157 2 1 48 107 56 0.040 3.08 3.03 Intr - 64659 64580 80 1 2 99 44 20 0.002 -2.03 3.02 Intr - 85698 85612 87 1 0 106 69 48 0.461 4.64 3.01 Init - 94988 94898 91 2 1 88 80 27 0.254 0.55 3.00 Prom - 95928 95889 40 -4.16 4.00 Prom + 96563 96602 40 -2.86 4.01 Init + 100001 100067 67 1 1 110 88 248 0.995 26.23 4.02 Intr + 123062 123094 33 0 0 114 105 15 0.831 3.99 4.03 Intr + 128301 128345 45 1 0 108 101 57 0.986 7.38 4.04 Intr + 128996 129064 69 0 0 57 111 61 0.947 4.45 4.05 Intr + 131267 131314 48 0 0 116 87 37 0.915 5.05 4.06 Intr + 134900 135013 114 0 0 104 115 95 0.963 14.22 4.07 Intr + 146840 146896 57 0 0 107 99 86 0.851 10.46 4.08 Intr + 149323 149366 44 2 2 80 94 -4 0.825 -2.64 4.09 Intr + 149775 149857 83 2 2 81 121 3 0.839 1.34 4.10 Term + 150264 150375 112 0 1 61 47 135 0.444 4.73 4.11 PlyA + 150520 150525 6 1.05 5.00 Prom + 151730 151769 40 -1.56 5.01 Init + 160915 160975 61 0 1 91 94 44 0.897 4.71 5.02 Intr + 162546 162662 117 1 0 80 110 19 0.826 3.74 5.03 Term + 163267 163379 113 2 2 38 47 80 0.381 -2.38 5.04 PlyA + 163864 163869 6 1.05 6.00 Prom + 169452 169491 40 -3.76 6.01 Init + 171066 171129 64 2 1 52 105 100 0.931 9.41 6.02 Intr + 176192 176312 121 0 1 73 58 50 0.863 0.05 6.03 Intr + 176998 177131 134 1 2 61 80 83 0.898 5.09 6.04 Intr + 178381 178413 33 2 0 91 91 15 0.551 0.29 6.05 Intr + 179190 179231 42 1 0 99 119 69 0.998 9.31 6.06 Intr + 183356 183379 24 0 0 99 109 25 0.910 3.70 6.07 Intr + 190706 190768 63 0 0 108 63 88 0.153 6.99 6.08 Intr + 203175 203243 69 1 0 106 96 87 0.791 10.45 6.09 Intr + 205950 206026 77 1 2 109 64 62 0.705 5.13 6.10 Intr + 207902 208019 118 1 1 59 62 110 0.655 5.54 6.11 Intr + 215341 215412 72 2 0 108 45 32 0.280 0.28 6.12 Intr + 216825 216860 36 1 0 126 88 5 0.587 2.53 6.13 Intr + 219976 220092 117 2 0 104 115 44 0.969 9.14 6.14 Term + 221150 221274 125 0 2 105 39 30 0.552 -1.65 6.15 PlyA + 222629 222634 6 1.05 7.04 PlyA - 222681 222676 6 1.05 7.03 Term - 228550 228435 116 2 2 50 39 110 0.906 1.03 7.02 Intr - 232674 232650 25 0 1 93 115 26 0.870 3.40 7.01 Intr - 238483 238223 261 1 0 108 49 185 0.947 14.28 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 190706 190788 83 0 2 108 47 115 0.824 7.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:2591361_2838256|GENSCAN_predicted_peptide_1|278_aa MALKNDDACVCADLTKFSGALVMNPMMGEEIYFSLTATLPLRQATGSTSSCSVDVLQLEF ELISSSLPSEWSGIEPIFQSSAGSAPPGPWGRFRPAAFARALHSRTISARLGRAVARGAV LALRLLGLLRRLVAAPVSERRDPGTDDCNSHVLQTGDPAPPNAPKPKPDPNPNRPGFTGA DFDLADAFRDGGNNDPAPLNSPKLKPNANPEQPGFIGDDFDLADALHDKGNCTHNLNSHL LQTDEPAPLNPPKPKPNPNPKQPDSTGDDFDFTDVSSW >gi568815575f:2591361_2838256|GENSCAN_predicted_CDS_1|837_bp atggcgctgaagaatgatgatgcctgtgtgtgtgcggacctgaccaaattctcaggggcc ttagtcatgaacccaatgatgggtgaggaaatatatttctctctcactgctaccctgcca ttacgacaggccacaggcagcacttccagctgctccgtggatgtgctgcaacttgaattt gaacttatttcaagttctttgccttcagaatggtcaggaatagagcccatcttccagtcc tccgcgggctccgccccacccggcccgtgggggaggttccgtcccgccgccttcgcccgc gcgctgcactcgcggacgatctctgctcgccttgggcgcgccgtggcgcgcggggccgtg ctggcgctgcggctcctcggcctgctccgccgtctggtggccgccccggtgagcgagcgg cgggatccgggtactgatgattgcaactctcatgttttacaaacaggtgacccagcacca cctaatgcccccaaaccaaagccagatccaaaccccaaccgacctggtttcactggggct gactttgacttagcagatgcctttcgtgatggaggaaataacgacccagcacctcttaat tcacccaagctgaagccaaatgcgaaccctgagcagcctggattcattggggatgacttt gacttagcagatgccttacatgacaaaggaaactgcactcataatctcaactctcatctt ttacaaacagacgagccagcaccactgaacccacccaaaccaaagccaaatccaaacccc aagcagcctgattccaccggggatgactttgatttcacagatgtttcttcatggtga >gi568815575f:2591361_2838256|GENSCAN_predicted_peptide_2|163_aa MSLRTRKCYNYPYELDGSKCRYEPGNTTIFHMNQKVLQCHYEPGSVTMSLKLRSATIIPM NQEVLQCPCKPGSATIIHMNQEVSQCPYEPGSATIICMNLEMLQCPYEPGSATIIRMNQK VLQCPCKLGSSTIIGMKEEVLQCPCEPGSAIIHMNQEVLQLSG >gi568815575f:2591361_2838256|GENSCAN_predicted_CDS_2|492_bp atgtccctacgaaccaggaagtgctacaattatccctatgaactagacggttccaaatgt cgttatgaaccaggaaatactacaattttccatatgaaccaaaaagtgttgcagtgtcac tatgaaccaggaagtgtcacaatgtccctcaaactaagaagtgctacaattatccctatg aaccaggaagtattgcaatgtccctgcaaaccaggaagtgctacaattatccatatgaac caggaagtgtcacaatgtccctatgaaccaggaagtgctacaattatctgtatgaacctg gaaatgttacaatgtccctatgaaccaggaagtgctacaattatccgtatgaaccagaaa gtgttacaatgtccctgcaaattaggaagttctacaattatcggaatgaaagaggaagtg ttacaatgtccctgtgaaccaggaagtgctattatccatatgaaccaggaagtgctacaa ttatctggatga >gi568815575f:2591361_2838256|GENSCAN_predicted_peptide_3|196_aa MEFHHVGQAGLELLTSTDLPASASRSAGITGSDQDTCTFCLQVLVTGKSMNRILIDVCAD LPSLMIQSPAHSPLWVMRAGDYAANRREFLTRHPQPLTCGLSLLMFSQWWEMRPANGTLY RTTVLNCWEDKARSQIKSGHFQPHNGTVVRTSSTRDTRNTKECGTIQQSRIQRHEWCLLR AFRAANEQEMTAGISQ >gi568815575f:2591361_2838256|GENSCAN_predicted_CDS_3|591_bp atggagtttcaccacgttggccaggctggtctcgaactcctgacctcaactgatctgccg gcctcggcctcccgaagtgctggtattacagggtcagatcaggacacctgcacattctgt ttgcaggttctcgtcaccggcaaatcaatgaaccgaattttaattgatgtgtgtgcagat ttaccctcactgatgatccagtcacctgctcactctcctctttgggtcatgagagctggt gactatgcagccaacaggagggaattcttgacaagacatccccagcccctgacttgtgga ttgagcctgctgatgttcagtcagtggtgggaaatgagacctgctaatgggaccctgtac cgcaccacggtgctgaactgttgggaagataaagcaagaagtcagataaagtctggacat ttccagcctcacaatggcactgtggttagaacttcatcaacaagagacacgcgtaacacc aaggaatgtggcacaattcagcagtcccgaatacagcggcatgaatggtgccttctaaga gcattcagagcagctaacgagcaagagatgacggctggcatttctcaataa >gi568815575f:2591361_2838256|GENSCAN_predicted_peptide_4|223_aa MARGAALALLLFGLLGVLVAAPDGGFDLSDALPGDDFDLGDAVVDGENDDPRPPNPPKPM PNPNPNHPSSSGSFSDADLADGVSGGEADAPGVIPGIVGAVVVAVAGAISSFIAYQKKKL CFKENAEQGEVDMESHRNANAEPAEIKPLAPESCRNCVHNLGCLQITCPSSTSETPGSPD FFLIKIRYSSVSRYSSKSRYSLHPDTVLPKMYKMYRIFPTLKT >gi568815575f:2591361_2838256|GENSCAN_predicted_CDS_4|672_bp atggcccgcggggctgcgctggcgctgctgctcttcggcctgctgggtgttctggtcgcc gccccggatggtggtttcgatttatccgatgcccttcctggggatgactttgacttagga gatgctgttgttgatggagaaaatgacgacccacgaccaccgaacccacccaaaccgatg ccaaatccaaaccccaaccaccctagttcctccggtagcttttcagatgctgaccttgcg gatggcgtttcaggtggagaagccgacgccccaggcgtgatccccgggattgtgggggct gtcgtggtcgccgtggctggagccatctctagcttcattgcttaccagaaaaagaagcta tgcttcaaagaaaatgcagaacaaggggaggtggacatggagagccaccggaatgccaac gcagagccagctgaaataaaaccactggctcctgaaagttgtaggaactgtgtccacaat cttggctgtttacaaatcacgtgtccatcgagcacgtctgaaacccctggtagccccgac ttctttttaattaaaataagatactcctctgtatccagatactcctctaaatccaggtac tccctacatccagatactgtacttcctaagatgtacaagatgtaccgcattttcccaaca ctgaagacttga >gi568815575f:2591361_2838256|GENSCAN_predicted_peptide_5|96_aa MESWWGLPCLAFLCFLMHARVQCSINYVRYATPHYKVDCVLDDLSIRWLMKSVLSTLKAG RLQAEEQGEPVQVPKLKNLESDVPASTMGERCRPED >gi568815575f:2591361_2838256|GENSCAN_predicted_CDS_5|291_bp atggagagctggtggggacttccctgtcttgcgttcctgtgttttctaatgcacgcccga gtacaatgttcaataaattatgtgaggtatgcaacacctcattataaagttgactgtgtg ctagatgatttgtccatccgttggctgatgaaaagtgttctgagcactctgaaggcaggc cgtctgcaggctgaggagcaaggagagccagtccaggttccaaaactgaagaacttggag tctgatgttccagcatccaccatgggagaaagatgtaggccagaagactag >gi568815575f:2591361_2838256|GENSCAN_predicted_peptide_6|364_aa MGSEEQIMHQGGLLSSGVCSPVENKPRESNGVMCNPTLETETPELALSTWDISQPRGLTP DPCLDWNCWQPWRGDVAQIQEADTVPVPQPSLSSLDHDDQERWRRPDTFIRLTAFGSGQR DFDLADALDDPEPTKKPNSDIYPKPKPPYYPQPENPDSGGSYFNDVDRDDGRYPPRPRPR PPAGGGGGGYSSYGNSDNTHGTPTSGHAMALVSGVPFFVSMLSIEKNDETSVKDLFAKVK VKGAPQETGRGGYRLNSRYGNTYGKGIMSYRICGDHHSTYGNPEGNMVAKIVSPIVSVVV VTLLGAAASYFKLNNRRNCFRTHGNQKGVPIPTVFALGFAFGLKNPKTFPCAPSAGEWAV STVS >gi568815575f:2591361_2838256|GENSCAN_predicted_CDS_6|1095_bp atgggttccgaggagcagatcatgcaccagggcggcctgctcagctctggtgtctgcagc cccgtggaaaataaaccccgtgaaagcaatggcgtcatgtgcaacccgaccttggagaca gagaccccagagttagcgttgtctacctgggacatctcccagcccaggggtcttactcca gacccgtgtctcgactggaactgctggcagccttggaggggagatgttgcacagatccaa gaggctgacacggttcctgtcccccagcccagtctgagcagcttggaccatgatgatcaa gagcgctggaggaggccagacacattcattagattaactgcttttggatcaggtcaaaga gactttgatttggcagatgcccttgatgaccctgaacccaccaagaagccaaactcagat atctacccaaagccaaaaccaccttactacccacagcccgagaatcccgacagcggtgga agttacttcaatgatgtggaccgtgatgacggacgctacccgcccaggcccaggccacgg ccgcctgcaggaggtggcggcggtggctactccagttatggcaactccgacaacacgcac ggtacccctacatctgggcatgcgatggccctggtgtctggtgttcccttctttgtgtcc atgttgtcaatcgagaaaaatgacgagacgagcgtaaaagatttatttgccaaagttaag gttaagggcgcgccccaggagacaggaagagggggctatagactcaactctcgttatgga aatacttatggtaaaggaattatgtcttacagaatatgtggagatcaccattcaacgtat ggcaatccagaaggcaatatggtagcaaaaatcgtgtctcccatcgtatccgtggtggtg gtgacactgctgggagcagcagccagttatttcaaactaaacaataggagaaattgtttc aggacccatggaaaccagaaaggggtgcccattcctacggtgtttgctttaggatttgca tttggcctcaaaaatcctaaaaccttcccatgcgctccctctgctggtgaatgggcagta tcaaccgttagctaa >gi568815575f:2591361_2838256|GENSCAN_predicted_peptide_7|133_aa SPATPLRLNTHAPSNDNAPGTPPDLSGVSSCPAAPRTCLEFWASSSSTPAGAPRSPLSTP WTTPALPDASICTSSELGLPLPAPPGQEAMESLAAALYEVHNWAQIKVVVEAFRILKEAE KALNDTPKTNFYV >gi568815575f:2591361_2838256|GENSCAN_predicted_CDS_7|402_bp tcacctgcgaccccgctgcgtcttaacacccatgcgcccagcaacgacaacgcccccggg acaccacccgacctcagcggtgtcagcagctgccccgccgccccgcgcacctgcttggag ttctgggcatcctcttcctccaccccagctggcgccccccggtcacccctctccactccc tggaccacccctgcgctcccggacgccagtatctgcacctcctccgagcttgggctgcct ctgcccgcgccccccggccaggaagccatggagagcttggcagcagctttgtatgaagtt cataactgggctcagatcaaagttgtggtggaagcctttagaattctgaaggaagcagaa aaggcattgaatgacacccccaagaccaacttttatgtttaa