GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:22:08 Sequence gi568815597r:23409883_23630793 : 220911 bp : 49.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.31 Intr - 7508 7367 142 0 1 86 66 148 0.783 11.81 1.30 Intr - 8127 8022 106 0 1 77 66 161 0.970 12.59 1.29 Intr - 9257 9195 63 2 0 108 75 76 0.960 7.21 1.28 Intr - 12952 12749 204 1 0 96 43 96 0.937 5.30 1.27 Intr - 14802 14683 120 0 0 42 89 162 0.354 12.39 1.26 Intr - 21243 21153 91 2 1 92 74 10 0.321 -0.00 1.25 Intr - 22036 21814 223 2 1 72 77 95 0.406 3.99 1.24 Intr - 23390 23195 196 2 1 85 77 93 0.942 6.89 1.23 Intr - 23811 23543 269 1 2 100 98 166 0.528 16.05 1.22 Intr - 24487 24372 116 0 2 106 75 180 0.999 18.59 1.21 Intr - 24736 24651 86 1 2 35 80 110 0.999 3.52 1.20 Intr - 26146 25969 178 0 1 49 94 192 0.978 15.82 1.19 Intr - 26772 26678 95 0 2 65 75 65 0.999 1.66 1.18 Intr - 27162 27029 134 1 2 103 73 164 0.999 16.76 1.17 Intr - 27438 27248 191 2 2 109 67 252 0.999 24.43 1.16 Intr - 27715 27542 174 0 0 86 69 191 0.779 16.05 1.15 Intr - 28952 28858 95 2 2 70 -23 163 0.528 2.16 1.14 Intr - 29348 29279 70 1 1 65 89 28 0.831 -0.22 1.13 Intr - 31335 31220 116 0 2 15 114 97 0.693 4.25 1.12 Intr - 31591 31505 87 1 0 78 60 107 0.991 7.07 1.11 Intr - 31848 31773 76 2 1 39 92 99 0.990 4.92 1.10 Intr - 32389 32304 86 1 2 79 58 77 0.989 2.42 1.09 Intr - 32730 32619 112 2 1 70 94 97 0.992 8.88 1.08 Intr - 41646 41597 50 0 2 107 101 45 0.982 5.18 1.07 Intr - 42889 42815 75 1 0 73 109 33 0.888 3.61 1.06 Intr - 46144 45999 146 2 2 105 81 205 0.955 21.50 1.05 Intr - 46312 46240 73 1 1 77 111 67 0.999 6.88 1.04 Intr - 74270 74123 148 1 1 76 78 325 0.082 30.54 1.03 Intr - 89838 89808 31 1 1 63 95 29 0.004 -1.71 1.02 Intr - 96303 96203 101 2 2 125 45 59 0.968 5.05 1.01 Init - 96777 96740 38 0 2 98 80 111 0.982 8.79 1.00 Prom - 97328 97289 40 -7.96 2.09 PlyA - 98987 98982 6 -0.45 2.08 Term - 100266 99998 269 1 2 82 48 299 0.934 20.96 2.07 Intr - 106645 106453 193 1 1 138 96 160 0.995 20.87 2.06 Intr - 109226 109134 93 2 0 61 78 140 0.913 10.46 2.05 Intr - 111189 111006 184 2 1 64 -24 189 0.784 5.09 2.04 Intr - 112174 111955 220 2 1 137 59 473 0.938 46.66 2.03 Intr - 114606 114501 106 0 1 98 80 29 0.894 2.89 2.02 Intr - 120314 120248 67 1 1 47 91 32 0.343 -1.69 2.01 Init - 120911 120568 344 2 2 56 80 242 0.461 14.93 2.00 Prom - 122482 122443 40 -3.06 3.03 PlyA - 122634 122629 6 1.05 3.02 Term - 149137 149078 60 1 0 99 41 64 0.494 0.60 3.01 Init - 149544 149245 300 0 0 90 115 426 0.928 42.75 3.00 Prom - 154280 154241 40 -4.26 4.00 Prom + 172946 172985 40 -3.56 4.01 Init + 179182 179246 65 0 2 62 61 57 0.018 1.02 4.02 Intr + 197550 197619 70 0 1 106 73 72 0.076 6.68 4.03 Term + 203160 203327 168 2 0 59 47 139 0.939 4.78 4.04 PlyA + 206138 206143 6 1.05 5.03 PlyA - 207652 207647 6 1.05 5.02 Term - 209808 209588 221 1 2 37 43 191 0.963 6.70 5.01 Init - 211249 211186 64 1 1 60 89 19 0.940 0.52 5.00 Prom - 213624 213585 40 -8.16 6.03 PlyA - 213753 213748 6 1.05 6.02 Term - 216523 216028 496 0 1 64 38 1605 0.834 147.14 6.01 Init - 217310 217243 68 2 2 79 84 68 0.547 4.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 55044 55117 74 2 2 71 82 79 0.839 6.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:23409883_23630793|GENSCAN_predicted_peptide_1|1231_aa MVATRLASLLAPGFLKGLAAHTLPPTLYISGSNCSPQTSSSSITWKYPSTDPGRENSSTL APAMPEQFSVAEFLAVTAEDLSSPAGAAAFAAKMPRYRGAALAREEILEGDQAILQRIKK AVRAIHSSGLGHVENEEQYREAVESLGNSHLSQNSHELSTGFLNLAVFTREVAALFKNLI QNLNNIVSFPLDSLMKGQLRDGRQDSKKQLEKAWKDYEAKMAKLEKERDRARVTGGIPGE VAQDMQRERRIFQLHMCEYLLKAGESQMKQGPDFLQSLIKFFHAQHNFFQDGWKAAQSLF PFIEKLAASVHALHQAQEDELQKLTQLRDSLRGTLQLESREGQEHLSRKNSGCGYSIHQH QGNKQFGTEKVGFLYKKSDGIRRVWQKRKCGVKYGCLTISHSTINRPPVKLTLLTCQVRP NPEEKKCFDLVTRECLRVRLVRDTSPCGPSGFGGGSLGSGKASSISVGQKPPSLPTDNRT YHFQAEDEHECEAWVSVLQNSKDEALSSAFLGEPSAGPGSWGSAGHDGEPHDLTKLLIAE VKSRPGNSQCCDCGAADPTWLSTNLGVLTCIQCSGVHRELGVRFSRMQSLTLDLLGPSEL LLALNMGNTSFNEVMEAQLPSHGGPKPSAESDMGTRRDYIMAKYVEHRFARRCTPEPQRL WTAICNRDLLSVLEAFANGQDFGQPLPGPDAQAPEELVLHLAVKVANQASLPLVDFIIQN GGHLDAKAADGNTALHYAALYNQPDCLKLLLKGRALVGTVNEAGETALDIARKKHHKECE ELVRTRGGKGPLTLSLSLLSPQPSEPPSLCFWQLEQAQAGTFAFPLHVDYSWVISTEPGS DSEEDEEEKRCLLKLPAQAHWASGRLDISNKTYETVASLGAATPQGESEDCPPPLPVKNS SRTLVQGCARHASGDRSEVSSLSSEAPETPESLGSPASSSSLMSPLEPGDPSQAPPNSEE GLREPPGTSRPSLTSGTTPSEMYLPVRFSSESTRSYRRGARSPEDGPSARQPLPRRNVPA LLLPLCPSPRRASRANMGQEEELLRIAKKLEKMVARKNTAKATQPSLSGPRVSSSVKWEE EMVSFGDGFLWKCPELSPEKMQVWPQGPCSMEPVYSHVCAWALGSVWEGALDLLKKLHSC QMSIQLLQTTRIGVAVNGVRKHCSDKEVVSLAKVLIKNWKRLLDSPGPPKGEKGEEREKA KKKEKGLECSDWKPEAGLSPPRKKREDPKTS >gi568815597r:23409883_23630793|GENSCAN_predicted_CDS_1|3693_bp atggtggccacacgcctggccagcctgctggctcctggcttcctgaaaggtcttgctgcc cacactctccctcccaccctctacatcagtgggtccaactgtagtcctcagaccagcagc agcagtatcacctggaaatatccttccacagatcctggaagagagaacagctccacgctc gcgcccgccatgccggagcagttcagcgtcgccgagttcctggccgtcaccgcggaggac ctcagctccccggctggggccgccgccttcgccgccaagatgccccggtaccgaggggcg gcgctggcgcgggaggagatcttggaaggagaccaagccatcctgcagagaataaagaag gctgtgcgggcaatccatagctccggccttggccatgtggagaatgaagagcagtaccga gaggccgtggaatccttaggcaacagccacctgtcccagaacagccatgagctgtccaca ggcttcctaaacttggccgtgttcacccgcgaggttgctgcgctcttcaagaacctgatt cagaacttgaacaacattgtctctttccccctggacagtctgatgaaggggcagctgagg gacggtcgacaggattccaaaaaacagctggagaaggcatggaaggactatgaagccaaa atggccaagctggagaaggagcgcgatcgggccagggtgacaggagggatccctggggag gtggcccaggacatgcagagagagcggcgcatcttccagctgcacatgtgtgagtatctg ctcaaagccggggagagccagatgaagcaaggtcctgacttccttcagagcctcatcaag ttcttccacgcccagcacaactttttccaagatggctggaaggctgcccagagcctgttc cccttcatcgagaagctggcggcctcagtacatgcactccatcaggcccaggaggacgag ctacagaagctgacccagctccgggactccctccgagggacactgcagcttgagagcaga gagggacaggaacacctgagccggaagaactcaggatgtggctatagcatccaccagcac caaggcaacaagcagtttgggacggagaaagtgggctttctatacaagaaaagtgacgga attcgaagagtctggcagaaaaggaagtgtggagtcaagtatggctgcctgaccatctca cacagcacgataaaccggcccccggtgaagctgaccctgctgacgtgccaagtgaggcca aaccctgaggagaaaaagtgcttcgacctggtgacccgtgagtgcttgagagtccgactt gtcagggacacatctccatgtggccccagcggctttggtgggggctccctgggctccggg aaggccagcagcatttctgtgggtcagaagcccccttctctcccgacagacaaccggacg taccactttcaggcagaggacgagcacgagtgtgaggcgtgggtgtcagtgttgcagaac agcaaggacgaagccctgagcagcgccttcctcggggagcccagcgctggcccggggtcc tgggggtccgccggccatgatggggagccgcacgacctcacaaagctgctcatcgcggag gtgaagagcaggcctgggaatagccagtgctgcgactgcggggctgcagaccccacgtgg ctcagcaccaacctgggcgtgctcacctgcatccagtgctcgggcgtccaccgcgaactg ggcgtgcgcttttcgcgcatgcagtcactcaccttggacctgctgggcccctccgagttg ttgctggccttgaacatgggaaacacgagcttcaatgaggtcatggaggcccagctaccc tcacacggcggccctaaaccctcagctgagagtgacatgggcacccgcagggactacatt atggccaagtatgtggagcataggtttgcacgccggtgcacacctgagcctcagcgactc tggacagccatttgcaacagggacctcctgtcggtactggaggcctttgccaatgggcag gactttggacagccgctgccagggcctgatgcacaggcacctgaagaactcgtcttgcat ttggctgtcaaagtcgccaaccaggcttccctgcctctggtggatttcatcatccagaac ggtggtcacctggatgccaaggctgctgacgggaacacggctctgcactacgcagcactc tacaaccagcccgactgcctcaagctgctgctgaaggggagagctttggttggcacagta aatgaagcaggcgagacagctctggacatagccaggaagaagcaccacaaggagtgtgag gagctggtgaggactcggggaggcaaggggcccctgactctttctctctccctgctgagc ccccaaccctctgaaccaccaagcctttgtttttggcagctggagcaggcccaggcgggg acctttgccttccctctacatgtggactactcctgggtaatttccacagagcctggctct gacagtgaggaggatgaggaagagaagcgctgcttgctgaagctcccggcccaggctcac tgggccagtgggaggctggacatcagcaacaagacctatgagactgtcgccagcctggga gcagccacccctcagggcgagagtgaggactgtcccccgcccttgccagtcaaaaactct tctcggactttggtccaagggtgtgcaagacatgccagtggagatcgttctgaagtctcc agcctgagttcagaggcccctgagacccctgagagcctgggcagtccagcctcctcctcc agtctgatgagccccttggaacctggggatcccagccaagccccacccaactctgaagag ggcctccgagagcccccaggcacctccagacccagcctgacatccgggaccaccccttcg gagatgtacctccccgtcagattcagctccgagagcactcgctcctatcggcggggggcg cggagccctgaagatggtccctcagccaggcagcctctgcccagaaggaacgtgccggcc ctactgctgcccctgtgcccctcgccccgccgggcgtcgcgggccaacatgggccaggaa gaggagctgctgaggatcgccaaaaagctggagaagatggtggccaggaagaacacggca aaagccactcaaccttccctttctggacctcgagtttcctcttctgtgaaatgggaagag gaaatggtgtcttttggagatggattcctgtggaagtgcccagagctgtcccctgaaaag atgcaagtctggccgcagggtccctgcagcatggagccagtttacagccacgtctgcgcc tgggctctcggcagtgtctgggaaggggccctggaccttctgaagaagctgcacagctgc cagatgtccatccagctactacagacaaccaggattggagttgctgttaatggggtccgc aagcactgctcagacaaggaggtggtgtccttggccaaagtccttatcaaaaactggaag cggctgctagactcccctggacccccaaaaggagaaaaaggagaggaaagagaaaaggca aagaagaaggaaaaagggcttgagtgttcagactggaagccagaagcaggcctttctcca ccaaggaaaaaacgagaagaccccaaaaccagn >gi568815597r:23409883_23630793|GENSCAN_predicted_peptide_2|491_aa MLQGPRALASAAGQTPKVVPAMSPTELWPSGLSSPQLCPATATYYTPLYPQTAPPAAAPG TCLDATPHGPEGQVVRCLPAGRLPVMLGAPTASPYAFPGVGKEGGVPEIQFEWSRNQYWV PAIVQAKSMAWGHCGTLAKRKLDLEGIGRPVVPEFPTPKGKCIRVDGLPSPKTPKSPGEK TRYDTSLGLLTKKFIYLLSESEDGVLDLNWAAEVLDVQKRRIYDITNVLEGIQLIRKKAK NNIQWVGRGMFEDPTRPGKQQQLGQELKELMNTEQALDQLIQSCSLSFKHLTEDKANKRY PPWLGEGDIRAVGNFKEQTVIAVKAPPQTRLEVPDRTEDNLQIYLKSTQGPIEVYLCPEE VQEPDSPSEEPLPSTSTLCPSPDSAQPSSSTDPSIMEPTASSVPAPAPTPQQAPPPPSLV PLEATDSLLELPHPLLQQTEDQFLSPTLACSSPLISFSPSLDQDDYLWGLEAGEGISDLF DSYDLGDLLIN >gi568815597r:23409883_23630793|GENSCAN_predicted_CDS_2|1476_bp atgctgcaagggccccgggccttggcttcggccgctgggcagaccccgaaggtggtgccc gcgatgagccccacagagctgtggccatccggcctcagcagcccccagctctgcccagct actgctacctactacacaccgctgtacccgcagacggcgcctcccgcagcggcgccaggc acctgcctcgacgccactccccacggacccgagggccaagttgtgcgatgcctgccggca ggccggctgccggtaatgctgggggcccccaccgcttccccctatgcttttccaggtgtg ggaaaggagggaggggtgcctgagatccagtttgagtggagcagaaaccagtactgggtg cctgctattgtccaggcaaagagtatggcctggggacactgtgggactttggccaaaagg aagctggatctggaggggattgggaggcccgtcgtccctgagttcccaacccccaagggg aagtgcatcagagtggatggcctccccagccccaaaacccccaaatcccccggggagaag actcggtatgacacttcgctggggctgctcaccaagaagttcatttacctcctgagcgag tcagaggatggggtcctggacctgaactgggccgctgaggtgctggacgtgcagaagcgg cgcatctatgacatcaccaacgtgctggaaggcatccagctcatccgcaagaaggccaag aacaacatccagtgggtaggcaggggaatgtttgaagaccccaccagacctgggaagcag caacagctggggcaggagctgaaggagctgatgaacacggagcaggccttggaccagctc atccagagctgctctctgagcttcaagcacctgactgaggacaaggccaacaagagatat cctccttggttgggggaaggtgatatccgtgctgttggcaactttaaggagcagacagtg attgccgtcaaggcccctccgcagacgagactggaagtgcccgacaggactgaggacaac ctgcagatatatctcaagagcacccaagggcccatcgaagtctacctgtgcccagaggag gtgcaggagccggacagtccttccgaggagcctctcccctctacctccaccctctgcccc agccctgactctgcccagcccagcagcagcaccgaccctagcatcatggagcccacagca tcctcagtgccagcaccagcgccaaccccccagcaggccccaccgcctccatccctggtc cccttggaggctactgacagcctgctggagctgccgcacccactcctgcagcagactgag gaccagttcctgtccccgaccctggcgtgcagctcccctctgatcagcttctccccatcc ttggaccaggacgactacctgtggggcttggaggcgggtgagggcatcagcgatctcttc gactcctacgaccttggggacctgttgattaattga >gi568815597r:23409883_23630793|GENSCAN_predicted_peptide_3|119_aa MKALSPVRGCYEAVCCLSERSLAIARGRGKGPAAEEPLSLLDDMNHCYSRLRELVPGVPR GTQLSQVEILQRVIDYILDLQVVLAEPAPGPPDGPHLPIQTAELTPELVISNDKRSFCH >gi568815597r:23409883_23630793|GENSCAN_predicted_CDS_3|360_bp atgaaggcgctgagcccggtgcgcggctgctacgaggcggtgtgctgcctgtcggaacgc agtctggccatcgcccggggccgagggaagggcccggcagctgaggagccgctgagcttg ctggacgacatgaaccactgctactcccgcctgcgggaactggtacccggagtcccgaga ggcactcagcttagccaggtggaaatcctacagcgcgtcatcgactacattctcgacctg caggtagtcctggccgagccagcccctggaccccctgatggcccccaccttcccatccag acagccgagctcactccggaacttgtcatctccaacgacaaaaggagcttttgccactga >gi568815597r:23409883_23630793|GENSCAN_predicted_peptide_4|100_aa MQREHGPEPLVRFLRKGMGEVGTYLSTVPDFNGTGSIGGQNIQGYETQIVMITGIVVQNN TGDDDNTTIMEGAYFHWLNNYPNAHMEGAWGFKKRLLEAG >gi568815597r:23409883_23630793|GENSCAN_predicted_CDS_4|303_bp atgcaaagagagcatggcccagagcctttagtgcggtttttgcggaaaggaatgggcgag gtaggaacatacctgtccacagtccctgacttcaatggcacagggtccattgggggccag aacattcaaggctatgaaacccagattgttatgattactggtattgttgtccaaaacaat actggagacgatgacaatacaacaatcatggagggtgcttatttccactggctcaacaat tatcccaacgcccatatggagggggcctggggcttcaagaagaggcttctcgaggcagga tga >gi568815597r:23409883_23630793|GENSCAN_predicted_peptide_5|94_aa MDWIGKLDEELRSRPHLGQYPGPSPTPTRAHPPRRRRHRLRTCRACCGEARLCCGGGGGG GGWRGGDADSHSRPYRLLPASPPELRHQALGALP >gi568815597r:23409883_23630793|GENSCAN_predicted_CDS_5|285_bp atggattggattggaaagctggatgaggagctgaggagcagacctcacctgggacaatat ccaggtccctcgcccacgcccacgcgggcgcacccgccgcgccgacgccgccaccggctg cgcacctgccgcgcttgctgcggggaagccaggctctgctgtggcggcggcggcggcggc ggcggctggcggggaggagacgcggactcccactcgcggccctatcgcttgctccccgcc tccccgccagagctgcgccaccaggctctgggcgcgctcccatga >gi568815597r:23409883_23630793|GENSCAN_predicted_peptide_6|187_aa MRLEVQELGPAGGTWQVTFPLKGILIIPIIIPIIIIPIIIITITIIPIIIIPITIIPITI IPTIIIPIITIIPIIIIPITIIPITIIPTIIIPIIIITIIIIPIIITITTIIIIPNITII IIPIIITITITIIIIPITIITITINITIITITIIPIITIIITITIIPIIIILLLITIIVI CMEASLY >gi568815597r:23409883_23630793|GENSCAN_predicted_CDS_6|564_bp atgaggctggaagtgcaggaacttggaccggcggggggaacctggcaggtcacattccct ctcaagggaatcctcatcatccccatcatcatccccatcatcatcatccccatcatcatc atcaccatcaccatcatccccatcatcatcatccccatcaccatcatccccatcaccatc atccccaccatcatcatccccatcatcaccatcatccccatcatcatcatccccatcacc atcatccccatcaccatcatccccaccatcatcatccccatcatcatcatcaccatcatc atcatccccatcatcatcaccatcaccaccatcatcatcatccccaacatcaccatcatc atcatccccatcatcatcaccatcaccatcaccatcatcatcatccccatcaccatcatc accatcaccatcaacatcaccatcatcaccatcaccatcatccccatcatcaccatcatc atcaccatcaccatcatccctatcattatcatcctcctcctgatcaccataattgtcatc tgcatggaagctagcctatattga