GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:02:11 Sequence gi568815589f:36069503_36271266 : 201764 bp : 44.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13863 14060 198 1 0 118 92 -15 0.122 1.22 1.02 Intr + 18192 18302 111 1 0 34 44 135 0.464 4.05 1.03 Intr + 18405 18459 55 1 1 122 78 -37 0.642 -3.26 1.04 Intr + 21662 21841 180 2 0 103 110 22 0.746 4.88 1.05 Intr + 30829 31041 213 1 0 128 108 19 0.932 5.73 1.06 Intr + 32592 32728 137 0 2 76 89 35 0.960 2.61 1.07 Intr + 35641 35781 141 2 0 96 116 34 0.989 7.32 1.08 Intr + 40455 40577 123 1 0 84 62 56 0.891 3.16 1.09 Intr + 42803 42974 172 0 1 98 115 11 0.688 3.80 1.10 Intr + 47483 47675 193 2 1 107 92 104 0.990 12.19 1.11 Intr + 49255 49465 211 0 1 80 65 147 0.966 10.09 1.12 Intr + 51161 51234 74 0 2 67 94 -10 0.579 -3.37 1.13 Intr + 52031 52186 156 1 0 108 109 176 0.963 21.91 1.14 Intr + 53322 53426 105 2 0 92 92 63 0.930 7.51 1.15 Intr + 70198 70342 145 0 1 75 50 71 0.007 1.86 1.16 Intr + 78284 78392 109 0 1 60 79 178 0.935 13.44 1.17 Intr + 79045 79148 104 1 2 53 94 93 0.994 6.22 1.18 Intr + 81370 81447 78 2 0 52 105 41 0.678 1.72 1.19 Term + 92860 93020 161 2 2 124 32 250 0.959 21.10 1.20 PlyA + 93441 93446 6 -1.75 2.00 Prom + 93470 93509 40 -6.76 2.01 Sngl + 100001 101767 1767 1 0 94 41 2604 0.927 250.94 2.02 PlyA + 102540 102545 6 1.05 3.00 Prom + 103535 103574 40 -4.86 3.01 Init + 121555 121771 217 0 1 108 80 405 0.876 39.05 3.02 Intr + 128049 128086 38 1 2 64 92 47 0.850 0.68 3.03 Intr + 129477 129594 118 2 1 38 87 92 0.962 4.14 3.04 Intr + 134566 134677 112 2 1 89 82 112 0.984 10.14 3.05 Term + 142101 142272 172 0 1 104 49 300 0.989 25.00 3.06 PlyA + 142534 142539 6 1.05 4.15 PlyA - 142577 142572 6 1.05 4.14 Term - 148098 147863 236 1 2 95 41 271 0.995 19.58 4.13 Intr - 148797 148681 117 1 0 89 99 80 0.919 9.64 4.12 Intr - 150518 150336 183 0 0 79 12 92 0.159 0.46 4.11 Intr - 153364 153275 90 2 0 70 110 45 0.797 4.87 4.10 Intr - 154000 153871 130 1 1 80 82 28 0.986 1.67 4.09 Intr - 157956 157746 211 2 1 88 93 71 0.985 6.52 4.08 Intr - 159606 159519 88 1 1 56 97 52 0.982 1.93 4.07 Intr - 164630 164418 213 0 0 75 68 222 0.973 17.59 4.06 Intr - 165835 165734 102 2 0 37 58 84 0.516 0.45 4.05 Intr - 167482 167330 153 2 0 68 91 82 0.902 6.54 4.04 Intr - 176980 176529 452 0 2 64 84 470 0.826 37.15 4.03 Intr - 179895 179690 206 0 2 70 86 153 0.709 11.30 4.02 Intr - 189393 189080 314 1 2 50 81 190 0.050 10.40 4.01 Init - 195721 195472 250 1 1 63 39 173 0.059 7.53 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 67278 67129 150 0 0 61 55 198 0.899 5.97 S.002 Init - 169010 168944 67 2 1 64 58 50 0.860 0.83 S.003 Sngl - 170722 170405 318 1 0 71 38 166 0.889 5.88 S.004 Sngl - 195721 195422 300 1 0 63 48 212 0.808 10.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:36069503_36271266|GENSCAN_predicted_peptide_1|888_aa XGSVCCSYAGHHTNCREYCQAIFRTDSSPGPSQIKAVENYCASISPQLIHCVNNYTQSYP MRNPTDSLYCCDRAEDHACQNACKRILMSKKTEMEIVDGLIEGCLDGAKLHCCSKANTST CRELCTKLYSMSWGNTQSWQEFDRFCEYNPVEVSMLTCLADVREPCQLGCRNLTYCTNFN NRPTELFRSCNAQSDQGAMNDMKLWEKGSIKMPFINIPVLDIKKCQPEMWKAIACSLQIK PCHSKSRGSIICKSDCVEILKKCGDQNKFPEDHTAESICELLSPTDDLKNCIPLDTYLRP STLGNIVEEVTHPCNPNPCPANELCEVNRKGCPSGDPCLPYFCVQGHGTSFSIDCNVCSC FAGNLVCSTRLCLSEHSSEDDRRTFTGLPCNCADQFVPVCGQNGRTYPSACIARCVGLQD HQFEFGSCMSKDPCNPNPCQKNQRCIPKPQVCLTTFDKFGCSQYECVPRQLACDQVQDPV CDTDHMEHNNLCTLYQRGKSLSYKGPCQPFCRATEPVCGHNGETYSSVCAAYSDRVAVDY YGDCQAVGVLSEHSSVAECASVKCPSLLAAGCKPIIPPGACCPLCAGMLRVLFDKEKLDT IAKVTNKKPITVLEILQKIRMHVSVPQCDVFGYFSIESEIVILIIPVDHYPKALQIEACN KEAEKIESLINSDSPTLASHVPLSALIISQIHPRAAWHNRRYCKIITQLAQLKFQKTLGL VIEDSQILLPAVVSLLGLASKQFHNEVLKAHNEYRQKHGVPPLKLCKNLNREAQQYSEAL ASTRILKHSPESSRGQCGENLAWASYDQTGKEVADRWYSEIKNYNFQQPGFTSGTGHFTA MVWKNTKKMGVGKASASDGSSFVVARYFPAGNVVNEGFFEENVLPPKK >gi568815589f:36069503_36271266|GENSCAN_predicted_CDS_1|2667_bp ntgggctcggtttgttgcagttatgcaggtcatcacacaaactgccgagaatactgtcaa gccatttttcgaacagactcttctcctggtccatctcagataaaagcagtggaaaattat tgcgcctctattagtccacaattaatacattgtgtgaacaattatactcaatcttatcca atgaggaacccaacggatagtttatattgctgtgacagagctgaagaccatgcttgccaa aatgcctgcaagagaatcctgatgtccaagaaaacggaaatggagattgttgatggtctc atcgagggttgcctcgatggggctaaattgcattgttgttctaaagcaaacacttcaaca tgtagggaactctgcactaaactttacagcatgagctggggcaatacacagagttggcaa gagtttgatcgcttttgtgaatataatccagtggaagtgtccatgttgacctgtttagcg gatgtccgggaaccttgccagttgggctgtagaaaccttacttactgtactaattttaac aacaggccaacagaacttttcaggagttgtaatgcacagtcagatcaaggagccatgaat gacatgaagttgtgggagaaaggaagcataaagatgccatttatcaatatacctgttctt gatattaaaaagtgccagccagagatgtggaaagcaatagcttgttcactgcagattaaa ccttgtcatagtaaatctcggggaagtattatttgcaaatcagattgtgtggagattctt aaaaaatgtggagaccagaacaaattccctgaagaccacacagctgaaagtatttgtgag cttctgtcacctacagatgatctgaagaattgtatacctttggatacatacctcaggcca agtactttaggtaacattgtagaagaagtgactcatccctgtaacccaaatccttgccct gccaatgagctctgtgaagtaaaccgaaaaggatgtccatctggagatccctgtcttcca tacttttgtgttcaaggtcatggaacatcctttagtattgactgcaatgtctgttcttgt tttgctggcaatttggtgtgctctacccgcctttgcctcagtgagcacagttcagaagat gaccgtcgtaccttcacaggtctgccctgtaactgtgcagatcagtttgtccctgtatgt gggcagaatgggcgcacttaccccagtgcctgcattgctcgctgtgtgggcctccaagac catcagtttgagtttggatcatgcatgtcaaaggatccatgtaatcctaatccctgccaa aaaaaccaaaggtgcatacccaaaccacaggtctgcctgacgacttttgataaatttgga tgtagccagtatgagtgtgtaccaagacagctcgcgtgtgaccaggtccaagatcctgtt tgtgacacagaccacatggagcacaacaatctctgcactttataccaaagaggaaaaagc ctctcttacaaaggtccctgccagcccttttgcagagcaaccgagcccgtatgtgggcac aatggtgagacctacagcagtgtgtgtgctgcctactcggatcgcgtggcagtcgattac tatggggactgccaggccgtcggagtcctctcagagcacagctccgtcgccgagtgtgct tctgtcaagtgtccttcgctcttggcagctggatgcaaacccatcatcccaccgggtgct tgttgcccattatgtgctgggatgttaagagttttatttgacaaagaaaaactggatact attgctaaggtaacaaataaaaagccaataacagttctggaaatacttcagaaaatccgc atgcacgtgtctgtcccacagtgtgatgtgtttggatacttcagcattgaatcagaaatt gtgatcctgatcattcccgtcgatcactatccaaaagctctgcagattgaagcctgcaat aaagaagcagagaagattgagtcccttatcaactctgacagcccgactttggcgtcccat gtccctctctctgccctcatcatttcccagatacaccccagagctgcttggcataacagg cgttattgtaaaatcatcacacagctggcacagttgaaattccagaagacgcttggccta gtgatcgaagactcccagatcctgctcccagctgtggtttctctgctgggacttgcttcc aaacagtttcataatgaggtcctgaaggcccacaatgagtaccggcagaagcacggcgtc cccccactgaagctctgcaagaacctcaaccgggaggctcaacagtattctgaggccctg gccagcacgaggatcctcaagcacagcccggagtccagccgtggccagtgtggggagaac cttgcatgggcatcctatgatcagacaggaaaggaggtggctgatagatggtacagtgaa atcaagaactataacttccagcagcctggcttcacctcggggactggacacttcacggcc atggtatggaagaacaccaagaagatgggcgtggggaaggcgtccgcaagtgacgggtcc tcctttgtggtggccagatacttcccagcggggaatgttgtcaatgagggcttcttcgaa gaaaacgtcctgccgccgaagaagtaa >gi568815589f:36069503_36271266|GENSCAN_predicted_peptide_2|588_aa MKLEFTEKNYNSFVLQNLNRQRKRKEYWDMALSVDNHVFFAHRNVLAAVSPLVRSLISSN DMKTADELFITIDTSYLSPVTVDQLLDYFYSGKVVISEQNVEELLRGAQYFNTPRLRVHC NDFLIKSICRANCLRYLFLAELFELKEVSDVAYSGIRDNFHYWASPEGSMHFMRCPPVIF GRLLRDENLHVLNEDQALSALINWVYFRKEDREKYFKKFFNYINLNAVSNKTLVFASNKL VGMENTSSHTTLIESVLMDRKQERPCSLLVYQRKGALLDSVVILGGQKAHGQFNDGVFAY IIQENLWMKLSDMPYRAAALSATSAGRYIYISGGTTEQISGLKTAWRYDMDDNSWTKLPD LPIGLVFHTMVTCGGTVYSVGGSIAPRRYVSNIYRYDERKEVWCLAGKMSIPMDGTAVIT KGDRHLYIVTGRCLVKGYISRVGVVDCFDTSTGDVVQCITFPIEFNHRPLLSFQQDNILC VHSHRQSVEINLQKVKASKTTTSVPVLPNSCPLDVSHAICSIGDSKVFVCGGVTTASDVQ TKDYTINPNAFLLDQKTGKWKTLAPPPEALDCPACCLAKLPCKILQRI >gi568815589f:36069503_36271266|GENSCAN_predicted_CDS_2|1767_bp atgaaattggaattcacggagaaaaactacaatagcttcgtgctgcagaacctgaacaga cagaggaaacgcaaagagtactgggacatggccctgagtgtggacaaccacgtcttcttt gcacatcgcaatgtgctggctgctgtctccccactggtgaggagcctcatctccagcaat gacatgaagaccgctgatgagcttttcatcaccattgacaccagttacctgagcccggtc acagtggaccagcttctggactacttctatagcggcaaggtggtgatctccgagcaaaat gtggaggagctgcttcgtggggctcagtatttcaacacaccacgccttcgagttcactgt aacgacttccttattaagtccatctgccgtgccaactgcttgcgctacctcttcttggct gagctgtttgagctcaaagaggtatcagacgtagcttactctggcattcgggacaacttc cactactgggccagtcctgagggctccatgcacttcatgcgctgtccacctgttatcttt ggccgcctgctccgtgatgaaaaccttcacgtgctcaatgaagaccaggcgctcagcgca ctcatcaattgggtgtacttccggaaggaggatcgggagaagtatttcaagaagttcttc aattacatcaatctcaatgctgtctccaataagacgctggtgtttgccagcaacaagctg gtgggcatggagaacacctcatcccatacaaccctgattgagagtgtcctgatggaccgc aagcaggagcggccatgcagcctgctggtctaccagcggaaaggggccctgcttgattcc gtggtcatcctcggtggccagaaggcccacggccagttcaatgatggagtgtttgcttat atcatccaggagaacctgtggatgaagctctcagacatgccctatcgggcagcagcactt agtgccacctctgctggtcgctacatctacatctctggtggcaccactgagcagatttca gggctgaagacagcctggcggtatgacatggatgacaactcctggaccaagttgcctgac ctgcccatcgggcttgtcttccacaccatggtgacctgtggggggacagtgtactcagtg ggcgggagcattgccccaaggcggtatgtctccaacatctatcgctatgatgagcggaag gaagtctggtgcctggcaggaaagatgagcatccccatggatggcaccgccgtgatcact aaaggagacaggcatctgtacattgtcactggacggtgcttggtgaaaggttatatctcc cgggtcggggtagtggactgctttgacaccagcactggggacgtggtccagtgtatcacc ttccccattgagttcaaccatcggcccctgctctctttccaacaggacaacatcctctgc gtgcacagccaccggcagagtgtggaaatcaatctgcagaaggtgaaggcaagcaagacg accacctcagtgcctgtcttgcccaacagctgccccttggatgtgtcccatgctatatgc tccattggagacagcaaggtgtttgtatgtgggggtgtcaccactgccagcgatgtccag acaaaggactacaccatcaatccaaatgccttcttgctggaccaaaagacaggcaagtgg aagaccctggctcctccaccagaggcactggactgtcctgcctgctgtctagccaagcta ccttgcaagattcttcaaaggatttaa >gi568815589f:36069503_36271266|GENSCAN_predicted_peptide_3|218_aa MAELDPFGAPAGAPGGPALGNGVAGAGEEDPAAAFLAQQESEIAGIENDEAFAILDGGAP GPQPHGEPPGGPDAVDGVMNGEYYQESNGPTDSYAAISQVDRLQSEPESIRKWREEQMER LEALDANSRKQEAEWKEKAIKELEEWYARQDEQLQKTKANNRAAEEAFVNDIDESSPGTE WERVARLCDFNPKSSKQAKDVSRMRSVLISLKQAPLVH >gi568815589f:36069503_36271266|GENSCAN_predicted_CDS_3|657_bp atggctgagctggatccgttcggcgcccctgccggcgcccctggcggtcccgcgctgggg aacggagtggccggcgccggcgaagaagacccggctgcggccttcttggcgcagcaagag agcgagattgcgggcatcgagaacgacgaggccttcgccatcctggacggcggcgccccc gggccccagccgcacggcgagccgccggggggtccggatgctgttgatggagtaatgaat ggtgaatactaccaggaaagtaatggtccaacagacagttatgcagctatttcacaagtg gatcgattgcagtcagagcctgaaagtatccgtaaatggagagaagaacaaatggaacgc ttggaagcccttgatgccaattctcggaagcaagaagcagagtggaaagaaaaggcaata aaggagctagaagaatggtatgcaagacaggacgagcagctacagaaaacaaaagcaaac aacagggcagcagaagaagcctttgtaaatgacattgacgagtcgtccccaggcactgag tgggaacgggtggcccggctgtgtgactttaaccccaagtctagcaagcaggccaaagat gtctcccgcatgcgctcagtcctcatctccctcaagcaggccccgctggtgcactga >gi568815589f:36069503_36271266|GENSCAN_predicted_peptide_4|914_aa MEENRRTQRLVFISIRTNPGTCRAGTTASLQPDREQQWAPRWIRSTADTLPDPEGWKSAA GLRLRQTAVVDNERKLSLSGNKHAVTARVNVRDRCGVAALHDSIVAVLVYEITEGTRGAR NPSLGARPKQLPSGKTRASVRVSPVEEDASGREHKLAGQPRSSSSAAEAMEDGPLVSTLP APQNTAGKELYFKNLSKRNKQIMEKNGNNRKLRVCVATCNRADYSKLAPIMFGIKTEPEF FELDVVVLGSHLIDDYGNTYRMIEQDDFDINTRLHTIVRGEDEAAMVESVGLALVKLPDV LNRLKPDIMIVHGDRFDALALATSAALMNIRILHIEGGEVSGTIDDSIRHAITKLAHYHV CCTRSAEQHLISMCEDHDRILLAGCPSYDKLLSAKNKDYMSIIRMWLGDDVKSKDYIVAL QHPVTTDIKHSIKMFELTLDALISFNKRTLVLFPNIDAVQDSLRKMDYRSLKPKRGFQLN RNLNLGIVVVPSGSKEMVRVMRKKGIEHHPNFRAVKHVPFDQFIQLVAHAGCMIGNSSCG VREVGAFGTPVINLGTRQIGRETGENVLHVRDADTQDKILQALHLQFGKQYPCSKIYGDG NAVPRILKFLKSIDLQEPLQKKFCFPPVKENISQDIDHILETLSALAVDLGGTNLRVAIV SMKGEIVKKYTQFNPKTYEERINLILQMCVEAAAEAVKLNCRILGVDNDGNCAALAERKF GQGKGLENFVTLITGTGIGGGIIHQHELIHGSSFCAAELGHLVVSLDGPDCSCGSHGCIE AYASGMALQREAKKLHDEDLLLVEGMSVPKDEAVGALHLIQAAKLGNAKAQSILRTAGTA LGLGVVNILHTMNPSLVILSGVLASHYIHIVKDVIRQQALSSVQDVDVVVSDLVDPALLG AASMVLDYTTRRIY >gi568815589f:36069503_36271266|GENSCAN_predicted_CDS_4|2745_bp atggaagagaaccgtagaacccagcgactagtgttcatctcaattaggacgaacccaggc acttgccgtgcaggaacaacggcaagccttcagcccgatcgggagcagcagtgggcgcct cgctggatcaggagcacagcagacaccctgccagatccggaggggtggaagtcagcggcg ggtctgcgactgcggcaaacagcagtggtggacaacgagcgaaagcttagcttgagcggt aacaaacacgcagtcacggcaagggttaacgtcagggaccgctgtggggtggccgcgcta cacgacagtatagttgcggtcctggtttatgaaataactgagggaacaagaggcgcaaga aatccctccttgggtgcaagaccaaaacaactacccagcgggaagactcgggcttcagtg cgtgtgtcgccagtggaggaggacgcttcggggcgggagcacaagctggcaggacagccc cgcagcagctccagcgcggcagaggccatggaagatggtccgctggtcagcaccctgcct gcgcctcaaaataccgccgggaaggaactctattttaagaacctctcaaaacgaaacaag caaatcatggagaagaatggaaataaccgaaagctgcgggtttgtgttgctacttgtaac cgtgcagattattctaaacttgccccgatcatgtttggcattaaaaccgaacctgagttc tttgaacttgatgttgtggtacttggctctcacctgatagatgactatggaaatacatat cgaatgattgaacaagatgactttgacattaacaccaggctacacacaattgtgagggga gaagatgaggcagccatggtggagtcagtaggcctggccctagtgaagctgccagatgtc cttaatcgcctgaagcctgatatcatgattgttcatggagacaggtttgatgccctggct ctggccacatctgctgccttgatgaacatccgaatccttcacattgaaggtggggaagtc agtgggaccattgatgactctatcagacatgccataacaaaactggctcattatcatgtg tgctgcacccgcagtgcagagcagcacctgatatccatgtgtgaggaccatgatcgcatc cttttggcaggctgcccttcctatgacaaacttctctcagccaagaacaaagactacatg agcatcattcgcatgtggctaggtgatgatgtaaaatctaaagattacattgttgcacta cagcaccctgtgaccactgacattaagcattccataaaaatgtttgaattaacattggat gcacttatctcatttaacaagcggaccctagtcctgtttccaaatattgacgcagtacag gactccctccggaaaatggattatcgaagccttaagccaaagaggggctttcaactgaat cgaaacctgaatcttggaatagtagtggtgccttctgggagcaaagagatggttcgagtg atgcggaagaagggcattgagcatcatcccaactttcgtgcagttaaacacgtcccattt gaccagtttatacagttggttgcccatgctggctgtatgattgggaacagcagctgtggg gttcgagaagttggagcttttggaacacctgtgatcaacctgggaacacgtcagattgga agagaaacaggggagaatgttcttcatgtccgggatgctgacacccaagacaaaatattg caagcactgcaccttcagtttggtaaacagtacccttgttcaaagatatatggggatgga aatgctgttccaaggattttgaagtttctcaaatctatcgatcttcaagagccactgcaa aagaaattctgctttcctcctgtgaaggagaatatctctcaagatattgaccatattctt gaaactctaagtgccttggccgttgatcttggcgggacgaacctccgagttgcaatagtc agcatgaagggtgaaatagttaagaagtatactcagttcaatcctaaaacctatgaagag aggattaatttaatcctacagatgtgtgtggaagctgcagcagaagctgtaaaactgaac tgcagaattttgggagtagacaatgatggcaactgtgctgccctggcggaaaggaaattt ggccaaggaaagggactggaaaactttgttacacttatcacaggcacaggaatcggtggt ggaattatccatcagcatgaattgatccacggaagctccttctgtgctgcagaactgggc caccttgttgtgtctctggatgggcctgattgttcctgtggaagccatgggtgcattgaa gcatacgcctctggaatggccttgcagagggaggcaaaaaagctccatgatgaggacctg ctcttggtggaagggatgtcagtgccaaaagatgaggctgtgggtgcgctccatctcatc caagctgcgaaacttggcaatgcgaaggcccagagcatcctaagaacagctggaacagct ttgggtcttggggttgtgaacatcctccataccatgaatccctcccttgtgatcctctcc ggagtcctggccagtcactatatccacattgtcaaagacgtcattcgccagcaggccttg tcctccgtgcaggacgtggatgtggtggtttcggatttggttgaccccgccctgctgggt gctgccagcatggttctggactacacaacacgcaggatctactag