GENSCAN 1.0 Date run: 3-Nov-116 Time: 21:26:28 Sequence gi568815583r:66236961_66452947 : 215987 bp : 44.45% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 15506 15603 98 1 2 89 57 141 0.944 11.04 1.02 Intr + 16092 16215 124 0 1 40 73 104 0.975 4.69 1.03 Intr + 18139 18324 186 0 0 78 33 130 0.806 6.39 1.04 Term + 34112 34240 129 1 0 -29 29 379 0.999 18.48 1.05 PlyA + 34460 34465 6 1.05 2.03 PlyA - 34870 34865 6 1.05 2.02 Term - 42083 41959 125 0 2 63 55 103 0.858 3.05 2.01 Init - 43173 43116 58 0 1 78 60 34 0.509 1.07 2.00 Prom - 46136 46097 40 -2.36 3.00 Prom + 55972 56011 40 -5.96 3.01 Init + 56637 56775 139 2 1 103 49 245 0.974 22.40 3.02 Intr + 58028 58181 154 0 1 85 59 44 0.888 0.33 3.03 Intr + 69864 69992 129 0 0 79 80 74 0.962 5.51 3.04 Intr + 71749 71884 136 1 1 90 63 116 0.961 9.87 3.05 Intr + 74764 74940 177 0 0 59 94 104 0.917 8.22 3.06 Intr + 77079 77157 79 2 1 39 31 71 0.569 -4.38 3.07 Intr + 78076 78255 180 2 0 52 99 213 0.835 18.64 3.08 Intr + 81489 81658 170 1 2 77 68 70 0.954 3.57 3.09 Intr + 83611 83772 162 0 0 109 71 93 0.995 9.87 3.10 Intr + 85727 85974 248 1 2 75 90 228 0.984 17.96 3.11 Intr + 88871 89404 534 2 0 64 18 573 0.967 39.94 3.12 Intr + 92010 92164 155 0 2 100 20 186 0.766 12.72 3.13 Intr + 92261 92439 179 0 2 71 71 89 0.915 5.14 3.14 Intr + 94915 95060 146 0 2 78 95 65 0.497 5.28 3.15 Term + 96091 96352 262 1 1 29 54 136 0.450 -0.70 3.16 PlyA + 96821 96826 6 1.05 4.06 PlyA - 96973 96968 6 1.05 4.05 Term - 100221 99998 224 1 2 62 38 181 0.975 7.58 4.04 Intr - 104396 104190 207 0 0 85 54 60 0.642 1.35 4.03 Intr - 112163 112100 64 2 1 84 95 29 0.964 1.49 4.02 Intr - 112477 112355 123 1 0 82 111 41 0.821 6.58 4.01 Init - 115881 115855 27 0 0 89 95 34 0.562 3.66 4.00 Prom - 116689 116650 40 -4.16 5.08 PlyA - 117940 117935 6 1.05 5.07 Term - 142915 142452 464 2 2 58 41 436 0.821 31.12 5.06 Intr - 148396 148298 99 2 0 -2 110 73 0.519 0.58 5.05 Intr - 149852 149690 163 2 1 104 77 91 0.570 9.15 5.04 Intr - 150460 150258 203 2 2 83 61 48 0.196 0.60 5.03 Intr - 153591 153541 51 1 0 71 99 20 0.138 0.28 5.02 Intr - 157389 157326 64 0 1 40 97 54 0.146 -0.21 5.01 Init - 171357 171295 63 0 0 65 93 35 0.452 2.85 5.00 Prom - 172904 172865 40 -5.56 6.00 Prom + 173068 173107 40 -2.36 6.01 Init + 195756 195764 9 2 0 81 116 10 0.779 3.04 6.02 Intr + 197978 198277 300 1 0 86 113 322 0.871 31.43 6.03 Intr + 199786 199932 147 0 0 112 80 244 0.999 26.43 6.04 Intr + 207696 207747 52 2 1 100 94 40 0.719 4.28 6.05 Term + 212893 214514 1622 2 2 19 49 548 0.148 34.11 6.06 PlyA + 215325 215330 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 48743 48841 99 1 0 28 38 156 0.973 2.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:66236961_66452947|GENSCAN_predicted_peptide_1|178_aa MTTRETVTLASQRGLRGTDPETRLPLDGARESSKSVHGTCWGLAAAGRIAGRRRKPAASP QALPPDPGGEGAGAPSGVGREERRGVSVPMESTLRPDTTRILIVPPLISGARNFCGTQPE AETEVQPEAEPLLSPWEEEEEEEEDEKEEEEEEEEEEEEEEEEEEEEEEEEEENSALA >gi568815583r:66236961_66452947|GENSCAN_predicted_CDS_1|537_bp atgaccactcgggagacagtcacactggcctcgcagcgcgggctgcggggcaccgatccc gagacgcggctgccgctggatggagcccgcgagtccagcaaaagcgtgcatggcacttgc tggggcttagcggccgcggggcgcatcgccggccgccgccgaaaacctgctgcgtccccg caggctctgcctcccgaccccggcggggaaggcgccggtgcacccagtggagtcgggcgg gaggagagacgtggggtctccgttcccatggagtccacgctgcggcccgacaccaccagg attctcatagttccacccctaatttctggggccagaaatttctgtggtacacaaccagaa gcagaaacagaagtacaaccagaagcagaacctctgctcagcccctgggaagaagaagaa gaagaggaggaggacgagaaggaggaagaggaagaggaagaagaagaagaagaggaagag gaagaagaagaagaagaggaagaagaagaggaagaagaaaattcagcattagcttag >gi568815583r:66236961_66452947|GENSCAN_predicted_peptide_2|60_aa MESVASMGAKAFWKSVKNVQTAVIQFQSHFLNCRLVVSTCMCSSLYSCPKDSTPVPDRIY >gi568815583r:66236961_66452947|GENSCAN_predicted_CDS_2|183_bp atggagagtgttgcctcaatgggggctaaagccttttggaagtcagtgaagaatgttcag acagcagtgatccagttccagtcccacttcctcaactgccggctggtggtttccacctgc atgtgctcatctctttactcatgccctaaggactcaacaccagtgcctgataggatctac tga >gi568815583r:66236961_66452947|GENSCAN_predicted_peptide_3|949_aa MLQKREKVLLLRTFQGRTLRIVREHYLRPCVPCHSPLCPQPAACSHDGKLLSSDVTHYVI PDWKVVQDYLEILEFPELKGIIFMQTACQAVQHQRGRRQYNKLRNLLKDARHDCILFANE FQQCCYLPRERGESMEKWQTRSIYNAAVWYYHHCQDRMPIVMVTEDEEAIQQYGSETEGV FVITFKNYLDNFWPDLKAAHELCDSILQSRRERENESQESHGKEYPEHLPLEVLEAGIKS GRYIQGILNVNKHRAQIEAFVRLQGASSKDSDLVSDILIHGMKARNRSIHGDVVVVELLP KNEWKGRTVALCENDCDDKASGESPSEPMPTGRVVGILQKNWRDYVVTFPSKEEVQSQGK NAQKILVTPWDYRIPKIRISTQQAETLQDFRVVVRIDSWESTSVYPNGHFVRVLGRIGDL EGEIATILVENSISVIPFSEAQMCEMPVNTPESPWKVSPEEEQKRKDLRKSHLVFSIDPK GCEDVDDTLSVRTLNNGNLELGVHIADVTHFVAPNSYIDIEARTRYAVSIMWELDKASYE IKKVWYGRTIIRSAYKLFYEAAQELLDGNLSVVDDIPEFKDLDEKSRQAKLEELVWAIGK LTDIARHVRAKRDGCGALELEGVEVCVQLDDKKNIHDLIPKQPLEVHETVAECMILANHW VAKKIWESFPHQALLRQHPPPHQEFFSELRECAKAKGFFIDTRSNKTLADSLDNANDPHD PIVNRLLRSMATQAMSNALYFSTGSCAEEEFHHYGLALDKYTHFTSPIRRYSDIVVHRLL MAAISKDKKMEIKGNLFSNKDLEELCRHINNRNQAAQHSQKQSTELFQCMYFKDKDPATE ERCISDGVIYSIRTNGVLLFIPRLEIISNKPYKIPNTELIHQSSPLLKSELVKEVTKSVE EAQLAQEVKVNIIQEEYQEYRQTKGRSLYTLLEEIRDLALLDVSNNYGI >gi568815583r:66236961_66452947|GENSCAN_predicted_CDS_3|2850_bp atgctgcagaagcgggagaaggtgctgctgctgaggaccttccagggccgcacgctgcgg atcgtgcgcgagcactacctgcggccctgcgtgccctgccacagcccgctctgcccgcag cccgccgcctgcagccacgatgggaaactcttgtctagtgatgtgactcattacgtgatc ccagactggaaagttgttcaagattatcttgagatccttgagtttcctgagttgaaggga attattttcatgcagacagcttgtcaagctgtgcagcatcaaagaggcaggagacagtat aacaaactgcgaaacctgctgaaggatgcgcgtcatgattgcattctctttgctaatgaa ttccagcaatgctgctatctgccacgggaaagaggagagtccatggagaagtggcagacc aggagcatatacaacgcagctgtttggtactatcatcactgccaggacaggatgccaatt gttatggtgacagaagatgaagaggcaattcagcagtatggaagtgaaacagaaggagta ttcgtgattactttcaagaattacctggacaatttctggcctgatttaaaagctgcccac gagctttgtgattctatccttcagtctcgacgggagagagagaatgagagtcaggagagc catgggaaggagtacccagaacatcttcccctggaagtgttagaagctgggattaaatct ggacgctatatccagggaattctgaatgtcaacaaacacagagcccaaatagaagctttt gttcgacttcaaggagccagcagtaaagattcagatttagtcagtgacatcctaatccac gggatgaaggctcgaaaccgctcaattcatggagatgtggtagttgtggagctgcttcct aaaaatgaatggaaaggaagaaccgtagccctgtgtgagaatgactgtgacgacaaggct tcgggcgagtccccaagtgagcccatgcctacaggtcgagtggtgggcatacttcagaag aactggcgggattatgtggtgacatttccgtccaaagaagaggtccaatctcagggcaaa aatgctcagaaaatcctggttacaccttgggattacagaattcccaaaattcgaattagc actcagcaagcagaaaccctccaggacttcagggtggtcgtgcgcatcgattcctgggag tcaacatctgtgtatccaaatggacattttgtgcgtgttttaggaagaatcggagatctg gaaggggaaattgcaaccatcctggtggaaaacagtatttcagttattcctttctcagaa gctcagatgtgtgagatgccagtaaacacaccagaaagtccctggaaggtgagtcctgaa gaggaacaaaaacgtaaagacttgaggaaaagccatctcgtattcagcattgaccccaaa ggttgtgaagatgtggatgacacactctcagtcagaaccttaaataatggcaacctggaa cttggggtccacatcgcagatgtaacacactttgtggcaccaaattcttacattgatatt gaagctagaacaaggtatgctgtaagcatcatgtgggaactggataaagcctcttatgaa attaagaaagtgtggtatggcagaaccattattcgatcagcatacaaactgttctatgaa gcagcccaagaactactggatggaaacttaagcgttgttgatgatattccagaattcaaa gacttggatgagaagagcagacaagccaagctggaggagttggtgtgggcaattggaaag ctgaccgacatagctcgccatgtcagagctaaacgagacggatgtggtgccctggaactg gaaggggtagaggtttgcgtacagctagatgacaaaaagaacattcacgacctcatcccc aagcagcccctggaagtccacgagacagtggctgaatgcatgatcctggccaaccactgg gtcgccaaaaagatctgggagagcttccctcatcaggccttgctgcgccagcaccctcct ccacaccaggagttcttttcagaactccgggaatgtgctaaagccaaaggcttcttcata gatacacggtccaataaaacactggctgattctctggataatgcgaacgacccccacgat cccattgtgaacaggctactgcgctccatggccacgcaggccatgtcgaatgctctgtac ttctccaccggatcctgtgcggaggaggagttccatcattacggtcttgcattagataaa tatacccactttacttctccaataagaagatattcagatattgtagtacaccgcttgtta atggcagccatttcaaaagataagaaaatggaaattaagggaaatctgttcagcaacaaa gatcttgaggaattatgcagacatatcaacaacagaaaccaagcagcacagcattctcag aagcagtctactgagctcttccagtgcatgtacttcaaagacaaagaccctgccaccgag gagcgttgcatatctgacggagttatttattcaattagaacaaatggtgtgcttctattt ataccaagacttgaaataattagtaacaaaccatacaagataccaaatacagaacttatt catcagagttcccccttgctgaagagtgagttagtgaaagaagtaactaaatctgtggaa gaagctcagcttgcccaagaagtcaaagtaaacatcattcaggaggaatatcaagaatat cgccaaacaaagggaaggagcctatacacacttctagaggagatacgggacctagctctc ctggatgtttcaaacaattatggaatatga >gi568815583r:66236961_66452947|GENSCAN_predicted_peptide_4|214_aa MVKELSLMKAEDLKMLIRHMEHWAHRLFPKLQFEDFIDRVEYLGSKKEVQTCLKRIRLDL PILHEDFVSNNDEVAENNEHDVTSTELDPFLTNLSESEMFASELSRSLTEEQQQRIERNK QLALERRQAKLLSNSQTLGNDMLMNTPRAHTVEEVNTDEDQKEESNGLNEDILDNPCNDA IANTLNEEETLLDQSFKNVQQQLDATSRNITEAR >gi568815583r:66236961_66452947|GENSCAN_predicted_CDS_4|645_bp atggtgaaggaactgagcctgatgaaggctgaagacttgaagatgctaatcagacacatg gagcactgggcacataggctattccctaaactgcagtttgaggattttattgacagagtt gaatacctgggaagtaaaaaggaagttcagacctgtttaaaacgaattcgacttgatctc cctattttacatgaagattttgttagcaataatgatgaagttgcggagaataatgaacat gatgtcacttctactgaattagatccctttctgacaaacttatctgaaagtgagatgttt gcttctgagttaagtagaagcctaacagaagagcaacaacaaagaattgagagaaataaa caactggccttggaaagaaggcaggcaaagctgctgagtaatagtcagaccctaggaaat gatatgttaatgaatacacccagggcacacacggttgaagaggttaatactgatgaggat caaaaggaggagtcaaatggattaaacgaagacattctggacaatccatgtaatgatgct attgccaatactttaaatgaagaggaaacactgctggaccagtcttttaaaaatgtgcaa cagcaacttgatgctacatccagaaatattactgaagctagataa >gi568815583r:66236961_66452947|GENSCAN_predicted_peptide_5|368_aa MPGVVSILSLLDKEQLKSPVTMCVPYKPGRTADAMNYTLNAISPVLPNSISELGKGYVRE LVPLTAEPSGAGFSWMGVGFFLGILDPGNALPTPGGGQHLSLPPAALPRARCAGPSPGRG PRGCLQRTTNGGKPRRQMTRKYVTGARRTAWGRGPSRTPWRPLSPRGKQVDPAEKKSVSL CVSKTLGVQNKGNVYRTKETSTPGYPQPGVGDLAENVNITLKGHTVIVNGPRGTLRRDFN HINVELSLLGKKKKEALVDKWWGNRKELATVRTICSHVQNMIKGVTLGFRYKMRSVYAYF PIIVVIPENGSVVEIRNFLGEKYIRRVRMRPGVACSVSQAQKDELIFEGNDIELVSNSAA LIQQATTV >gi568815583r:66236961_66452947|GENSCAN_predicted_CDS_5|1107_bp atgcccggtgttgtctctattctaagtctgttggataaagagcagctgaagagccccgtc actatgtgtgttccttacaaacctggcagaactgctgatgctatgaactacaccctgaat gcaataagtcccgtattacccaactccatctctgaactgggtaagggctatgttagagag ctggtcccgttaactgcagagccgtcgggggccgggttcagctggatgggcgtcggcttc ttcttgggcattttggacccgggtaacgcgcttccaactccggggggagggcagcacctc tcgcttcctcccgctgcgctgccccgcgcccgctgcgcaggaccaagtccgggccgcggg ccccggggctgccttcagcggacaaccaatgggggcaagcctcggcggcagatgacgcgg aaatacgtcacgggagcgcggcgcactgcctgggggcggggtccgtcgcggacgccgtgg cgccctctgtcgccccgaggcaagcaggtggacccagctgagaagaagtctgtgtcgctg tgtgtgtcaaagacattaggtgtacagaacaaaggaaacgtgtacagaacaaaggaaaca tctacacctggttatccacagccaggtgttggtgacctagcagaaaatgtcaacattact ctgaagggacacacagttatcgtgaatggccccagaggaaccctgcggagggacttcaat cacatcaatgtagaactcagtcttcttggaaagaaaaaaaaagaggctctggttgacaaa tggtggggtaacagaaaagaactggctaccgttcggactatttgtagtcatgtacagaac atgatcaagggtgttacactgggcttccgttacaagatgaggtctgtgtatgcttacttc cccatcatcgtcgttatcccggagaatgggtctgttgttgaaatccgaaatttcttgggt gaaaaatacatccgcagggtgcggatgagaccaggtgttgcttgttcagtatctcaagcc cagaaagatgaattaatctttgaaggaaatgacattgagcttgtttcaaattcagcggct ttgattcagcaagccacaacagtttaa >gi568815583r:66236961_66452947|GENSCAN_predicted_peptide_6|709_aa MVKTWSFLSMIGVLLWVDFSGDSIDLCSPLWNRTNLEALQKKLEELELDEQQRKRLEAFL TQKQKVGELKDDDFEKISELGAGNGGVVFKVSHKPSGLVMARKLIHLEIKPAIRNQIIRE LQVLHECNSPYIVGFYGAFYSDGEISICMEHMVIKGLTYLREKHKIMHRAWTTRAKLSKI LNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAE KAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDIPTANIIPNGQKLEAFPLKTGTRQGCP LSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLI SNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDLFK ENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPMTFFTELEK TTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNRDIDQWNRT EPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSR WIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTA KETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKTTPSKSG >gi568815583r:66236961_66452947|GENSCAN_predicted_CDS_6|2130_bp atggttaagacctggagctttctttccatgataggagtacttctttgggttgacttctct ggtgacagtattgacttgtgctccccactttggaacaggaccaacttggaggccttgcag aagaagctggaggagctagagcttgatgagcagcagcgaaagcgccttgaggcctttctt acccagaagcagaaggtgggagaactgaaggatgacgactttgagaagatcagtgagctg ggggctggcaatggcggtgtggtgttcaaggtctcccacaagccttctggcctggtcatg gccagaaagctaattcatctggagatcaaacccgcaatccggaaccagatcataagggag ctgcaggttctgcatgagtgcaactctccgtacatcgtgggcttctatggtgcgttctac agcgatggcgagatcagtatctgcatggagcacatggtaataaaaggcctgacatatctg agggagaagcacaagatcatgcacagagcctggacaacaagagcaaaactgtcaaaaatc ctcaataaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaa aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatctatgacatacccacagccaatatcata ccgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaatcaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagac gacatgattgtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct tcaaagagaataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaag gagaactacaaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacatt ccatgctcatgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatt tacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaa actactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagc caaaagaacaaagctggaggcatcacactacctgacttcaaactatactacaaggctaca gtaaccaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaaca gagccctcagaaataatgccacatatctacaactatctgatctttgacaaacctgagaaa aacaagcaatggggaaaggattccctatttaataaatggtgctgggaaaactggctagcc atatgtagaaagctgaaactggatcccttccttacaccttatacaaaaatcaattcaaga tggattaaagatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggc attaccattcaggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatg gcaacaaaagccaaaattgacaaatgggatctaattaaactcaagagcttctgcacagca aaagaaactaccatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacc tactcatctgacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaag aaaaaaacaaccccatcaaaaagtgggtga