GENSCAN 1.0 Date run: 6-Nov-116 Time: 22:05:10 Sequence gi568815579r:50052612_50263194 : 210583 bp : 48.82% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.23 PlyA - 2316 2311 6 1.05 1.22 Term - 7388 6105 1284 2 0 59 42 1657 0.968 149.80 1.21 Intr - 7981 7869 113 2 2 76 75 93 0.953 6.90 1.20 Intr - 13773 13675 99 1 0 97 94 43 0.736 5.88 1.19 Intr - 19749 19588 162 1 0 115 95 2 0.216 3.55 1.18 Intr - 24351 24167 185 2 2 91 28 67 0.013 0.43 1.17 Intr - 41001 40810 192 2 0 81 53 112 0.066 5.61 1.16 Intr - 55536 55345 192 2 0 81 53 112 0.318 5.61 1.15 Intr - 60890 60699 192 1 0 81 53 112 0.395 5.61 1.14 Intr - 66240 66049 192 2 0 81 53 112 0.363 5.61 1.13 Intr - 71592 71401 192 2 0 81 50 112 0.323 5.31 1.12 Intr - 76927 76736 192 0 0 81 50 112 0.456 5.31 1.11 Intr - 81377 81204 174 1 0 107 61 27 0.486 0.95 1.10 Intr - 82261 82070 192 0 0 89 53 112 0.172 6.41 1.09 Intr - 86750 86577 174 1 0 107 61 49 0.045 3.15 1.08 Intr - 90330 90284 47 0 2 64 -14 109 0.021 -4.49 1.07 Intr - 91469 91416 54 2 0 6 109 70 0.131 0.08 1.06 Intr - 95651 95522 130 1 1 18 94 136 0.084 7.90 1.05 Intr - 102115 101989 127 2 1 44 80 244 0.738 18.94 1.04 Intr - 105737 105657 81 0 0 90 99 40 0.956 4.91 1.03 Intr - 106969 106883 87 2 0 113 56 62 0.975 5.44 1.02 Intr - 110202 110128 75 1 0 122 97 59 0.995 9.69 1.01 Init - 110583 110352 232 0 1 97 65 474 0.970 42.42 1.00 Prom - 111327 111288 40 -7.76 2.00 Prom + 112263 112302 40 -5.96 2.01 Init + 117815 117877 63 1 0 73 79 59 0.242 4.65 2.02 Intr + 135556 135696 141 0 0 78 115 78 0.430 10.05 2.03 Intr + 141518 141622 105 1 0 16 94 62 0.026 0.01 2.04 Intr + 146853 146878 26 2 2 111 32 27 0.062 -3.78 2.05 Intr + 147480 147558 79 0 1 67 99 75 0.127 6.05 2.06 Intr + 151031 151060 30 1 0 117 117 11 0.590 5.23 2.07 Intr + 152791 152943 153 0 0 63 68 43 0.293 0.07 2.08 Intr + 157752 158159 408 2 0 139 83 640 0.869 63.06 2.09 Intr + 165004 165160 157 0 1 105 96 333 0.999 35.38 2.10 Intr + 170472 170499 28 1 1 120 94 9 0.998 1.87 2.11 Intr + 170636 170738 103 2 1 99 79 91 0.998 9.48 2.12 Intr + 172974 173066 93 2 0 108 69 173 0.999 17.56 2.13 Intr + 174292 174378 87 0 0 103 -10 182 0.158 10.07 2.14 Intr + 177892 178012 121 0 1 73 86 236 0.870 21.97 2.15 Intr + 179319 179459 141 1 0 113 65 220 0.550 22.52 2.16 Intr + 191631 191726 96 1 0 103 84 130 0.605 14.08 2.17 Intr + 194393 194511 119 0 2 77 96 103 0.626 10.18 2.18 Intr + 196376 196528 153 1 0 69 36 357 0.995 28.87 2.19 Intr + 197015 197212 198 1 0 114 105 523 0.919 56.15 2.20 Intr + 197904 198077 174 2 0 61 97 272 0.999 25.54 2.21 Intr + 200028 200142 115 2 1 80 89 179 0.995 17.22 2.22 Intr + 204688 204875 188 2 2 86 96 261 0.997 26.11 2.23 Intr + 206533 206654 122 0 2 92 89 270 0.999 26.79 2.24 Intr + 208035 208104 70 0 1 117 101 140 0.993 17.38 2.25 Intr + 208864 209024 161 0 2 119 80 320 0.928 33.09 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 90330 90196 135 0 0 64 42 115 0.837 2.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815579r:50052612_50263194|GENSCAN_predicted_peptide_1|1455_aa MPLALTLLLLSGLGAPGGWGCLQCDPLVLEALGHLRSALIPSRFQLEQLQARAGAVLMGM EGPFFRDYALNVFVGKVETNQLDLVASFVKNQTQHLMGNSLKDEPLLEELVTLRANVIKE FKKVLISYELKGLLKEEVLDCLHCQRITPKCIHKKYCFVDRQPRVALQYQMDSKYPRNQA LLGILISVSLAVFVFVVIVVSYRDLQFRDRDCSGAFRDPAEEGKAWLSSGIRLDARNSRT RPEQAHAQNPAMANGTLMAEGQESVTFKDVAVDFTLEEGRHGTQGPHWPWGLGDDRGEPL VIGSLTLRKIKGPTGQPRHPEPLWLRPVARALGPSGHTGLLTVPVAAGMCPHRASVLAVP TARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVWGRHGTQGPHWPWGLGDVRGE PLVIGSLTLRKIKGPTGQPRHPEPLWLRPVARALGPSGHTGLLTVPVAAGMCPHRASVLA VPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVRHTGLLTVPVAAGMCPHRA SVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVRHTGLLTVPVAAGMC PHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTGLLTVPVA AGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHTGLLT VPVAAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDVVCHT GLLTVPVAAGMCPHRASVLAVPTARTPSCTICTVQYGRLWPHMATEQLNMAGPNQDFQDV VWQSCTPPWELEMTDVHYPDRFATRGSHTAGFWQERNKENYSEEEKRGTQKGRPRSTLPD LDARPLRPQPCPPTPSLPSPRPQPCPGLVLPLHLSTAPLLVSLPTFLDLSFSVFMFPWAG LPGSKSDVISRVDRGEELRQEREARRVTCPDSETSSVTIRETPRKKSISEDSASPVEGPL RAMLGDAQHQRIHTGEKPYKGKDCGKAFTQSSSLIKHQRCHTGEKPYKCGQCGKFYSQVS HLTRHQKIHTGEKPYQCGECGKAFCHTSSLTQHQTTHTGEKPYKCNKCGKTFSHSSSLTE HQRVHTGEKPYECTDCGKAFSHSSSLTQHQRIHTGEKPYACHECGKAYTQISHLMRHQST HVGEKPYVCNECGKAFSHTSSFTQHQTIHTGEKPYKCTECGKTFSQNSSLTHHQRIHTGE KPYECKVCGKAYTQISHLIQHQRIHTGERPYEGRECGKAFSQSTHLIEHQKIHTGEKPYK CKECGKTFSHNSSLTQHQKIHTGEKPYACKECGKAFNQSIHLIQHQRIHTGEKPYKCSDC GRTYTQISHLLQHQKVHTGSKRYTCEECGEGFSWSSHLTEHQRLHTGQNAYICDDFEKAF AWGTQLADHQRTHAD >gi568815579r:50052612_50263194|GENSCAN_predicted_CDS_1|4368_bp atgcctctggctttgacccttctgctgctctcgggcttgggcgcccccggaggctggggc tgcctgcagtgcgaccccttggtgctggaggccctgggtcacctgcgctccgccctcatc cccagtcgcttccagttggagcagctgcaggcgcgcgccggggccgtgctgatgggcatg gaggggcctttcttccgggactacgcgctgaacgtgtttgtggggaaagtggagacaaat caactggaccttgtggcgtcctttgtcaagaaccaaacgcagcacttaatgggtaactct ctgaaagatgagcctctgctggaagagctggtgaccctcagggcgaatgtgatcaaggaa ttcaagaaagttttaatttcatatgaattaaaaggcttgctaaaagaagaggtgttggac tgtttacattgccagaggatcactcccaagtgtatccacaaaaagtactgctttgtcgac cggcaaccccgcgtggccctgcagtaccagatggacagcaaatacccgaggaaccaggcg ctgttgggcatcctcatttctgtgtctctggctgtctttgtcttcgtggtcatcgtggtc tcgtaccgggacttacagttccgagacagggactgctcaggcgcgttccgggacccggcg gaggaagggaaggcctggctgtccagcggaatccgcctggacgcacggaactcgcggacc cgaccggagcaggcacatgcccagaacccagcaatggccaatgggaccttgatggccgag ggccaggaatcagtgaccttcaaggatgtggcggtggacttcaccctggaggaggggcgc catggaacgcagggccctcactggccctggggactgggtgacgacaggggggagcctctg gtgattggctccctcaccctgcgtaagatcaaagggcctacaggacagccccgacacccg gagccattgtggctccggccggttgcgcgggccctcggaccctcaggccacacggggctc ctcactgttcccgtagcagcaggcatgtgcccccacagggcctctgtactggctgttccc actgcccgaacaccctcatgcaccatctgcactgtccaatacggccgcctctggccacac atggctactgagcagttgaacatggctggtccaaaccaagatttccaagacgtcgtatgg gggcgccatggaacgcagggccctcactggccctggggactgggtgacgtcaggggtgag cctctggtgattggctccctcaccctgcgtaagatcaaagggcctacaggacagccccga cacccggagccattgtggctccggccggttgcgcgggccctcggaccctcaggccacacg gggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctctgtactggct gttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggccgcctctgg ccacacatggctactgagcagttgaacatggctggtccaaaccaagatttccaagacgtc gtacgccacacggggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcc tctgtactggctgttcccactgcccgaacaccctcatgcaccatctgcactgtccaatac ggccgcctctggccacacatggctactgagcagttgaacatggctggtccaaaccaagat ttccaagacgtcgtacgccacacggggctcctcactgttcccgtagcagcaggcatgtgc ccccacagggcctctgtactggctgttcccactgcccgaacaccctcatgcaccatctgc actgtccaatacggccgcctctggccacacatggctactgagcagttgaacatggctggt ccaaaccaagatttccaagacgtcgtatgccacacggggctcctcactgttcccgtagca gcaggcatgtgcccccacagggcctctgtactggctgttcccactgcccgaacaccctca tgcaccatctgcactgtccaatacggccgcctctggccacacatggctactgagcagttg aacatggctggtccaaaccaagatttccaagacgtcgtatgccacacggggctcctcact gttcccgtagcagcaggcatgtgcccccacagggcctctgtactggctgttcccactgcc cgaacaccctcatgcaccatctgcactgtccaatacggccgcctctggccacacatggct actgagcagttgaacatggctggtccaaaccaagatttccaagacgtcgtatgccacacg gggctcctcactgttcccgtagcagcaggcatgtgcccccacagggcctctgtactggct gttcccactgcccgaacaccctcatgcaccatctgcactgtccaatacggccgcctctgg ccacacatggctactgagcagttgaacatggctggtccaaaccaagatttccaagacgtc gtatggcaaagctgcaccccaccatgggaactggaaatgacagatgttcactacccagac cgctttgcaactcggggtagccacacagccgggttctggcaagagagaaataaggagaat tattctgaggaggaaaaaaggggaacacagaaggggaggccccggtctacactgcctgat ttagatgccagacccctcaggccacagccctgccctcctacaccctccttaccatcccca cggccccagccctgcccaggcctcgtcttgcccctgcacctcagcacagcccccttgctg gtttcacttcccaccttcctggacctgtcgttctccgtctttatgttcccctgggcaggg ctgcctgggtccaaatccgacgtgatctcccgtgtggatcgaggggaagagctgaggcag gagagagaggcccggcgagttacctgcccagactcggagacaagctctgtgactattcga gagacacctcgaaaaaagagcatttctgaagattcagcttccccagtagagggaccttta agggccatgcttggagatgcccagcatcagaggattcacactggagagaagccctacaaa ggcaaggactgcgggaaggccttcacacagagctcatccctcatcaaacaccagcggtgc cacaccggggagaagccctataaatgcggccagtgtgggaagttctactcgcaggtctcc cacctcacccgccaccagaaaatccacacgggggagaagccctaccaatgtggagagtgc ggcaaggccttctgtcacacctcctccctgacccaacaccagaccacccacacgggggag aaaccctacaaatgtaacaagtgtgggaaaacgttcagccacagctcatccctgactgag caccagcgagtccacactggagaaaaaccctatgagtgcactgattgtggcaaagccttc agccacagctcgtccctgacccagcatcagcgaattcacactggcgagaagccctacgcg tgtcacgagtgcgggaaggcttacacgcagatttcccacctcatgaggcaccagagcact cacgtgggggaaaagccctatgtatgcaacgaatgtgggaaggctttcagccacacctca tcctttactcagcaccagaccatccacaccggtgagaagccctacaaatgtacagagtgt gggaaaaccttcagccagaactcctccctcacacaccaccagaggattcacacaggggag aaaccctatgaatgtaaagtgtgtgggaaagcctatacccagatctcccacctcattcaa caccagaggattcacactggagagaggccctacgagggcagagagtgtgggaaagccttc agccagagcacgcacctcattgagcaccagaagatccacactggcgagaagccctataag tgtaaggaatgtgggaaaaccttcagtcacaactcatccctcactcaacatcagaagatt cacaccggcgagaagccctacgcctgcaaggaatgtgggaaggccttcaaccagagcatc cacttaatccaacaccaaaggattcacactggagagaagccttacaagtgtagtgactgt gggagaacctatacccagatctcacaccttctccaacatcagaaggtccacactggcagc aaacgctacacatgtgaggagtgtggagagggtttcagctggagttcacacctgactgaa caccagaggcttcatactggccagaatgcctacatctgtgatgattttgagaaagccttt gcttggggcacacagcttgctgatcatcagagaacccatgctgattag >gi568815579r:50052612_50263194|GENSCAN_predicted_peptide_2|1044_aa MALSPSTDEDDEVVQLAPNRTPAFHSGGQMSLKARGTPAAMPGSKEPEANQRWSRSLRTP RPSEGDRQPPKMLGLHGMSRHIRSDSGYILKVTLTFADKLDVELIPTGPTEGLSRIDDIS NYEVNLEPGGHDDITSCQAEASGRPWKPPGRCWVTSEPGSRPVTRRGPALRWTAEVAAPE ENGDSRALTPSPRSGRRPQTMAAVTMSVPGRKAPPRPGPVPEAAQPFLFTPRGPSAGGGP GSGTSPQVEWTARRLVWVPSELHGFEAAALRDEGEEEAEVELAESGRRLRLPRDQIQRMN PPKFSKAEDMAELTCLNEASVLHNLRERYYSGLIYTYSGLFCVVINPYKQLPIYTEAIVE MYRGKKRHEVPPHVYAVTEGAYRSMLQDREDQSILCTGESGAGKTENTKKVIQYLAHVAS SPKGRKEPGVPGELERQLLQANPILEAFGNAKTVKNDNSSRFGKFIRINFDVAGYIVGAN IETCILSQPMGHLDSLCPDLLEKSRAIRQAKDECSFHIFYQLLGGAGEQLKADLLLEPCS HYRFLTNGPSSSPGQERELFQETLESLRVLGFSHEEIISMLRMVSAVLQFGNIALKRERN TDQATMPDNTAAQKLCRLLGLGVTDFSRALLTPRIKVGRDYVQKAQTKEQADFALEALAK ATYERLFRWLVLRLNRALDRSPRQGASFLGILDIAGFEIFQLTCPASLQLNSFEQLCINY TNEKLQQLFNHTMFVLEQEEYQREGIPWTFLDFGLDLQPCIDLIERPANPPGLLALLDEE CWFPKATDKSFVEKVAQEQGGHPKFQRPRHLRDQADFSVLHYAGKVDYKANEWLMKNMDP LNDNVAALLHQSTDRLTAEIWKDVEGIVGLEQVSSLGDGPPGGRPRRGMFRTVGQLYKES LSRLMATLSNTNPSFVRCIVPNHEKRAGKLEPRLVLDQLRCNGVLEGIRICRQGFPNRIL FQEFRQRYEILTPNAIPKGFMDGKQACEKMIQALELDPNLYRVGQSKIFFRAGVLAQLEE ERDLKVTDIIVSFQAAARGYLARS >gi568815579r:50052612_50263194|GENSCAN_predicted_CDS_2|3132_bp atggcattgtccccatctacagatgaggatgatgaggttgtacaacttgccccaaatcgg acacctgcattccattcgggcggacagatgtccctgaaggcccgagggacaccagccgct atgccaggctccaaagaacccgaggcaaaccaacgctggtcccggtctttgaggactccc cggcccagtgagggagaccgacagcctcccaaaatgctgggattacatggcatgagccgc cacatccgatcagattctggatatattttgaaggtcacattgacatttgctgataaactg gatgtagagctgatacccacaggacccacagagggactgagcagaattgatgacatctcc aattatgaggtcaatttagaacctggtgggcatgatgatatcacaagttgccaggccgaa gcctcgggacggccctggaagccgccgggcaggtgctgggtgacatcagagccgggctcc cgccctgtgacgcgacggggccccgccctgcgctggaccgcggaggtcgctgcacctgaa gagaacggggattcccgcgccctcaccccatctccccgctccggtcggcgcccccagacc atggcagccgtgaccatgtcggtgcccgggcggaaggcgccccccaggccgggcccagtg cccgaggcggcccagccgttcctgttcacgccccgcgggcccagcgcgggtggcgggcct ggctcgggcacctccccgcaggtggagtggacggcccggcgtctcgtgtgggtgccttcg gagcttcacgggttcgaggcggcggcgctgcgggacgaaggcgaggaggaggcggaggtg gagctggcggagagcgggaggcggctgcgactgccgcgggaccagatccagcgcatgaac ccgcccaagttcagcaaggccgaggacatggccgagctgacctgcctcaacgaggcctcg gtcctgcacaacctccgggagcggtactactccggcctcatctacacgtactccggcctt ttctgtgtggtcatcaacccgtacaagcagcttcccatctacacagaagccattgtggag atgtaccggggcaagaagcgccacgaggtgccaccccacgtgtacgcagtgaccgagggg gcctatcggagcatgctgcaggatcgtgaggaccagtccattctctgcactggagagtct ggagctgggaagacggaaaacaccaagaaggtcatccagtacctcgcccacgtggcgtcg tctccaaagggcaggaaggagccgggtgtccccggtgagctggagcggcagctgcttcag gccaaccccatcctagaggcctttggcaatgccaagacagtgaagaatgacaactcctcc cgattcggcaaattcatccgcatcaactttgatgttgccgggtacatcgtgggcgccaac attgagacctgtatcctctcacagcccatggggcaccttgactcgctgtgtccagacctg ctggagaagtcgcgggccatccgccaggccaaggacgagtgcagcttccacatcttctac cagctgctggggggcgctggagagcagctcaaagccgacctcctcctcgagccctgctcc cactaccggttcctgaccaacgggccgtcatcctctcccggccaggagcgggaactcttc caggagacgctggagtcgctgcgggtcctgggattcagccacgaggaaatcatctccatg ctgcggatggtctcagcagttctccagtttggcaacattgccttgaagagagaacggaac accgatcaagccaccatgcctgacaacacagctgcacagaagctctgccgcctcttggga ctgggggtgacggatttctcccgagccttgctcacccctcgcatcaaagttggccgagac tatgtgcagaaagcccagactaaggaacaggctgacttcgcgctggaggccctggccaag gccacctacgagcgcctcttccgctggctggttctgcgcctcaaccgggccttggaccgc agcccccgccaaggcgcctccttcctgggcatcctggacatcgcgggctttgagatcttc cagctcacgtgtcctgcgtccctgcagctgaactccttcgagcagctctgcatcaactac accaacgagaagctgcagcagctcttcaaccacaccatgttcgtgctggagcaggaggag taccagcgtgagggcatcccctggaccttcctcgactttggcctcgacctgcagccctgc atcgacctcatcgagcggccggccaacccccctggactcctggccctgctggatgaggag tgctggttcccgaaggccacagacaagtcgtttgtggagaaggtagcccaggagcagggc ggccaccccaagttccagcggccgaggcacctgcgggatcaggccgacttcagtgttctc cactacgcgggcaaggtcgactacaaggccaacgagtggctgatgaaaaacatggaccct ctgaatgacaacgtcgcagccttgctccaccagagcacagaccggctgacggcagagatc tggaaagacgtggagggcatcgtggggctggaacaggtgagcagcctgggcgacggccca ccaggtggccgcccccgtcggggtatgttccggacagtgggacagctctacaaggagtcc ctgagccgcctcatggccacactcagcaacaccaaccccagttttgtccgctgcattgtc cccaaccacgagaagagggccgggaagctggagccacggctggtgctggaccagcttcgc tgcaacggggtcctggagggcatccgcatctgtcgccagggcttccccaaccgcatcctc ttccaggagttccggcagcgatacgagatcctgacacccaatgccatccccaagggcttc atggatgggaagcaggcctgtgaaaagatgatccaggcgctggaactggaccccaacctc taccgcgtgggacagagcaagatcttcttccgggctggggtcctggcccagctggaagag gagcgagacctgaaggtcaccgacatcatcgtctccttccaggcagctgcccggggatac ctggctcgcagn