GENSCAN 1.0 Date run: 5-Nov-116 Time: 12:10:07 Sequence gi568815597f:28429867_28638004 : 208138 bp : 46.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8519 8564 46 1 1 65 97 35 0.128 2.94 1.02 Intr + 29219 29392 174 0 0 81 100 65 0.242 6.91 1.03 Intr + 35819 35983 165 0 0 122 82 77 0.998 10.43 1.04 Intr + 36516 36902 387 1 0 78 111 120 0.641 8.06 1.05 Intr + 43688 44285 598 0 1 101 84 221 0.724 14.60 1.06 Intr + 46241 46425 185 2 2 -12 52 264 0.870 12.23 1.07 Intr + 50585 50738 154 0 1 50 82 96 0.248 4.33 1.08 Intr + 61822 61921 100 1 1 58 77 99 0.790 5.91 1.09 Intr + 66454 66541 88 0 1 70 77 14 0.379 -1.96 1.10 Term + 66668 66696 29 0 2 98 54 64 0.525 2.24 1.11 PlyA + 67795 67800 6 1.05 2.00 Prom + 74576 74615 40 -3.96 2.01 Init + 88290 88442 153 2 0 64 100 74 0.130 4.21 2.02 Intr + 99992 100073 82 1 1 50 110 96 0.240 7.21 2.03 Intr + 101937 102124 188 1 2 76 96 103 0.981 9.31 2.04 Intr + 102305 102484 180 1 0 31 90 232 0.999 17.76 2.05 Intr + 105184 105280 97 0 1 135 94 131 0.999 17.98 2.06 Intr + 105392 105514 123 0 0 102 123 137 0.999 19.16 2.07 Intr + 106005 106160 156 1 0 129 85 194 0.999 23.18 2.08 Intr + 106396 106515 120 2 0 104 97 56 0.996 8.57 2.09 Intr + 106881 107033 153 1 0 132 101 110 0.999 16.74 2.10 Term + 107966 108141 176 0 2 125 45 188 0.999 16.12 2.11 PlyA + 108336 108341 6 1.05 3.00 Prom + 122936 122975 40 -3.76 3.01 Init + 123245 123271 27 1 0 76 86 40 0.576 0.97 3.02 Intr + 123774 123871 98 2 2 101 106 173 0.979 19.21 3.03 Intr + 130767 130866 100 0 1 102 67 27 0.991 2.11 3.04 Intr + 131480 131532 53 1 2 91 96 67 0.993 5.51 3.05 Intr + 134837 134968 132 2 0 119 69 153 0.999 16.26 3.06 Intr + 137428 137547 120 1 0 70 75 120 0.994 8.51 3.07 Intr + 141310 141472 163 1 1 79 91 241 0.999 23.48 3.08 Intr + 142001 142034 34 1 1 99 116 59 0.999 7.60 3.09 Term + 147634 147770 137 2 2 128 42 196 0.999 17.18 3.10 PlyA + 148555 148560 6 -0.45 4.05 PlyA - 148681 148676 6 1.05 4.04 Term - 148814 148765 50 0 2 91 43 12 0.674 -5.53 4.03 Intr - 149721 149667 55 0 1 106 106 52 0.851 7.35 4.02 Intr - 150115 150046 70 0 1 107 98 29 0.823 4.98 4.01 Init - 151286 151228 59 2 2 58 94 37 0.563 2.08 4.00 Prom - 151366 151327 40 -7.26 5.00 Prom + 155874 155913 40 -0.36 5.01 Init + 162646 162878 233 0 2 111 80 400 0.985 39.23 5.02 Term + 163828 164251 424 1 1 134 41 508 0.954 45.47 5.03 PlyA + 165558 165563 6 1.05 6.07 PlyA - 166301 166296 6 1.05 6.06 Term - 173708 173673 36 2 0 120 41 39 0.652 -0.26 6.05 Intr - 175594 175506 89 2 2 104 94 71 0.909 8.99 6.04 Intr - 183495 183381 115 0 1 111 77 163 0.994 17.52 6.03 Intr - 188164 188087 78 1 0 48 94 123 0.994 8.65 6.02 Intr - 192299 192105 195 2 0 82 48 183 0.661 13.31 6.01 Init - 194151 194080 72 0 0 66 87 42 0.687 2.97 6.00 Prom - 195351 195312 40 -2.56 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:28429867_28638004|GENSCAN_predicted_peptide_1|641_aa MGQADVSRPVNPDAVEEADQPTTEPGMVLDSVEAGDTTPPTKRKSKFSGFGKIFKPWKWR KKKSSDKFKETSEGGEDPGKPSDAMLKNGHTTPIGNARSSSPVQVEEEPVRLASLRKAIP EEDLKKRLGSTGSQPNSEAESVPENVPKPPLLPPKRPLSSSHEASEGQAKDATSSGGTAR FIISTSITTAPAATTAATSLAKTVNLSVTPSPAPRTLPAAPASTNTTATPSLTHMVPAKQ PPIPPPKPAHRNSNPVIAELSQAINSGTLLSKPSPPLPPKRGIPSTSVPTLESAAAITTK TPSDEREKSTCSMGSELLPMISPRSPSPPLPTHIPPEPPRTPPFPAKTFQVVPEIEFPPS LDLHQEIPQQEDQKKEVPKRILDQNFGEPHIPSRLPPLPLHIRIQQALTSPLPMTPILEG SHRAHSLLFENSDSFSEDSSTLGRTRSLPITIEMLKVPDDEEEEEQTCPSTFSEEMTPTS VIPKLPQCLREEEEKESDSDSEGPIQYRDEEDEDESYQSALANKVKRKDTLAMKLNHRPS EPELNLNSWPCKSKEEWNEIRHQIGNTLIRKILRFNEYVEVTDAQDYDRRADKPWTKLTP ADKCWDYRCEPLRPAEKSFKKEADEEYQMQQKATIVHDAKG >gi568815597f:28429867_28638004|GENSCAN_predicted_CDS_1|1926_bp atgggacaagctgatgtctccagaccggtaaatccagatgcagttgaggaagcagaccag cccactacagagccaggcatggtcctggacagtgtggaagcaggagacacaacacctcct accaaaaggaagagcaagttctcaggctttggcaagatcttcaagccctggaaatggagg aaaaaaaaaagtagtgataaatttaaagagacttcagaaggtggtgaggatccaggaaag ccaagcgatgccatgttaaagaatggccataccacccccatagggaatgccagatcatct agtccagtccaagtagaggaagagccagtaagattagcaagtcttaggaaagctattcca gaagaggacctaaagaaacgactaggctcaactggaagccagcctaattctgaagcagag tctgttcctgagaatgtacccaaaccacctttacttcctcccaaaagacccttgtcctct tctcatgaagcaagtgaagggcaagcaaaggatgccacttcctctggcggcacggcaagg ttcatcatctccacctccatcaccacagcacccgctgccaccactgctgccacaagcctt gcaaagactgttaatctctctgtcaccccttccccagcacccaggactctgcctgctgct cctgccagcactaacactactgctaccccaagcctcactcatatggtccctgccaagcag ccccctatccctccccctaaaccagctcacagaaatagcaaccctgtcattgctgaactg tcccaagcaataaacagtggtacattgttatcaaaaccgtccccacccttaccacctaag agaggcattccatcaacctcagtacccaccttggagtctgctgctgccatcaccacaaaa acaccaagtgatgaaagagagaagagcacgtgttctatgggctcggaactactaccaatg atctcacctcgctctccgtcccccccactgcctactcatatacctccagagcctccacgc acccctccattccctgctaagacttttcaagttgtgccagaaattgagtttccaccatcc ttagatctacaccaggagattccccagcaggaagatcagaaaaaggaagtccccaagagg atactggaccagaactttggggagccccatataccctctaggctgcctccactcccactg catattcgaatccagcaggccctcaccagcccacttcccatgactcctattctggagggt tctcacagagctcattcgttgctttttgaaaacagtgacagcttttctgaggacagcagt acgctgggtcggaccaggtctcttcccatcactattgaaatgctaaaagttccagacgat gaagaagaagaggagcaaacctgtccatccacattcagtgaagaaatgacacctacctca gtcattcctaaattaccacagtgtctacgggaggaagaagagaaggagagcgactctgat tcagaaggtcccattcagtaccgagatgaagaagatgaagatgaaagctatcagagtgct ctcgccaacaaagtgaagaggaaagacacactggcaatgaagttgaaccacagacccagt gaaccagagttgaacctgaattcttggccttgtaaaagcaaggaggagtggaatgaaata cggcaccagattggaaacacactgatccggaagattctgaggtttaatgaatatgtagag gtaacagatgctcaagattatgaccggcgagccgacaaaccttggaccaaactgacccct gctgacaagtgctgggattacaggtgtgaaccattgcgcccagccgagaagagttttaag aaggaagcagatgaggagtatcagatgcaacagaaagctaccatcgtccatgatgccaaa ggttga >gi568815597f:28429867_28638004|GENSCAN_predicted_peptide_2|475_aa MQILGWPPGAAGLFVFLSHPGRGQRPAERMLWGSSRPGGERRSSLWPTCTKDRKMSPKRI AKRRSPPADAIPKSKKVKVSHRSHSTEPGLVLTLGQGDVGQLGLGENVMERKKPALVSIP EDVVQAEAGGMHTVCLSKSGQVYSFGCNDEGALGRDTSVEGSEMVPGKVELQEKVVQVSA GDSHTAALTDDGRVFLWGSFRDNNGVIGLLEPMKKSMVPVQVQLDVPVVKVASGNDHLVM LTADGDLYTLGCGEQGQLGRVPELFANRGGRQGLERLLVPKCVMLKSRGSRGHVRFQDAF CGAYFTFAISHEGHVYGFGLSNYHQLGTPGTESCFIPQNLTSFKNSTKSWVGFSGGQHHT VCMDSEGKAYSLGRAEYGRLGLGEGAEEKSIPTLISRLPAVSSVACGASVGYAVTKDGRV FAWGMGTNYQLGTGQDEDAWSPVEMMGKQLENRVVLSVSSGGQHTVLLVKDKEQS >gi568815597f:28429867_28638004|GENSCAN_predicted_CDS_2|1428_bp atgcagatcctgggatggccgccaggggccgccgggctctttgttttcctttctcacccg ggtcggggccagaggcctgcagagcgcatgctctggggcagttcgcggcccggcggggag cgccggagttccttgtggccgacgtgcaccaaggacaggaagatgtcacccaagcgcata gctaaaagaaggtcccccccagcagatgccatccccaaaagcaagaaggtgaaggtctca cacaggtcccacagcacagaacccggcttggtgctgacactaggccagggcgacgtgggc cagctggggctgggtgagaatgtgatggagaggaagaagccggccctggtatccattccg gaggatgttgtgcaggctgaggctgggggcatgcacaccgtgtgtctaagcaaaagtggc caggtctattccttcggctgcaatgatgagggtgccctgggaagggacacatcagtggag ggctcggagatggtccctgggaaagtggagctgcaagagaaggtggtacaggtgtcagca ggagacagtcacacagcagccctcaccgatgatggccgtgtcttcctctggggctccttc cgggacaataacggtgtgattggactgttggagcccatgaagaagagcatggtgcctgtg caggtgcagctggatgtgcctgtggtaaaggtggcctcaggaaacgaccacttggtgatg ctgacagctgatggtgacctctacaccttgggctgcggggaacagggccagctaggccgt gtgcctgagttatttgccaaccgtggtggccggcaaggcctcgaacgactcctggtcccc aagtgtgtgatgctgaaatccaggggaagccggggccacgtgagattccaggatgccttt tgtggtgcctatttcacctttgccatctcccatgagggccacgtgtacggcttcggcctc tccaactaccatcagcttggaactccgggcacagaatcttgcttcataccccagaaccta acatccttcaagaattccaccaagtcctgggtgggcttctctggtggccagcaccataca gtctgcatggattcggaaggaaaagcatacagcctgggccgggctgagtatgggcggctg ggccttggagagggtgctgaggagaagagcatacccaccctcatctccaggctgcctgct gtctcctcggtggcttgtggggcctctgtggggtatgctgtgaccaaggatggtcgtgtt ttcgcctggggcatgggcaccaactaccagctgggcacagggcaggatgaggacgcctgg agccctgtggagatgatgggcaaacagctggagaaccgtgtggtcttatctgtgtccagc gggggccagcatacagtcttattagtcaaggacaaagaacagagctga >gi568815597f:28429867_28638004|GENSCAN_predicted_peptide_3|287_aa MAASLWMGDLEPYMDENFISRAFATMGETVMSVKIIRNRLTGIPAGYCFVEFADLATAEK CLHKINGKPLPGATPAKRFKLNYATYGKQPDNSPEYSLFVGDLTPDVDDGMLYEFFVKVY PSCRGGKVVLDQTGVSKGYGFVKFTDELEQKRALTECQGAVGLGSKPVRLSVAIPKASRV KPVEYSQMYSYSYNQYYQQYQNYYAQWGYDQNTGSYSYSYPQYGYTQSTMQTYEEVGDDA LEDPMPQLDVTEANKEFMEQSEELYDALMDCHWQPLDTVSSEIPAMM >gi568815597f:28429867_28638004|GENSCAN_predicted_CDS_3|864_bp atggcggccagcctgtggatgggcgacctggaaccctacatggatgagaacttcatctcc agagcctttgccaccatgggggagaccgtaatgagcgtcaaaattatccgaaaccgcctc actgggatcccagctggctactgctttgtagaatttgcagatttggccacagctgagaag tgtttgcataaaattaatgggaaaccccttccaggagccacacctgcgaaacgttttaaa ctgaactatgccacttacgggaaacaaccagataacagccctgagtattccctctttgtg ggggacctgaccccggacgtggatgatggcatgctgtatgaattcttcgtcaaagtctac ccctcctgtcggggaggcaaggtggttttggaccagacaggcgtgtctaagggttatggt tttgtgaaattcacagatgaactggaacagaagcgagccctgacggagtgccagggagca gtgggactggggtctaagcctgtgcggctgagcgtggcaatccctaaagcgagccgtgta aagccagtggaatatagtcagatgtacagttatagctacaaccagtattatcagcagtac cagaactactatgctcagtggggctatgaccagaacacaggcagctacagctacagttac ccccagtatggctatacccagagcaccatgcagacatatgaagaagttggagatgatgca ttggaagaccccatgccacagctggatgtgactgaggccaacaaggagttcatggaacag agtgaggagctgtatgacgctctgatggactgtcactggcagcccctggacacagtgtct tcagagatccctgccatgatgtag >gi568815597f:28429867_28638004|GENSCAN_predicted_peptide_4|77_aa MTDLVYRDPGVLKQVVMMDRLPPSFLAVCDTEEVSLFLELCFKIHVTCKAVLICDYGPME LGQSLWEAEGKDPGHFR >gi568815597f:28429867_28638004|GENSCAN_predicted_CDS_4|234_bp atgactgacttagtctacagagatcccggcgtacttaagcaggtagtaatgatggacagg ttgcccccaagtttcctggcagtgtgtgatactgaggaggtgagcttgtttctggagctg tgctttaagattcatgttacatgtaaagctgtcctcatttgtgactatggacctatggag ttgggacaatctctatgggaagcagaaggcaaggaccccggtcattttaggtag >gi568815597f:28429867_28638004|GENSCAN_predicted_peptide_5|218_aa MEAEGCRYQFRVALLGDAAVGKTSLLRSYVAGAPGAPEPEPEPEPTVGAECYRRALQLRA GPRVKLQLWDTAGHERFRCITRSFYRNVVGVLLVFDVTNRKSFEHIQDWHQEVMATQGPD KVIFLLVGHKSDLQSTRCVSAQEAEELAASLGMAFVETSVKNNCNVDLAFDTLADAIQQA LQQGDIKLEEGWGGVRLIHKTQIPRSPSRKQHSGPCQC >gi568815597f:28429867_28638004|GENSCAN_predicted_CDS_5|657_bp atggaggccgagggctgccgctaccaatttcgggtcgcgctgctgggggacgcggcggtg ggcaagacgtcgctgctgcggagctacgtggcaggcgcgcctggcgccccggagccggag cccgagcccgagcccacggtgggcgccgagtgctaccgccgcgcgctgcagctgcgggcc gggccgcgggtcaagctgcaactctgggacaccgcgggccacgagcgcttcaggtgcatc accaggtccttttaccggaatgtggtgggtgtcctgctggtctttgatgtgacaaacagg aagtcctttgaacacatccaagactggcaccaggaggtcatggccactcagggcccggac aaggtcatcttcctgctggttggccacaagagtgacctgcagagcacccgctgtgtctca gcccaggaggccgaggagctagctgcctccctgggcatggccttcgtggagacctcggtt aaaaacaactgcaatgtggacctggcctttgacaccctcgctgatgctatccagcaggcc ctgcagcagggggacatcaagctagaagagggctgggggggtgtccggctcatccacaag acccaaatccccaggtcccccagcaggaagcagcactcaggcccatgccagtgttga >gi568815597f:28429867_28638004|GENSCAN_predicted_peptide_6|194_aa MAASHFTGLTAVADVIKDLDTQIALIGLGPHSSKKKQDLDKLYELKSKARQIMNQFGPSA LINLSNFSSIKPEPASTPPQGSMANSTAVVLTKKKLQDLVREVDPNEQLDEDVEEMLLQI ADDFIESVVTAACQLARHRKSSTLEVKDVQLHLERQWNMWIPGFGSEEIRPYKKACTTEA HKQRMALIRKTTKK >gi568815597f:28429867_28638004|GENSCAN_predicted_CDS_6|585_bp atggctgcctctcatttcaccgggctcacagctgttgctgatgtaattaaagatctagac actcagatagctttaattggccttggtcctcacagctccaaaaagaaacaggatctcgat aagctctatgagctgaagtccaaagctcggcagattatgaaccagtttggcccctcagcc ctaatcaacctctccaatttctcatccataaaaccggaaccagccagcacccctccacaa ggctccatggccaatagtactgcagtggtattgaccaagaagaaattacaggacttagta agagaagtggatcctaatgagcagttggatgaagatgtggaggagatgctgctgcagatt gctgatgattttatcgagagtgtggtgacagcagcctgtcagcttgcgcggcatcgcaag tctagcaccctggaggtgaaagatgtccagctgcatttagagcgccagtggaacatgtgg atcccaggatttggctctgaagaaatccgaccctacaaaaaagcttgcaccacagaagct cacaaacagagaatggcattgatccggaaaacaaccaagaaataa