GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:54:04 Sequence gi568815575f:120505407_120712261 : 206855 bp : 40.09% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 355 350 6 1.05 1.02 Term - 8946 8660 287 1 2 95 43 65 0.345 -2.72 1.01 Init - 14362 14035 328 1 1 70 71 202 0.783 14.23 1.00 Prom - 15938 15899 40 -7.65 2.00 Prom + 16233 16272 40 -3.65 2.01 Init + 16410 16558 149 2 2 83 60 110 0.347 7.31 2.02 Term + 19380 19530 151 0 1 87 41 86 0.474 0.20 2.03 PlyA + 20022 20027 6 1.05 3.24 PlyA - 20784 20779 6 -0.45 3.23 Term - 21450 21355 96 0 0 77 39 128 0.988 3.79 3.22 Intr - 24848 24696 153 2 0 33 100 117 0.970 6.75 3.21 Intr - 27188 27016 173 0 2 97 20 131 0.994 5.94 3.20 Intr - 29180 29075 106 2 1 101 58 69 0.996 4.07 3.19 Intr - 30537 30424 114 0 0 77 107 15 0.801 2.02 3.18 Intr - 31628 31521 108 2 0 62 94 62 0.927 3.76 3.17 Intr - 32069 32008 62 0 2 118 60 48 0.421 2.53 3.16 Intr - 33364 33254 111 2 0 66 78 103 0.989 6.53 3.15 Intr - 33966 33862 105 1 0 82 115 87 0.998 10.07 3.14 Intr - 35156 34964 193 2 1 53 77 166 0.998 10.14 3.13 Intr - 36314 36196 119 0 2 48 62 105 0.993 3.16 3.12 Intr - 37627 37560 68 0 2 124 86 32 0.993 4.23 3.11 Intr - 38403 38321 83 0 2 44 100 81 0.998 2.42 3.10 Intr - 38797 38708 90 1 0 118 53 107 0.999 9.47 3.09 Intr - 39237 39075 163 2 1 70 108 103 0.336 9.56 3.08 Intr - 49547 49471 77 2 2 70 49 41 0.080 -4.21 3.07 Intr - 52633 52518 116 2 2 58 91 128 0.966 9.35 3.06 Intr - 55219 54677 543 2 0 110 86 412 0.992 35.14 3.05 Intr - 55316 55276 41 1 2 130 61 47 0.821 3.15 3.04 Intr - 55592 55432 161 2 2 104 -2 134 0.585 3.86 3.03 Intr - 61414 61285 130 0 1 86 41 55 0.279 0.38 3.02 Intr - 63155 63059 97 0 1 16 105 31 0.066 -4.25 3.01 Init - 69271 69145 127 1 1 53 89 115 0.674 8.47 3.00 Prom - 69326 69287 40 -8.95 4.00 Prom + 69810 69849 40 -9.45 4.01 Init + 70482 70537 56 2 2 70 81 8 0.153 -0.69 4.02 Term + 76476 76800 325 0 1 16 52 910 0.981 73.45 4.03 PlyA + 76942 76947 6 1.05 5.04 PlyA - 80353 80348 6 1.05 5.03 Term - 98740 98604 137 2 2 -44 48 265 0.041 6.50 5.02 Intr - 109306 109235 72 2 0 81 100 50 0.034 3.96 5.01 Init - 112253 112190 64 2 1 53 37 43 0.035 -2.94 5.00 Prom - 112749 112710 40 -5.05 6.05 PlyA - 113297 113292 6 1.05 6.04 Term - 121765 120804 962 2 2 98 54 435 0.065 32.33 6.03 Intr - 124394 124312 83 1 2 44 15 33 0.030 -10.04 6.02 Intr - 124640 124511 130 0 1 71 100 128 0.175 11.13 6.01 Init - 131324 131255 70 2 1 90 74 18 0.067 1.89 6.00 Prom - 142102 142063 40 -3.45 7.02 PlyA - 142590 142585 6 1.05 7.01 Sngl - 150834 150415 420 0 0 36 48 214 0.924 8.25 7.00 Prom - 154578 154539 40 -0.65 8.03 PlyA - 154908 154903 6 1.05 8.02 Term - 155380 155272 109 0 1 88 47 64 0.156 -0.70 8.01 Init - 170514 170315 200 0 2 72 9 206 0.529 9.52 8.00 Prom - 175991 175952 40 -5.95 9.06 PlyA - 176243 176238 6 1.05 9.05 Term - 178785 178437 349 2 1 73 43 123 0.076 -0.73 9.04 Intr - 191091 190954 138 2 0 25 93 85 0.230 1.36 9.03 Intr - 191389 191199 191 1 2 70 76 143 0.443 8.76 9.02 Intr - 191622 191565 58 2 1 90 61 32 0.381 -1.33 9.01 Intr - 199719 199536 184 1 1 58 91 138 0.030 9.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 193843 194451 609 0 0 88 54 224 0.958 14.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_1|204_aa MVDKLFDVLLDSVCQYFIEDFCIDVHMAGYPSETKLLEERSGSNICCSAIFAVLQPPLLI PRQTGSGVDLQQTPTDLQLRVLTVRRKTNKQKGRPCQNPICMSPSSKTKASCNCFWMSLS LLRARVLQFKRLRGWSGKVKGLTVVDRRMEEVGAWRGNRSRFKETELKIESGRRRSRSNG EEEVLAPVWGDLKFSEEANEVSSY >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_1|615_bp atggtggataaactttttgatgtgctgctagattcggtttgccagtattttattgaggat ttttgcatcgatgttcacatggccgggtacccctctgagacgaagcttctagaggaacga tcaggcagcaacatttgctgttcagcaatattcgctgttctgcagcctccgctgctgata cccaggcaaacagggtctggagtggacctccagcaaactccaacagacctgcagcttagg gtcctgactgttagaaggaaaactaacaaacagaaaggacgtccatgccaaaaccccatc tgtatgtcaccatcatcaaagaccaaagcctcatgtaattgcttttggatgtccctaagt ctcttgagagccagagttctacagttcaagagactaaggggctggagtgggaaggtaaaa ggtctcacagtagtggacagaaggatggaggaggtaggggcttggagggggaacagatca aggttcaaagaaactgaattaaagatcgaaagtggcagaaggaggagcagaagcaacggg gaggaagaggtcttagcaccagtttggggagatcttaagttttctgaagaagccaatgaa gtttcaagttattag >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_2|99_aa MDEAGDHHSQQTIARTENQTPYVLTHRCELNNENTWTQEGEHHTLGPVVGLANFSAETAS SFQRIKSLGEGTRFQDTEGDIKSLLLTGPTLSKSCLRVT >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_2|300_bp atggatgaagctggagaccatcattctcagcaaactatcgcaaggacagaaaaccaaaca ccatatgttctcactcataggtgcgaactgaacaacgagaacacttggacacaggaaggg gaacaccacactctggggcctgttgtgggtcttgctaatttttcagctgaaacagcatca agttttcaaagaattaagagcctcggggaggggacccgctttcaagatactgaaggtgac atcaagagtctcctcctaacaggaccaactctatctaaaagttgcttacgagtaacttga >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_3|1011_aa MGEDCWKSEFSVFYRRPKGRMMSQSSGSGDGNDDEATTSKDGGIGERNNAVGGNKGHLVI EAGKIHDLVIETSLRWFTGMTDKISELRNEVLEWLMYQVSNFRRCKINNFAKNMKFGWGE GEKEEELQPHRLAESLLQKGGERDRKAGEGDGTAVYLSASSFALPPRSAHSCIATFSPTR TQAYKRFSSPSPSAAAAAQEVRSATDGNTSTTPPTSAKKRKLNSSSSSSSNSSNEREDFD STSSSSSTPPLQPRDSASPSTSSFCLGVSVAASSHVPIQKKLRFEDTLEFVGFDAKMAEE SSSSSSSSSPTAATSQQQQLKNKSILISSVASVHHANGLAKSSTTVSSFANSKPGSAKKL VIKNFKDKPKLPENYTDETWQKLKEAVEAIQNSTSIKYNLEELYQKDYFEMFKFKQEDKG GKAISYNLDFRDMGLELFRAHIISDQKVQNKTIDGILLLIERERNGEAIDRSLLRSLLSM LSDLQIYQDSFEQRFLEETNRLYAAEGQKLMQEREVPEYLHHVNKRLEEEADRLITYLDQ TTQKSLIATVEKQLLGEHLTAILQKGLNNLLDENRIQDLSLLYQLFSRVRGGVQVLLQQW IEYIKAFGSTIVINPEKDKTMVQELLDFKDKVDHIIDICFLKNEKFINAMKEAFETFINK RPNKPAELIAKYVDSKLRAGNKEATDEELEKMLDKIMIIFRFIYGKDVFEAFYKKDLAKR LLVGKSASVDAEKSMLSKLKHGHSLQTGSGASEKLETWAGWGYMQNQNVPGNIELTVNIL TMGYWPTYVPMEVHLPPEMVKLQEIFKTFYLGKHSGRKLQWQSTLGHCVLKAEFKEGKKE LQVSLFQTLVLLMFNEGEEFSLEEIKQATGIEDGELRRTLQSLACGKARVLAKNPKGKDI EDGDKFICNDDFKHKLFRIKINQIQMKETVEEQASTTERVFQDRQYQIDAAIVRIMKMRK TLSHNLLVSEVYNQLKFPVKPADLKKRIESLIDRDYMERDKENPNQYNYIA >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_3|3036_bp atgggagaggactgctggaaaagtgaattctctgttttctatcgtagacccaaaggacgg atgatgtcacagtcatctggatcaggagatgggaatgatgatgaggctactacctctaaa gacggtggaattggagagagaaataatgctgttggtgggaataaaggacatttagtaata gaagcaggaaaaatacatgacttggtaatagaaacttccctgagatggttcacaggtatg acagataaaatctctgagctaaggaacgaagtgcttgaatggttgatgtaccaagtcagt aacttcaggcgatgtaagataaacaattttgctaaaaatatgaaatttggctggggagag ggagaaaaggaggaggagctgcagcctcacagactcgctgagtcgctcctgcagaaaggg ggggagagagatcgaaaagcaggggagggggacggcacggccgtttacctgtctgcctcc tcattcgctctcccccctcgttctgctcactcctgcattgctaccttctctcctacacgc acgcaggcatataaacgtttttcttcccccagtccctcagctgctgctgctgctcaggag gtcagatctgccactgatggtaataccagcaccactccgcccacctctgccaagaagaga aagttaaacagcagcagcagtagcagcagtaacagtagtaacgagagagaagactttgat tccacctcttcctcctcttccactcctcctttacaacccagggattcggcatccccttca acctcgtccttctgcctgggggtttcagtggctgcttccagccacgtaccgatacagaag aagctgcgttttgaagacaccctggagtttgtagggtttgatgcgaagatggctgaggaa tcctcctcctcctcctcctcatcttcaccaactgctgcaacatctcagcagcagcaactt aaaaataagagtatattaatctcttctgtggcttcggtgcatcatgcaaacggcctagcc aaatcttctaccaccgtctctagctttgctaacagcaaacctggctctgctaagaagtta gtgatcaagaactttaaagataagcctaaattaccagaaaactacacagatgaaacctgg caaaaactgaaagaagcagtggaagctattcagaatagtacttcaattaagtacaattta gaagaactctaccagaaagattattttgagatgttcaagttcaaacaggaggataaagga ggtaaagctatttcatataatctggacttcagggacatgggactggagttatttagggct catattataagtgatcagaaagtgcagaataagacaattgatggcattcttctcttgatt gagagggaaaggaatggtgaagcaattgatagaagtttacttcgaagccttttaagcatg ctgtctgatttgcaaatttatcaagattcttttgaacaacgatttttggaagaaactaac cggctctatgcagctgaaggccaaaaattaatgcaagaaagagaggttcctgaatatcta catcatgttaacaaacgtctagaagaagaagcagacagacttattacttacttagatcag accacccagaagtcattaattgctactgtagaaaaacaacttctaggtgaacacttaaca gcaattcttcagaaaggtttaaataacctccttgatgaaaaccgaattcaagatttgtct cttctgtatcagctcttcagtagagttcgaggtggagttcaggttcttttgcagcagtgg atcgaatatatcaaggcatttggcagcactattgtaattaatcctgaaaaagataaaacc atggttcaagaattgctggattttaaagataaggttgaccatataattgatatctgcttt ctgaagaatgagaaatttatcaatgccatgaaagaagcatttgaaacgttcattaacaaa agaccaaataaaccagctgaacttatagctaagtatgtagattcaaaacttcgtgcaggc aacaaagaagctacagatgaagaacttgagaaaatgttggataaaattatgatcatattt agatttatctatggcaaggatgtttttgaggccttctataagaaagatttagccaagcgc ctgttagtcggaaagagtgcatctgtagatgctgaaaaatcaatgctgtccaaacttaaa catggtcactccctgcagacaggcagtggagcaagtgaaaaactggagacttgggctggc tgggggtatatgcagaatcagaatgttccgggaaatattgagttaactgtgaatatcctg acaatgggctattggccgacatatgtgcctatggaagttcatttaccaccagagatggta aaacttcaggagattttcaagacattttacctaggcaaacatagtggcaggaaacttcag tggcagtcaaccctaggacactgtgtgttaaaagcagaatttaaagagggtaaaaaggaa ctccaggtctctctttttcaaacactggtgctgctaatgtttaatgagggagaggagttc agtttagaagagatcaagcaggcaactggaatagaggatggagagttaaggagaacactg cagtcattagcctgtggcaaagctagagttctggcgaaaaatccaaagggcaaagacatt gaagatggtgacaagttcatttgtaatgatgatttcaaacataaacttttcaggataaag atcaatcaaatccagatgaaagaaacggttgaagaacaagcaagcactacagaaagagta tttcaagacagacagtatcaaattgatgctgcaattgttcgaattatgaagatgagaaag acacttagccacaatctccttgtttcagaagtgtacaaccagttgaaatttccagtaaag cctgctgatcttaagaagagaatagaatctttaattgaccgggactacatggaaagagat aaagaaaatccaaaccagtacaactatattgcatag >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_4|126_aa MGRVRGIRFSTWSLLSGKSKKEKKKKKKKKKKKKKKKKKKKKKKKKKRKKRRREEEEKKK KKKKKKKKKKKKKKRRRRRKEEEEEEEEEEEEEEEEEEEEEEEEEEEILSNQKPSLEEAL SPCLSE >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_4|381_bp atggggagggtgaggggcatcaggttctccacgtggtctctgctgagcgggaaaagtaag aaagagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaag aagaagaagaagaagaagaagaggaagaaaagaagaagagaagaagaagagaagaagaag aagaagaagaagaagaagaagaagaagaagaagaagaagaaaagaagaagaagaagaaaa gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaggaagaagaaatcctgagtaatcagaagccttccctggaagaggccctt tctccttgcctctctgagtga >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_5|90_aa MNFKGVIDYAFEVSYIHILAEKHKELQASNRVPPPPNRETSEFEARRPALTVGAVVVIRK TRASRALRLSVKSDYFRSSGLHVGAFAGSF >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_5|273_bp atgaactttaagggagtaattgattatgcttttgaagtctcctatattcacatattggct gaaaaacataaggagctacaggcctcaaatagggtgcctcctcccccaaatagggagaca tctgaatttgaagccagaaggccggcactgacagtcggagctgtagtcgttatccggaaa acgagagccagccgcgctttacggctctcagtcaaatctgactacttccgctcctctgga ctccacgtaggcgcttttgccggctccttttag >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_6|414_aa MPCFPFTFHYDCTFPEASLAMQNSRPTREHRLQLEQPGQERNGVVRQRERKPVRGCAFLS PSRSRRGCSGRIVSTGDRESSSWGRSLGPSEGYRGKMLSESSSFLKGVMLGSIFCALITM LGHIRIGHGNRMHHHEHHHLQAPNKEDILKISEDERMELSKSFRVYCIILVKPKDVSLWA AVKETWTKHCDKAEFFSSENVKVFESINMDTNDMWLMMRKAYKYAFDKYRDQYNWFFLAR PTTFAIIENLKYFLLKKDPSQPFYLGHTIKSGDLEYVGMEGGIVLSVESMKRLNSLLNIP EKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAMTYHPN QVVEGCCSDMAVTFNGLTPNQMHVMMYGVYRLRAFGHIFNDALVFLPPNGSDND >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_6|1245_bp atgccttgcttccccttcaccttccactatgattgtacgtttcctgaggcttctctagcc atgcagaactcgagaccaacgagagaacaccgcctgcagctagaacagcctggtcaggag cgtaacggagtggtgcgccaacgtgagaggaaacccgtgcgcggctgcgctttcctgtcc ccaagccgttctagacgcgggtgcagcgggaggattgtcagcacaggagaccgagagtct tcttcctgggggcgctctcttggtccctcagagggatatcgaggaaaaatgctttctgaa agcagctcctttttgaagggtgtgatgcttggaagcattttctgtgctttgatcactatg ctaggacacattaggattggtcatggaaatagaatgcaccaccatgagcatcatcaccta caagctcctaacaaagaagatatcttgaaaatttcagaggatgagcgcatggagctcagt aagagctttcgagtatactgtattatccttgtaaaacccaaagatgtgagtctttgggct gcagtaaaggagacttggaccaaacactgtgacaaagcagagttcttcagttctgaaaat gttaaagtgtttgagtcaattaatatggacacaaatgacatgtggttaatgatgagaaaa gcttacaaatacgcctttgataagtatagagaccaatacaactggttcttccttgcacgc cccactacgtttgctatcattgaaaacctaaagtattttttgttaaaaaaggatccatca cagcctttctatctaggccacactataaaatctggagaccttgaatatgtgggtatggaa ggaggaattgtcttaagtgtagaatcaatgaaaagacttaacagccttctcaatatccca gaaaagtgtcctgaacagggagggatgatttggaagatatctgaagataaacagctagca gtttgcctgaaatatgctggagtatttgcagaaaatgcagaagatgctgatggaaaagat gtatttaataccaaatctgttgggctttctattaaagaggcaatgacttatcaccccaac caggtagtagaaggctgttgttcagatatggctgttacttttaatggactgactccaaat cagatgcatgtgatgatgtatggggtataccgccttagggcatttgggcatattttcaat gatgcattggttttcttacctccaaatggttctgacaatgactga >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_7|139_aa MASVQHLKTFGRFSRYHRSCLKDCSKIAKLINELIQDGQVRIGAHIRKGCGVEEEGRSRY KMPGPDSEEECLRLNYVAYQNKAALTEGSPRKQTNRRRSFHLGKLLSCAPPLTSTFWKLS TCLAQGSVLAYAVHYLQDL >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_7|420_bp atggccagtgttcaacatttaaagacatttgggagattcagcagataccacaggagctgt ttgaaggactgttcaaagattgcaaagctcatcaatgagttgattcaggatggtcaggtc agaattggagcacacattagaaagggctgtggagttgaagaggaagggagaagcaggtac aaaatgccagggcctgacagtgaggaagagtgtttgcggctcaactacgtggcctatcaa aataaagctgcccttactgagggatcccccagaaagcaaactaacagacgaaggagtttt cacctgggcaaacttctctcatgtgctcctcctcttacttcaacattttggaaattgtca acctgcttggctcaaggttctgttttagcatatgctgtccactacctccaagacctttga >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_8|102_aa MVDPPIAYTVHLEKLQTMPAHESNWKWGCTLQSHRVELPKTMRANLLHQCALDVTHGVKG DHFETLRPRVKGAAMIRSPDHWSHVTGNIHMQDIKGKTWKDR >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_8|309_bp atggtagatccaccaatagcttacaccgtgcacctggaaaagctgcagacaatgccagcc catgaaagcaactggaagtggggctgtaccctgcaaagccacagggtggagctgcccaag accatgagagccaacctcttgcatcagtgtgccctggatgtaacacatggagtcaaagga gatcattttgaaactttaagaccaagagtgaaaggagctgcaatgatcaggagcccagac cattggtcccacgtaactggtaacatccacatgcaggacataaaggggaaaacttggaaa gacagatga >gi568815575f:120505407_120712261|GENSCAN_predicted_peptide_9|306_aa XLLRPFNMIINIVNLQEGFECALVLKLTLAQNPPFNEEHVSGLMFPKAQFEKQDFRKIDL ARGGYVSEVFFVLKLWGVIQQLWGNLRAAKEEILVMVVAQGRLLDSWGLYAREMQVSNCS VQSAQDGGFVLWTQARGSLPGDEPRLRVARAVAEKLSNAPGGSAQGAAEFVLAQELWWQG VWRPRPGEPARRLNGQRWSPSNTGLSRETRKRLKILCLPPHQPREGEVCAGGAEQSARKE RPPPPPSPKSQESQAEATSTSNAGNTKEETGLQATELSTPSSQRPAPTPRVRKHNEELGW EHTSDC >gi568815575f:120505407_120712261|GENSCAN_predicted_CDS_9|921_bp ngacttctcagaccctttaacatgataataaacatagtgaatcttcaggagggatttgaa tgtgccctggttcttaaactcacattagcacaaaatcctccttttaatgaggagcatgta tcagggctaatgtttcccaaagcacagtttgagaaacaggattttaggaagattgatctg gcaagagggggatatgttagtgaagtatttttcgtgttgaagctttggggtgtgatccag cagctctggggcaatctcagagctgctaaggaagagatcttagtaatggttgtggcccag ggtcgtttgctcgactcctggggcctctatgccagagagatgcaggtcagcaattgctca gtgcagtcagcccaagatggagggtttgtgctgtggacccaagctaggggttccctgcct ggtgatgagccacgtctgagggtggcaagggcagtggcagagaagctttcaaatgcccct ggaggatctgcacagggagctgctgagtttgtactggctcaagagctctggtggcagggt gtctggaggcccaggcctggagaacctgcccgcagactgaatggacaaagatggagtcct tcaaatacaggcctttcaagagaaaccagaaagaggttgaagatattgtgtctcccacct caccaaccaagagaaggggaagtttgtgctggtggtgctgaacagtctgccaggaaggaa aggccaccgccaccaccttctcccaagagccaggagtcccaagcagaggctacctctacc agtaatgctggaaacacaaaagaagagacagggctccaagccacagaattgtccaccccc tcaagccagagaccagcaccgacaccaagggtcaggaagcacaatgaagagctgggttgg gagcacacaagtgattgctaa