GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:06:29 Sequence gi568815585f:60360467_60667638 : 307172 bp : 37.84% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 15662 15701 40 -3.65 1.01 Init + 20653 20695 43 0 1 50 97 51 0.250 2.83 1.02 Intr + 25167 25283 117 1 0 27 56 111 0.113 1.32 1.03 Intr + 27197 27251 55 0 1 60 63 62 0.094 -1.98 1.04 Intr + 35771 35825 55 2 1 68 52 86 0.451 1.06 1.05 Intr + 35913 36082 170 2 2 54 33 170 0.665 5.92 1.06 Intr + 36406 36653 248 1 2 52 36 281 0.621 15.48 1.07 Intr + 48776 49071 296 0 2 7 39 141 0.000 -3.10 1.08 Term + 65312 65380 69 1 0 122 36 89 0.092 4.06 1.09 PlyA + 66074 66079 6 1.05 2.00 Prom + 66283 66322 40 -5.85 2.01 Init + 68191 68285 95 0 2 20 89 88 0.180 2.00 2.02 Intr + 79222 79306 85 1 1 92 51 93 0.160 4.90 2.03 Intr + 86392 86468 77 0 2 45 92 62 0.073 -0.31 2.04 Intr + 87684 87740 57 0 0 58 110 64 0.074 2.68 2.05 Intr + 106772 106913 142 2 1 90 89 100 0.921 9.73 2.06 Intr + 123309 123380 72 2 0 97 95 77 0.994 7.98 2.07 Intr + 125333 125482 150 1 0 71 63 189 0.993 14.14 2.08 Intr + 133969 134109 141 0 0 55 95 128 0.839 9.83 2.09 Term + 134670 134696 27 2 0 89 40 23 0.529 -5.30 2.10 PlyA + 134870 134875 6 1.05 3.00 Prom + 136804 136843 40 -4.75 3.01 Init + 138318 138895 578 2 2 57 84 391 0.328 30.25 3.02 Intr + 139362 139536 175 0 1 -48 23 188 0.156 -2.28 3.03 Intr + 140168 140347 180 1 0 21 84 104 0.269 2.44 3.04 Intr + 145127 145670 544 1 1 25 77 299 0.265 14.04 3.05 Intr + 147288 147736 449 1 2 53 -44 222 0.271 -2.36 3.06 Intr + 149297 149453 157 1 1 117 60 249 0.639 23.66 3.07 Intr + 150164 150289 126 0 0 66 121 110 0.999 11.83 3.08 Intr + 167901 168751 851 1 2 76 86 569 0.388 45.65 3.09 Intr + 174642 174767 126 2 0 92 79 146 0.935 14.06 3.10 Term + 187415 187540 126 1 0 100 39 34 0.046 -3.00 3.11 PlyA + 189818 189823 6 1.05 4.03 PlyA - 189944 189939 6 1.05 4.02 Term - 192309 191928 382 2 1 7 40 218 0.616 2.23 4.01 Init - 192847 192627 221 1 2 75 86 150 0.978 11.65 4.00 Prom - 203828 203789 40 -3.65 5.00 Prom + 204984 205023 40 -5.05 5.01 Init + 224618 224701 84 1 0 66 43 114 0.195 5.57 5.02 Intr + 226724 226838 115 1 1 28 43 124 0.066 1.00 5.03 Intr + 239414 239485 72 0 0 68 84 65 0.512 2.56 5.04 Intr + 240034 240115 82 2 1 56 94 97 0.586 4.88 5.05 Intr + 247533 247651 119 0 2 72 26 82 0.330 -0.41 5.06 Intr + 250099 250158 60 2 0 112 93 42 0.677 4.89 5.07 Term + 259927 260084 158 2 2 84 37 75 0.181 -0.89 5.08 PlyA + 261379 261384 6 1.05 6.03 PlyA - 262434 262429 6 1.05 6.02 Term - 276817 276420 398 2 2 44 55 259 0.063 12.45 6.01 Init - 283039 282964 76 1 1 83 89 56 0.238 6.50 6.00 Prom - 290665 290626 40 -5.55 7.05 PlyA - 291427 291422 6 1.05 7.04 Term - 300379 299958 422 2 2 -2 43 382 0.404 19.17 7.03 Intr - 303038 302920 119 1 2 53 72 76 0.359 1.69 7.02 Intr - 303595 303438 158 1 2 38 32 134 0.581 0.69 7.01 Init - 306089 305925 165 2 0 70 73 101 0.800 6.48 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 277631 277459 173 2 2 61 47 163 0.882 8.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:60360467_60667638|GENSCAN_predicted_peptide_1|350_aa MGLSENRGEKLEKVTSEIKQPYLCQILMVKEITKIQKVEIETPSFTGGMWMPHWNQRAEA KAAPKGTCETTEWGRCPLEAFFPAIRDAAERESRTPNRAVASAVPLGDGGAHAHSGTHPA PQPSTARAKGIGGEAAVVARSGSTEPSAVLDAFPRSQPVSLLRPYPALSEVAACGRCKRH TFALRRTSSRLSKVCNSEKRPPKQVKAFGEEAKSLNRPDSSHQPVFGGSEPSWRTSVSVV QKGNVGLEPPHRVPTGALPSRAVRRRPLSSRLQNGRATDSLHCAPGKAADIQHQPVKAAG RKAVSCKATRAELSKTMETHLLHQCDLDPTQHEDNEDEDLNDGLFPHNEQ >gi568815585f:60360467_60667638|GENSCAN_predicted_CDS_1|1053_bp atgggtttaagtgaaaatagaggagagaaattggagaaagtaacctcagaaatcaagcag ccttacctctgccagattcttatggtcaaggaaattacaaagatccaaaaggtggaaata gaaactccatctttcactggaggcatgtggatgccacactggaaccagagggctgaagca aaagctgctccaaaaggaacttgtgagaccacagaatggggacgctgccccttagaagcc tttttccccgcaatacgggacgccgcagagcgtgagtcacggacgcccaatcgcgcggta gcgtccgccgtgccgctgggcgacggcggcgcgcatgcccactcagggacccaccctgcg ccccagccctctacagcgcgggcgaagggcatcggcggggaagcggctgtggttgccaga tctggcagtacagagccaagcgcggtcctggacgcgttccctcgcagccaacccgtcagt ctcctcaggccttatcctgccctctctgaggtggccgcctgtgggcgctgcaaacgccac acttttgctctacgaagaacctcctcgcgactttccaaggtgtgcaactcagaaaagcgc cccccaaaacaagtaaaagccttcggggaagaggcaaagtccctgaaccgtcctgactcc agccaccagccggtgtttggtggatcagagccctcatggagaacctctgttagtgtagtg cagaagggaaatgtggggttggagcccccacacagagtccctactggggcactgcctagt agagctgtgagaagaaggccactgtcctccagacttcagaatggtagagccactgacagc ttgcactgtgcacctggaaaagctgcagacattcaacaccagcctgtgaaagcagctggg aggaaggctgtatcctgtaaagccacaagggcagagctgtccaagaccatggaaacccac ctcttgcatcagtgtgacctggatcctactcaacatgaagacaatgaggatgaagacctt aatgatggtctctttccacataatgaacagtaa >gi568815585f:60360467_60667638|GENSCAN_predicted_peptide_2|281_aa MGLENIRLETYVLVHPETKPCGPTEKTVVLSRYLSDEGIEACTSSPDKVNVNDIILIALN HFTINDDYKCMTLESVTTDPLLTNIIIWELESGKGEWRADRSVISLNTPPGTKVKLSGIV DIKNGFLLLNDSNTTVLGGEVEHLIEKWELQRSLSKHNRSNIGTEGGPPPFVPFGQKCVS HVQVDSRELDRRKTLQVTMPVKPTNDNDEFEKQRTAAIAEVAKSKETKTFGGGGGGARSN LNMNAAGNRNREVLQKEKSTKSEGKHEGVYRELGTEYPKTA >gi568815585f:60360467_60667638|GENSCAN_predicted_CDS_2|846_bp atgggcttggagaacatacgtcttgagacatacgtcctcgtgcatcctgagacaaagcct tgtgggcccacagagaagacagtggttctgtccaggtatctttcagatgaaggcattgaa gcttgcacaagctctccagacaaagtcaatgtaaatgacatcatcctgattgctctcaat cattttaccattaatgatgactataaatgcatgactctagaatcagtcactactgatccg ttacttactaacattatcatttgggaacttgagagtgggaagggagaatggagagctgat agatctgtcatcagcctgaacacaccacctggaactaaagttaagctctcaggcattgtt gacataaaaaatggattcctgctcttgaatgactctaacaccacagttcttggtggtgaa gtggaacaccttattgagaaatgggagttacagagaagcttatcaaaacacaatagaagc aatattggaactgaaggtggaccaccgccttttgtgccttttggacagaagtgtgtatct catgtccaagtggatagcagagaacttgatcgaagaaaaacattgcaagttacaatgcct gtcaaacctacaaatgataatgatgaatttgaaaagcaaaggacggctgctattgctgaa gttgcaaagagcaaggaaaccaagacatttggaggaggtggtggtggtgctagaagtaat ctcaatatgaatgctgctggtaaccgaaatagggaagttttacagaaagaaaagtcaacc aaatcagagggaaaacatgaaggtgtctatagagaactgggtacagaatatcccaaaaca gcttaa >gi568815585f:60360467_60667638|GENSCAN_predicted_peptide_3|1103_aa MIKRFGDYWKLAQLMLIPGDPKCHCGPPVKVGAYGGQEIKGVLAQVQITVGPVGTRTHPV VISPVPECIIGIDILSSWQNPHTGSLTGRVRVIMVEKAKWKPLELPLPRKIVNQKQYCIP GGIAEISATIKDLKDTGVMIPTTSPFNSPIWPVQKTDGSWRITVDYQVVTPIAAAVPDVV SLLEQINTSPGTWQHITHLDVLLWRIYQVTQKAASFEWGPEQEKALQQVQAAVQAALPLG PYDPADPMVLEEKWRNVRLYTDLWAVANGLAGLSGTWKKHDWKIGDKEIWERGMLMDHSE WSKTVKMFVSHEHVLTQCKEAKKLEKRLEELLTGITSLEKNINDPMELKNIARELCEAYT SINSRIDKVEERISEIEDQLNEIKWKDKIREKRMKRNNESLQEIWDYVKRPNLHLIGVPE SDRKNGTKLENTLQDIVRKNFPNLARQANIQIQKIQRTPQKYSLRRATPRHIIIRFTKVE MKEKILREAREKVLDVLARAFRQQKEIKSIHIGREEVKLSVFADDMIVYLENLIVSVQNL LKLISNFSKLSGYKANVQKSQAFLYTGHIQTESQIMSELPFTIATKRIPRNTTYKGCEGP LQGELETIAQGNKRGYKRMEKYSMLTDRKNQYHENGHTAQSNVDEKALKHITEMGFSKEA SRQALMDNGNNLEAALNVLLTSNKQKPVMGPPLRGRGKGRGRIRSEDEEDLGNARPSAPS TLFDFLESKMGTLNVEEPKSQPQQLHQGQYRSSNTEQNGVKDNNHLRHPPRNDTRQPRNE KPPRFQRDSQNSKSVLEGSGLPRNRGSERPSTSSVSEVWAEDRIKCDRPYSRYDRTKDTS YPLGSQHSDGAFKKRDNSMQSRSGKGPSFAEAKENPLPQGSVDYNNQKRGKRESQTSIPD YFYDRKSQTINNEAFSGIKIEKHFNVNTDYQNPVRSNSFIGVPNGEVEMPLKGRRIGPIK PAGPVTAVPCDDKIFYNSGPKRRSGPIKPEKILESSIPMEYAKMWKPGDECFALYWEDNK FYRAEVEALHSSGMTAVVKFIDYGNYEEVLLSNIKPIQTEAWNYVLRNIKESLMHENSDS VEEKDFGRGKSLESCFLAHCSIS >gi568815585f:60360467_60667638|GENSCAN_predicted_CDS_3|3312_bp atgatcaaacgttttggggactactggaaactggctcagctgatgttgattccaggggac ccaaaatgtcactgtggtcctccagttaaagtaggggcttatggaggtcaggaaattaaa ggagttttagctcaggtccaaattacagtgggtccagtgggtacgcggactcatcctgtg gtcatttccccagtgccagaatgcataattggaatagacatacttagcagctggcagaac cctcacactggctccctgactggtagggtgagggtgattatggtggaaaaggccaaatgg aagccattagagctgcctctacctagaaaaatagtaaatcaaaaacaatattgcatccct ggagggattgcagagattagtgccaccatcaaggacctgaaagacacaggggtgatgatt cccaccacatccccattcaactctcccatttggcctgtgcagaagacagatggatcttgg agaattacagtggattatcaagtggtgactccaattgcagctgctgtaccagatgtggtt tcattgcttgagcaaattaacacatctcctggtacctggcaacacattactcatttggat gtgttactctggcggatttatcaagtgacccaaaaggctgccagttttgagtggggtcca gaacaggagaaggctctgcaacaggtccaggctgctgtgcaagctgctctgccacttggg ccatatgacccagcagatccaatggtgcttgaggagaaatggcgaaatgtgcgattatat actgatttatgggctgtagccaatggtttggctggattatcagggacttggaagaagcat gattggaaaattggtgacaaagaaatttgggaaagaggtatgttgatggaccactctgag tggtcaaaaactgtgaagatgtttgtatcccatgagcatgttctaacccaatgcaaggaa gctaagaaacttgaaaaaaggttagaggaattgctaactggaataactagtttagagaag aacataaatgacccgatggaattgaaaaacatagcacgagaactttgtgaagcatacaca agtatcaatagccgaattgataaagtggaagaaaggatatcagagattgaagatcaactt aatgaaataaagtggaaagacaagattagagaaaaaagaatgaaaaggaacaatgaaagc ctccaagaaatatgggactatgtgaaaagaccaaacctacatttgattggtgtacctgaa agtgacaggaagaatggaaccaagttggaaaacacacttcaggatattgtccggaagaac ttccccaacctagcaagacaggccaacattcaaattcagaaaatacagagaacaccacaa aaatactccttgagaagagcaaccccaagacacataatcatcagattcaccaaggttgaa atgaaggaaaaaatattaagggaagccagagagaaagtattggatgttctggccagggca ttcaggcaacagaaagaaataaagagtattcacataggaagagaggaagtcaaattgtct gtgtttgcagatgacatgattgtatatttagaaaacctcattgtctcagttcaaaatctc cttaagctgataagcaactttagcaaactctcaggatacaaagccaatgtgcaaaaatca caagcattcctatacactggtcatatacaaacagagagccaaatcatgagtgaactgcca ttcacaattgctacaaagagaatacctaggaatacaacttacaagggatgtgaaggacct cttcaaggagaactagaaaccattgctcaaggaaataagagaggatacaaacgaatggaa aaatattccatgctcacggataggaagaatcaatatcatgaaaatggccatactgcccaa agtaatgttgatgagaaagctctgaagcacataacggaaatgggcttcagtaaggaagca tcgaggcaagctcttatggataatggcaacaacttagaagcagcactgaacgtacttctt acaagcaataaacagaaacctgttatgggtcctcctctgagaggtagaggaaaaggcagg gggcgaataagatctgaagatgaagaggacctgggaaatgcaaggccatcagcaccaagc acattatttgatttcttggaatctaaaatgggaactttgaatgtggaagaacctaaatca cagccacagcagcttcatcagggacaatacagatcatcaaatactgagcaaaatggagta aaagataataatcatctgagacatcctcctcgaaatgataccaggcagccaagaaatgaa aaaccgcctcgttttcaaagagactcccaaaattcaaagtcagttttagaaggcagtgga ttacctagaaatagaggttctgaaagaccaagtacttcttcagtatctgaagtatgggct gaagacagaatcaaatgtgatagaccgtattctagatatgacagaactaaagatacttca tatcctttaggttctcagcatagtgatggtgcttttaaaaaaagagataactctatgcaa agcagatcaggaaaaggtccctcctttgcagaggcaaaagaaaatccacttcctcaagga tctgtagattataataatcaaaaacgtggaaaaagagaaagccaaacatctattcctgat tatttttatgacaggaaatcacaaacaataaataatgaagctttcagtggtataaaaatt gaaaaacattttaatgtaaatactgattatcagaatccagttcgaagtaatagtttcatt ggtgttccaaatggagaagtagaaatgccactgaaaggaagacgaataggacctattaag ccagcaggacctgtcacagctgtaccctgtgatgataaaatattttacaatagtgggccc aaacgaagatctgggccaattaagccagaaaaaatactagaatcatctattcctatggag tatgcaaaaatgtggaaacctggagatgaatgttttgcactttattgggaagacaacaag ttttaccgggcagaagttgaagccctccattcttcgggtatgacagcagttgttaaattc attgactacggaaactatgaagaggtgctactgagcaatatcaagcccattcaaacagag gcatggaattatgtgcttagaaacatcaaagaatctctaatgcatgaaaattcagacagt gttgaggagaaagattttggcagagggaagtcccttgaatcctgtttcttagctcactgc tccattagttag >gi568815585f:60360467_60667638|GENSCAN_predicted_peptide_4|200_aa MLIVNDDLGYLAEEISKQQSIQEVTWLILKAFSDLHSQRYGLKLKFMFKREAEHKDLENL QCEHGVEKKNPFSGAQKTRVEVWEPPLRFQRMYGKAWISRQKSAAGTEPSWRTSTRAIQR GNVGLQPTYRVPTGALPSEAVREGPPSSGPQNGRSTYSLHRVLGKATGTQHQTVKAVIRA APCRATGAELLKALGAYPLH >gi568815585f:60360467_60667638|GENSCAN_predicted_CDS_4|603_bp atgctgatagtgaatgatgatttagggtatctggctgaagaaatttctaagcagcaaagc attcaagaggtgacctggcttatcctgaaagcgttcagtgatctacattcacaaagatat ggtttgaaattgaaatttatgtttaaaagggaagcagagcataaagatttggaaaatttg cagtgtgagcatggggtagaaaagaaaaacccattttctggtgcacagaagacaagagtt gaggtttgggagcctccacttagatttcagaggatgtacgggaaagcctggatatccagg cagaagtctgctgcagggacagagccctcatggagaacctctactagggcaattcagagg ggaaatgtggggctccagcccacatacagagtccccactggggcactgcctagtgaagct gtgagagaagggccaccatcctctggaccccagaatggcagatccacctatagcttgcac cgtgtgcttggaaaagccacaggcactcaacaccagactgtgaaagcagtcataagggct gcaccctgcagagccacaggggcagagctgctcaaggccttaggagcctaccccttgcat tag >gi568815585f:60360467_60667638|GENSCAN_predicted_peptide_5|229_aa MPRANLGCPVQAPGHELAKRKAGTGLAQPQAPRADKQTNGRVMQQRRKEEKECLNVKRSL AGDGWRVCSELGKSAVKLHQQRMNIIEKEGEEEFKTQTYVKREDDVKTEERMATCKPRGS TEGRHQVPTDHLWIRRLGLPKQLLGVVKPITLSGDLAGLPICCAAVQLPLASEQCVEDTI LSQSELVKSNIAYFHSTPNPFCILSPPPEMPFAPPSSLKELLYVLQVLV >gi568815585f:60360467_60667638|GENSCAN_predicted_CDS_5|690_bp atgccccgtgctaaccttggctgccctgtccaggcaccaggccatgagttagcaaagcgc aaagccggcactgggcttgcacagccccaggctccaagagcagacaagcagacaaatgga agagtgatgcagcagagaaggaaagaagagaaggagtgtctgaatgtcaagaggagtttg gctggggatggttggagagtgtgttctgaacttgggaagagtgctgttaagctccatcag cagcgtatgaatattattgagaaagagggagaagaggaatttaagacacagacctacgta aagagggaagatgatgtgaagacagaggagaggatggccacctgcaagcccagaggcagc accgaagggaggcaccaggtgccaacagaccacctgtggatacgccgtctagggctcccc aaacaattgctgggggtggtaaagcccattactctcagcggagacctagcaggtctcccc atttgctgtgcagcggtacagctgccactagcttctgaacagtgtgtggaagatactatc ctgagccagtcagaacttgttaaatccaacattgcctatttccactcaacaccaaaccct ttttgtatactttctcctccacctgaaatgccctttgccccaccctcttcactcaaggag cttttatatgttcttcaggtcttggtttag >gi568815585f:60360467_60667638|GENSCAN_predicted_peptide_6|157_aa MDGAGSHYPQQTNAGTENQTLHVLTCSHLHQHLKEGGLAHCEDACSRFHRLCHAWRYGPK GTRCDQKEFHSGSSRVLITTDLLPRNIDMQRLSLVIKYNLPTNRQNYIHRISRGRWFDLK DADINMVTEEDKRTLRDIRTFYNTSIEEMPLDVADLI >gi568815585f:60360467_60667638|GENSCAN_predicted_CDS_6|474_bp atggatggagctggaagccattatcctcagcaaaccaatgcaggaaccgaaaaccaaaca ctgcatgttctcacttgcagtcatcttcatcaacacttgaaggaaggtggactggctcac tgcgaagatgcatgctcgagatttcaccgtctctgccatgcatggagatatggaccaaaa ggaacgcgatgtgatcagaaggagtttcattctggctctagcagagtattgattaccact gacctgctgcccagaaatattgatatgcaacgactttctttagtcatcaaatataacctt cccaccaacaggcaaaactatatccacagaatcagtcgaggtagatggtttgaccttaag gatgcggatattaacatggtgacagaagaagacaagaggactcttcgagacatcaggacc ttctacaacacctccattgaggaaatgcccctcgatgttgctgacctcatctga >gi568815585f:60360467_60667638|GENSCAN_predicted_peptide_7|287_aa MNAHFLKEDIYVAKKHGKMPIITNNQRNSNQIHNEMPSHTSWNGYYKKVKKQQMLGSKGS TTGSGNGREAFGCPGSSNPEKCRAAANGNVQPQVQWLHSRPELGGPACLKAVGSGAVAVA KMAGLSVNPGSSVPEKCKAVPSPRLRQGQSASSPNDRNTSPARGQNWTEDEMDKLTEEAF RKWVITNSAELKEHILSQCKEAENLDKRLQELPARITSLENINDLTEVKNTAQELREAYT SINSQIDQARERISEFEDHLAEIRQADKIREQRKKRNEQSSEKYRTV >gi568815585f:60360467_60667638|GENSCAN_predicted_CDS_7|864_bp atgaacgcacacttcttgaaagaagacatatatgtggccaagaaacatggtaaaatgcct atcatcactaataatcagagaaattcaaatcaaatccacaatgagatgccatctcatact agttggaatggctattataaaaaagtcaaaaaacaacagatgttgggcagcaagggcagt accactggcagtggcaatggcagagaggcttttggttgccccgggagctccaacccagag aaatgcagagctgctgccaatgggaatgttcagccgcaggtgcagtggctgcactcgagg cctgagctaggaggccctgcctgcctgaaggcagtagggtcaggggctgtggccgtggct aaaatggcgggcttgtctgttaaccctgggagctctgtcccagagaaatgcaaagctgtt cccagcccaagactcaggcagggccagagtgcctcttctccaaatgatcgcaacacctct ccagcaagagggcagaactggacagaggatgagatggacaaattgacggaagaagccttc agaaagtgggtaataacaaactctgctgagctaaaggaacacattctaagccaatgcaaa gaagctgagaaccttgataaaaggttgcaggagctgccagctagaataaccagtttagag aacataaatgacctgactgaggtgaaaaacacagcacaagaacttcgtgaagcatacaca agtatcaatagccaaattgatcaagcaagagaaagaatatcagagtttgaagaccatctt gctgaaataaggcaggcagacaagattagagaacaaagaaagaaaaggaatgaacaaagc tctgagaaatacaggactgtgtag