GENSCAN 1.0 Date run: 4-Nov-116 Time: 15:10:58 Sequence gi568815575f:118725245_118926450 : 201206 bp : 42.40% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1817 2145 329 2 2 43 51 245 0.350 10.72 1.02 Intr + 4497 4735 239 1 2 49 53 128 0.798 1.81 1.03 Intr + 14503 14614 112 0 1 125 45 41 0.320 2.53 1.04 Intr + 15773 15912 140 0 2 55 115 -4 0.280 -1.74 1.05 Intr + 21710 21848 139 1 1 48 87 143 0.793 9.32 1.06 Intr + 24414 24534 121 1 1 48 97 48 0.333 0.33 1.07 Intr + 32811 32998 188 0 2 57 103 87 0.922 5.51 1.08 Term + 38959 39146 188 2 2 73 42 157 0.417 6.27 1.09 PlyA + 41555 41560 6 1.05 2.00 Prom + 43415 43454 40 -5.95 2.01 Init + 44552 44776 225 1 0 92 74 230 0.775 19.37 2.02 Term + 45133 45597 465 0 0 13 45 530 0.533 35.23 2.03 PlyA + 46431 46436 6 -0.45 3.00 Prom + 47151 47190 40 -6.15 3.01 Init + 48667 48731 65 0 2 50 116 53 0.380 4.97 3.02 Intr + 51183 51267 85 0 1 53 99 52 0.379 1.70 3.03 Term + 66518 66610 93 1 0 110 45 139 0.966 8.65 3.04 PlyA + 69264 69269 6 1.05 4.00 Prom + 72220 72259 40 -5.65 4.01 Init + 74003 74156 154 1 1 58 57 180 0.259 10.95 4.02 Intr + 98444 98761 318 0 0 95 103 295 0.943 26.71 4.03 Intr + 99644 99751 108 0 0 28 81 173 0.985 9.94 4.04 Term + 99981 101209 1229 1 2 51 49 1013 0.680 84.47 4.05 PlyA + 101504 101509 6 1.05 5.03 PlyA - 103338 103333 6 1.05 5.02 Term - 104546 104382 165 2 0 13 48 194 0.243 4.83 5.01 Init - 111926 111924 3 2 0 93 95 0 0.147 1.25 5.00 Prom - 113516 113477 40 -4.35 6.02 PlyA - 113627 113622 6 1.05 6.01 Sngl - 114472 114017 456 1 0 20 48 363 0.956 21.33 6.00 Prom - 115860 115821 40 -5.05 7.04 PlyA - 117102 117097 6 1.05 7.03 Term - 128666 128594 73 1 1 83 53 67 0.244 -0.90 7.02 Intr - 130583 130486 98 2 2 96 105 85 0.427 8.89 7.01 Init - 139325 139014 312 2 0 50 70 109 0.061 2.67 7.00 Prom - 151763 151724 40 -5.85 8.02 PlyA - 151975 151970 6 1.05 8.01 Sngl - 153095 152685 411 2 0 81 37 255 0.827 15.80 8.00 Prom - 153179 153140 40 -11.93 9.02 PlyA - 153192 153187 6 -0.45 9.01 Sngl - 154089 153763 327 0 0 108 48 328 0.766 24.56 9.00 Prom - 154577 154538 40 -8.35 10.00 Prom + 154622 154661 40 -4.25 10.01 Init + 161155 161885 731 0 2 51 72 382 0.025 27.20 10.02 Intr + 165542 165732 191 2 2 -4 81 183 0.007 6.71 10.03 Intr + 170316 170450 135 1 0 70 83 84 0.461 5.72 10.04 Term + 175381 175403 23 2 2 90 49 22 0.162 -4.00 10.05 PlyA + 176680 176685 6 1.05 11.03 PlyA - 177578 177573 6 1.05 11.02 Term - 180327 179899 429 0 0 -71 39 1181 0.896 91.62 11.01 Init - 191207 191142 66 2 0 73 77 58 0.419 4.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 98181 98241 61 2 1 73 67 30 0.873 -1.10 S.002 Term + 196971 197146 176 1 2 30 55 161 0.941 3.94 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_1|485_aa XTVLPVSPTPPSEVGLVLLTLESALANKGSEVKGWKPGLQTPRPRLFSADDSVLVSSRAQ PLAPPRPRKCAGSGDAGSATRREEAQAAPGVGGDVVEEVLFGQRPDAGAAGAVISPILYT KKLRHSATKTLAQIRIPGKVEELGFQPWQPDSRGHALNHCTHCFCPAEKTRVLSWTPPRV AGQDGLGPTKISQSSSSSCLVSDLRGTRAAKSLSQYFFGKAIKKQLPETQPPVTNLSVSV ENLCTVIWTWNPPEGASSNCSLWYFSHFGDKQDKKIAPETRRSIEVPLNERICLQVGSQC STNESEKPSILVEKCISPPEGDPESAVTELQCIWHNLSYMKCSWLPGRNTSPDTNYTLYY WHRSLEKIHQCENIFREGQYFGCSFDLTKVKDSSFEQHSVQIMVKDNAGKIKPSFNIVPL TSRADYLLGALAFLLLVVSEEPVNYAGGFTFEQEPRDVNLQVDSASLEFLYMGDGPGEEG HQFAT >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_1|1458_bp nncacagtactgccagtgtccccaactccacccagtgaagtagggctggtattactaacc ctggagtcagcgctagcaaacaagggctcagaggtcaagggctggaagccaggtctccag actcccaggccaaggctcttttccgcagatgactctgtcctggtgtccagcagggcacag cccctagcacctccgaggccccgcaagtgtgctggcagcggcgacgccggctccgccacg cggagggaggaggcccaggcagcgcccggagtgggcggcgacgtggtggaagaagttttg ttcggtcagagaccggacgcgggagctgcgggtgctgttattagtcccattttgtacacg aaaaagttgaggcacagtgcgaccaagacccttgcccaaattcgcatacctggcaaagtg gaagagctaggtttccaaccctggcagcctgattccagaggccatgcccttaaccactgt actcattgcttctgcccagcagaaaagacgagagtccttagctggacacctcccagggta gcagggcaggacggcttgggacccacgaaaattagtcaaagctctagctccagctgttta gtgtcagacctcaggggaacaagggctgctaagtctctttctcagtacttctttggaaaa gctataaaaaaacaattaccagaaactcagccacctgtgacaaatttgagtgtctctgtt gaaaacctctgcacagtaatatggacatggaatccacccgagggagccagctcaaattgt agtctatggtattttagtcattttggcgacaaacaagataagaaaatagctccggaaact cgtcgttcaatagaagtacccctgaatgagaggatttgtctgcaagtggggtcccagtgt agcaccaatgagagtgagaagcctagcattttggttgaaaaatgcatctcacccccagaa ggtgatcctgagtctgctgtgactgagcttcaatgcatttggcacaacctgagctacatg aagtgttcttggctccctggaaggaataccagtcccgacactaactatactctctactat tggcacagaagcctggaaaaaattcatcaatgtgaaaacatctttagagaaggccaatac tttggttgttcctttgatctgaccaaagtgaaggattccagttttgaacaacacagtgtc caaataatggtcaaggataatgcaggaaaaattaaaccatccttcaatatagtgccttta acttcccgtgcagactatcttcttggtgccctggctttccttctcttagtggtctctgaa gagccagtaaactatgcagggggatttacatttgaacaagagcctagagacgtaaatcta caggttgattcggcaagtttggaatttctatatatgggagatggtccaggggaggaggga catcaatttgcaacttga >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_2|229_aa MTWSTTAGGAHQLNTTTFTWQHLPAWQPLLLASIRLQLFFYVGLAFISLDLYYSSTSIKE LEYNYTGDPGTSNCSTNYPIKFCNPPLVNGSLALAFHGTAPLPNWRWLVYDKLSPIPNNN GFINQDFVVWMRMAALPTFRKLFRKLYGHIRQGNYSAGLPRCVYCVNITYNYLVRAFGGH RLRIFSSILWMGGRNPFLCIAYVVVSSLCILTGFVMLVIYISYQDLPER >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_2|690_bp atgacctggagcaccacagccgggggcgcccaccagctcaacaccaccacattcacgtgg cagcacctccctgcctggcaaccgctgctgttggccagcattaggctgcagctcttcttc tacgtgggcctggccttcatcagcctggacctctattactcctccaccagcatcaaggag ctggagtacaactacaccggcgacccgggcaccagcaactgctcgaccaactaccccatc aagttctgcaacccaccactggtcaacggcagcctggcactggccttccatggcacagca cccctgcccaactggcgctggctggtctacgacaagctcagccccatccccaacaacaac ggcttcatcaaccaggacttcgtggtgtggatgcgcatggcagcgctgcccacgttccgc aagctgttccgcaagctgtacgggcacatccgccagggcaactactcagctgggctgccg cggtgtgtctactgtgtcaacatcacctacaactacctggtgcgtgcatttggcggccac aggctccgcatcttcagcagcatcttgtggatgggtggcaggaaccccttcctgtgcatc gcctatgtggtggtcagctccctctgcatccttactggctttgtcatgctggtcatctac attagctaccaggacctaccagaaagatga >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_3|80_aa MLLIVPVIVAGAIIVLLLYLKRLKIIIFPPIPDPGKIFKEMFGDQNDDTLHWKKYDIYEK QTKEETDSVVLIENLKKASQ >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_3|243_bp atgttactcattgttccagtcatcgtcgcaggtgcaatcatagtactcctgctttaccta aaaaggctcaagattattatattccctccaattcctgatcctggcaagatttttaaagaa atgtttggagaccagaatgatgatactctgcactggaagaagtacgacatctatgagaag caaaccaaggaggaaaccgactctgtagtgctgatagaaaacctgaagaaagcctctcag tga >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_4|602_aa MGSCAARASRTSTTPCSTAPSPIDHPRAEECERTVQDWQAAPPAAPVRDPRGENSPLPKA VRGVRPRTSLIGRTSPLVTLFCLSWQRIKAAAAALGTALPRLCYPWLGGPAKQLLRAAPG RLAAKEASVLCRLQGGDQKAESTAGDAGASAVHQRSTEFEATVTVPLGGLVGGMDVAAKS TLLARRHRVALGGYTLSSSVMASIIARVGNSRRLNAPLPPWAHSMLRSLGRSLGPIMASM ADRNMKLFSGRVVPAQGEETFENWLTQVNGVLPDWNMSEEEKLKRLMKTLRGPAREVMRV LQATNPNLSVADFLRAMKLVFGESESSVTAHGKFFNTLQAQGEKASLYVIRLEVQLQNAI QAGIIAEKDANRTRLQQLLLGGELSRDLRLRLKDFLRMYANEQERLPNFLELIRMVREEE DWDDAFIKRKRPKRSESMVERAVSPVAFQGSPPIVIGSADCNVIEIDDTLDDSDEDVILV ESQDPPLPSWGAPPLRDRARPQDEVLVIDSPHNSRAQFPSTSGGSGYKNNGPGEMRRARK RKHTIRCSYCGEEGHSKETCDNESDKAQVFENLIITLQELTHTEMERSRVAPGEYNDFSE PL >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_4|1809_bp atgggctcctgtgcggcccgagcctcccggacgagcactaccccctgctccacggcgccc agtcccattgaccacccaagggctgaggaatgcgagcgcacagtgcaggactggcaggca gctccacctgcagcccctgtgcgggatccacgaggcgaaaactcgcctctgcccaaggct gtgcggggcgtccgcccccgcacttcgctgattggccgcactagcccgctcgtcacgctc ttttgtctcagctggcagaggataaaagccgccgcggctgccttaggaacggcgctgcct cgtctctgctacccctggttgggcggccctgcgaagcagctccttcgggcagccccgggt cgcttagcggccaaggaggcttcagttctttgccgcctgcaaggcggagaccagaaggcg gaatccacagctggcgacgcgggagcatctgctgtccaccagcggagcacagaatttgaa gcaacagttaccgtccctcttggaggactggtgggagggatggatgtggctgcaaaaagc accttgctagcacgcaggcatcgggtagctctgggaggttataccctatcgtcgtcagtc atggctagcatcattgcacgtgtcggtaacagccggcggctgaatgcacccttgccgcct tgggcccattccatgctgaggtccctggggagaagtctcggtcctataatggccagcatg gcagacagaaacatgaagttgttctcggggagggtggtgccagcccaaggggaagaaacc tttgaaaactggctgacccaagtcaatggcgtcctgccagattggaatatgtctgaggag gaaaagctcaagcgcttgatgaaaacccttaggggccctgcccgcgaggtcatgcgtgtg cttcaggcgaccaaccctaacctaagtgtggcagatttcttgcgagccatgaaattggtg tttggggagtctgaaagcagtgtgactgcccatggtaaattttttaacaccctacaagct caaggggagaaagcctccctttatgtgatccgtttagaggtgcagctccagaacgctatt caggcaggcattatagctgagaaagatgcaaaccggactcgcttgcagcagctcctttta ggcggtgagctgagtagggacctccgactcagacttaaggattttctcaggatgtatgca aatgagcaggagcggcttcccaactttctggagttaatcagaatggtaagggaggaagag gattgggatgatgcttttattaaacggaagcgtccaaaaaggtctgagtcaatggtggag agggcagtcagccctgtggcatttcagggctccccaccgatagtgatcggcagtgctgac tgcaatgtgatagagatagatgataccctcgacgactccgatgaggatgtgatcctggtg gagtctcaggaccctccacttccatcctggggtgcccctcccctcagagacagggccaga cctcaggatgaagtgctggtcattgattccccccacaattccagggctcagtttccttcc accagtggtggttctggctataagaataacggtcctggggagatgcgtagagccaggaag cgaaaacacacaatccgctgttcgtattgtggtgaggaaggccactcaaaagaaacctgt gacaacgagagtgacaaggcccaggtttttgagaatttgatcatcactctccaggagctg acccatactgagatggagaggtcaagagtggcccctggcgaatacaatgacttctctgag ccactgtaa >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_5|55_aa MDQEEEEEDSAKETEKWASKGQEAGTVRGDVMETEGQDARKRGDCVKGHSETGFF >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_5|168_bp atggatcaggaagaagaagaagaagattcagcaaaggagaccgagaagtgggcatcaaaa ggtcaggaagcgggaactgtgaggggggatgtcatggaaacggagggacaagatgccagg aagcggggagattgtgtaaaagggcactcagagactggtttcttctga >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_6|151_aa MRLLPDFSRFRRCSPTVRPGHGETSRRWREETSPSQPLIPRSRPGIPRRSPTTLRVTPLS RGRRPFRDAPLLLAPPGDQKRAELAPRGRACRSAGPYASRRRCLGPRGGNGPAGGTRACS AKPRELTQARTGVGGSRFHPFLPSPEREKDL >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_6|456_bp atgcgactgctcccggacttcagtcgctttcgccgctgcagccccaccgtccggcccggg catggcgaaacaagcaggcggtggagggaggaaaccagcccgagccaaccgctcattcct cgcagccgtccggggatccccaggcggtcccctacgaccctgcgcgtgacccccctgtca cgtggccgccgtccctttcgggatgcaccattgcttctggcgccccctggtgaccagaag agagcagagctggcaccgcgagggcgagcctgccggtccgctggaccctacgcgagccgg cggcgctgtctgggcccccgcggcggaaatggccccgccggtggcaccagggcgtgctcc gcgaagcccagggagctgacccaagcccgaactggagtgggtggaagccgcttccaccct ttcctcccttcccccgaaagggaaaaggacctctaa >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_7|160_aa MFLTALWALQPCCFFALSQVNHTLCHGLGVGGGSVRGVNKTDLPAGKDVWPNYESASKRR EVIVEAYDSTRKLWHNQHHLPPSTCQFTAAVQKAYGLASRTVHKVKQALSVRLKRWSDIH NPVSYRIRKSSAAIHSCWNLVLLGEDSQRSPYISDPKNLF >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_7|483_bp atgtttctgactgctttatgggctctgcagccttgctgcttctttgctttgagccaagtc aatcatactctctgtcatggactgggggtggggggagggtcagtgagaggagtgaataaa acagaccttcctgctggcaaagatgtgtggccaaactatgaatctgcaagcaaaagaagg gaggtaattgtagaagcatatgacagcaccagaaagctctggcacaaccagcatcacctc ccacccagcacctgccagttcacagcagctgttcagaaagcttatggccttgcgagcagg acagttcacaaggtaaaacaagcactgtctgttagactgaagaggtggtctgatattcac aatccagtctcttaccgaattcgaaaatcatcagcagctatacattcctgctggaatctg gttctgttgggagaagacagtcagagaagtccttacatctctgaccccaaaaacctcttc tga >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_8|136_aa MELFPCLGHFGLRFYAYNPLAGGLLTGKYKYEDKGRKQSGGHFFGNSWAETHRNRFWKEH HFEAISLMQKATQAVYGVSAPSMTLATHPLVDVPPLTAAGCPQRRGHPGHVQPGEAGAEL GSDGGRAHGPGCHGHF >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_8|411_bp atggagctcttcccctgccttgggcactttggactgaggttctatgcctacaaccctctg gctgggggtctgctgaccggcaagtacaagtatgaggacaagggcaggaaacagtctggg ggccacttctttgggaatagctgggctgagacccacaggaatcgcttctggaaggagcac cactttgaggccatttccctgatgcagaaggccacacaggccgtgtatggtgtcagtgcc cccagcatgaccttggccacccaccctctggtggatgtaccaccactcacagctgcaggg tgcccacagagacgtggtcatcctgggcatgtccagcctggagaagctggagcagaactt ggcagtgatggaggaagggcccatggacccggctgtcatggacacttttaa >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_9|108_aa MVMLSTASRTMPQAAIHCTSRSLPPEARMLAMSLPQPTQAALGILARRTMVLGIMEMGRH MDTSASATAVHAFLERGHTELDMAFMYSDGQSKIILGGLRLGLGGGDC >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_9|327_bp atggtcatgttgagcaccgcgtctcgcaccatgccccaagccgccatccactgcacaagt cgctccctgccacccgaggcccgcatgctcgccatgtcccttccacaaccaacgcaggcc gccttgggcatcctggcccggcgcaccatggtgctgggcatcatggagatggggcgccac atggacacatctgccagcgccacggccgtgcacgctttcctggagcgtggccacaccgag ctggacatggccttcatgtacagcgatggccagtccaaaatcatcctgggcggcctgagg ctggggctgggcggcggcgactgctga >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_10|359_aa MLPLREVPMGQGETGFVNAPLTSSEVRNFKKEMKPRLEDPLGLADQLDQFLGTSFYTWAE MMSIMNILFTGEERGMIRRAAMTIWERQHPPRQGVLPTKQKFPNVDPKWDNNDPRDWTQM QDLRELIIKGIKESTPRTQNVSKAFKIQQENEETPSAFLQRFLRKYSRLDLEDPVGQGLL KVNFVTKSWPDITKKLQKSNEWNEKPVEELLREAQKVFVRREEEKQKQKARIMVSTVEKV VRRRLVLDMILAEKGGVCVMLDGKCCTFIPNNIAPDGTITKVLQGLTTVANKLAENAGID PFTDWLEECYSHEQPCLAVLTTSIQYRIVSSIQYNGKKGGKIALAQQEKEIKGKPSRRS >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_10|1080_bp atgctccctcttagagaagttcccatgggacagggagagactggctttgtaaatgctcct cttacaagttctgaagttaggaatttcaagaaggaaatgaaaccacgcctagaagatccc ctcggtttagcagaccagctggaccaattcctaggaaccagcttttacacctgggctgaa atgatgtctatcatgaatattctgttcacaggagaagaaaggggaatgattaggagagca gccatgaccatctgggagaggcaacaccctcccaggcaaggagtcttgccaaccaaacaa aaatttccaaatgtcgatcctaaatgggataataatgatcccagggactggacccaaatg caggacctcagggaactaataattaaagggatcaaagagtccactcctaggacacaaaat gtctcaaaggcattcaagattcaacaagaaaatgaggaaactccctctgcattcctgcag aggttcctgagaaaatactctagattagatctggaggacccagtagggcaaggccttttg aaggttaactttgtaactaagagctggcctgacattacaaaaaaattacaaaagagtaat gaatggaatgagaaaccagttgaagaattactgagggaagctcagaaggtctttgtaagg agagaggaagagaagcagaaacaaaaagcgagaatcatggtttccactgtggaaaaggta gtcagaagaagacttgtgctagacatgatactagcagaaaaagggggtgtatgtgttatg ctagatgggaaatgttgtactttcattcccaacaatattgccccagatgggaccatcacc aaagttttacaaggactgacaactgtagccaataaactggcagaaaatgctggaattgac ccatttacagactggctagaagagtgctacagtcatgaacaaccgtgcctggctgttctc actacttctattcaatatcgtattgtaagttctatccagtacaatggcaagaaaggggga aaaatagctctagcacaacaagaaaaggaaataaaaggcaagccatcacggaggtcctaa >gi568815575f:118725245_118926450|GENSCAN_predicted_peptide_11|164_aa MGFSFILNDDEMTYDNKSDFKVETEAAERDTVEEEEGEEGEEEEEEEEEEEEEEEEEEEE EEEEGGGGGGGGGGGGGGGGGGGEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEGEE EEGEGEGEEEEKEEEGNSAIFDNMDGAGNIILSEINKTQKREAA >gi568815575f:118725245_118926450|GENSCAN_predicted_CDS_11|495_bp atggggttctctttcattttgaatgatgatgagatgacttacgataacaagtctgacttc aaagtggagactgaggctgcagagcgagacactgtcgaagaagaagaaggagaagaggga gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagaaggaggaggaggaggaggaggaggaggaggaggaggaggaggaggagga ggaggaggagaagaagaagaagaagaagaagaggaagaggaagaggaagaagaagaagag gaagaggaagaggaagaggaagaagaagaagaagaagaagaagaagaagaaggagaagaa gaagaaggggaaggggaaggggaggaagaggaaaaagaagaagaaggaaattctgccatt ttcgacaacatggatggagctggaaacattatactaagtgaaataaacaagacacagaag agagaagctgcatga