GENSCAN 1.0 Date run: 3-Nov-116 Time: 06:27:33 Sequence gi568815591f:75948161_76166407 : 218247 bp : 48.13% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 5829 6020 192 0 0 105 87 299 0.925 30.11 1.02 Intr + 24253 24301 49 1 1 122 113 44 0.969 8.98 1.03 Intr + 29745 29865 121 2 1 66 38 149 0.768 7.77 1.04 Intr + 31120 31223 104 2 2 103 31 -2 0.770 -4.51 1.05 Intr + 31291 31419 129 0 0 112 107 320 0.988 37.19 1.06 Intr + 32179 32328 150 0 0 80 101 296 0.987 30.46 1.07 Intr + 32888 33012 125 1 2 82 81 315 0.999 29.58 1.08 Intr + 33357 33446 90 0 0 83 99 102 0.999 9.91 1.09 Intr + 34064 34162 99 2 0 108 78 286 0.999 28.83 1.10 Intr + 35360 35476 117 2 0 99 81 223 0.999 22.28 1.11 Intr + 35578 35696 119 1 2 88 101 264 0.969 27.81 1.12 Intr + 36617 36798 182 0 2 113 91 488 0.999 51.19 1.13 Intr + 36898 37047 150 0 0 85 100 282 0.998 29.46 1.14 Intr + 37419 37689 271 2 1 63 96 491 0.659 44.51 1.15 Intr + 37763 37908 146 0 2 71 100 322 0.975 31.70 1.16 Intr + 37999 38081 83 0 2 71 92 142 0.810 11.34 1.17 Term + 38177 38321 145 2 1 94 50 362 0.999 30.38 1.18 PlyA + 38675 38680 6 1.05 2.23 PlyA - 38846 38841 6 1.05 2.22 Term - 39125 39012 114 2 0 113 54 50 0.999 2.67 2.21 Intr - 39268 39200 69 1 0 59 99 84 0.937 5.98 2.20 Intr - 39442 39378 65 2 2 126 113 17 0.999 6.44 2.19 Intr - 39650 39558 93 0 0 85 92 187 0.999 18.74 2.18 Intr - 39824 39763 62 1 2 80 70 48 0.997 0.58 2.17 Intr - 40181 40092 90 1 0 119 68 212 0.999 21.41 2.16 Intr - 40356 40261 96 2 0 91 80 272 0.999 25.82 2.15 Intr - 41064 41005 60 2 0 98 -4 118 0.732 1.45 2.14 Intr - 44100 43984 117 2 0 111 68 107 0.793 10.58 2.13 Intr - 44397 44279 119 0 2 104 -4 228 0.581 14.46 2.12 Intr - 46412 46330 83 0 2 24 71 138 0.033 5.06 2.11 Intr - 46718 46547 172 2 1 67 23 69 0.024 -2.18 2.10 Intr - 48439 48383 57 1 0 64 85 52 0.035 1.58 2.09 Intr - 52842 52730 113 1 2 119 85 25 0.605 5.40 2.08 Intr - 55695 55598 98 2 2 87 87 121 0.984 11.55 2.07 Intr - 57244 57099 146 1 2 83 89 143 0.989 12.98 2.06 Intr - 65727 65582 146 1 2 76 91 193 0.972 18.40 2.05 Intr - 73832 73691 142 2 1 79 98 209 0.985 20.93 2.04 Intr - 80543 80482 62 0 2 90 83 92 0.705 7.35 2.03 Intr - 82367 82261 107 1 2 60 106 21 0.257 0.96 2.02 Intr - 85885 85841 45 0 0 75 84 50 0.239 0.82 2.01 Init - 87378 87188 191 0 2 76 59 109 0.215 5.28 2.00 Prom - 88599 88560 40 -8.26 3.00 Prom + 89840 89879 40 -3.86 3.01 Init + 100001 100066 66 1 0 81 96 145 0.994 13.67 3.02 Intr + 100158 100235 78 2 0 53 64 75 0.701 1.35 3.03 Intr + 106670 106838 169 1 1 98 79 141 0.960 13.72 3.04 Intr + 109250 109333 84 0 0 70 72 49 0.631 1.29 3.05 Intr + 109809 109918 110 1 2 64 117 214 0.955 21.90 3.06 Intr + 112213 112338 126 0 0 50 116 219 0.999 21.78 3.07 Intr + 115355 115465 111 1 0 54 55 146 0.374 8.48 3.08 Intr + 116179 116278 100 0 1 74 80 99 0.993 7.38 3.09 Intr + 116642 116793 152 0 2 102 71 152 0.990 14.78 3.10 Term + 118119 118250 132 2 0 100 52 347 0.957 30.39 3.11 PlyA + 118408 118413 6 1.05 4.06 PlyA - 118432 118427 6 1.05 4.05 Term - 121242 121115 128 1 2 118 41 31 0.025 -0.16 4.04 Intr - 125846 125710 137 1 2 92 53 50 0.084 2.11 4.03 Intr - 138075 137946 130 1 1 9 109 100 0.003 3.95 4.02 Intr - 153248 153072 177 0 0 -9 106 137 0.661 5.69 4.01 Init - 155665 155653 13 1 1 81 97 15 0.654 2.06 4.00 Prom - 163459 163420 40 -3.66 5.02 PlyA - 163483 163478 6 -0.45 5.01 Sngl - 165318 164854 465 0 0 63 53 320 0.947 22.15 5.00 Prom - 170290 170251 40 -5.96 6.06 PlyA - 170602 170597 6 1.05 6.05 Term - 175998 175936 63 0 0 75 54 31 0.399 -3.71 6.04 Intr - 180622 180460 163 0 1 64 57 180 0.681 12.48 6.03 Intr - 181428 181319 110 0 2 45 27 78 0.361 -3.52 6.02 Intr - 186109 186049 61 0 1 63 115 47 0.383 3.64 6.01 Init - 202465 202422 44 1 2 113 51 79 0.089 4.93 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 6248 6338 91 2 1 69 43 87 0.846 -0.51 S.002 Init + 19922 19968 47 1 2 96 86 25 0.874 3.27 S.003 Init - 46410 46330 81 0 0 101 71 131 0.905 13.69 S.004 Init - 138103 137946 158 1 2 45 109 128 0.947 9.98 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:75948161_76166407|GENSCAN_predicted_peptide_1|757_aa XFMINMGDSHVDTSSTVSEAVAEEVSLFSMTDMILFSLIVGLLTYWFLFRKKKEEVPEFT KIQTLTSSVRESSFVEKMKKTILQSTFNWLNRIGCNSSPNGLWGIGFQAFQQLLEEEGGL TMAVTVRSKSQRNLEGTQSQEGKGAAWRAPACQACPGRNIIVFYGSQTGTAEEFANRLSK DAHRYGMRGMSADPEEYDLADLSSLPEIDNALVVFCMATYGEGDPTDNAQDFYDWLQETD VDLSGVKFAVFGLGNKTYEHFNAMGKYVDKRLEQLGAQRIFELGLGDDDGNLEEDFITWR EQFWPAVCEHFGVEATGEESSIRQYELVVHTDIDAAKVYMGEMGRLKSYENQKPPFDAKN PFLAAVTTNRKLNQGTERHLMHLELDISDSKIRYESGDHVAVYPANDSALVNQLGKILGA DLDVVMSLNNLDEESNKKHPFPCPTSYRTALTYYLDITNPPRTNVLYELAQYASEPSEQE LLRKMASSSGEGKELYLSWVVEARRHILAILQDCPSLRPPIDHLCELLPRLQARYYSIAS SSKVHPNSVHICAVVVEYETKAGRINKGVATNWLRAKEPAGENGGRALVPMFVRKSQFRL PFKATTPVIMVGPGTGVAPFIGFIQERAWLRQQGKEVGETLLYYGCRRSDEDYLYREELA QFHRDGALTQLNVAFSREQSHKVYVQHLLKQDREHLWKLIEGGAHIYVCGDARNMARDVQ NTFYDIVAELGAMEHAQAVDYIKKLMTKGRYSLDVWS >gi568815591f:75948161_76166407|GENSCAN_predicted_CDS_1|2274_bp nntttcatgatcaacatgggagactcccacgtggacaccagctccaccgtgtccgaggcg gtggccgaagaagtatctcttttcagcatgacggacatgattctgttttcgctcatcgtg ggtctcctaacctactggttcctcttcagaaagaaaaaagaagaagtccccgagttcacc aaaattcagacattgacctcctctgtcagagagagcagctttgtggaaaagatgaagaaa acgatcttgcaaagcacgttcaattggctgaaccgaattggctgcaactccagccccaac gggctctggggaataggatttcaggcttttcaacagctgctggaggaggagggtggactg acgatggctgtgacagtgagaagcaagtcccagaggaacttagaagggactcaaagccag gaaggaaagggggcggcctggagggcccccgcctgccaggcctgcccagggaggaacatc atcgtgttctacggctcccagacggggactgcagaggagtttgccaaccgcctgtccaag gacgcccaccgctacgggatgcgaggcatgtcagcggaccctgaggagtatgacctggcc gacctgagcagcctgccagagatcgacaacgccctggtggttttctgcatggccacctac ggtgagggagaccccaccgacaatgcccaggacttctacgactggctgcaggagacagac gtggatctctctggggtcaagttcgcggtgtttggtcttgggaacaagacctacgagcac ttcaatgccatgggcaagtacgtggacaagcggctggagcagctcggcgcccagcgcatc tttgagctggggttgggcgacgacgatgggaacttggaggaggacttcatcacctggcga gagcagttctggccggccgtgtgtgaacactttggggtggaagccactggcgaggagtcc agcattcgccagtacgagcttgtggtccacaccgacatagatgcggccaaggtgtacatg ggggagatgggccggctgaagagctacgagaaccagaagcccccctttgatgccaagaat ccgttcctggctgcagtcaccaccaaccggaagctgaaccagggaaccgagcgccacctc atgcacctggaattggacatctcggactccaaaatcaggtatgaatctggggaccacgtg gctgtgtacccagccaacgactctgctctcgtcaaccagctgggcaaaatcctgggtgcc gacctggacgtcgtcatgtccctgaacaacctggatgaggagtccaacaagaagcaccca ttcccgtgccctacgtcctaccgcacggccctcacctactacctggacatcaccaacccg ccgcgtaccaacgtgctgtacgagctggcgcagtacgcctcggagccctcggagcaggag ctgctgcgcaagatggcctcctcctccggcgagggcaaggagctgtacctgagctgggtg gtggaggcccggaggcacatcctggccatcctgcaggactgcccgtccctgcggcccccc atcgaccacctgtgtgagctgctgccgcgcctgcaggcccgctactactccatcgcctca tcctccaaggtccaccccaactctgtgcacatctgtgcggtggttgtggagtacgagacc aaggctggccgcatcaacaagggcgtggccaccaactggctgcgggccaaggagcctgcc ggggagaacggcggccgtgcgctggtgcccatgttcgtgcgcaagtcccagttccgcctg cccttcaaggccaccacgcctgtcatcatggtgggccccggcaccggggtggcacccttc ataggcttcatccaggagcgggcctggctgcgacagcagggcaaggaggtgggggagacg ctgctgtactacggctgccgccgctcggatgaggactacctgtaccgggaggagctggcg cagttccacagggacggtgcgctcacccagctcaacgtggccttctcccgggagcagtcc cacaaggtctacgtccagcacctgctaaagcaagaccgagagcacctgtggaagttgatc gaaggcggtgcccacatctacgtctgtggggatgcacggaacatggccagggatgtgcag aacaccttctacgacatcgtggctgagctcggggccatggagcacgcgcaggcggtggac tacatcaagaaactgatgaccaagggccgctactccctggacgtgtggagctag >gi568815591f:75948161_76166407|GENSCAN_predicted_peptide_2|748_aa MAVLGKAIMVTDNNVFIPHVFIDYYFAACKSGVLGEDLAVDKTGKTFPQKPGDNKQANKH NNFRMPDKCDFMNPHTPGSRMPGLLLCEPTELYNILNQATKLSRLTDPNYLCLLDVRSKW EYDESHVITALRVKKKNNEYLLPESVDLECVKYCVVYDNNSSTLEILLKDDDDDSDSDGD GKDLVPQAAIEYGRILTRLTHHPVYILKGGYERFSGTYHFLRTQKIIWMPQELDAFQPYP IEIVPGKVFVGNFSQACDPKIQKDLKIKAHVNVSMDTGPFFAGDADKLLHIRIEDSPEAQ ILPFLRHMCHFIEIHHHLGSVILIFSTQGISRSCAAIIAYLMHSNEQTLQRSWAYVKKCK NNMCPNRGLHDGPEPCWLHHAAGTVSAVQARGLQPSQSRSRPRVPGLATALAYGPAHTPP LSRIGWAMQPPPPGPLGDCLRDWEDLQQDFQNIQETHRLYRLKLEELTKLQNNCTSSITR QKKRLQELALALKKCKPSLPAEAEGAAQELENQMKERQGLFFDMEAYLPKKNGLYLSLVL GNVNVTLLSKQAKFAYKDEYEKFKLYLTIILILISFTCRFLLNSRVTDAAFNFLLVWYYC TLTIRESILINNGSRPDGLMYQKFRNQFLSFSMYQSFVQFLQYYYQSGCLYRLRALGERH TMDLTVEGFQSWMWRGLTFLLPFLFFGHFWQLFNALTLFNLAQDPQCKEWQVLMCGFPFL LLFLGNFFTTLRVVHHKFHSQRHGSKKD >gi568815591f:75948161_76166407|GENSCAN_predicted_CDS_2|2247_bp atggcggtgttgggaaaggccataatggtaactgacaataatgtattcattccacatgtg ttcattgactattactttgcagcatgtaaatctggggtattgggtgaggacctagcagtg gacaagacaggcaagaccttccctcagaaaccaggagacaataaacaagcaaataagcac aataacttcagaatgcctgacaaatgtgacttcatgaatccacatacaccaggcagcagg atgcctggtttgcttttatgtgaaccaacagagctttacaacatcctgaatcaggccaca aaactctccagattaacagaccccaactatctctgtttattggatgtccgttccaaatgg gagtatgacgaaagccatgtgatcactgcccttcgagtgaagaagaaaaataatgaatat cttctcccggagtctgtggacctggagtgtgtgaagtactgcgtggtgtatgataacaac agcagcaccctggagatactcttaaaagatgatgatgatgattcagactctgatggtgat ggcaaagatcttgtgcctcaagcagccattgagtatggcaggatcctgacccgcctcacc caccaccccgtctacatcctgaaagggggctatgagcgcttctcaggcacgtaccacttt ctccggacccagaagatcatctggatgcctcaggaactggatgcatttcagccatacccc attgaaatcgtgccagggaaggtcttcgttggcaatttcagtcaagcctgtgaccccaag attcagaaggacttgaaaatcaaagcccatgtcaatgtctccatggatacagggcccttt tttgcaggcgatgctgacaagcttctgcacatccggatagaagattccccggaagcccag attcttcccttcttacgccacatgtgtcacttcattgaaattcaccatcaccttggctct gtcattctgatcttttccacccaaggtatcagccgcagttgtgccgccatcatagcctac ctcatgcatagtaacgagcagaccttgcagaggtcctgggcctatgtcaagaagtgcaaa aacaacatgtgtccaaatcggggattgcatgacgggccagagccctgctggctgcaccac gcggctggcacagtcagtgcagtccaggcccgggggctgcagccgtcccagtcccggtcc aggccgcgagtcccggggctagcaaccgcccttgcatacggccccgcccataccccgccc ctgtcccggattggctgggccatgcagcccccgcccccgggcccgctgggcgactgcctg cgggactgggaggatctacagcaggacttccagaacatccaggagacccatcggctctac cgcctgaagctggaggagctgaccaaacttcagaacaattgcaccagctccatcacgcgg cagaagaagcggctccaggagctggccctcgccctgaagaaatgcaaaccctccctccca gcagaggccgagggggccgcacaggagctggagaaccagatgaaagagcgccaaggcctc ttctttgacatggaggcctatttgcctaagaagaatggattgtacctgagcctggttctg gggaacgtcaacgtcacgctcctgagcaagcaggctaagtttgcctacaaggacgagtat gagaagttcaagctctacctcaccatcatcctcatcctcatctccttcacttgccgcttc ctgctcaactccagggtgacagatgctgccttcaacttcctgctggtctggtactactgc accctgaccatccgggagagcatcctcatcaacaacggctcccggcccgacggtctcatg taccagaaattccggaaccaattcctctccttttccatgtaccagagcttcgtgcagttt ctccagtactactaccagagcggctgcctctaccgcctgcgggcgctgggcgagcggcac accatggacctcactgtggagggcttccagtcctggatgtggcggggcctcaccttcctg ctgccttttcttttctttggacacttctggcagctttttaacgcgctgacgttgttcaac ctggcccaggaccctcagtgcaaggagtggcaggtgcttatgtgcggctttcccttcctc ctccttttcctcggcaatttcttcaccaccctgagggttgtgcaccacaagtttcacagt cagcggcacgggagcaagaaggattga >gi568815591f:75948161_76166407|GENSCAN_predicted_peptide_3|375_aa MLSALARPASAALRRSFSTSAQASLDRRDARHWLETVAPGAGPDTGLENNAKVAVLGASG GIGQPLSLLLKNSPLVSRLTLYDIAHTPGVAADLSHIETKAAVKGYLGPEQLPDCLKGCD VVVIPAGVPRKPGMTRDDLFNTNATIVATLTAACAQHCPEAMICVIANPVNSTIPITAEV FKKHGVYNPNKIFGVTTLDIVRANTFVAELKGLDPARVNVPVIGGHAGKTIIPLISQVHA YDPVRGFECTPKVDFPQDQLTALTGRIQEAGTEVVKAKAGAGSATLSMAYAGARFVFSLV DAMNGKEGVVECSFVKSQETECTYFSTPLLLGKKGIEKNLGIGKVSSFEEKMISDAIPEL KASIKKGEDFVKTLK >gi568815591f:75948161_76166407|GENSCAN_predicted_CDS_3|1128_bp atgctctccgccctcgcccggcctgccagcgctgctctccgccgcagcttcagcacctcg gcccaggccagcctggaccgcagggatgcccggcactggctggagactgtggcgcccggg gcaggccctgacactggcctggagaacaatgctaaagtagctgtgctaggggcctctgga ggcatcgggcagccactttcacttctcctgaagaacagccccttggtgagccgcctgacc ctctatgatatcgcgcacacacccggagtggccgcagatctgagccacatcgagaccaaa gccgctgtgaaaggctacctcggacctgaacagctgcctgactgcctgaaaggttgtgat gtggtagttattccggctggagtccccagaaagccaggcatgacccgggacgacctgttc aacaccaatgccacgattgtggccaccctgaccgctgcctgtgcccagcactgcccggaa gccatgatctgcgtcattgccaatccggttaattccaccatccccatcacagcagaagtt ttcaagaagcatggagtgtacaaccccaacaaaatcttcggcgtgacgaccctggacatc gtcagagccaacacctttgttgcagagctgaagggtttggatccagctcgagtcaacgtc cctgtcattggtggccatgctgggaagaccatcatccccctgatctctcaggtacacgca tatgaccctgtgaggggcttcgagtgcacccccaaggtggactttccccaggaccagctg acagcactcactgggcggatccaggaggccggcacggaggtggtcaaggctaaagccgga gcaggctctgccaccctctccatggcgtatgccggcgcccgctttgtcttctcccttgtg gatgcaatgaatggaaaggaaggtgttgtggaatgttccttcgttaagtcacaggaaacg gaatgtacctacttctccacaccgctgctgcttgggaaaaagggcatcgagaagaacctg ggcatcggcaaagtctcctcttttgaggagaagatgatctcggatgccatccccgagctg aaggcctccatcaagaagggggaagatttcgtgaagaccctgaagtga >gi568815591f:75948161_76166407|GENSCAN_predicted_peptide_4|194_aa MEILILGLTEAVKEPYPVFESNPKFLYVEGLPDRIPFRSPPGLEFHDLKGSSMGVIKSNL LVKTKGATLDHCGSIDESEEWYGMDTYECTRAGVSIHRATEPEVSSRPRTVSSGPLQARL SIRAASSGPNPCLTTTTFGPAPGSLCRPQAPRKILIGEVLLDALPRHEVTLPSSGVPRYF LCVCLVDSQYLTRL >gi568815591f:75948161_76166407|GENSCAN_predicted_CDS_4|585_bp atggagatcctaattcttggactcactgaggcagtaaaagaaccatatcctgtgtttgaa tcaaatcccaagttcctgtacgtagaaggtttgccagacaggattccctttcgaagccct cctggtttggaattccatgacttgaaaggatcgtccatgggagtaataaaatcaaatttg ttggtaaaaactaagggtgctacacttgaccactgcggcagtatagatgagtctgaagaa tggtatgggatggatacttacgaatgcactcgagcaggggtctccatccacagggccaca gagccagaggtgagcagcaggcccagaactgtttccagtggccctctccaggcccggctc tccatccgggccgcgtccagcggtcccaacccctgcctcacgacaaccacgtttggccca gctcctggtagcctttgtaggccccaggctcctcgaaagatcttgataggggaagtcctc cttgatgctctgccaagacatgaagtaaccttgccctcctctggagtcccacggtacttc ctctgtgtttgtcttgtggattctcagtacctgactcgtctttag >gi568815591f:75948161_76166407|GENSCAN_predicted_peptide_5|154_aa MGEGKLKSSSRGSHGGSMREAGATRSPGVGIVSRPSPQFCTQLQNKRTVASTWKRMWIQT KWRTDTPSMDKILMEEVKLEEQLKEAVEEDKQALADTEGSEQSSQKLVEEGNMYSIQGFC KDSLEVADVLEKATQCVPEEEIKDNNPHLKNLSL >gi568815591f:75948161_76166407|GENSCAN_predicted_CDS_5|465_bp atgggcgaggggaaacttaaatcatcatcccggggcagccacggcggctccatgcgtgag gctggtgccacgcggtctcccggtgttggcattgtctccaggccttctcctcagttctgc acacagctacaaaacaaaagaacagttgccagcacttggaagaggatgtggatccaaacc aaatggaggacagatactccctctatggacaagatactcatggaagaagtcaagttagaa gagcagctgaaggaggctgtggaagaagataagcaagcactggcagatactgagggctca gagcagagcagccaaaaattggtggaggagggaaatatgtatagcattcagggcttctgc aaggactcgttagaggttgcagatgttttggagaaggcaacacagtgtgttccagaagaa gaaattaaagacaataaccctcacctgaagaacctctctctgtga >gi568815591f:75948161_76166407|GENSCAN_predicted_peptide_6|146_aa MPGFMQPGCVAAQSSHSCRQSTRLLGTVAQEKGEMPASQFITRQCSGYIQLVVKQTTGLH IRSSFFLECPFTLQWEEEAFASSQSSQGAQSLTFSKFEGKKTNEKTHEVTTVKKSSVRLP GSDQRRKCHAERNKGWENCELLIVNN >gi568815591f:75948161_76166407|GENSCAN_predicted_CDS_6|441_bp atgccgggcttcatgcagccgggctgcgtggctgcgcagagcagccatagctgcagacaa agcacacgactgctgggcactgtggctcaagagaagggggagatgccagcatcacagttc atcacacgccagtgctctggctacattcagcttgttgtgaagcaaaccacaggcttgcac attcgtagttccttcttcctggaatgtcctttcacactccagtgggaggaagaggccttt gccagcagtcagagcagccaaggggcccaatccctcacattctccaagtttgaaggaaag aaaaccaacgagaagacccacgaggttaccacagtgaagaaatcttcagtgcgtcttcca gggtcggatcaaagaaggaaatgtcatgccgagagaaacaaaggctgggaaaattgtgaa ctgctcattgtgaacaattga