GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:21:59 Sequence gi568815576r:19423602_19624255 : 200654 bp : 48.32% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 7351 8520 1170 0 0 45 44 406 0.002 20.70 1.02 Intr + 8894 9006 113 1 2 6 100 -14 0.002 -9.22 1.03 Intr + 11135 11293 159 2 0 42 116 166 0.996 13.90 1.04 Term + 12037 12361 325 1 1 9 38 333 0.968 14.64 1.05 PlyA + 12452 12457 6 1.05 2.14 PlyA - 13129 13124 6 1.05 2.13 Term - 20678 20664 15 2 0 100 48 30 0.469 -1.56 2.12 Intr - 23944 23777 168 1 0 78 85 308 0.982 29.64 2.11 Intr - 24089 24064 26 0 2 26 109 40 0.072 -2.36 2.10 Intr - 24789 24537 253 0 1 90 41 104 0.115 2.91 2.09 Intr - 31229 31148 82 1 1 118 105 22 0.863 6.54 2.08 Intr - 32167 32079 89 1 2 85 100 26 0.875 2.27 2.07 Intr - 33033 32986 48 0 0 111 77 65 0.565 6.58 2.06 Intr - 33317 33252 66 2 0 8 98 81 0.413 0.20 2.05 Intr - 34538 34470 69 2 0 104 92 109 0.769 12.28 2.04 Intr - 41686 41601 86 2 2 64 105 79 0.388 6.74 2.03 Intr - 44402 44219 184 2 1 106 -51 202 0.081 7.46 2.02 Intr - 48207 48086 122 1 2 102 92 118 0.911 13.81 2.01 Init - 51989 51869 121 2 1 60 36 146 0.505 6.95 2.00 Prom - 53600 53561 40 -5.56 3.00 Prom + 54467 54506 40 -7.96 3.01 Init + 54957 55034 78 2 0 40 29 119 0.830 2.26 3.02 Intr + 55520 55588 69 1 0 78 92 89 0.838 7.68 3.03 Intr + 55867 55970 104 0 2 19 71 65 0.493 -3.13 3.04 Intr + 56337 56418 82 0 1 66 66 168 0.842 12.04 3.05 Intr + 56557 56616 60 0 0 64 109 89 0.957 7.53 3.06 Intr + 59089 59226 138 0 0 77 80 59 0.685 4.66 3.07 Intr + 60261 60404 144 2 0 49 105 237 0.995 21.98 3.08 Intr + 70621 70781 161 0 2 92 105 45 0.738 5.39 3.09 Intr + 72380 72428 49 2 1 71 123 105 0.766 10.98 3.10 Intr + 73785 73891 107 2 2 107 57 124 0.853 10.21 3.11 Intr + 75500 75550 51 2 0 131 61 60 0.917 5.62 3.12 Intr + 81761 81874 114 2 0 71 -10 208 0.161 8.86 3.13 Intr + 83779 83916 138 1 0 69 84 100 0.233 7.28 3.14 Intr + 84165 84263 99 0 0 76 32 142 0.958 6.63 3.15 Intr + 84929 85090 162 2 0 72 69 201 0.968 15.69 3.16 Intr + 91148 91286 139 2 1 72 55 210 0.988 16.57 3.17 Intr + 91364 91447 84 1 0 135 85 29 0.969 7.32 3.18 Intr + 92926 93044 119 0 2 51 55 243 0.831 16.56 3.19 Intr + 93216 93292 77 0 2 53 94 74 0.717 3.66 3.20 Term + 95616 95761 146 1 2 32 37 115 0.416 -1.13 3.21 PlyA + 96981 96986 6 1.05 4.02 PlyA - 97651 97646 6 -0.45 4.01 Sngl - 100909 99998 912 1 0 101 55 1205 0.924 112.80 4.00 Prom - 111363 111324 40 -7.16 5.09 PlyA - 111875 111870 6 1.05 5.08 Term - 122336 122134 203 0 2 139 42 80 0.736 6.05 5.07 Intr - 147841 147767 75 2 0 87 77 40 0.048 2.29 5.06 Intr - 148268 148119 150 0 0 57 95 47 0.020 2.43 5.05 Intr - 167120 167073 48 0 0 90 99 39 0.007 3.85 5.04 Intr - 174746 174410 337 2 1 111 47 90 0.032 2.29 5.03 Intr - 181512 181384 129 0 0 94 55 72 0.498 5.39 5.02 Intr - 191979 191839 141 0 0 4 39 139 0.026 1.15 5.01 Init - 197986 197867 120 1 0 73 80 84 0.219 4.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 8954 9006 53 1 2 99 100 98 0.989 10.75 S.002 Init - 24087 24064 24 0 0 104 109 35 0.809 6.86 S.003 Init + 169163 169240 78 1 0 72 98 81 0.872 6.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:19423602_19624255|GENSCAN_predicted_peptide_1|588_aa GPQFPNWGHCPQIQRGSNSPSCRFRIFILPLKGHKGDFIPVSPSGPSEHFSGDADQWKAL AGTKARSWEPGRGPSKDHRWAAEDSEQEPGAKMALAAVNPEHVLGLTSLSALRTKSAPAG FLAWAPGTGRQGRDRQDLRAPRLDSRPRPDSGLGLPRPLRALTIVVDPGRLQELHCSAAA AAGLRRAPGPSARPGHGATAAASSRATRPPAAARPAPSGPPRHRRPAPPSAATAATRARP PPPPPQPHPLRRSSSGGSRATPEVTARTCQIAPAGNRSPRGLGRPGSSAGRGFGSGPGAG APERAVRLDGGQRAGPAVATRAARRGRRGPGGVDRPSPALPRQSESDLLTSFRHSSTRAI GHLASAGLRAGAGMRSGLRTRTNAGDEPTLQGAPRAGLWRGARPEAARVAMTASVLRSIS LALRPTSGSEPLRKKKKVDPKKDQEAKERLKRKIRKLEKATQELIPIEDFITPLKFLDKA RERPQVELTFEETERRALLLKKWSLYKQQERKMERDTIRAMLEAQQEALEELQLESPKLH AEAIKRDPNLFPFEKEGPHYTPPIPNYQPPEGRYNDITKVYTQVEFKR >gi568815576r:19423602_19624255|GENSCAN_predicted_CDS_1|1767_bp ggaccacaattccccaactgggggcactgtccccagatccagaggggcagtaacagtccc tcttgcagatttcggatcttcatcctgccactgaaaggccacaagggggatttcatcccg gtttcaccttcaggaccgtctgagcacttctcgggcgatgcagaccagtggaaggcactg gctggcaccaaggcccgctcctgggagccaggccgcggcccctccaaggaccacaggtgg gcagcggaagactcagagcaagagccaggggcgaagatggccctcgctgcggtcaacccg gagcacgtgctggggctcaccagcctctccgcgctgaggacaaagtccgctcccgccggc ttcttggcttgggcaccgggtaccgggcgtcaggggcgagacaggcaggacttgcgcgcg ccccgactcgactccagaccccgacccgactccgggctcggcctcccgcgacccctgcgc gcactcaccattgtggttgacccaggtcggcttcaggagcttcattgttcggccgccgcc gccgccgggctgaggcgagcgccgggtccctcagcgcgcccgggccatggagccaccgcc gccgcttcctcccgcgccacccgccctccggccgccgcccgccccgcgccctcagggccg ccgcgccatcgccggcccgcgcccccctccgccgccacagccgccacccgcgctcggccg ccgccgccgccaccacagccgcatcccctgcgccgctcctcctcaggcggctcccgggca acgccggaagtcacggcgcgcacctgccaaatcgccccggcgggaaaccgctccccacgc ggactgggccgccccggctcctccgctggcaggggcttcgggtcgggcccgggcgcgggc gccccagaaagggcggttcgcctggacggcggacagcgagcggggcctgcagttgcaacc cgggccgcccgcagaggcaggcggggcccaggtggcgtggaccgcccgtcaccagctctg cctcgccagtctgagtccgacttattaactagcttccgtcattcatcaacacgcgctatt gggcatcttgcgagcgccgggctccgcgccggcgccggaatgcgatccgggcttcggact cgaacgaatgcgggggacgagccaaccctgcagggggcaccgcgggccggactgtggagg ggcgcacgcccggaagcggcgagggtagccatgacggcctccgtgctgcgaagtatctcg ctagccctgcgcccgactagcggatcagaacctcttcgaaaaaagaagaaggtagatcct aaaaaagaccaagaagcaaaggagcgcttgaaaaggaagatccgaaaactggaaaaggct actcaagagctaattcctattgaagattttattacccctctaaagttcttggataaagca agagagcggcctcaggtggagctcacctttgaggagactgagaggagagctctgcttctg aagaagtggtccttgtacaagcagcaagagcgtaagatggagagggacaccatcagggct atgctagaagcccagcaggaagctctggaggaactgcaactggaatccccgaagctccat gctgaggccatcaagcgggatcctaacctgttcccctttgagaaggaagggccacattac acaccaccgatccctaactaccaaccccctgaaggcaggtacaatgacatcaccaaggtg tacacacaagtggagtttaagagatag >gi568815576r:19423602_19624255|GENSCAN_predicted_peptide_2|442_aa MFDHPIPRVFQNRFSTQYRCFSVSMLAGPNDRSDVEKGGKSRLNITYPMLFKLTNKNSDR MTHCGVLEFVADEGICYLPHWMMQNLLLEEGGLVQVESVNLQVATYSKFQPQSPDFLDIT NPKAVYLFQISGVLLDKGECAGECVCRLENALRNFACLTTGDVIAINYNEKIYELRVMET KPDKAVSIIECDMNVDFDAPLGYKEPERQVQHEESTEGEADHSGYAGELGFRAFSGSGNR LDGKKKGVEPSPSPIKPGDIKRGIPNYEFKLGKITFIRNSRPLVKKVEENRTTDIQATPT PLTPATHRLHTFDRPMAVLGLHSQPPLQACVPWGGHSTLDPPYVMPNQSQLLCEFTSILS GRSSKTAGAADLGDMADGSGWQPPRPCEAYRAEWKLCRSARHFLHHYYVHGERPACEQWQ RDLASCRDWEERRNAEAQEKDE >gi568815576r:19423602_19624255|GENSCAN_predicted_CDS_2|1329_bp atgttcgaccaccctattcccagggtcttccaaaaccgcttctccacacagtaccgctgc ttctctgtgtccatgctagcagggcctaatgacaggtcagatgtggagaaaggagggaag agccgacttaacattacctatcccatgctgttcaaactgaccaataagaattcggaccgc atgacgcattgtggcgtgctggagtttgtggctgatgagggcatctgctacctcccacac tggatgatgcagaacttactcttggaagaaggcggcctggtccaggtggagagcgtcaac cttcaagtggccacctactccaaattccaacctcagagccctgacttcctggacatcacc aaccccaaagccgtgtatctttttcaaataagcggagttcttctagacaaaggagagtgt gctggcgaatgtgtctgcagattagaaaacgcacttaggaactttgcctgtctgaccacc ggggatgtgattgccatcaactataatgaaaagatctacgaactgcgtgtgatggagacc aaacccgacaaggcagtgtccatcattgagtgtgacatgaacgtggactttgatgctccc ctgggctacaaagaacccgaaagacaagtccagcatgaggagtcgacagaaggtgaagcc gaccacagtggctatgctggagagctgggcttccgcgctttctctggatctggcaataga ctggatggaaagaagaaaggggtagagcccagcccctccccaatcaagcctggagatatt aaaagaggaattcccaattatgaatttaaacttggtaagataactttcatcagaaattca cgtccccttgtcaaaaaggttgaagagaatagaactacagacatccaagccactcctacc ccgctcacccctgctacacaccgcctccatacctttgaccggcctatggctgtactcgga ttgcattctcagcctccacttcaagcctgcgttccctggggaggtcattcaaccctagat ccaccatatgtcatgcccaaccaatctcagttactttgcgagttcacctccatcctgagt ggacgctcaagcaagactgctggagctgcagatctgggagacatggcggacggcagcggc tggcagccgccgcgcccctgcgaggcctaccgcgccgagtggaagctctgccgcagcgcc aggcacttcctacaccactactacgtccacggcgagcggccggcctgcgaacagtggcag cgcgacctggccagctgccgcgactgggaggagcgccggaacgccgaggcccaggagaag gacgagtga >gi568815576r:19423602_19624255|GENSCAN_predicted_peptide_3|706_aa MYQQVLASIVIITEHNTAVQIRRHEPAMQRRNPADRSPSRRCRCRRAKPTASASAIEDSG GTKLQILLQTTPKTYCKRKCCPIGASARRPAAVAMFVSDFRKEFYEVVQSQRVLLFVASD VDALCACKILQFHYFILINCGANVDLLDILQPDEDTIFFVCDTHRPVNVVNVYNDTQIKL LIKQDDDLEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEGSGPTDFCQIGTFLG CGDKFLVHLLHLLFSLSLYQEIVEQTMRRRQRREWEARRRDILFDYEQYEYHGTSSAMVM FELAWMLSKDLNDMLWYVAPAAAVWEYLLPWWAIVGLTDQWVQDKITQMKYVTDVGVLQR HVSRHNHRNEDEENTLSVDCTRISFEPSLRLVLYQHWSLHDSLCNTSYTAARFKLWSVHG QKRLQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGF KHKFLASDVVFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQ QTIASCLCTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCK LLPLVMAAPLSMEHGTVTVVGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSDS LVLPSKATWSLLSHSLRVRRWARARRAAIRAFVCLAHEKAKLLAAA >gi568815576r:19423602_19624255|GENSCAN_predicted_CDS_3|2121_bp atgtaccagcaggtattagcttccatcgttatcattaccgaacacaacaccgctgtacag attcgaaggcacgaacccgcaatgcaacgaagaaaccccgccgaccgctctcccagccgc cgctgccgctgccgccgcgccaagccgactgcctctgcttcagccatcgaggactcgggc ggaactaagctacaaatccttcttcaaaccacaccaaaaacctattgcaaacgtaaatgc tgccccattggcgcgagcgccaggcgtccggccgccgtggctatgttcgtgtccgatttc cgcaaagagttctacgaggtggtccagagccagagggtccttctcttcgtggcctcggac gtggatgctctgtgtgcgtgcaagatccttcagtttcattattttattctcataaactgt ggagctaatgtagacctattggatattcttcaacctgatgaagacactatattctttgtg tgtgacacccataggccagtcaatgtcgtcaatgtatacaacgatacccagatcaaatta ctcattaaacaagatgatgaccttgaagttcccgcctatgaagacatcttcagggatgaa gaggaggatgaagagcattcaggaaatgacagtgatgggtcagagccttctgagaagcgc acacggttagaagaggggtctgggcctactgacttctgccaaattggaaccttcttgggc tgtggggataaattcctggtgcatttgctccaccttttgttctctttgtccctgtatcag gagatagtggagcaaaccatgcggaggaggcagcggcgagagtgggaggcccggagaaga gacatcctctttgactacgagcagtatgaatatcatgggacatcgtcagccatggtgatg tttgagctggcttggatgctgtccaaggacctgaatgacatgctgtggtacgtagcccct gcggcagctgtgtgggagtatttgttgccttggtgggccatcgttggactaacagaccag tgggtgcaagacaagatcactcaaatgaaatacgtgactgatgttggtgtcctgcagcgc cacgtttcccgccacaaccaccggaacgaggatgaggagaacacactctccgtggactgc acacggatctcctttgagcccagcctccgcctggtgctctaccagcactggtccctccat gacagcctgtgcaacaccagctataccgcagccaggttcaagctgtggtctgtgcatgga cagaagcggctccaggagttccttgcagacatgggtcttcccctgaagcaggtgaagcag aagttccaggccatggacatctccttgaaggagaatttgcgggaaatgattgaagagtct gcaaataaatttgggatgaaggacatgcgcgtgcagactttcagcattcattttgggttc aagcacaagtttctggccagcgacgtggtctttgccaccatgtctttgatggagagcccc gagaaggatggctcagggacagatcacttcatccaggctctggacagcctctccaggagt aacctggacaagctgtaccatggcctggaactcgccaagaagcagctgcgagccacccag cagaccattgccagctgcctttgcaccaacctcgtcatctcccaggggcctttcctgtac tgctctctcatggagggcactccagatgtcatgctgttctctaggccggcatccctaagc ctgctcagcaaacacctgctcaagtcctttgtgtgttcgacaaagaaccggcgctgcaaa ctgctgcccctggtgatggctgcccccctgagcatggagcatggcacagtgaccgtggtg ggcatccccccagagaccgacagctcggacaggaagaacttttttgggagggcgtttgag aaggcagcggaaagcaccagctcccggatgctgcacaaccattttgacctctcagactcc ctggtgctgccctcgaaggccacctggagcctcctgtcccacagccttcgtgtcaggcgc tgggcaagagcccgcagagcagccatccgggcttttgtgtgcttggctcatgagaaagca aagctgttggcagccgcttag >gi568815576r:19423602_19624255|GENSCAN_predicted_peptide_4|303_aa MTRARIGCFGPGGRARGTESAPEPSKRVPPGRSWQTQEVRQTRGANGLGPRAGSAGAKAP GPAQGAAQHGLGGSAGLRVRVSPLAMGSAALEILGLVLCLVGWGGLILACGLPMWQVTAF LDHNIVTAQTTWKGLWMSCVVQSTGHMQCKVYDSVLALSTEVQAARALTVSAVLLAFVAL FVTLAGAQCTTCVAPGPAKARVALTGGVLYLFCGLLALVPLCWFANIVVREFYDPSVPVS QKYELGAALYIGWAATALLMVGGCLLCCGAWVCTGRPDLSFPVKYSAPRRPTATGDYDKK NYV >gi568815576r:19423602_19624255|GENSCAN_predicted_CDS_4|912_bp atgacccgcgcacggattggctgcttcgggccggggggccgggcccgggggacagaatcc gcccccgaaccttcaaagagggtaccccccggcaggagctggcagacccaggaggtgcga cagacccgcggggcaaacggactggggccaagagccgggagcgcgggcgcaaaggcacca gggcccgcccagggcgccgcgcagcacggccttgggggttctgcgggccttcgggtgcgc gtctcgcctctagccatggggtccgcagcgttggagatcctgggcctggtgctgtgcctg gtgggctgggggggtctgatcctggcgtgcgggctgcccatgtggcaggtgaccgccttc ctggaccacaacatcgtgacggcgcagaccacctggaaggggctgtggatgtcgtgcgtg gtgcagagcaccgggcacatgcagtgcaaagtgtacgactcggtgctggctctgagcacc gaggtgcaggcggcgcgggcgctcaccgtgagcgccgtgctgctggcgttcgttgcgctc ttcgtgaccctggcgggcgcgcagtgcaccacctgcgtggccccgggcccggccaaggcg cgtgtggccctcacgggaggcgtgctctacctgttttgcgggctgctggcgctcgtgcca ctctgctggttcgccaacattgtcgtccgcgagttttacgacccgtctgtgcccgtgtcg cagaagtacgagctgggcgcagcgctgtacatcggctgggcggccaccgcgctgctcatg gtaggcggctgcctcttgtgctgcggcgcctgggtctgcaccggccgtcccgacctcagc ttccccgtgaagtactcagcgccgcggcggcccacggccaccggcgactacgacaagaag aactacgtctga >gi568815576r:19423602_19624255|GENSCAN_predicted_peptide_5|400_aa MASSRASSCSLSKAPSAWPRVTEEERPSSRAALALHQIRQSGKDSLSMEALVVIWEVSHI QDTSSQAQVHPDSRGPIVQDVAPVIMMVQTGPPQQVSLTTHSPGIILLRNVNHNDISCRG SKADKQRGNEVPAPQQQRTNCMVAMFMQSTLCSWMPASGTPSQHTTGCMDGGLASWTWGD KAAQQPLQSLRELFLKQVNLLWESRNTFAGTIQPLGHPNLKPPPPPRGAALARGRSGTAA NLGLSASAPARCSFYGDPGSQIIPQLLILAATTLQAVWAHGILSSLSPRTKILLAKGTQA SSGKLDLNGVCLRVPRISHLCSISTTIRPETLKGPAFPPKAARRSQFAASLTTVVGPPGN CSACLTKLLSGLKLATAAIPALAGGSFRAVASAWVPLTMP >gi568815576r:19423602_19624255|GENSCAN_predicted_CDS_5|1203_bp atggcctcaagccgggcttcctcctgcagcctctccaaggctccttctgcctggcccaga gtcacggaggaagagcgcccgtcttcacgggctgccctggcgctgcatcagatccgccag tctgggaaggactccctctccatggaagctctcgttgtgatctgggaagtgtcccacatc caagacaccagttcccaagcccaggtgcatcccgacagccgtgggccgattgttcaggac gtggcaccagtcatcatgatggtccagacaggccctccccaacaagtgtccctcactacc cactccccagggatcatcctactgcgaaatgtgaaccacaatgacatcagctgcagagga agcaaagcggataaacagagaggcaatgaggtgcctgcaccccagcagcagagaaccaac tgcatggtagccatgttcatgcaatccactctctgttcctggatgccagcatctggcacc ccgtcccagcacacaacaggatgcatggatggtggcctggcgtcatggacctggggagac aaggcagcccagcagcccctgcagagcctcagggaactgttcctaaaacaagtgaacttg ctttgggaatccaggaacacctttgctggaacaatccagcccttgggacaccctaatttg aagcccccaccaccacccaggggtgcagctttggcgaggggccggtcaggaacagctgct aatctaggcttgtctgcctcagccccagctcgctgcagcttctacggagacccagggtca caaataattccacagctgttaattttggcagccacgactctgcaggcagtatgggctcat ggtatactatcttcattgtcacctcggacaaagatcctgttggccaaaggcacacaggca tcttcaggaaaactagatttaaatggggtgtgtctccgagtcccccgaatcagtcacctg tgctccatctccaccaccatcagacctgagactctcaaaggacctgcctttccgcccaaa gctgcacggcggagccagtttgctgcctctctgaccactgtggtgggccctcccgggaac tgctctgcctgcctcacaaagctgctctcagggctaaagctggcaactgcagccatccct gcccttgccgggggctccttcagggccgtggcttctgcctgggtgcccttaactatgccc tga